and formal language theory Sylvain Salvati INRIA Bordeaux - - PowerPoint PPT Presentation

and formal language theory
SMART_READER_LITE
LIVE PREVIEW

and formal language theory Sylvain Salvati INRIA Bordeaux - - PowerPoint PPT Presentation

The word problem in Z 2 and formal language theory Sylvain Salvati INRIA Bordeaux Sud-Ouest Topology and languages June 22-24 Outline The group language of Z 2 A similar problem in computational linguistics Multiple Context Free Grammars


slide-1
SLIDE 1

The word problem in Z2 and formal language theory

Sylvain Salvati

INRIA Bordeaux Sud-Ouest

Topology and languages June 22-24

slide-2
SLIDE 2

Outline

The group language of Z2 A similar problem in computational linguistics Multiple Context Free Grammars (MCFGs) A grammar for O2 Proof of the Theorem A Theorem on Jordan curves Conjectures

slide-3
SLIDE 3

Outline

The group language of Z2 A similar problem in computational linguistics Multiple Context Free Grammars (MCFGs) A grammar for O2 Proof of the Theorem A Theorem on Jordan curves Conjectures

slide-4
SLIDE 4

Group languages

Group finite presentation:

◮ a finite set of generators Σ ◮ a finite set of defining equations E

slide-5
SLIDE 5

Group languages

Group finite presentation:

◮ a finite set of generators Σ ◮ a finite set of defining equations E

Word problem: given w in Σ∗, is w =E 1? Group language: {w ∈ Σ∗ | w =E 1}

slide-6
SLIDE 6

Group languages

Group finite presentation:

◮ a finite set of generators Σ ◮ a finite set of defining equations E

Word problem: given w in Σ∗, is w =E 1? Group language: {w ∈ Σ∗ | w =E 1}

◮ the word problem is in general undecidable (Novikov 1955,

Boone 1958)

◮ the languages of different representation of a group a

rationally equivalent

◮ relate algebraic properties of groups to language-theoretic

properties of their group languages

slide-7
SLIDE 7

Group languages

Group finite presentation:

◮ a finite set of generators Σ ◮ a finite set of defining equations E

Word problem: given w in Σ∗, is w =E 1? Group language: {w ∈ Σ∗ | w =E 1}

◮ the word problem is in general undecidable (Novikov 1955,

Boone 1958)

◮ the languages of different representation of a group a

rationally equivalent

◮ relate algebraic properties of groups to language-theoretic

properties of their group languages Example: a group language is context free iff its underlying group is virtually free (Muller Schupp 1983)

slide-8
SLIDE 8

A simple presentation of Z2

◮ Generators: {a; a; b; b} ◮ Defining equations: a−1 = a, b−1 = b, xy = yx a a b b

The associated group language is O2 = {w ∈ {a; a; b; b}∗||w|a = |w|a ∧ |w|b = |w|b}

slide-9
SLIDE 9

O2 and computational group theory

◮ Gilman (2005)

slide-10
SLIDE 10

Outline

The group language of Z2 A similar problem in computational linguistics Multiple Context Free Grammars (MCFGs) A grammar for O2 Proof of the Theorem A Theorem on Jordan curves Conjectures

slide-11
SLIDE 11

MIX

MIX = {w ∈ {a; b; c}∗||w|a = |w|b = |w|c} MIX and O2 are rationally equivalent

slide-12
SLIDE 12

The Bach language

◮ Bach (1981)

Wikipedia entry: http://en.wikipedia.org/wiki/Bach language

slide-13
SLIDE 13

The MIX language

◮ Marsh (1985)

Conjecture: MIX is not an indexed language.

slide-14
SLIDE 14

MIX and Tree Adjoining Grammars

◮ Joshi (1985)

[MIX] represents the extreme case of the degree of free word order permitted in a language. This extreme case is linguistically not

  • relevant. [. . . ] TAGs also cannot generate this language although

for TAGs the proof is not in hand yet.

slide-15
SLIDE 15

MIX and Tree Adjoining Grammars

◮ Vijay Shanker, Weir, Joshi (1991)

slide-16
SLIDE 16

MIX and mildly context sensitive languages

◮ Joshi, Vijay Shanker, Weir (1991)

slide-17
SLIDE 17

Outline

The group language of Z2 A similar problem in computational linguistics Multiple Context Free Grammars (MCFGs) A grammar for O2 Proof of the Theorem A Theorem on Jordan curves Conjectures

slide-18
SLIDE 18

Original paper

slide-19
SLIDE 19

A generalization of context-free grammars

Rule of a context free grammar: A → w1B1 . . . wnBnwn+1 with A, B1, . . . , Bn non-terminals and w1, . . . wn+1 string of terminals.

slide-20
SLIDE 20

A generalization of context-free grammars

Rule of a context free grammar: A → w1B1 . . . wnBnwn+1 with A, B1, . . . , Bn non-terminals and w1, . . . wn+1 string of terminals. A bottom-up view: A(w1x1 . . . wnxnwn+1) ← B1(x1), . . . , Bn(xn)

slide-21
SLIDE 21

A generalization of context-free grammars

Replace strings by tuple of strings: B(s1, . . . , sm) ← B1(x1

1, . . . , x1 k1), . . . , Bn(xn 1 , . . . , xn kn)

slide-22
SLIDE 22

A generalization of context-free grammars

Replace strings by tuple of strings: B(s1, . . . , sm) ← B1(x1

1, . . . , x1 k1), . . . , Bn(xn 1 , . . . , xn kn) ◮ the strings si are made of terminals and of the variables xi j ,

slide-23
SLIDE 23

A generalization of context-free grammars

Replace strings by tuple of strings: B(s1, . . . , sm) ← B1(x1

1, . . . , x1 k1), . . . , Bn(xn 1 , . . . , xn kn) ◮ the strings si are made of terminals and of the variables xi j , ◮ the variables xi j are pairwise distinct (otherwise we get

Groenink’s Literal Movement Grammars),

slide-24
SLIDE 24

A generalization of context-free grammars

Replace strings by tuple of strings: B(s1, . . . , sm) ← B1(x1

1, . . . , x1 k1), . . . , Bn(xn 1 , . . . , xn kn) ◮ the strings si are made of terminals and of the variables xi j , ◮ the variables xi j are pairwise distinct (otherwise we get

Groenink’s Literal Movement Grammars),

◮ each variable xi j has at most one occurrence in the string

s1 . . . sm (otherwise we get Parallel Multiple Context-Free Grammars).

slide-25
SLIDE 25

Formal definition

A m-MCFG(r) is a 4-tuple (N, T, P, S) such that:

◮ N is a ranked alphabet of non-terminals of max. rank m.

slide-26
SLIDE 26

Formal definition

A m-MCFG(r) is a 4-tuple (N, T, P, S) such that:

◮ N is a ranked alphabet of non-terminals of max. rank m. ◮ T is an alphabet of terminals

slide-27
SLIDE 27

Formal definition

A m-MCFG(r) is a 4-tuple (N, T, P, S) such that:

◮ N is a ranked alphabet of non-terminals of max. rank m. ◮ T is an alphabet of terminals ◮ P is a set of rules of the form:

A(s1, . . . , sk) ← B1(x1

1, . . . , x1 k1), . . . , Bn(xn 1 , . . . , xn kn)

where:

◮ A is a non-terminal of rank k, Bi is non-terminal of rank ki,

n ≤ r,

◮ the variables xi

j are pairwise distinct,

◮ the strings si are in (T ∪ X)∗ with X = n

i=1

ki

j=1{xi j },

◮ each variable xi

j has at most one occurrence in s1 . . . sk

slide-28
SLIDE 28

Formal definition

A m-MCFG(r) is a 4-tuple (N, T, P, S) such that:

◮ N is a ranked alphabet of non-terminals of max. rank m. ◮ T is an alphabet of terminals ◮ P is a set of rules of the form:

A(s1, . . . , sk) ← B1(x1

1, . . . , x1 k1), . . . , Bn(xn 1 , . . . , xn kn)

where:

◮ A is a non-terminal of rank k, Bi is non-terminal of rank ki,

n ≤ r,

◮ the variables xi

j are pairwise distinct,

◮ the strings si are in (T ∪ X)∗ with X = n

i=1

ki

j=1{xi j },

◮ each variable xi

j has at most one occurrence in s1 . . . sk

◮ S is a non-terminal of rank 1, the starting symbol.

slide-29
SLIDE 29

The language generated by an MCFG

Given an MCFG G = (N, T, P, S), if the following conditions holds:

◮ B1(s1 1, . . . , s1 k1), . . . , Bn(sn 1, . . . , sn kn) are derivable, ◮ A(s1, . . . , sk) ← B1(x1 1, . . . , x1 k1), . . . , Bn(xn 1 , . . . , xn kn) is a rule

in P then A(t1, . . . , tk) with ti = si[xi

j ← si j ]i∈[1;n],j∈[1;ki] is derivable.

slide-30
SLIDE 30

The language generated by an MCFG

Given an MCFG G = (N, T, P, S), if the following conditions holds:

◮ B1(s1 1, . . . , s1 k1), . . . , Bn(sn 1, . . . , sn kn) are derivable, ◮ A(s1, . . . , sk) ← B1(x1 1, . . . , x1 k1), . . . , Bn(xn 1 , . . . , xn kn) is a rule

in P then A(t1, . . . , tk) with ti = si[xi

j ← si j ]i∈[1;n],j∈[1;ki] is derivable.

The language define by G, L(G) is: {w | S(w) is derivable}

slide-31
SLIDE 31

An example

S(x1y1x2y2) ← P(x1, x2), Q(y1, y2) P(ax1, bx2) ← P(x1, x2) P(ǫ, ǫ) ← Q(cx1, dx2) ← Q(x1, x2) Q(ǫ, ǫ) ←

slide-32
SLIDE 32

An example

S(x1y1x2y2) ← P(x1, x2), Q(y1, y2) P(ax1, bx2) ← P(x1, x2) P(ǫ, ǫ) ← Q(cx1, dx2) ← Q(x1, x2) Q(ǫ, ǫ) ← Q(ǫ, ǫ)

slide-33
SLIDE 33

An example

S(x1y1x2y2) ← P(x1, x2), Q(y1, y2) P(ax1, bx2) ← P(x1, x2) P(ǫ, ǫ) ← Q(cx1, dx2) ← Q(x1, x2) Q(ǫ, ǫ) ← Q(ǫ, ǫ) Q(c, d)

slide-34
SLIDE 34

An example

S(x1y1x2y2) ← P(x1, x2), Q(y1, y2) P(ax1, bx2) ← P(x1, x2) P(ǫ, ǫ) ← Q(cx1, dx2) ← Q(x1, x2) Q(ǫ, ǫ) ← Q(ǫ, ǫ) Q(c, d) Q(cc, dd)

slide-35
SLIDE 35

An example

S(x1y1x2y2) ← P(x1, x2), Q(y1, y2) P(ax1, bx2) ← P(x1, x2) P(ǫ, ǫ) ← Q(cx1, dx2) ← Q(x1, x2) Q(ǫ, ǫ) ← Q(ǫ, ǫ) Q(c, d) Q(cc, dd) P(ǫ, ǫ)

slide-36
SLIDE 36

An example

S(x1y1x2y2) ← P(x1, x2), Q(y1, y2) P(ax1, bx2) ← P(x1, x2) P(ǫ, ǫ) ← Q(cx1, dx2) ← Q(x1, x2) Q(ǫ, ǫ) ← Q(ǫ, ǫ) Q(c, d) Q(cc, dd) P(ǫ, ǫ) P(a, b)

slide-37
SLIDE 37

An example

S(x1y1x2y2) ← P(x1, x2), Q(y1, y2) P(ax1, bx2) ← P(x1, x2) P(ǫ, ǫ) ← Q(cx1, dx2) ← Q(x1, x2) Q(ǫ, ǫ) ← Q(ǫ, ǫ) Q(c, d) Q(cc, dd) P(ǫ, ǫ) P(a, b) S(accbdd)

slide-38
SLIDE 38

An example

S(x1y1x2y2) ← P(x1, x2), Q(y1, y2) P(ax1, bx2) ← P(x1, x2) P(ǫ, ǫ) ← Q(cx1, dx2) ← Q(x1, x2) Q(ǫ, ǫ) ← Q(ǫ, ǫ) Q(c, d) Q(cc, dd) P(ǫ, ǫ) P(a, b) S(accbdd) S(ancmbndm) ← P(an, bn), Q(cm, dm) The language is: {ancmbndm | n ∈ N ∧ m ∈ N}

slide-39
SLIDE 39

The well-nestedness constraint

I(x1y1, y2x2) ← J(x1, x2), K(y1, y2) I(x1y1, x2y2) ← J(x1, x2), K(y1, y2) A(x1z1, z2x2y1, y2y3x3) ← B(x1, x2, x3) C(y1, y2, y3) D(z1, z2) A(z1x1, y1x2z2y2x3, y3) ← B(x1, x2, x3) C(y1, y2, y3) D(z1, z2)

slide-40
SLIDE 40

MCFLwn and MCFL

MCFLwn

{an

1 . . . an m | n ∈ N}

{wm+1 | w ∈ {a; b}∗, m ∈ N}

MCFL

slide-41
SLIDE 41

MCFLwn and MCFL

MCFLwn

{an

1 . . . an m | n ∈ N}

{wm+1 | w ∈ {a; b}∗, m ∈ N}

MCFL

{w1 . . . wnznwnzn−1 . . . z1w1z0wr

1 . . . wr n |

n ∈ N, wi ∈ {c; d}+, z0, . . . , zn ∈ D∗

1 }

Staudacher 1993 Michaelis 2005

slide-42
SLIDE 42

MCFLwn and MCFL

MCFLwn

{an

1 . . . an m | n ∈ N}

{wm+1 | w ∈ {a; b}∗, m ∈ N}

MCFL

{w1 . . . wnznwnzn−1 . . . z1w1z0wr

1 . . . wr n |

n ∈ N, wi ∈ {c; d}+, z0, . . . , zn ∈ D∗

1 }

Staudacher 1993 Michaelis 2005 {w#w#w | w ∈ D∗

1 }

Engelfriet, Skyum 1976

slide-43
SLIDE 43

MCFLwn and MCFL

MCFLwn

{an

1 . . . an m | n ∈ N}

{wm+1 | w ∈ {a; b}∗, m ∈ N}

MCFL

{w1 . . . wnznwnzn−1 . . . z1w1z0wr

1 . . . wr n |

n ∈ N, wi ∈ {c; d}+, z0, . . . , zn ∈ D∗

1 }

Staudacher 1993 Michaelis 2005 {w#w#w | w ∈ D∗

1 }

Engelfriet, Skyum 1976 {w#w | w ∈ D∗

1 }

Kanazawa, S. 2010

slide-44
SLIDE 44

Outline

The group language of Z2 A similar problem in computational linguistics Multiple Context Free Grammars (MCFGs) A grammar for O2 Proof of the Theorem A Theorem on Jordan curves Conjectures

slide-45
SLIDE 45

A 2-MCFG for O2

S(xy) ← Inv(x, y) Inv(x1y1, y2x2) ← Inv(x1, x2), Inv(y1, y2) Inv(x1y1y2, x2) ← Inv(x1, x2), Inv(y1, y2) Inv(x1, y1y2x2) ← Inv(x1, x2), Inv(y1, y2) Inv(x1x2y1, y2) ← Inv(x1, x2), Inv(y1, y2) Inv(x1, x2y1y2) ← Inv(x1, x2), Inv(y1, y2) Inv(αx1α, x2) ← Inv(x1, x2) Inv(αx1, αx2) ← Inv(x1, x2) Inv(αx1, x2α) ← Inv(x1, x2) Inv(x1α, αx2) ← Inv(x1, x2) Inv(x1α, x2α) ← Inv(x1, x2) Inv(x1, αx2α) ← Inv(x1, x2) Inv(x1y1x2, y2) ← Inv(x1, x2), Inv(y1, y2) Inv(x1, y1x2y2) ← Inv(x1, x2), Inv(y1, y2) Inv(ǫ, ǫ) ← where α ∈ {a; b}

slide-46
SLIDE 46

A 2-MCFG for O2

S(xy) ← Inv(x, y) Inv(x1y1, y2x2) ← Inv(x1, x2), Inv(y1, y2) Inv(x1y1y2, x2) ← Inv(x1, x2), Inv(y1, y2) Inv(x1, y1y2x2) ← Inv(x1, x2), Inv(y1, y2) Inv(x1x2y1, y2) ← Inv(x1, x2), Inv(y1, y2) Inv(x1, x2y1y2) ← Inv(x1, x2), Inv(y1, y2) Inv(αx1α, x2) ← Inv(x1, x2) Inv(αx1, αx2) ← Inv(x1, x2) Inv(αx1, x2α) ← Inv(x1, x2) Inv(x1α, αx2) ← Inv(x1, x2) Inv(x1α, x2α) ← Inv(x1, x2) Inv(x1, αx2α) ← Inv(x1, x2) Inv(x1y1x2, y2) ← Inv(x1, x2), Inv(y1, y2) Inv(x1, y1x2y2) ← Inv(x1, x2), Inv(y1, y2) Inv(ǫ, ǫ) ← well-nested binary rules non well-nested rules where α ∈ {a; b}

slide-47
SLIDE 47

A 2-MCFG for O2

S(xy) ← Inv(x, y) Inv(x1y1, y2x2) ← Inv(x1, x2), Inv(y1, y2) Inv(x1y1y2, x2) ← Inv(x1, x2), Inv(y1, y2) Inv(x1, y1y2x2) ← Inv(x1, x2), Inv(y1, y2) Inv(x1x2y1, y2) ← Inv(x1, x2), Inv(y1, y2) Inv(x1, x2y1y2) ← Inv(x1, x2), Inv(y1, y2) Inv(αx1α, x2) ← Inv(x1, x2) Inv(αx1, αx2) ← Inv(x1, x2) Inv(αx1, x2α) ← Inv(x1, x2) Inv(x1α, αx2) ← Inv(x1, x2) Inv(x1α, x2α) ← Inv(x1, x2) Inv(x1, αx2α) ← Inv(x1, x2) Inv(x1y1x2, y2) ← Inv(x1, x2), Inv(y1, y2) Inv(x1, y1x2y2) ← Inv(x1, x2), Inv(y1, y2) Inv(ǫ, ǫ) ← well-nested binary rules rules for constants non well-nested rules where α ∈ {a; b}

slide-48
SLIDE 48

A 2-MCFG for O2

S(xy) ← Inv(x, y) Inv(x1y1, y2x2) ← Inv(x1, x2), Inv(y1, y2) Inv(x1y1y2, x2) ← Inv(x1, x2), Inv(y1, y2) Inv(x1, y1y2x2) ← Inv(x1, x2), Inv(y1, y2) Inv(x1x2y1, y2) ← Inv(x1, x2), Inv(y1, y2) Inv(x1, x2y1y2) ← Inv(x1, x2), Inv(y1, y2) Inv(αx1α, x2) ← Inv(x1, x2) Inv(αx1, αx2) ← Inv(x1, x2) Inv(αx1, x2α) ← Inv(x1, x2) Inv(x1α, αx2) ← Inv(x1, x2) Inv(x1α, x2α) ← Inv(x1, x2) Inv(x1, αx2α) ← Inv(x1, x2) Inv(x1y1x2, y2) ← Inv(x1, x2), Inv(y1, y2) Inv(x1, y1x2y2) ← Inv(x1, x2), Inv(y1, y2) Inv(ǫ, ǫ) ← terminal rule well-nested binary rules rules for constants non well-nested rules initial rule where α ∈ {a; b} Theorem: Given w1 and w2 such that w1w2 ∈ O2, Inv(w1, w2) is derivable.

slide-49
SLIDE 49

A graphical interpretation of O2.

Graphical interpretation of the word aaabaabaabbbbbaabbabbbbaaaabbbbbbbbaaa:

slide-50
SLIDE 50

A graphical interpretation of O2.

Graphical interpretation of the word aaabaabaabbbbbaabbabbbbaaaabbbbbbbbaaa: The words in O2 are precisely the words that are represented as closed curves: babbababbabbabbababbaaabbbabbaaaabbabbbaba

slide-51
SLIDE 51

Parsing with the grammar

Rule Inv(ax1a, x2) ← Inv(x1, x2)

Inv(abaabaaababbbabaaabbbabbbbaaaba, babbbbaaaaaababbaab) Inv(baabaaababbbabaaabbbabbbbaaab, babbbbaaaaaababbaab)

slide-52
SLIDE 52

Parsing with the grammar

Rule: Inv(x1y1, y2x2) ← Inv(x1, x2), Inv(y1, y2)

Inv(baabaaababbbabaaabbbabbbbaaab, babbbbaaaaaababbaab) Inv(bbbabaaabbbabbbbaaab, babbbbaaaaaaba) Inv(baabaaaba, bbaab)

slide-53
SLIDE 53

Parsing with the grammar

Rule Inv(x1, y1x2y2) ← Inv(x1, x2), Inv(y1, y2)

Inv(bbbabaaabbbabbbbaaab, babbbbaaaaaaba) Inv(bbbabaaabbbabbbbaaab, bbaaaaaa) Inv(babb, ba)

slide-54
SLIDE 54

Parsing with the grammar

Rule: Inv(x1b, bx2) ← Inv(x1, x2)

Inv(bbbabaaabbbabbbbaaab, bbaaaaaa) Inv(bbbabaaabbbabbbbaaa, baaaaaa)

slide-55
SLIDE 55

Parsing with the grammar

Rule: Inv(bx1, bx2) ← Inv(x1, x2)

Inv(bbbabaaabbbabbbbaaa, baaaaaa) Inv(bbabaaabbbabbbbaaa, aaaaaa)

slide-56
SLIDE 56

Parsing with the grammar

Rule: Inv(x1y1, y2x2) ← Inv(x1, x2), Inv(y1, y2)

Inv(bbabaaabbbabbbbaaa, aaaaaa) Inv(bbabaaabbbabbbb, aaa) Inv(aaa, aaa)

slide-57
SLIDE 57

Parsing with the grammar

Rule: Inv(bx1b, x2) ← Inv(x1, x2)

Inv(bbabaaabbbabbbb, aaa) Inv(babaaabbbabbb, aaa)

slide-58
SLIDE 58

Outline

The group language of Z2 A similar problem in computational linguistics Multiple Context Free Grammars (MCFGs) A grammar for O2 Proof of the Theorem A Theorem on Jordan curves Conjectures

slide-59
SLIDE 59

The proof of the Theorem

Theorem: Given w1 and w2 such that w1w2 ∈ O2, Inv(w1, w2) is derivable. The proof is done by induction on the lexicographically ordered pairs (|w1w2|, max(|w1|, |w2|)) . There are five cases: Case 1: w1 or w2 equal ǫ:

slide-60
SLIDE 60

The proof of the Theorem

Theorem: Given w1 and w2 such that w1w2 ∈ O2, Inv(w1, w2) is derivable. The proof is done by induction on the lexicographically ordered pairs (|w1w2|, max(|w1|, |w2|)) . There are five cases: Case 1: w1 or w2 equal ǫ: w.l.o.g., w1 = ǫ , then by induction hypothesis, for any v1 and v2 different from ǫ such that w1 = v1v2, Inv(v1, v2) is derivable then: Inv(v1, v2) Inv(ǫ, ǫ) Inv(x1x2, y1y2) ← Inv(x1, x2), Inv(y1, y2) Inv(v1v2 = w1, ǫ)

slide-61
SLIDE 61

The proof of the Theorem

Theorem: Given w1 and w2 such that w1w2 ∈ O2, Inv(w1, w2) is derivable. The proof is done by induction on the lexicographically ordered pairs (|w1w2|, max(|w1|, |w2|)) . There are five cases: Case 2: w1 = s1w′

1s2 and w2 = s3w′ 2s4 and for i, j ∈ {1; 2; 3; 4}, s.t. i = j,

{si; sj} ∈ {{a; a}; {b; b}}:

slide-62
SLIDE 62

The proof of the Theorem

Theorem: Given w1 and w2 such that w1w2 ∈ O2, Inv(w1, w2) is derivable. The proof is done by induction on the lexicographically ordered pairs (|w1w2|, max(|w1|, |w2|)) . There are five cases: Case 2: w1 = s1w′

1s2 and w2 = s3w′ 2s4 and for i, j ∈ {1; 2; 3; 4}, s.t. i = j,

{si; sj} ∈ {{a; a}; {b; b}}: e.g., if i = 1, j = 2, s1 = a and s2 = a then by induction hypothesis Inv(w′

1, w2) is

derivable and: Inv(w′

1, w2)

Inv(ax1a, x2) ← Inv(x1, x2) Inv(aw′

1a, w2)

slide-63
SLIDE 63

The proof of the Theorem

Theorem: Given w1 and w2 such that w1w2 ∈ O2, Inv(w1, w2) is derivable. The proof is done by induction on the lexicographically ordered pairs (|w1w2|, max(|w1|, |w2|)) . There are five cases: Case 3: the curves representing w1 and w2 have a non-trivial intersection point:

slide-64
SLIDE 64

The proof of the Theorem

Theorem: Given w1 and w2 such that w1w2 ∈ O2, Inv(w1, w2) is derivable. The proof is done by induction on the lexicographically ordered pairs (|w1w2|, max(|w1|, |w2|)) . There are five cases: Case 3: the curves representing w1 and w2 have a non-trivial intersection point:

v1 v2 v3 v4 A B w1 w2

Inv(v1, v4) Inv(v2, v3) Inv(v1v2 = w1, v3v4 = w2)

slide-65
SLIDE 65

The proof of the Theorem

Theorem: Given w1 and w2 such that w1w2 ∈ O2, Inv(w1, w2) is derivable. The proof is done by induction on the lexicographically ordered pairs (|w1w2|, max(|w1|, |w2|)) . There are five cases: Case 4: the curve representing w1 or w2 starts or ends with a loop:

slide-66
SLIDE 66

The proof of the Theorem

Theorem: Given w1 and w2 such that w1w2 ∈ O2, Inv(w1, w2) is derivable. The proof is done by induction on the lexicographically ordered pairs (|w1w2|, max(|w1|, |w2|)) . There are five cases: Case 4: the curve representing w1 or w2 starts or ends with a loop:

v1 v2

Inv(v1, ǫ) Inv(v2, w2) Inv(v1v2 = w1, w2)

slide-67
SLIDE 67

The proof of the Theorem

Theorem: Given w1 and w2 such that w1w2 ∈ O2, Inv(w1, w2) is derivable. The proof is done by induction on the lexicographically ordered pairs (|w1w2|, max(|w1|, |w2|)) . There are five cases: Case 5: w1 and w2 do not start or end with compatible letters, the curve representing then do not intersect and do not start or end with a loop.

slide-68
SLIDE 68

Case 5

No rule other than Inv(x1y1x2, y2) ← Inv(x1, x2), Inv(y1, y2) Inv(x1, y1x2y2) ← Inv(x1, x2), Inv(y1, y2) can be used.

slide-69
SLIDE 69

Case 5

No rule other than Inv(x1y1x2, y2) ← Inv(x1, x2), Inv(y1, y2) Inv(x1, y1x2y2) ← Inv(x1, x2), Inv(y1, y2) can be used.

slide-70
SLIDE 70

The relevance of case 5

The word abbaabaaabbbbaaaba is not in the language of the grammar only containing the well-nested rules.

slide-71
SLIDE 71
slide-72
SLIDE 72
slide-73
SLIDE 73

The relevance of case 5: a proof is now in hand

◮ Joshi (1985)

[MIX] represents the extreme case of the degree of free word order permitted in a language. This extreme case is linguistically not

  • relevant. [. . . ] TAGs also cannot generate this language although

for TAGs the proof is not in hand yet.

Theorem (Kanazawa, S. 12)

There is no 2-MCFLwn (or TAG) generating MIX or O2.

slide-74
SLIDE 74

Solving case 5: towards geometry

slide-75
SLIDE 75

Solving case 5: towards geometry

slide-76
SLIDE 76

Solving case 5: towards geometry

slide-77
SLIDE 77

Solving case 5: towards geometry

slide-78
SLIDE 78

Solving case 5: a geometrical invariant

slide-79
SLIDE 79

Solving case 5: a geometrical invariant

slide-80
SLIDE 80

Solving case 5: a geometrical invariant

An invariant on the Jordan curve representing w′

1w′ 2:

w′

1 = aw′′ 1 a and w′ 2 = aw′′ 2 a

w′

1 = aw′′ 1 a and w′ 2 = aw′′ 2 b

w′

1 = aw′′ 1 a and w′ 2 = bw′′ 2 a

w′

1 = aw′′ 1 a and w′ 2 = bw′′ 2 b

slide-81
SLIDE 81

Solving case 5: a geometrical invariant

An invariant on the Jordan curve representing w′

1w′ 2:

w′

1 = aw′′ 1 b and w′ 2 = aw′′ 2 b

w′

1 = aw′′ 1 b and w′ 2 = bw′′ 2 a

w′

1 = aw′′ 1 a and

w′

2 = a

w′

1 = aw′′ 1 a and

w′

2 = b

slide-82
SLIDE 82

Solving case 5: a geometrical invariant

An invariant on the Jordan curve representing w′

1w′ 2:

w′

1 = aw′′ 1 b and w′ 2 = a

w′

1 = aw′′ 1 b and w′ 2 = b

slide-83
SLIDE 83

Outline

The group language of Z2 A similar problem in computational linguistics Multiple Context Free Grammars (MCFGs) A grammar for O2 Proof of the Theorem A Theorem on Jordan curves Conjectures

slide-84
SLIDE 84

Jordan curves

illustration from: A combinatorial introduction to topology by Michael Henle (Dover Publications).

slide-85
SLIDE 85

A theorem on Jordan curves

Theorem: If A and D are two points on a Jordan curve J such that there are two points A′ and D′ inside J such that − → AD = − − → A′D′, then there are two points B and C pairwise distinct from A and D such that A, B, C, and D appear in that order on one

  • f the arcs going from A to D and −

→ AD = − → BC. A D A′ D′

slide-86
SLIDE 86

A theorem on Jordan curves

Theorem: If A and D are two points on a Jordan curve J such that there are two points A′ and D′ inside J such that − → AD = − − → A′D′, then there are two points B and C pairwise distinct from A and D such that A, B, C, and D appear in that order on one

  • f the arcs going from A to D and −

→ AD = − → BC. A D A′ D′

slide-87
SLIDE 87

A theorem on Jordan curves

Theorem: If A and D are two points on a Jordan curve J such that there are two points A′ and D′ inside J such that − → AD = − − → A′D′, then there are two points B and C pairwise distinct from A and D such that A, B, C, and D appear in that order on one

  • f the arcs going from A to D and −

→ AD = − → BC. A D B C E F A′ D′ G H

slide-88
SLIDE 88

A theorem on Jordan curves

Theorem: If A and D are two points on a Jordan curve J such that there are two points A′ and D′ inside J such that − → AD = − − → A′D′, then there are two points B and C pairwise distinct from A and D such that A, B, C, and D appear in that order on one

  • f the arcs going from A to D and −

→ AD = − → BC. Applying this Theorem solves case 5.

slide-89
SLIDE 89

A theorem on Jordan curves

Theorem: If A and D are two points on a Jordan curve J such that there are two points A′ and D′ inside J such that − → AD = − − → A′D′, then there are two points B and C pairwise distinct from A and D such that A, B, C, and D appear in that order on one

  • f the arcs going from A to D and −

→ AD = − → BC. Applying this Theorem solves case 5.

slide-90
SLIDE 90

A theorem on Jordan curves

Theorem: If A and D are two points on a Jordan curve J such that there are two points A′ and D′ inside J such that − → AD = − − → A′D′, then there are two points B and C pairwise distinct from A and D such that A, B, C, and D appear in that order on one

  • f the arcs going from A to D and −

→ AD = − → BC. Applying this Theorem solves case 5.

slide-91
SLIDE 91

Winding number

Let wn(J, z) be the winding number of a closed curve around z.

illustration from: A combinatorial introduction to topology by Michael Henle (Dover Publications).

slide-92
SLIDE 92

An interesting Lemma

Let exp : C → C − {0} z → e2iπz .

Lemma

Given an simple arc

  • AB such that −

→ AB = k ∈ N, we have: wn(exp(

  • AB), 0) = k
slide-93
SLIDE 93

Translation becomes rotation

exp : C → C − {0} z → e2iπz .

A D B C E F I J G H O A, D E, F B, C I, J G, H

slide-94
SLIDE 94

Translation becomes rotation

exp : C → C − {0} z → e2iπz .

A D B C E F I J O A, D E, F B, C I, J

slide-95
SLIDE 95

An interesting characterization

Lemma

Given an simple arc

  • AD such that −

→ AD = 1, we have:

  • AD contains a proper subarc
  • BC such that −

→ AD = − → BC iff exp(

  • AD) is not a Jordan curve.
slide-96
SLIDE 96

Jordan curves and winding numbers

illustration from: A combinatorial introduction to topology by Michael Henle (Dover Publications).

Theorem: There is k ∈ {−1; 1} such that the winding number of Jordan curve around a point in its interior is k, its winding number around a point in its exterior is 0.

slide-97
SLIDE 97

Proving the characterization

Lemma

Given an simple arc

  • AD such that −

→ AD = 1, we have: ◮

  • AD contains a proper subarc
  • BC such that −

→ AD = − → BC iff exp(

  • AD) is not a

Jordan curve. Proof

slide-98
SLIDE 98

Proving the characterization

Lemma

Given an simple arc

  • AD such that −

→ AD = 1, we have: ◮

  • AD contains a proper subarc
  • BC such that −

→ AD = − → BC iff exp(

  • AD) is not a

Jordan curve. Proof ◮ by 1-periodicity of exp, if

  • AD contains a proper subarc
  • BC such that −

→ AD = − → BC, then exp(

  • AD) is not a Jordan curve,
slide-99
SLIDE 99

Proving the characterization

Lemma

Given an simple arc

  • AD such that −

→ AD = 1, we have: ◮

  • AD contains a proper subarc
  • BC such that −

→ AD = − → BC iff exp(

  • AD) is not a

Jordan curve. Proof ◮ by 1-periodicity of exp, if

  • AD contains a proper subarc
  • BC such that −

→ AD = − → BC, then exp(

  • AD) is not a Jordan curve,

◮ if exp(

  • AD) is not a Jordan curve:

◮ take the closed curve C obtained by removing the closed subcurves of

exp(

  • AD) that have a negative winding number,
slide-100
SLIDE 100

Proving the characterization

Lemma

Given an simple arc

  • AD such that −

→ AD = 1, we have: ◮

  • AD contains a proper subarc
  • BC such that −

→ AD = − → BC iff exp(

  • AD) is not a

Jordan curve. Proof ◮ by 1-periodicity of exp, if

  • AD contains a proper subarc
  • BC such that −

→ AD = − → BC, then exp(

  • AD) is not a Jordan curve,

◮ if exp(

  • AD) is not a Jordan curve:

◮ take the closed curve C obtained by removing the closed subcurves of

exp(

  • AD) that have a negative winding number,

◮ take a proper closed subcurve D of C that is minimal for inclusion,

slide-101
SLIDE 101

Proving the characterization

Lemma

Given an simple arc

  • AD such that −

→ AD = 1, we have: ◮

  • AD contains a proper subarc
  • BC such that −

→ AD = − → BC iff exp(

  • AD) is not a

Jordan curve. Proof ◮ by 1-periodicity of exp, if

  • AD contains a proper subarc
  • BC such that −

→ AD = − → BC, then exp(

  • AD) is not a Jordan curve,

◮ if exp(

  • AD) is not a Jordan curve:

◮ take the closed curve C obtained by removing the closed subcurves of

exp(

  • AD) that have a negative winding number,

◮ take a proper closed subcurve D of C that is minimal for inclusion, ◮ D is a Jordan curve winding positively (i.e. once) around 0,

slide-102
SLIDE 102

Proving the characterization

Lemma

Given an simple arc

  • AD such that −

→ AD = 1, we have: ◮

  • AD contains a proper subarc
  • BC such that −

→ AD = − → BC iff exp(

  • AD) is not a

Jordan curve. Proof ◮ by 1-periodicity of exp, if

  • AD contains a proper subarc
  • BC such that −

→ AD = − → BC, then exp(

  • AD) is not a Jordan curve,

◮ if exp(

  • AD) is not a Jordan curve:

◮ take the closed curve C obtained by removing the closed subcurves of

exp(

  • AD) that have a negative winding number,

◮ take a proper closed subcurve D of C that is minimal for inclusion, ◮ D is a Jordan curve winding positively (i.e. once) around 0, ◮ D induces a proper subcurve J of exp(

  • AD) whose winding number is 1,
slide-103
SLIDE 103

Proving the characterization

Lemma

Given an simple arc

  • AD such that −

→ AD = 1, we have: ◮

  • AD contains a proper subarc
  • BC such that −

→ AD = − → BC iff exp(

  • AD) is not a

Jordan curve. Proof ◮ by 1-periodicity of exp, if

  • AD contains a proper subarc
  • BC such that −

→ AD = − → BC, then exp(

  • AD) is not a Jordan curve,

◮ if exp(

  • AD) is not a Jordan curve:

◮ take the closed curve C obtained by removing the closed subcurves of

exp(

  • AD) that have a negative winding number,

◮ take a proper closed subcurve D of C that is minimal for inclusion, ◮ D is a Jordan curve winding positively (i.e. once) around 0, ◮ D induces a proper subcurve J of exp(

  • AD) whose winding number is 1,

◮ J induces a proper subarc

  • BC of exp(
  • AD) such that −

→ AD = − → BC.

slide-104
SLIDE 104

The characterization on the example

exp : C → C − {0} z → e2iπz .

A D B C E F I J G H O A, D E, F B, C I, J G, H

slide-105
SLIDE 105

The characterization on the example

exp : C → C − {0} z → e2iπz .

A D B C E F I J O A, D E, F B, C I, J

slide-106
SLIDE 106

Yet another observation from algebraic topology

Let’s suppose that − → AD = 1 and that A0 = A′ = 0, A1 = D′ = 1,. . . , Ak = k let exp : C → C − {0} z → e2iπz .

A D B C E F I J G H A0 A1 A2 A3 A−1 A−2 O A, D E, F B, C I, J G, H A0, A1

exp sums up the winding number of a Jordan curve around the Ai’s as the winding number around exp(A0) = exp(0) = 1.

slide-107
SLIDE 107

Proving the Theorem

Let’s suppose that − → AD = 1,

Lemma

Given an simple arc

  • AD we have:

  • AD contains a proper subarc
  • BC such that −

→ AD = − → BC iff exp(

  • AD) is not a

Jordan curve.

slide-108
SLIDE 108

Proving the Theorem

Let’s suppose that − → AD = 1,

Lemma

Given an simple arc

  • AD we have:

  • AD contains a proper subarc
  • BC such that −

→ AD = − → BC iff exp(

  • AD) is not a

Jordan curve. Corollary: a simple path J from A to D (resp. D to A) does not contain B and C as required in the Theorem iff ϕ(J) is a Jordan curve of C − {1} that winding 0 or 1 (resp. or −1) time around 1.

slide-109
SLIDE 109

Proving the Theorem

Let’s suppose that − → AD = 1,

Lemma

Given an simple arc

  • AD we have:

  • AD contains a proper subarc
  • BC such that −

→ AD = − → BC iff exp(

  • AD) is not a

Jordan curve. Corollary: a simple path J from A to D (resp. D to A) does not contain B and C as required in the Theorem iff ϕ(J) is a Jordan curve of C − {1} that winding 0 or 1 (resp. or −1) time around 1. Corollary: if J is a simple closed curve of C composed with two curves J1 and J2 respectively going from A to D and D to A which do not contain points B and C as required in the Theorem then |wn(exp(J), 1)| = |wn(exp(J1), 1) + wn(ϕ(J2), 1)| ≤ 1.

slide-110
SLIDE 110

Proving the Theorem

Let’s suppose that − → AD = 1,

Lemma

Given an simple arc

  • AD we have:

  • AD contains a proper subarc
  • BC such that −

→ AD = − → BC iff exp(

  • AD) is not a

Jordan curve. Corollary: a simple path J from A to D (resp. D to A) does not contain B and C as required in the Theorem iff ϕ(J) is a Jordan curve of C − {1} that winding 0 or 1 (resp. or −1) time around 1. Corollary: if J is a simple closed curve of C composed with two curves J1 and J2 respectively going from A to D and D to A which do not contain points B and C as required in the Theorem then |wn(exp(J), 1)| = |wn(exp(J1), 1) + wn(ϕ(J2), 1)| ≤ 1. Lemma: if J is a simple closed curve of C composed with two curves J1 and J2 respectively going from A to D and D to A such that 0 and 1 are in the interior of J, then |wn(ϕ(J), 1)| ≥ 2.

slide-111
SLIDE 111

Proving the Theorem

Let’s suppose that − → AD = 1,

Lemma

Given an simple arc

  • AD we have:

  • AD contains a proper subarc
  • BC such that −

→ AD = − → BC iff exp(

  • AD) is not a

Jordan curve. Corollary: a simple path J from A to D (resp. D to A) does not contain B and C as required in the Theorem iff ϕ(J) is a Jordan curve of C − {1} that winding 0 or 1 (resp. or −1) time around 1. Corollary: if J is a simple closed curve of C composed with two curves J1 and J2 respectively going from A to D and D to A which do not contain points B and C as required in the Theorem then |wn(exp(J), 1)| = |wn(exp(J1), 1) + wn(ϕ(J2), 1)| ≤ 1. Lemma: if J is a simple closed curve of C composed with two curves J1 and J2 respectively going from A to D and D to A such that 0 and 1 are in the interior of J, then |wn(ϕ(J), 1)| ≥ 2. The Theorem follows by contradiction.

slide-112
SLIDE 112

Outline

The group language of Z2 A similar problem in computational linguistics Multiple Context Free Grammars (MCFGs) A grammar for O2 Proof of the Theorem A Theorem on Jordan curves Conjectures

slide-113
SLIDE 113

Nederhof’s conjecture

◮ Nederhof (2016)

Conjecture: for every k, Ok = {w ∈ {a1, a1 . . . , ak, ak}∗ | ∀1 ≤ i ≤ k, |w|ai = |w|ai} is generated by the grammar with rules of the form: S(x1 . . . xk) ← Inv(x1, . . . , xk) Inv(s1, . . . , sk) ← Inv(x1, . . . , xk), Inv(y1, . . . , yk) s1 . . . sk ∈ perm(x1 . . . xky1 . . . yk) Inv(x1, . . . , αxi, . . . , αxj, . . . , xk) ← Inv(x1, . . . , xk) . . . Inv(ǫ, . . . , ǫ) ←

slide-114
SLIDE 114

Status of the conjecture

Positive arguments

◮ The conjecture has been tested on millions of examples

slide-115
SLIDE 115

Status of the conjecture

Positive arguments

◮ The conjecture has been tested on millions of examples ◮ In the case of Z3, some cases can be solved using braiding

arguments Negative argument

slide-116
SLIDE 116

Status of the conjecture

Positive arguments

◮ The conjecture has been tested on millions of examples ◮ In the case of Z3, some cases can be solved using braiding

arguments Negative argument

◮ for the case of Z2 many arguments are strongly related to

planarity → no clear way of generalizing to higher dimensions