2 MPRI 4 Syntactic Formalisms 3 - - PowerPoint PPT Presentation

2 mpri 4 syntactic formalisms 3 mpri 4
SMART_READER_LITE
LIVE PREVIEW

2 MPRI 4 Syntactic Formalisms 3 - - PowerPoint PPT Presentation

1 MPRI 4 2 MPRI 4 Syntactic Formalisms 3 MPRI 4 I. Context-Free Grammars 4 MPRI 4 Definition G = ( N, T, P, S ) where: N is a finite set of


slide-1
SLIDE 1

MPRI 4

1

◭ ◮ ◮

slide-2
SLIDE 2

MPRI 4

2

Syntactic Formalisms

◭ ◮ ◮

slide-3
SLIDE 3

MPRI 4

3

  • I. Context-Free Grammars

◭ ◮ ◮

slide-4
SLIDE 4

MPRI 4

4

  • Definition

G = (N, T, P, S) where:

  • N is a finite set of non-terminal symbols;
  • T is a finite set of terminal symbols;
  • P is a finite set of production rules

A → α where A ∈ N and α ∈ (N ∪ T)∗

  • S ∈ N is the start symbol.

L(G) = {ω ∈ T ∗ | S →∗ ω}

◭ ◮ ◮

slide-5
SLIDE 5

MPRI 4

5

Example

  • Grammatical rules

S → NP VP VP → tV NP VP → stV Adj NP → Det N

  • Lexicon

tV → /mange/ stV → /est/ NP → /Pierre/ N → /fruit/ Adj → /intelligent/ Det → /un/

◭ ◮ ◮

slide-6
SLIDE 6

MPRI 4

6

CKY Algorithm

init <A → •α, i, i> scan <A → α • aβ, i, j> a = aj+1 <A → αa • β, i, j + 1> complete <A → α • Bβ, i, j> <B → γ•, j, k> <A → αB • β, i, k>

◭ ◮ ◮

slide-7
SLIDE 7

MPRI 4

6

CKY Algorithm

init <A → •α, i, i> scan <A → α • aβ, i, j> a = aj+1 <A → αa • β, i, j + 1> complete <A → α • Bβ, i, j> <B → γ•, j, k> <A → αB • β, i, k>

  • Good news/Bad news

◭ ◮ ◮

slide-8
SLIDE 8

MPRI 4

7

Earley Algorithm

init <S → •α, 0, 0> scan <A → α • aβ, i, j> a = aj+1 <A → αa • β, i, j + 1> complete <A → α • Bβ, i, j> <B → γ•, j, k> <A → αB • β, i, k> predict <A → α • Bβ, i, j> <B → •γ, j, j>

◭ ◮ ◮

slide-9
SLIDE 9

MPRI 4

7

Earley Algorithm

init <S → •α, 0, 0> scan <A → α • aβ, i, j> a = aj+1 <A → αa • β, i, j + 1> complete <A → α • Bβ, i, j> <B → γ•, j, k> <A → αB • β, i, k> predict <A → α • Bβ, i, j> <B → •γ, j, j>

  • Correct-prefix property

◭ ◮ ◮

slide-10
SLIDE 10

MPRI 4

8

  • II. Unification Grammars

◭ ◮ ◮

slide-11
SLIDE 11

MPRI 4

9 Suppose we extend our toy grammar with the following rules: NP → NP Conj NP Conj → /et/ NP → /Marie/ N → /pomme/ Det → /une/ Det → /des/

◭ ◮ ◮

slide-12
SLIDE 12

MPRI 4

9 Suppose we extend our toy grammar with the following rules: NP → NP Conj NP Conj → /et/ NP → /Marie/ N → /pomme/ Det → /une/ Det → /des/ We get:

∗Marie est intelligent

◭ ◮ ◮

slide-13
SLIDE 13

MPRI 4

9 Suppose we extend our toy grammar with the following rules: NP → NP Conj NP Conj → /et/ NP → /Marie/ N → /pomme/ Det → /une/ Det → /des/ We get:

∗Marie est intelligent ∗Marie mange un pomme

◭ ◮ ◮

slide-14
SLIDE 14

MPRI 4

9 Suppose we extend our toy grammar with the following rules: NP → NP Conj NP Conj → /et/ NP → /Marie/ N → /pomme/ Det → /une/ Det → /des/ We get:

∗Marie est intelligent ∗Marie mange un pomme ∗Pierre et Marie mange une pomme

◭ ◮ ◮

slide-15
SLIDE 15

MPRI 4

9 Suppose we extend our toy grammar with the following rules: NP → NP Conj NP Conj → /et/ NP → /Marie/ N → /pomme/ Det → /une/ Det → /des/ We get:

∗Marie est intelligent ∗Marie mange un pomme ∗Pierre et Marie mange une pomme ?Pierre mange Marie

◭ ◮ ◮

slide-16
SLIDE 16

MPRI 4

10 S → NP [X, Y ] VP [X, Y ] VP [X, Y ] → tV [Y ] NP [W, Z] VP [X, Y ] → stV [Y ] Adj [X, Y ] NP [X, Y ] → Det [X, Y ] N [X, Y ] NP [m, p] → NP [m, X] Conj NP [Y, Z] NP [m, p] → NP [X, Y ] Conj NP [m, Z] NP [f, p] → NP [f, X] Conj NP [f, Y ] tV [s] → /mange/ tV [p] → /mangent/ stV [s] → /est/ stV [p] → /sont/ NP [m, s] → /Pierre/ NP [f, s] → /Marie/ N [m, s] → /fruit/ N [m, p] → /fruits/ N [f, s] → /pomme/ N [f, p] → /pommes/ Adj [m, s] → /intelligent/ Adj [f, s] → /intelligente/ Adj [m, p] → /intelligents/ Adj [f, p] → /intelligentes/ Det [m, s] → /un/ Det [f, s] → /une/ Det [X, p] → /des/ Conj → /et/

◭ ◮ ◮

slide-17
SLIDE 17

MPRI 4

11

Earley Algorithm

init <S → •α, 0, 0> scan <A → α • aβ, i, j> a = aj+1 <A → αa • β, i, j + 1> complete <A → α • Bβ, i, j> <C → γ•, j, k> σ = mgu(B, C) <(A → αB • β)σ, i, k> predict <A → α • Bβ, i, j> σ = mgu(B, C) <(C → •γ)σ, j, j>

◭ ◮ ◮

slide-18
SLIDE 18

MPRI 4

11

Earley Algorithm

init <S → •α, 0, 0> scan <A → α • aβ, i, j> a = aj+1 <A → αa • β, i, j + 1> complete <A → α • Bβ, i, j> <C → γ•, j, k> σ = mgu(B, C) <(A → αB • β)σ, i, k> predict <A → α • Bβ, i, j> σ = mgu(B, C) <(C → •γ)σ, j, j>

  • equality up to variable renaming, subsumption, completeness ?

◭ ◮ ◮

slide-19
SLIDE 19

MPRI 4

12 Carl Pollard and Ivan A. Sag: Head-Driven Phrase Structure Grammar. Uni- versity of Chicago Press, 1994. Joan Bresnan: Lexical-Functional Syntax, Oxford: Blackwell Publishers Ltd, 2001. Anne Abeill´ e: Les Nouvelles syntaxes. Armand Colin, 1993.

◭ ◮ ◮

slide-20
SLIDE 20

MPRI 4

13

  • III. Tree Adjoining Grammars

◭ ◮ ◮

slide-21
SLIDE 21

MPRI 4

14

  • Lexicalization
  • Weak Equivalence versus Strong Equivalence
  • Definition

G = (N, T, I, A, S) where:

  • N is a finite set of non-terminal symbols;
  • T is a finite set of terminal symbols;
  • I is a finite set of trees called initial trees;
  • A is a finite set of trees called auxiliary trees;
  • S ∈ N is the start symbol.

The trees in I ∪ A are called elementary trees. The inner nodes of the el- ementary trees are labeled by non-terminals. Their leaves are labeled by non-terminals or by terminals. In each auxiliary tree, there is one distin- guished leaf (called the foot) whose label is the same non-terminal as the label of the root.

◭ ◮ ◮

slide-22
SLIDE 22

MPRI 4

15

  • Substitution

S

  • N↓

initial/derived tree

N

  • initial tree

⇒ S

  • N
  • derived tree
  • Adjunction

S

  • N
  • initial/derived tree

N

  • N ∗

auxiliary tree

⇒ S

  • N
  • N
  • derived tree

◭ ◮ ◮

slide-23
SLIDE 23

MPRI 4

16

Example

  • Initial trees

NP Peter NP Mary S

  • NP↓

VP

  • V

NP↓ kisses

  • Auxiliary tree

VP

  • VP ∗

Adv possionately

◭ ◮ ◮

slide-24
SLIDE 24

MPRI 4

17

CKY Algorithm

init <A → •α, i, , , i> scan <A → α • Bβ, i, j, k, l> <A → αB • β, i, j, k, l + 1> label(B) = al+1 complete <A → α • Bβ, i, j, k, l> <B → γ•, l, m, n, o> <A → αB • β, i, j ⊔ m, k ⊔ n, o> i ⊔ = i ⊔ i = i undefined otherwise

◭ ◮ ◮

slide-25
SLIDE 25

MPRI 4

18 foot <A → α • Bβ, i, , , j> <A → αB • β, i, j, k, k> B is the foot of an auxiliary tree adjoin <A → α•, i, j, k, l> <B → β•, m, i, l, n> <A → β•, m, j, k, n> B is the root of an auxiliary tree label(A) = label(B) substitute <A → α • Bβ, i, j, k, l> <C → γ•, l, , , m> <A → αB • β, i, j, k, m> C is the root of an initial tree label(B) = label(C)

◭ ◮ ◮

slide-26
SLIDE 26

MPRI 4

19

Example

  • Initial tree

S e

  • Auxiliary tree

S

  • a

S

  • d

b S∗ c

  • Let ω = 0 a 1 a 2 b 3 b 4 e 5 c 6 c 7 d 8 d 9

◭ ◮ ◮

slide-27
SLIDE 27

MPRI 4

20

<A2 → •bA3c, 3, , , 3> scan <A2 → b • A3c, 3, , , 4> foot <A2 → bA3 • c, 3, 4, 5, 5> scan <A2 → bA3c•, 3, 4, 5, 6> <A1 → •aA2d, 1, , , 1> scan <A1 → a • A2d, 1, , , 2> <A2 → •bA3c, 2, , , 2> scan <A2 → b • A3c, 2, , , 3> foot <A2 → bA3 • c, 2, 3, 6, 6> scan <A2 → bA3c•, 2, 3, 6, 7> complete <A1 → aA2 • d, 1, 3, 6, 7> scan <A1 → aA2d•, 1, 3, 6, 8> adjoin <A2 → aA2d•, 1, 4, 5, 8> <A0 → •e, 4, , , 4> scan <A0 → e•, 4, , , 5> <A1 → •aA2d, 0, , , 0> scan <A1 → a • A2d, 0, , , 1> · · · <A2 → aA2d•, 1, 4, 5, 8> complete <A1 → aA2 • d, 0, 4, 5, 8> scan <A1 → aA2d•, 0, 4, 5, 9> adjoin <A0 → aA2d•, 0, , , 9>

◭ ◮ ◮

slide-28
SLIDE 28

MPRI 4

21

  • Good news/Bad news
  • Earley algorithms (Schabes & Joshi, 1988; Schabes 1991; Nederhof 1997)
  • Expressive power
  • Adjoining constraints

◭ ◮ ◮

slide-29
SLIDE 29

MPRI 4

22 Aravind K. Joshi, K. Vijay-Shanker: Some Computational Properties of Tree Adjoining Grammars. ACL 1985: 82-93 Yves Schabes, Aravind K. Joshi: An Earley-Type Parsing Algorithm for Tree Adjoining Grammars. ACL 1988: 258-269 Yves Schabes: The valid prefix property and left to right parsing of tree- adjoining grammar. IWPT 1991: 2–30 Mark-Jan Nederhof: The Computational Complexity of the Correct-Prefix Property for TAGs. Computational Linguistics 25(3): 345-360 (1999) Eric Villemonte de la Clergerie: Tabulation et traitementde la langue na-

  • turelle. Tutorial 1999

http://pauillac.inria.fr/˜clerger/

◭ ◮ ◮

slide-30
SLIDE 30

MPRI 4

23

  • IV. Range Concatenation Grammars

◭ ◮ ◮

slide-31
SLIDE 31

MPRI 4

24

  • Expressive Power versus Tractability
  • Definition

G = (N, T, V, P, S) where:

  • N is a ranked alphabet of predicate names;
  • T is a finite set of terminal symbols;
  • V is a finite set of variable symbols;
  • P is a finite set of clauses

φ0 → φ1 . . . φn where φ0, φ1 . . . φn are predicates of the form A(α1, . . . , αp) whith A ∈ N and α1, . . . , αn ∈ (T ∪ V )∗;

  • S ∈ N is the start symbol.

◭ ◮ ◮

slide-32
SLIDE 32

MPRI 4

25

  • Notion of range
  • Given a word ω ∈ T ∗, an instantiated clause is such that the variables and

the predicate arguments are bound to ranges in ω ∈ T ∗

  • Example:

S(XY ) → S(X) E(X, Y ) S(a) → ǫ E(Xa, Y a) → E(X, Y ) E(ǫ, ǫ) → ǫ

◭ ◮ ◮

slide-33
SLIDE 33

MPRI 4

25

  • Notion of range
  • Given a word ω ∈ T ∗, an instantiated clause is such that the variables and

the predicate arguments are bound to ranges in ω ∈ T ∗

  • Example:

S(XY ) → S(X) E(X, Y ) S(a) → ǫ E(Xa, Y a) → E(X, Y ) E(ǫ, ǫ) → ǫ

  • Complete for PTIME.

◭ ◮ ◮

slide-34
SLIDE 34

MPRI 4

25

  • Notion of range
  • Given a word ω ∈ T ∗, an instantiated clause is such that the variables and

the predicate arguments are bound to ranges in ω ∈ T ∗

  • Example:

S(XY ) → S(X) E(X, Y ) S(a) → ǫ E(Xa, Y a) → E(X, Y ) E(ǫ, ǫ) → ǫ

  • Complete for PTIME.
  • Closed by Union, Intersection, Concatenation, Iteration, Complementation.

◭ ◮ ◮

slide-35
SLIDE 35

MPRI 4

26 See papers by Pierre Boullier http://atoll.inria.fr

◭ ◮ ◮

slide-36
SLIDE 36

MPRI 4

27

  • V. Categorial Grammars

◭ ◮ ◮

slide-37
SLIDE 37

MPRI 4

28

  • Radical approach to lexicalism
  • An algebra of syntactic categories
  • A finite set of grammatical composition rules

◭ ◮ ◮

slide-38
SLIDE 38

MPRI 4

29

A notion of syntactic category

Let A be a finite set of atomic syntactic categories. The set of syntactic categories is inductively defined as follows: TA ::= A | (TA • TA) | (TA \ TA) | (TA / TA)

◭ ◮ ◮

slide-39
SLIDE 39

MPRI 4

29

A notion of syntactic category

Let A be a finite set of atomic syntactic categories. The set of syntactic categories is inductively defined as follows: TA ::= A | (TA • TA) | (TA \ TA) | (TA / TA) Interpretation: (α • β) is the category of the phrases obtained by concatenating a phrase of category α with a phrase of category β. (α \ β) is the category of the phrases that yield a phrase of category β when appended to a phrase of category α. (β / α) is the category of the phrases that yield a phrase of category β when a phrase of category α is appended to them.

◭ ◮ ◮

slide-40
SLIDE 40

MPRI 4

30

An algebra of syntactic categories

The set of syntactic categories is provided with a preorder: α ≤ β

◭ ◮ ◮

slide-41
SLIDE 41

MPRI 4

30

An algebra of syntactic categories

The set of syntactic categories is provided with a preorder: α ≤ β Interpretation : Any phrase of category α is a phrase of category β.

◭ ◮ ◮

slide-42
SLIDE 42

MPRI 4

31 ≤ is a preorder: α ≤ α α ≤ β, β ≤ δ ⇒ α ≤ δ Associativity and monotonicity of •: (α • β) • γ ≤ α • (β • γ) α • (β • γ) ≤ (α • β) • γ α ≤ β ⇒ α • γ ≤ β • γ α ≤ β ⇒ γ • α ≤ γ • β Cancellation laws: α • (α \ β) ≤ β (β / α) • α ≤ β

◭ ◮ ◮

slide-43
SLIDE 43

MPRI 4

32

AB-Grammars

G = (A, Σ, L, s), where : A is a finite set of atomic categories; Σ is a finite vocabulary; L : Σ → 2TA is a lexicon that assigns a finite set of syntactic categories to any element of the vocabulary; s ∈ TA is a distinguished category.

◭ ◮ ◮

slide-44
SLIDE 44

MPRI 4

33 Let G = A, Σ, L, s be an AB-grammar. The language L(G) generated by G is the set of words a0 . . . an ∈ Σ∗ such that there exist α0 ∈ L(a0), . . . , αn ∈ L(an) with α0 • · · · • αn ≤ s

◭ ◮ ◮

slide-45
SLIDE 45

MPRI 4

34

Expressive power

The class of AB-languages is the class of context-free languages.

◭ ◮ ◮

slide-46
SLIDE 46

MPRI 4

35

Structural limitations

Pierre : SN une : SN / N pomme : N mange : (SN \ P) / SN qui : (SN \ SN ) /(SN \ P) rapidement : P \ P It is possible to generate: Pierre qui mange une pomme SN • ((SN \ SN ) /(SN \ P)) • ((SN \ P) / SN ) • (SN / N ) • N ≤ SN Pierre mange une pomme rapidement SN • ((SN \ P) / SN ) • (SN / N ) • N • (P \ P) ≤ P

◭ ◮ ◮

slide-47
SLIDE 47

MPRI 4

36 It is not possible to generate: Pierre qui mange une pomme rapidement because the following inequality: (SN \ P) • (P \ P) ≤ (SN \ P) cannot be derived

◭ ◮ ◮

slide-48
SLIDE 48

MPRI 4

37

Hypothetical reasoning

To generate: Pierre qui mange une pomme rapidement the following inequality is needed: (SN \ P) • (P \ P) ≤ (SN \ P) Similarly, to generate: une pomme que Pierre mange where que : (SN \ SN ) /(P / SN ) the following inequality is needed: SN • ((SN \ P) / SN ) ≤ (P / SN )

◭ ◮ ◮

slide-49
SLIDE 49

MPRI 4

38 Both inequalities are consistent with respect to the interpretation of the preorder but cannot be derived using the algebraic laws given so far. The algebra previously given does not capture completely the intuition we have of the connectives.

◭ ◮ ◮

slide-50
SLIDE 50

MPRI 4

38 Both inequalities are consistent with respect to the interpretation of the preorder but cannot be derived using the algebraic laws given so far. The algebra previously given does not capture completely the intuition we have of the connectives. Hypothetical reasoning: Assume: X : SN then: X mange rapidement : P hence: mange rapidement : SN \ P

◭ ◮ ◮

slide-51
SLIDE 51

MPRI 4

39

Enriching the algebra

Add the following adjunction laws: (α • β) ≤ γ ⇒ β ≤ (α \ γ) (α • β) ≤ γ ⇒ α ≤ (γ / β)

◭ ◮ ◮

slide-52
SLIDE 52

MPRI 4

40

Logical formalization

There exists a logical reading of: α0 • · · · • αn ≤ β where:

  • the αi’s are seen as hypotheses;
  • β plays the part of the conclusion;
  • ≤ is interpreted as a consequence relation.

According to this reading, the syntactic categories correspond to formulas.

  • is a conjonction, and both \ and / are implications.

◭ ◮ ◮

slide-53
SLIDE 53

MPRI 4

41 Lambek calculus A − A (ident) Γ − A ∆1, A, ∆2 − B (cut) ∆1, Γ, ∆2 − B Γ, A, B, ∆ − C (• left) Γ, A • B, ∆ − C Γ − A ∆ − B (• right) Γ, ∆ − A • B Γ − A ∆1, B, ∆2 − C (\ left) ∆1, Γ, A \ B, ∆2 − C A, Γ − B (\ right) Γ − A \ B Γ − A ∆1, B, ∆2 − C (/ left) ∆1, B / A, Γ, ∆2 − C Γ, A − B (/ right) Γ − B / A

◭ ◮ ◮

slide-54
SLIDE 54

MPRI 4

42

Lambek grammars

Same data as an AB-grammar. The langage L(G) generated by a Lambek-grammar G is the set of words a0 . . . an ∈ Σ∗ such that there exist α0 ∈ L(a0), . . . , αn ∈ L(an) with the following sequent α0, . . . , αn − s being derivable.

◭ ◮ ◮

slide-55
SLIDE 55

MPRI 4

43

Decidability

Cut elimination. Every derivable sequent is derivable without us- ing Rule (cut). Corollary. The Lambek calculus is decidable.

◭ ◮ ◮

slide-56
SLIDE 56

MPRI 4

44

Elements of model theory

Let Σ∗, +, ǫ be the free monoid generated by Σ. A valuation is defined to be a fonction ρ : A → 2Σ∗ that assigns a set of words to each atomic formula (syntactic category). Given such a valuation ρ, interpret the formulas as follows: [[α]]ρ = ρ(α) where α is atomic [[α • β]]ρ = {u ∈ Σ∗ | ∃a ∈ [[α]]ρ. ∃b ∈ [[β]]ρ. u = a + b} [[α \ β]]ρ = {u ∈ Σ∗ | ∀a ∈ [[α]]ρ. a + u ∈ [[β]]ρ} [[α / β]]ρ = {u ∈ Σ∗ | ∀b ∈ [[β]]ρ. u + b ∈ [[α]]ρ} Let Γ = γ0, . . . , γn, and define: [[Γ]]ρ = [[γ0 • · · · • γn]]ρ

◭ ◮ ◮

slide-57
SLIDE 57

MPRI 4

45 A valuation ρ satisfies a sequent Γ − α iff [[Γ]]ρ ⊂ [[α]]ρ A sequent Γ − α is valid iff it is satisfied by every valuation.

◭ ◮ ◮

slide-58
SLIDE 58

MPRI 4

46

Correctness. Every derivable sequent is valid. Completness (Pentus, 1993). Every valid sequent is derivable.

◭ ◮ ◮

slide-59
SLIDE 59

MPRI 4

47

Expressive Power

(Pentus, 1992) The class of languages generated by the Lambek grammars is the class of context-free languages.

◭ ◮ ◮

slide-60
SLIDE 60

MPRI 4

48

Structural limitations

The non-commutativity of the Lambek calculus is to “rigid”. Among others, it does not allow for medial wh-extraction. For instance: Une pomme que Pierre mange rapidement cannot be generated.

◭ ◮ ◮

slide-61
SLIDE 61

MPRI 4

49

  • Multimodal grammars (Michael Moortgat)
  • Type-logical grammars (Glyn Morrill)
  • Combinatory Categorial Grammars (Mark Steedman)
  • M. Pentus: Lambek Grammars are Context Free. LICS 1993: 429-433
  • M. Pentus: Language completeness of the Lambek calculus.

LICS 1994: 487-496

  • M. Moortgat: Categorial Type Logics. Chapter 2 of Handbook of Logic and

Language, Elsevier, 1997

◭ ◮ ◮