Abstract Categorial Grammar Parsing the general case in Honor of G - PowerPoint PPT Presentation

1 ACG parsing: the general case Abstract Categorial Grammar Parsing the general case in Honor of G´ erard Huet Philippe de Groote Inria-Lorraine

2 ACG parsing: the general case Content 1 Definition of ACG 2 Examples 3 Some Key Properties 4 Constructing a Parsing Algorithm

3 ACG parsing: the general case Definition

4 ACG parsing: the general case Motivations • To provide a type-theoretic notion of grammar, taking advantages of ideas by Curry and Lambek. • To provide a grammatical framework in which other existing grammatical models may be encoded. • To see the parse-structures as first-class citizen. • To allow the user to define grammatical composition combinators. • To base the formalism on a small set of mathematical primitives that combine via simple composition rules.

5 ACG parsing: the general case Types, signatures and λ -terms T ( A ) is the set of linear implicative types built on the set of atomic types A : T ( A ) ::= A | ( T ( A ) − ◦ T ( A ) ) A higher-order linear signature is a triple Σ = � A, C, τ � , where: A is a finite set of atomic types; C is a finite set of constants; τ : C → T ( A ) is a function that assigns each constant in C with a linear implicative type built on A . Λ(Σ) denotes the set of linear λ -terms built upon a higher-order linear signature Σ.

6 ACG parsing: the general case Vocabularies and Lexicons A vocabulary is simply defined to be a higher-order linear signature. Given two vocabularies Σ 1 = � A 1 , C 1 , τ 1 � and Σ 2 = � A 2 , C 2 , τ 2 � , a lexicon L = � η, θ � from Σ 1 to Σ 2 is made of two functions: η : A 1 → T ( A 2 ), θ : C 1 → Λ(Σ 2 ), such that − Σ 2 θ ( c ) : η ( τ 1 ( c )) .

7 ACG parsing: the general case Definition An abstract categorial grammar is a quadruple G = � Σ 1 , Σ 2 , L , s � where : Σ 1 = � A 1 , C 1 , τ 1 � and Σ 2 = � A 2 , C 2 , τ 2 � are two higher-order linear signatures; Σ 1 is called the abstract vocabulary and Σ 2 is called the object vocabulary; L : Σ 1 → Σ 2 is a lexicon from the abstract vocabulary to the object vocabulary; s ∈ T ( A 1 ) is a type of the abstract vocabulary; it is called the distin- guished type of the grammar.

8 ACG parsing: the general case Languages generated by an ACG The abstract language generated by G ( A ( G )) is defined as follows: A ( G ) = { t ∈ Λ(Σ 1 ) | − Σ 1 t : s is derivable } The object language generated by G ( O ( G )) is defined to be the image of the abstract language by the term homomorphism induced by the lexicon L : O ( G ) = { t ∈ Λ(Σ 2 ) | ∃ u ∈ A ( G ) . t = L ( u ) }

9 ACG parsing: the general case Some properties • Membership is decidable if and only if Multiplicative Exponential Linear Logic is decidable. • Membership for lexicalized ACGs is NP-complete. • Membership for second-order ACGs is polynomial.

10 ACG parsing: the general case Examples

11 ACG parsing: the general case Strings as linear λ -terms There is a canonical way of representing strings as linear λ -terms. It consists of representing strings as function composition: /abbac/ = λx. a ( b ( b ( a ( c x )))) In this setting: △ ǫ = λx. x △ α + β = λx. α ( β x )

12 ACG parsing: the general case Signatures Σ 0 : N , NP , S : type J : NP U : N A : N − ◦ (( NP − ◦ S ) − ◦ S ) S : (( NP − ◦ S ) − ◦ S ) − ◦ ( NP − ◦ S ) Σ 1 : a, John, seeks, unicorn : STRING Σ 2 : ι, o : type ∧ : o − ◦ ( o − ◦ o ) : ( ι → o ) − ∃ ◦ o : ι j : unicorn ι − ◦ o : ι − ◦ ( ι − ◦ o ) find : ι − ◦ (( ι − ◦ o ) − ◦ o ) try

13 ACG parsing: the general case Lexicons L 1 : Σ 0 → Σ 1 N , NP , S := STRING J := / John / U := / unicorn / A := λx. λp. p ( / a / + x ) S := λp. λx. p ( λy. x + / seeks / + y ) L 2 : Σ 0 → Σ 2 := i → o N := i NP S := o J := j U := λx. unicorn x A := λp. λq. ∃ x. p x ∧ q x S := λp. λx. try x ( λy. p ( λz. find y z ))

14 ACG parsing: the general case We have that: L 1 (S (A U) J) = / John / + / seeks / + / a / + / unicorn / L 2 (S (A U) J) = try j ( λx. ∃ y. unicorn y ∧ find x y ) L 1 (A U ( λx. S ( λk. k x ) J)) = / John / + / seeks / + / a / + / unicorn / L 2 (A U ( λx. S ( λk. k x ) J)) = ∃ y. unicorn y ∧ try j ( λx. find x y )

15 ACG parsing: the general case A language-theoretic example Abstract vocabulary: : type A, L, S H : ( A − ◦ A − ◦ A − ◦ S ) − ◦ S I : L − ◦ S E : L C : A − ◦ L − ◦ L Lexicon: A, L, S := string H := λf. f /a/ /b/ /c/ I := λf. λx. f x E := ǫ C := λx. λy. x + y Typically: H ( λx 11 x 12 x 13 . H ( λx 21 x 22 x 23 . . . . I ( C x ij ( C x kl . . . ( C x mn E ) . . . )) . . . )) : S

16 ACG parsing: the general case Some Key Properties

17 ACG parsing: the general case Curry-Howard isomorphism Coherence theorem Principal typing Subject reduction Subject expansion

18 ACG parsing: the general case Constructing a Parsing Algorithm

19 ACG parsing: the general case Back to the example H := λf. f ( λz. a z ) ( λz. b z ) ( λz. c z ) : ( A − ◦ A − ◦ A − ◦ S ) − ◦ S I := λf. λx. f x : L − ◦ S E := λx. x : L C := λx. λy. λz. x ( y z ) : A − ◦ L − ◦ L A, L, S := s − ◦ s λz. a ( c ( b ( a ( b ( c z ))))) ?

20 ACG parsing: the general case A first non deterministic algorithm 1. Try to prove S using the types of the abstract constants as proper axioms. I.e, prove S using ( A − ◦ A − ◦ A − ◦ S ) − ◦ S , L − ◦ S , L , and A − ◦ L − ◦ L . 2. By the Curry-Howard isomorphism, you have constructed a term of the abstract language. Apply the lexicon to this term. 3. Check whether the resulting object term is equal to the term you have to parse.

21 ACG parsing: the general case The Coherence Theorem comes in 1. Specialize the object signature by distinguishing between the different occurrences of a same object constant in the term to be parsed: a 1 : s 5 − ◦ s 6 a 2 : s 2 − ◦ s 3 b 1 : s 3 − ◦ s 4 b 2 : s 1 − ◦ s 2 c 1 : s 4 − ◦ s 5 : c 2 s 0 − ◦ s 1 λz. a 1 ( c 1 ( b 1 ( a 2 ( b 2 ( c 2 z ))))) : s 0 − ◦ s 6 2. Specialize the lexical entries accordingly: λf. f ( λz. a 1 z ) ( λz. b 1 z ) ( λz. c 1 z ) : · · · λf. f ( λz. a 1 z ) ( λz. b 1 z ) ( λz. c 2 z ) : · · · . . . : . . .

22 ACG parsing: the general case 3. Try to prove � S, s 0 − ◦ s 6 � using: � ( A − ◦ A − ◦ A − ◦ S ) − ◦ S, (( s 5 − ◦ s 6 ) − ◦ ( s 3 − ◦ s 4 ) − ◦ ( s 4 − ◦ s 5 ) − ◦ ( s 0 − ◦ s 0 )) − ◦ ( s 0 − ◦ s 0 ) � � ( A − ◦ S ) − ◦ A − ◦ A − ◦ S, (( s 5 − ◦ s 6 ) − ◦ ( s 3 − ◦ s 4 ) − ◦ ( s 4 − ◦ s 5 ) − ◦ ( s 0 − ◦ s 1 )) − ◦ ( s 0 − ◦ s 1 ) � . . . � ( A − ◦ A − ◦ A − ◦ S ) − ◦ S, (( s 5 − ◦ s 6 ) − ◦ ( s 3 − ◦ s 4 ) − ◦ ( s 0 − ◦ s 1 ) − ◦ ( s 0 − ◦ s 0 )) − ◦ ( s 0 − ◦ s 0 ) � � ( A − ◦ A − ◦ A − ◦ S ) − ◦ S, (( s 5 − ◦ s 6 ) − ◦ ( s 3 − ◦ s 4 ) − ◦ ( s 0 − ◦ s 1 ) − ◦ ( s 0 − ◦ s 1 )) − ◦ ( s 0 − ◦ s 1 ) � . . . � L − ◦ S, ( s 0 − ◦ s 0 ) − ◦ ( s 0 − ◦ s 0 ) � ◦ S, ( s 0 − ◦ s 1 ) − ◦ ( s 0 − ◦ s 1 ) � � L − . . .

23 ACG parsing: the general case Eliminating redundancies Consider the following pair: � ( A − ◦ A − ◦ A − ◦ S ) − ◦ S, (( s 5 − ◦ s 6 ) − ◦ ( s 3 − ◦ s 4 ) − ◦ ( s 4 − ◦ s 5 ) − ◦ ( s 0 − ◦ s 0 )) − ◦ ( s 0 − ◦ s 0 ) � The shape of the specialized object type is completely specified by the grammar. The only relevant information is given by the indices. Replace the above pair by the following formula: ( A [5 , 6] − ◦ A [3 , 4] − ◦ A [4 , 5] − ◦ S [0 , 0]) − ◦ S [0 , 0]

24 ACG parsing: the general case Principal typing Factorize the several formulas coming from a given lexical entry, ( A [5 , 6] − ◦ A [3 , 4] − ◦ A [4 , 5] − ◦ S [0 , 0]) − ◦ S [0 , 0] ( A [5 , 6] − ◦ A [3 , 4] − ◦ A [4 , 5] − ◦ S [0 , 1]) − ◦ S [0 , 1] . . . ( A [5 , 6] − ◦ A [3 , 4] − ◦ A [0 , 1] − ◦ S [0 , 0]) − ◦ S [0 , 0] ( A [5 , 6] − ◦ A [3 , 4] − ◦ A [0 , 1] − ◦ S [0 , 1]) − ◦ S [0 , 1] . . . as follows: a [ i, j ] , b [ k, l ] , c [ m, n ] ⊢ ( A [ i, j ] − ◦ A [ k, l ] − ◦ A [ m, n ] − ◦ S [ o, p ]) − ◦ S [ o, p ]

25 ACG parsing: the general case We end up with the following proof search problem: Formulas coming from the lexicon: a [ i, j ] , b [ k, l ] , c [ m, n ] ⊢ ( A [ i, j ] − ◦ A [ k, l ] − ◦ A [ m, n ] − ◦ S [ o, p ]) − ◦ S [ o, p ] ⊢ L [ i, j ] − ◦ S [ i, j ] ⊢ L [ i, i ] ⊢ A [ i, j ] − ◦ L [ k, i ] − ◦ L [ k, j ] Query (coming from the term to be parsed): a [5 , 6] , c [4 , 5] , b [3 , 4] , a [2 , 3] , b [1 , 2] , c [0 , 1] ⊢ S [0 , 6]

26 ACG parsing: the general case Correctness and Completeness Correctness : by subject reduction. Completeness : by subject expansion.

Abstract Categorial Grammar Parsing the general case in Honor of G - PowerPoint PPT Presentation

1 ACG parsing: the general case Abstract Categorial Grammar Parsing the general case in Honor of G erard Huet Philippe de Groote Inria-Lorraine 2 ACG parsing: the general case Content 1 Definition of ACG 2 Examples 3 Some Key

Categorial Grammar C a gr C oltekin c.coltekin@rug.nl November 18, 2008 1 / 28

Introduction to Bottom-Up Parsing Shift-reduce parsing The LR parsing algorithm

Working Together What does his future hold? Carres Grammar School Carres Grammar School

Combinatory Categorial Grammar The effort to develop natural language grammars and

Plan Introduction 1 On categorial grammars and learnability 2 Logical Information Systems

Prosodic Marking of Focus Domains German German Categorial or Gradient Categorial and

Surface Reasoning Lecture 2: Logic and Grammar Thomas Icard June 18-22, 2012 Thomas Icard:

Categorial Grammar Raffaella Bernardi Contents First Last Prev Next Contents 1

Parsing Jazz: Harmonic Analysis of Music Using Combinatory Categorial Grammar Mark

CSC 4181 Compiler Construction Parsing 1 1 Outline Top-down v.s. Bottom-up Top-down parsing

Grammar and word order Grammar and word order Grammar Grammar Includes morphology and syntax

Combinatory Categorial Grammar (CCG) 11-711 Algorithms for NLP 22 October 2019 (With thanks to

Lambek Calculus Extended with Subexponential and Bracket Modalities Max Kanovich, Stepan

Composing questions: A hybrid categorial approach Yimei Xiang Harvard University

General Context-Free Grammar Parsing: Application of grammar rewrite rules A phrase structure

General Context-Free Grammar Parsing Application of grammar rewrite rules A phrase structure

Grammar and graphical semiotics in early syntactic diagrams: Clark (1847) and Reed-Kellogg (1876)

CSC 452 File Systems Files Jonathan Misurda jmisurda@cs.arizona.edu File Naming File

GETTING STARTED AND BEST PRACTICES Jeff Goldsmith, PhD Department of Biostatistics 1

CS 2112 Lab 2: Javadoc and I/O September 9 / 11, 2019 Importing A2 Create a new Eclipse

For Thursday No new reading Homework: Chapter 23, exercise 15 Homework Instructions

Exercise 4.53. In each case, given the context-free grammar G , find an c. equivalent CFG with no

2. Lexical Analysis 2.1 Tasks of a Scanner 2.2 Regular Grammars and Finite Automata 2.3 Scanner

Gbor Csernyi Department of English Linguistics University of Debrecen