HOW BIG A SPOON SHOULD SYNTAX USE TO FEED SEMANTICS? Aravind K. - - PowerPoint PPT Presentation

how big a spoon should syntax use to feed semantics
SMART_READER_LITE
LIVE PREVIEW

HOW BIG A SPOON SHOULD SYNTAX USE TO FEED SEMANTICS? Aravind K. - - PowerPoint PPT Presentation

HOW BIG A SPOON SHOULD SYNTAX USE TO FEED SEMANTICS? Aravind K. Joshi University of Pennsylvania Philadelphia PA USA ESSLLI 2008 Workshop: What Syntax Feeds Semantics? Hamburg, August 13 2008 Outline Introduction Bigger spoon for


slide-1
SLIDE 1

HOW BIG A SPOON SHOULD SYNTAX USE TO FEED SEMANTICS?

Aravind K. Joshi University of Pennsylvania Philadelphia PA USA ESSLLI 2008 Workshop: What Syntax Feeds Semantics? Hamburg, August 13 2008

slide-2
SLIDE 2

esslli-08-syn-sem: 2

Outline

  • Introduction
  • Bigger spoon for CFG– LTAG
  • Derivation Tree and semantics computed from the

derivation tree

  • Flexible composition, Multicomponent LTAG, making

the spoon bigger

  • Some applications
  • Bigger spoon for a categorial grammar
  • Interaction with discourse
  • Summary
slide-3
SLIDE 3

esslli-08-syn-sem: 3

Introduction

  • Formal systems to specify a grammar formalism
  • Start with primitives (basic primitive structures or

building blocks) as simple as possible and then introduce various operations for constructing more complex structures

  • Conventional (mathematical) wisdom
  • Alternatively,
slide-4
SLIDE 4

esslli-08-syn-sem: 4

Introduction: CLSG

  • Start with complex primitives which directly

capture some crucial linguistic properties and then introduce some general operations for

  • perations for composing them
  • - Complicate Locally, Simplify Globally (CLSG)
  • CLSG approach is characterized by localizing

almost all complexity in the set of primitives, a key property

slide-5
SLIDE 5

esslli-08-syn-sem: 5

Introduction: CLSG – localization of complexity

  • Specification of the finite set of complex

primitives becomes the main task of a linguistic theory

  • CLSG pushes all dependencies to become

local, i. e. , they arise initially in the primitive structures to start with

slide-6
SLIDE 6

esslli-08-syn-sem: 6

CLSG approach

  • CLSG approach has led to several new insights into
  • Syntactic description
  • Semantic composition
  • Language generation
  • Statistical processing, Psycholinguistic properties
  • Discourse structure
slide-7
SLIDE 7

esslli-08-syn-sem: 7

Outline

  • Introduction
  • Bigger spoon for CFG– LTAG
  • Derivation Tree and semantics computed from the

derivation tree

  • Flexible composition, Multicomponent LTAG, making

the spoon bigger

  • Some applications
  • Bigger spoon for a categorial grammar
  • Interaction with discourse
  • Summary
slide-8
SLIDE 8

esslli-08-syn-sem: 8

  • agreement: person, number, gender
  • subcategorization: sleeps: null; eats: NP; gives:

NP NP; thinks: S

  • filler-gap: who did John ask Bill to invite e
  • word order: within and across clauses as in

scrambling and clitic movement

  • function – argument: all arguments of the

lexical anchor are localized

Localization of Dependencies

slide-9
SLIDE 9

esslli-08-syn-sem: 9

Localization of Dependencies

  • word-clusters (flexible idioms): non-compositional aspect
  • take a walk, give a cold shoulder to
  • word co-occurrences
  • lexical semantic aspects
  • statistical dependencies among heads
slide-10
SLIDE 10

esslli-08-syn-sem: 10

Strong lexicalization of CFG’s

Given a CFG, G, we want to construct a grammar G’ such that the elementary structures in G’ (each associated with a lexical item) (1) localize the dependencies (2) structures generated by G’ are the same as those generated by G then it can be shown that the composition operation of substitution alone is not sufficient. However, adding adjunction as another operation does the trick. Thus adjunction arises in the process of lexicalizing a CFG! Surprise: The resulting system is stronger than CFG’s both syntactically and semantically

slide-11
SLIDE 11

esslli-08-syn-sem: 11

Lexicalized TAG: LTAG

  • Finite set of elementary trees anchored on

lexical items

  • Elementary trees: Initial and Auxiliary
  • Operations: Substitution and Adjoining
  • Derivation:

– Derivation Tree

  • How elementary trees are put together.

– Derived tree

slide-12
SLIDE 12

esslli-08-syn-sem: 12

LTAG: Some Formal Properties

  • TAGs (more precisely, languages of TAGs) belong to

the class of languages called mildly context-sensitive languages (MCSL) characterized by

  • polynomial parsing complexity
  • grammars for the languages in this class can

characterize a limited set of patterns of nested and crossed dependencies and their combinations

  • languages in this class have the constant growth

property, i.e., sentences, if arranged in increasing

  • rder of length, grow only by a bounded amount
  • this class properly includes CFLs
slide-13
SLIDE 13

esslli-08-syn-sem: 13

α1:

S NP↓ V NP↓ likes

α2:

S NP↓ V NP↓ likes NP↓

e

S

transitive

  • bject extraction

some other trees for ‘likes’ subject extraction, topicalization, subject relative, object relative, passive, etc.

VP VP

LTAG: Examples

slide-14
SLIDE 14

esslli-08-syn-sem: 14

S NP↓ V NP↓ likes NP↓

e

S VP S NP↓ V S*

β1:

think VP

β2:

V S does S* NP↓ NP↓ NP↓ who Harry Bill

α3: α2: α4: α5:

LTAG: A derivation

slide-15
SLIDE 15

esslli-08-syn-sem: 15

S NP↓ V NP↓ likes NP↓

e

S VP S NP↓ V S*

β1:

think VP

β2:

V S does S* NP↓ NP↓ NP↓ who Harry Bill

α3: α2: α4: α5:

substitution adjoining

who does Bill think Harry likes

LTAG: A Derivation

slide-16
SLIDE 16

esslli-08-syn-sem: 16

LTAG: Derived Tree

S NP S V does S NP V think VP S NP V NP likes

e

VP who Harry Bill

who does Bill think Harry likes

slide-17
SLIDE 17

esslli-08-syn-sem: 17

who does Bill think Harry likes α2: likes α3: who β1:think α4:Harry β2: does α5: Bill

substitution adjoining

LTAG: Derivation Tree

  • Composition by lexical attachments (substitution and adjoining)
  • The derivation tree shows what attaches to what and where
  • Semantics to be defined on the derivation tree
  • - need for additional information?
  • Order of traversal of the nodes

1 2 2.1 1

slide-18
SLIDE 18

esslli-08-syn-sem: 18

Composition defined by the derivation tree

α2: S NP↓ VP V NP↓ hit

β1:

VP

VP* ADV repeatedly about: s2 John: x1 Bill: x2 hit( s1, x1 , x2 ) repeatedly(s2, s1) NP NP John

Bill

slide-19
SLIDE 19

esslli-08-syn-sem: 19

Attachments along the trunk (path from root to lexical anchor)

α3: S’ NPi↓ VP V NPi like NP↓

e

S β4: S NP↓ VP V S* think

β1:

VP VP*

seems V

α3(like) β4(think)

β1(seems)

2.2 2.2 2

( who do you think John seems to like) In the derivation tree seems and think are adjoined along the trunk

  • - uniform convention for scoping--lower

nodes before higher nodes along the trunk

slide-20
SLIDE 20

esslli-08-syn-sem: 20

Additional information on the derivation tree: Some alternatives

  • Additional links
  • Adding features
  • Extend the use of the addresses in the derivation tree

by adopting a uniform order of traversal of the tree

  • - post order traversal

Joshi and Vijayshanker, 1999, Frank and van Genbirth, 2001, Kallmeyer and Joshi, 2003, Joshi, Kallmeyer and Romero, 2003, Gardent and Kallmeyer 2004, Kallmeyer and Romero, 2004, Kallmeyer and Romero, 2008, …

slide-21
SLIDE 21

esslli-08-syn-sem: 21

Outline

  • Introduction
  • Bigger spoon for CFG– LTAG
  • Derivation Tree and semantics computed from the

derivation tree

  • Flexible composition, Multicomponent LTAG, making

the spoon bigger

  • Some applications
  • Bigger spoon for a categorial grammar
  • Interaction with discourse
  • Summary
slide-22
SLIDE 22

esslli-08-syn-sem: 22

Flexible Composition

α:

X

Split α at x

X X

α1: supertree of α at X α2: subtree of α at X Adjoining as Wrapping

slide-23
SLIDE 23

esslli-08-syn-sem: 23

α:

X

β:

X X

γ:

X X

β α wrapped around β i.e., the two components α1 1 and α2 are wrapped around β

α1: supertree of α at X α2: subtree of α at X

Flexible Composition

Adjoining as Wrapping

slide-24
SLIDE 24

esslli-08-syn-sem: 24

S V NP↓ likes NP(wh)↓

e

S VP S NP↓ V S*

β:

think VP

α:

substitution adjoining

Flexible Composition

Wrapping as substitutions and adjunctions

NP↓

  • We can also view this composition as

α wrapped around β

  • Flexible composition
slide-25
SLIDE 25

esslli-08-syn-sem: 25

S* V NP↓ likes NP(wh)↓

e

S VP S NP↓ V S*

β:

think VP

α:

substitution adjoining

Flexible Composition

Wrapping as adjunction and reverse adjunction

NP↓

α1: α2:

S

α1 and α2 are the two components of α α1 attached (adjoined) to the root node S of β α2 attached (reverse adjoined) at the foot node S of β Leads to multi-component TAG (MC-TAG)

slide-26
SLIDE 26

esslli-08-syn-sem: 26

α:

Multi-component LTAG (MC-LTAG)

α1: α2: β: β: The two components are used together in one composition

  • step. Both components attach to nodes in β,

, an elementary tree. This preserves locality. The representation can be used for both

  • - predicate-argument relationships
  • - non-p/a information such as scope, focus, etc.

(Making the spoon bigger)

slide-27
SLIDE 27

esslli-08-syn-sem: 27

Multicomponent LTAG (MC-LTAG) Generalizing on the adjoining as wrapping perspective leads to MC-LTAG.

  • A lexical item may be associated with a finite set of trees,

each tree in the set is a component

  • Set of components together provides an extended
  • The set of components together define one elementary
  • bject
  • The components are used together in one composition

step with the individual components being composed by attachments domain of locality

slide-28
SLIDE 28

esslli-08-syn-sem: 28

Multicomponent LTAG (MC-LTAG)

  • The representation can be used for both
  • - predicate argument relationships
  • - scoping information
  • The two pieces of information are together before the

single composition step

  • However, after the composition there may be

intervening material between the components

slide-29
SLIDE 29

esslli-08-syn-sem: 29

Tree-Local Multi-component LTAG (MC-LTAG)

  • How can the components of MC-LTAG compose

preserving locality of LTAG

  • Tree-Local MC-LTAG
  • - Components of a set compose only with an

elementary tree or an elementary component

  • Tree-Local MC-LTAGs are weakly equivalent

to LTAGs

  • However, Tree-Local MC-LTAGs provide structural

descriptions not obtainable by LTAGs

  • Increased strong generative power; hence supporting

more semantics

slide-30
SLIDE 30

esslli-08-syn-sem: 30

Scope ambiguities: Example

α1: α11: S* α12: NP DET N↓ every α2: α21: S* α22: NP DET N↓ some α3: S NP↓ VP V NP↓ hates α4: N student N course α5: ( every student hates some course)

slide-31
SLIDE 31

esslli-08-syn-sem: 31

Derivation with scope information: Example

α1: α11: S* α12: NP DET N↓ every α2: α21: S* α22: NP DET N↓ some α3: S NP↓ VP V NP↓ hates α4: N student N course α5: ( every student hates some course)

slide-32
SLIDE 32

esslli-08-syn-sem: 32

Derivation tree with scope information: Example

α3(hates) α11(E) α12(every) α22(some) α21(S) α4(student) α5(course)

1 2.2 2 2

( every student hates some course)

  • α11

11 and α21 21 are both adjoined at the root of α3(hates)

  • They can be adjoined in any order, thus representing the two

scope readings (underspecified representation)

  • The scope readings represented in the LTAG derivation itself
  • multiple adjunctions at the same node
slide-33
SLIDE 33

esslli-08-syn-sem: 33

  • Adding features

Kallmeyer and Romero, 2004, Kallmeyer and Romero, 2008, … Extend the use of the addresses in the derivation tree by adopting a uniform order of traversal of the tree

  • - post order traversal

Patterns of scope orderings

slide-34
SLIDE 34

esslli-08-syn-sem: 34

Other uses of tree-local MC-TAG

  • Misplaced adjectives
  • Parentheticals
  • Scrambling patterns
  • Clitic movement
slide-35
SLIDE 35

esslli-08-syn-sem: 35

Misplaced Adjectives

(1) An occasional sailor passed by

α1: α11: S* α12: N ADJ N∗

  • ccasional

NP DET N↓ an α2: S NP↓ VP passed by α4: N student α3:

slide-36
SLIDE 36

esslli-08-syn-sem: 36

Misplaced Adjectives

(1) An occasional sailor passed by

α1: α11: S* α12: N ADJ N∗

  • ccasional

NP DET N↓ a/an α2: S NP↓ VP passed by α4: N student α3:

slide-37
SLIDE 37

esslli-08-syn-sem: 37

Misplaced Adjectives

(1) An occasional sailor passed by

α1: α11: S* α12: N ADJ N∗

  • ccasional

NP DET N a/an α2: S NP↓ VP passed by sailor α3:

slide-38
SLIDE 38

esslli-08-syn-sem: 38

Misplaced Adjectives

(1) An occasional sailor passed by

α1: α11: S* α12: N ADJ N∗

  • ccasional

NP DET N a/an α2: S NP↓ VP passed by sailor α3:

slide-39
SLIDE 39

esslli-08-syn-sem: 39

Misplaced Adjectives

(1) An occasional sailor passed by

α1: α11: S* α12: N ADJ N

  • ccasional

NP DET a/an α2: S NP↓ VP passed by sailor

slide-40
SLIDE 40

esslli-08-syn-sem: 40

Parentheticals

(2) Hillary, Obama thinks, will win the primary An extension of tree local MC-TAG is required-- sister adjoining, which was developed by David Chiang (2000) for another purpose. With this extension we still have the weak equivalence with the standard TAG

slide-41
SLIDE 41

esslli-08-syn-sem: 41

Scrambling Patterns

  • Embedding of complement clauses in German

(1)Hans1 Peter2 Marie3 schwimmen3 lassen2 sah1

NP1 NP2 NP3 V3 V2 V1

(Hans saw Peter let/make Marie swim) Scrambled versions of (1) permuting the NP’s and keeping the order of V’s fixed as in (1) (Proper names, instead of full NPs are used for convenience)

slide-42
SLIDE 42

esslli-08-syn-sem: 42

VP VP NP e V VP NP VP* VP NP VP VP NP V e VP* Elementary Trees for a Scrambled Argument

Multi-component Tree (domination constraint) Standard single component tree

slide-43
SLIDE 43

esslli-08-syn-sem: 43

Scrambling: NP4 NP3 NP1 NP2 V4 V3 V2 V1 VP NP1 VP* VP VP NP4 e V4 VP NP2 VP VP NP2 V2 e VP* VP NP3 VP VP NP3 V3 e VP* VP NP4 VP* VP NP1 V1 e VP* VP

slide-44
SLIDE 44

esslli-08-syn-sem: 44

Clitic Climbing

  • Clitic placement can also be viewed as a word-
  • rder variation and described by using

MC-TAG as in scrambling Bleam 1998, 2002, Chen-Main. 2007

slide-45
SLIDE 45

esslli-08-syn-sem: 45

Outline

  • Introduction
  • Bigger spoon for CFG– LTAG
  • Derivation Tree and semantics computed from the

derivation tree

  • Flexible composition, Multicomponent LTAG, making

the spoon bigger

  • Some applications
  • Bigger spoon for a categorial grammar
  • Interaction with discourse
  • Summary
slide-46
SLIDE 46

esslli-08-syn-sem: 46

Bigger Spoon for Categorial Grammar

Partial proof trees as building blocks for a categorial grammar, Joshi and Kulick, Linguistics and Philosophy, 20, 1997 Partial proof trees, hybrid logic, and quantifier scope, Joshi, Kulick, and Kurtonina, ESSLLI 1999, Utrecht

slide-47
SLIDE 47

esslli-08-syn-sem: 47

Bigger Spoon for Categorial Grammar

likes (S\NP)/NP [NP] [NP] (S\NP) S

  • Each lexical item is associated with one or more (basic)

partial proof trees (PPT) obtained by unfolding arguments.

  • (PPT) is a finite set -- the set of basic types.
  • Informal description of the inference rule -- linking
slide-48
SLIDE 48

esslli-08-syn-sem: 48

Linking conclusion nodes to assumption nodes: an inference rule, stated informally the NP/N [N] NP man N apples NP likes (S\NP)/NP [NP] [NP] (S\NP) S the man likes apples

slide-49
SLIDE 49

esslli-08-syn-sem: 49

Bigger spoon

passionately [(S\NP)] (S\NP)\ (S\NP*) (S\NP*)

  • No unfolding past an argument marked by *
  • Thus unfolded arguments are only those which are the

arguments of the lexical item.

slide-50
SLIDE 50

esslli-08-syn-sem: 50

Stretching and linking – an inference rule

A proof tree can be stretched at any node. u v w X Y A proof tree to be stretched at the node X.

slide-51
SLIDE 51

esslli-08-syn-sem: 51

Stretching a proof tree at node X

u v w X Y u v w X Y [X] X is the conclusion from v Y is the conclusion from u [X] w i.e., from u, assumption X and w Linking X to [X] we have the original proof tree.

slide-52
SLIDE 52

esslli-08-syn-sem: 52

Stretching and linking -- an example

likes (S\NP)/NP [NP] [NP] (S\NP) S Stretching at the indicated node

slide-53
SLIDE 53

esslli-08-syn-sem: 53

Stretching and linking -- an example

likes (S\NP)/NP [NP] [NP] (S\NP) S [S\NP]

slide-54
SLIDE 54

esslli-08-syn-sem: 54

Stretching and linking -- an example

likes (S\NP)/NP [NP] [NP] (S\NP) S [S\NP)] passionately [(S\NP)] (S\NP)\ (S\NP*) (S\NP*) John likes apples passionately

slide-55
SLIDE 55

esslli-08-syn-sem: 55

Bigger Spoon for Categorial Grammar

likes (S\NP)/NP [NP] [NP] (S\NP) S NP/N N NP

every student

[S] S(every)

slide-56
SLIDE 56

esslli-08-syn-sem: 56

Bigger Spoon for Categorial Grammar

likes (S\NP)/NP [NP] (S\NP) S NP/N N NP

every student

S(every) some > every

slide-57
SLIDE 57

esslli-08-syn-sem: 57

Bigger Spoon for Categorial Grammar

likes (S\NP)/NP [NP] [NP] (S\NP) S NP/N N NP

some course

[S] S(some) every > some

slide-58
SLIDE 58

esslli-08-syn-sem: 58

Outline

  • Introduction
  • Bigger spoon for CFG– LTAG
  • Derivation Tree and semantics computed from the

derivation tree

  • Flexible composition, Multicomponent LTAG, making

the spoon bigger

  • Some applications
  • Bigger spoon for a categorial grammar
  • Interaction with discourse
  • Summary
slide-59
SLIDE 59

esslli-08-syn-sem: 59

Interaction with Discourse

  • Sometimes syntax should hold the spoon back

form semantics for a while

  • -Avoid delivering a complete structure even when

there is no ambiguity (1) John said Bill left

  • Role of attribution in discourse
  • - Illustrated with some examples from the

Penn Discourse Treebank (PDTB)

slide-60
SLIDE 60

esslli-08-syn-sem: 60

  • Attribution features are annotated for
  • Explicit connectives
  • Implicit connectives
  • AltLex (Lexical phrases behaving as connectives)

 34% of discourse relations are attributed to an agent other than the writer. PDTB Annotations: Attributions

slide-61
SLIDE 61

esslli-08-syn-sem: 61

  • Attribution captures the relation of “ownership” between agents and Abstract

Objects. But it is not a discourse relation!

  • Attribution is annotated in the PDTB to capture:

(1) How discourse relations and their arguments can be attributed to different individuals:

  • When Mr. Green won a $240,000 verdict in a land condemnation case

against the state in June 1983, [he says] [he says] Judge O’Kicki unexpectedly awarded him an additional $100,000. ⇒Relation and Arg2 are attributed to the Writer. ⇒Arg1 is attributed to another agent.

Attribution

slide-62
SLIDE 62

esslli-08-syn-sem: 62

There have been no orders for the Cray-3 so far, though the company says it is talking with several prospects. Discourse semantics: contrary-to-expectation relation between “there being no orders for the Cray-3” and “there being a possibility of some prospects”. Sentence semantics: contrary-to-expectation relation between “there being no orders for the Cray-3” and “the company saying something”. Mismatch between sentence level semantics and discourse level semantics

slide-63
SLIDE 63

esslli-08-syn-sem: 63

Although takeover experts said they doubted Mr. Steinberg will make a bid by himself, the application by his Reliance Group Holdings Inc. could signal his interest in helping revive a failed labor-management bid. Discourse semantics: contrary-to-expectation relation between “Mr. Steinberg not making a bid by himself” and “the RGH application signaling his bidding interest”. Sentence semantics: contrary-to-expectation relation between “experts saying something” and “the RGH application signaling Mr. Steinberg’s bidding interest”. Mismatch between sentence level semantics and discourse level semantics

slide-64
SLIDE 64

esslli-08-syn-sem: 64

  • Advocates said the 90-cent-an-hour rise, to $4.25 an hour by

April 1991, is too small for the working poor, while

  • pponents argued that the increase will still hurt small

business and cost many thousands of jobs. Attribution cannot always be excluded by default

Working with derivation trees can help as elementary trees corresponding to attributions may be easily included or left out as needed

slide-65
SLIDE 65

esslli-08-syn-sem: 65

Summary

  • Larger elementary structures as building blocks for

localizing dependencies

  • - adjunction (besides substitution) as a composition
  • peration arises naturally
  • Compositional semantics computed on the derivation tree
  • n an LTAG and not on the derived tree
  • - MRS type representation arises naturally
slide-66
SLIDE 66

esslli-08-syn-sem: 66

Summary

  • Multicomponent LTAGs arise naturally out of flexible

composition

  • - attachments for predicate argument composition

distinguished from “scope” type composition

  • Same game played out for categorial grammar
  • Interaction with discourse
  • - sometimes syntax has to hold back some spoonfuls

from semantics

  • - possible role for the derivation trees