Broad-coverage CCG Semantic Parsing with AMR Yoav Artzi Kenton Lee - - PowerPoint PPT Presentation

broad coverage ccg semantic parsing with amr
SMART_READER_LITE
LIVE PREVIEW

Broad-coverage CCG Semantic Parsing with AMR Yoav Artzi Kenton Lee - - PowerPoint PPT Presentation

Combinatory Abstract Categorial Meaning Grammar Representation Broad-coverage CCG Semantic Parsing with AMR Yoav Artzi Kenton Lee Luke Zettlemoyer Cornell University University of Washington Cornell Tech Semantic Parsing Show


slide-1
SLIDE 1

Broad-coverage CCG Semantic Parsing with AMR

Yoav Artzi Cornell University Kenton Lee Luke Zettlemoyer University of Washington

Cornell Tech

Combinatory Categorial Grammar Abstract Meaning Representation

slide-2
SLIDE 2

Semantic Parsing

Show me all papers about semantic parsing

Grammar

λx.paper(x) ∧ topic(x, SEMANTIC PARSING)

Less Supervision More Domains Situated Parsing

Answers Demonstrations Conversations Databases Large Knowledge-bases Instructions Web Tables Time Spatial Observations Linguistic Context Database Content

slide-3
SLIDE 3

Semantic Parsing

Show me all papers about semantic parsing

Grammar

λx.paper(x) ∧ topic(x, SEMANTIC PARSING)

Less Supervision More Domains Situated Parsing Non-compositional Semantics Broad-coverage Grammar Induction

slide-4
SLIDE 4

Abstract Meaning Representation

Pyongyang officials denied their involvement

deny-01

instance ARG0 ARG1 instance

involve-01

instance

person

ARG0-of

have-org-role-91

instance ARG1

city

instance name instance

name

ARG2 instance

  • fficial
  • p1

“Pyongyang”

ARG1

slide-5
SLIDE 5

AMR: Instances

Pyongyang officials denied their involvement

deny-01

instance ARG0 ARG1 instance

involve-01

instance

person

ARG0-of

have-org-role-91

instance ARG1

city

instance name instance

name

ARG2 instance

  • fficial
  • p1

“Pyongyang”

ARG1

slide-6
SLIDE 6

AMR: Instances

Pyongyang officials denied their involvement

deny-01

instance ARG0 ARG1 instance

involve-01

instance

person

ARG0-of

have-org-role-91

instance ARG1

city

instance name instance

name

ARG2 instance

  • fficial
  • p1

“Pyongyang”

ARG1

slide-7
SLIDE 7

AMR: Relations

Pyongyang officials denied their involvement

deny-01

instance ARG0 ARG1 instance

involve-01

instance

person

ARG0-of

have-org-role-91

instance ARG1

city

instance name instance

name

ARG2 instance

  • fficial
  • p1

“Pyongyang”

ARG1

slide-8
SLIDE 8

Pyongyang officials denied their involvement

AMR: Relations

deny-01

instance ARG0 ARG1 instance

involve-01

instance

person

ARG0-of

have-org-role-91

instance ARG1

city

instance name instance

name

ARG2 instance

  • fficial
  • p1

“Pyongyang”

ARG1

slide-9
SLIDE 9

AMR: Relations

Pyongyang officials denied their involvement

deny-01

instance ARG0 ARG1 instance

involve-01

instance

person

ARG0-of

have-org-role-91

instance ARG1

city

instance name instance

name

ARG2 instance

  • fficial
  • p1

“Pyongyang”

ARG1

slide-10
SLIDE 10

AMR and Combinatory Categorial Grammar

Challenges:

  • Distant non-compositional dependencies
  • Longer sentences
  • Higher syntactic variability

Great opportunity study CCG semantic parsing at scale

slide-11
SLIDE 11

Parsing Approach

  • Use CCG to recover compositional parse structure
  • Second stage to resolve non-compositional

phenomena, such as co-reference resolution

slide-12
SLIDE 12

Combinatory Categorial Grammar

Category Combinators Lexicon

Assign category to words

S\NP/NP : λx.λy.λd.deny-01(d) ∧ ARG0(d, y) ∧ ARG1(d, x)

Unary and binary operators to combine categories denied

S\NP/NP : λx.λy.λd.deny-01(d)∧ ARG0(d, y) ∧ ARG1(d, x)

Syntax Semantics

slide-13
SLIDE 13

CCG

Logical Form Entries from Lexicon Parse Steps

Lexicon Combinators

Learned

CCG is fun NP S\NP/ADJ ADJ CCG λf.λx.f(x) λx.fun(x)

>

S\NP λx.fun(x)

<

S fun(CCG)

slide-14
SLIDE 14

AMR to Lambda Calculus

Pyongyang officials denied their involvement

deny-01

instance ARG0 ARG1 instance

involve-01

instance

person

ARG0-of

have-org-role-91

instance ARG1

city

instance name instance

name

ARG2 instance

  • fficial
  • p1

“Pyongyang”

ARG1

slide-15
SLIDE 15

AMR to Lambda Calculus

Pyongyang officials denied their involvement

deny-01 instance ARG0 ARG1 instance involve-01 instance person ARG0-of have-org-role-91 instance ARG1 city instance name instance name ARG2 instance
  • fficial
  • p1
“Pyongyang” ARG1

A1(λd.deny-01(d)∧ ARG0(d, A2(λp.person(p)∧ ARG0-of(p, A3(λh.have-org-role-91(h)∧ ARG1(h, A4(λc.city(c)∧ name(c, A5(λn.name(n) ∧ op1(n, PYONGYANG)))))∧ ARG2(h, A6(λo.official(o))))))∧ ARG1(d, A7(λi.involve-01(i) ∧ ARG1(i, R(2))))))

AMR Lambda Calculus

Deterministic Conversion

slide-16
SLIDE 16

AMR Lambda Calculus: Instances

Pyongyang officials denied their involvement

A1(λd.deny-01(d)∧ ARG0(d, A2(λp.person(p)∧ ARG0-of(p, A3(λh.have-org-role-91(h)∧ ARG1(h, A4(λc.city(c)∧ name(c, A5(λn.name(n) ∧ op1(n, PYONGYANG)))))∧ ARG2(h, A6(λo.official(o))))))∧ ARG1(d, A7(λi.involve-01(i) ∧ ARG1(i, R(2))))))

slide-17
SLIDE 17

AMR Lambda Calculus: Relations

Pyongyang officials denied their involvement

A1(λd.deny-01(d)∧ ARG0(d, A2(λp.person(p)∧ ARG0-of(p, A3(λh.have-org-role-91(h)∧ ARG1(h, A4(λc.city(c)∧ name(c, A5(λn.name(n) ∧ op1(n, PYONGYANG)))))∧ ARG2(h, A6(λo.official(o))))))∧ ARG1(d, A7(λi.involve-01(i) ∧ ARG1(i, R(2))))))

slide-18
SLIDE 18

AMR Lambda Calculus: Instances

Pyongyang officials denied their involvement

A1(λd.deny-01(d)∧ ARG0(d, A2(λp.person(p)∧ ARG0-of(p, A3(λh.have-org-role-91(h)∧ ARG1(h, A4(λc.city(c)∧ name(c, A5(λn.name(n) ∧ op1(n, PYONGYANG)))))∧ ARG2(h, A6(λo.official(o))))))∧ ARG1(d, A7(λi.involve-01(i) ∧ ARG1(i, R(2))))))

d, A2(

Skolem ID Instance Quantifier

slide-19
SLIDE 19

AMR Lambda Calculus: References

Pyongyang officials denied their involvement

A1(λd.deny-01(d)∧ ARG0(d, A2(λp.person(p)∧ ARG0-of(p, A3(λh.have-org-role-91(h)∧ ARG1(h, A4(λc.city(c)∧ name(c, A5(λn.name(n) ∧ op1(n, PYONGYANG)))))∧ ARG2(h, A6(λo.official(o))))))∧ ARG1(d, A7(λi.involve-01(i) ∧ ARG1(i, R(2))))))

d, A2(

i, R(2))

Skolem ID Instance Quantifier Reference Predicate

slide-20
SLIDE 20

Model

Pyongyang officials denied their involvement

A1(λd.deny-01(d)∧ ARG0(d, A2(λp.person(p)∧ ARG0-of(p, A3(λh.have-org-role-91(h)∧ ARG1(h, A4(λc.city(c)∧ name(c, A5(λn.name(n) ∧ op1(n, PYONGYANG)))))∧ ARG2(h, A6(λo.official(o))))))∧ ARG1(d, A7(λi.involve-01(i) ∧ ARG1(i, R(2))))))

CCG?

slide-21
SLIDE 21

Model

Pyongyang officials denied their involvement

Underspecified Logical Form

A1(λd.deny-01(d)∧ ARG0(d, A2(λp.person(p)∧ ARG0-of(p, A3(λh.have-org-role-91(h)∧ ARG1(h, A4(λc.city(c)∧ name(c, A5(λn.name(n) ∧ op1(n, PYONGYANG)))))∧ ARG2(h, A6(λo.official(o))))))∧ ARG1(d, A7(λi.involve-01(i) ∧ ARG1(i, R(2))))))

CCG Parse Constant Mapping

A1(λd.deny-01(d)∧ ARG0(d, A2(λp.person(p)∧ REL-of(p, A3(λh.have-org-role-91(h)∧ ARG1(h, A4(λc.city(c)∧ name(c, A5(λn.name(n) ∧ op1(n, PYONGYANG)))))∧ REL(h, A6(λo.official(o))))))∧ ARG1(d, A7(λi.involve-01(i) ∧ ARG1(i, R(ID))))))

slide-22
SLIDE 22

REL(

A REL-of(

Model

Pyongyang officials denied their involvement

Underspecified Logical Form

A1(λd.deny-01(d)∧ ARG0(d, A2(λp.person(p)∧ ARG0-of(p, A3(λh.have-org-role-91(h)∧ ARG1(h, A4(λc.city(c)∧ name(c, A5(λn.name(n) ∧ op1(n, PYONGYANG)))))∧ ARG2(h, A6(λo.official(o))))))∧ ARG1(d, A7(λi.involve-01(i) ∧ ARG1(i, R(2))))))

1(i, R(ID))))))

A1(λd.deny-01(d)∧ ARG0(d, A2(λp.person(p)∧ REL-of(p, A3(λh.have-org-role-91(h)∧ ARG1(h, A4(λc.city(c)∧ name(c, A5(λn.name(n) ∧ op1(n, PYONGYANG)))))∧ REL(h, A6(λo.official(o))))))∧ ARG1(d, A7(λi.involve-01(i) ∧ ARG1(i, R(ID))))))

Reference placeholder Passive relation placeholder Active relation placeholder

slide-23
SLIDE 23

REL(

A REL-of(

Model

Pyongyang officials denied their involvement

Underspecified Logical Form

A1(λd.deny-01(d)∧ ARG0(d, A2(λp.person(p)∧ ARG0-of(p, A3(λh.have-org-role-91(h)∧ ARG1(h, A4(λc.city(c)∧ name(c, A5(λn.name(n) ∧ op1(n, PYONGYANG)))))∧ ARG2(h, A6(λo.official(o))))))∧ ARG1(d, A7(λi.involve-01(i) ∧ ARG1(i, R(2))))))

A1(λd.deny-01(d)∧ ARG0(d, A2(λp.person(p)∧ REL-of(p, A3(λh.have-org-role-91(h)∧ ARG1(h, A4(λc.city(c)∧ name(c, A5(λn.name(n) ∧ op1(n, PYONGYANG)))))∧ REL(h, A6(λo.official(o))))))∧ ARG1(d, A7(λi.involve-01(i) ∧ ARG1(i, R(ID))))))

A ARG0-of( A cause-of(

ARG2(

unit(

frequency(

slide-24
SLIDE 24

REL(

A REL-of(

Model

Pyongyang officials denied their involvement

Underspecified Logical Form

A1(λd.deny-01(d)∧ ARG0(d, A2(λp.person(p)∧ ARG0-of(p, A3(λh.have-org-role-91(h)∧ ARG1(h, A4(λc.city(c)∧ name(c, A5(λn.name(n) ∧ op1(n, PYONGYANG)))))∧ ARG2(h, A6(λo.official(o))))))∧ ARG1(d, A7(λi.involve-01(i) ∧ ARG1(i, R(2))))))

A1(λd.deny-01(d)∧ ARG0(d, A2(λp.person(p)∧ REL-of(p, A3(λh.have-org-role-91(h)∧ ARG1(h, A4(λc.city(c)∧ name(c, A5(λn.name(n) ∧ op1(n, PYONGYANG)))))∧ REL(h, A6(λo.official(o))))))∧ ARG1(d, A7(λi.involve-01(i) ∧ ARG1(i, R(ID))))))

A ARG0-of( A cause-of(

ARG2(

unit(

frequency(

slide-25
SLIDE 25

Model

Pyongyang officials denied their involvement

Underspecified Logical Form

A1(λd.deny-01(d)∧ ARG0(d, A2(λp.person(p)∧ ARG0-of(p, A3(λh.have-org-role-91(h)∧ ARG1(h, A4(λc.city(c)∧ name(c, A5(λn.name(n) ∧ op1(n, PYONGYANG)))))∧ ARG2(h, A6(λo.official(o))))))∧ ARG1(d, A7(λi.involve-01(i) ∧ ARG1(i, R(2))))))

1(i, R(ID))))))

A1(λd.deny-01(d)∧ ARG0(d, A2(λp.person(p)∧ REL-of(p, A3(λh.have-org-role-91(h)∧ ARG1(h, A4(λc.city(c)∧ name(c, A5(λn.name(n) ∧ op1(n, PYONGYANG)))))∧ REL(h, A6(λo.official(o))))))∧ ARG1(d, A7(λi.involve-01(i) ∧ ARG1(i, R(ID))))))

i, R(2))

i, R(1))

i, R(7))

slide-26
SLIDE 26

Model

Pyongyang officials denied their involvement

Underspecified Logical Form

A1(λd.deny-01(d)∧ ARG0(d, A2(λp.person(p)∧ ARG0-of(p, A3(λh.have-org-role-91(h)∧ ARG1(h, A4(λc.city(c)∧ name(c, A5(λn.name(n) ∧ op1(n, PYONGYANG)))))∧ ARG2(h, A6(λo.official(o))))))∧ ARG1(d, A7(λi.involve-01(i) ∧ ARG1(i, R(2))))))

1(i, R(ID))))))

A1(λd.deny-01(d)∧ ARG0(d, A2(λp.person(p)∧ REL-of(p, A3(λh.have-org-role-91(h)∧ ARG1(h, A4(λc.city(c)∧ name(c, A5(λn.name(n) ∧ op1(n, PYONGYANG)))))∧ REL(h, A6(λo.official(o))))))∧ ARG1(d, A7(λi.involve-01(i) ∧ ARG1(i, R(ID))))))

i, R(2))

i, R(1))

i, R(7))

slide-27
SLIDE 27

Model Advantages

  • Reason about non-compositional distant

references, including:

  • Co-reference
  • Control structures (often compositional, but not

distinguished)

  • Defer certain compositional decisions from the

difficult CCG parsing problem

slide-28
SLIDE 28

CCG Parse

Derivation

Constant Mapping

Pyongyang
  • fficials
denied their involvement NP[sg] N[pl]\(N[pl]/N[pl]) S\NP/NP NP[pl] N[nb] A1(λc.city(c)∧ λf.λp.person(p)∧ λx.λy.λd.deny-01(d)∧ R(ID) λi.involve-01(i) name(c, A2(λn.name(n)∧ REL-of(p, A3(f(λh.have-org-role-91(h)∧ ARG0(d, y)∧
  • p(n, PYONGYANG))))
REL(h, A4(λo.official(o)))))) ARG1(d, x) < > > < A

A1(λd.deny-01(d) ∧ ARG0(d, A2(λp.person(p) ∧ ARG0-of(p, A3(λh.have-org-role-91(h)∧ ARG1(h, A4(λc.city(c) ∧ name(c, A5(λn.name(n) ∧ op(n, PYONGYANG)))))∧ ARG2(h, A6(λo.official(o)))))) ∧ ARG1(d, A7(λi.involve-01(i) ∧ ARG1(i, R(2)))))) A1(λd.deny-01(d) ∧ ARG0(d, A2(λp.person(p) ∧ REL-of(p, A3(λh.have-org-role-91(h)∧ ARG1(h, A4(λc.city(c) ∧ name(c, A5(λn.name(n) ∧ op(n, PYONGYANG)))))∧ REL(h, A6(λo.official(o)))))) ∧ ARG1(d, A7(λi.involve-01(i) ∧ ARG1(i, R(ID))))))

slide-29
SLIDE 29

Log-linear Model

  • Given a sentence :
  • The probability of a logical form is:
  • The probability of a derivation is:

p(z | x; θ, Λ) = X

d∈D(z)

p(d | x; θ, Λ)

p(d | x; θ, Λ) = eθ·φ(x,d) P

d0∈D eθ·φ(x,d0)

ω ∈ Rm

φ : X × D → Rm

Weights CCG Lexicon Feature Function

Λ

z

x ∈ X d ∈ D

slide-30
SLIDE 30

CCG Parse

Inference

Constant

CKY parsing Factor graph

Pyongyang
  • fficials
NP[sg] N[pl]\(N[pl]/N[pl]) A1(λc.city(c)∧ λf.λp.person(p)∧ name(c, A2(λn.name(n)∧ REL-of(p, A3(f(λh.have-org-role-91(h)∧
  • p(n, PYONGYANG))))
REL(h, A4(λo.official(o)))))) < A

A1(λd.deny-01(d) ∧ ARG0(d, A2(λp.person(p) ∧ AR ARG1(h, A4(λc.city(c) ∧ name(c, A5(λn.name(n ARG2(h, A6(λo.official(o)))))) ∧ ARG1(d, A7(λi.in A1(λd.deny-01(d) ∧ ARG0(d, A2(λp.person(p) ∧ RE ARG1(h, A4(λc.city(c) ∧ name(c, A5(λn.name(n REL(h, A6(λo.official(o)))))) ∧ ARG1(d, A7(λi.in

Joint scoring

slide-31
SLIDE 31

Constant Mapping with a Factor Graph

A1(λd.deny-01(d)∧ ARG0(d, A2(λp.person(p)∧ REL-of(p, A3(λh.have-org-role-91(h)∧ ARG1(h, A4(λc.city(c)∧ name(c, A5(λn.name(n) ∧ op(n, PYONGYANG)))))∧ REL(h, A6(λo.official(o))))))∧ ARG1(d, A7(λi.involve-01(i)∧ ARG1(i, R(ID))))))

Build a factor graph for each underspecified logical form

slide-32
SLIDE 32

Factor Graph

A1(λd.deny-01(d)∧ ARG0(d, A2(λp.person(p)∧ REL-of(p, A3(λh.have-org-role-91(h)∧ ARG1(h, A4(λc.city(c)∧ name(c, A5(λn.name(n) ∧ op(n, PYONGYANG)))))∧ REL(h, A6(λo.official(o))))))∧ ARG1(d, A7(λi.involve-01(i)∧ ARG1(i, R(ID))))))

Each constant is a random variable

slide-33
SLIDE 33

Factor Graph

unit, prep-with, frequency, prep-against, compared-to, employed-by, ARG2, . . . unit-of, prep-with-of, frequency-of, prep-against-of, compared-to-of, employed-by-of, ARG0-of, . . .

1, 2, 3, 4, 5, 6, 7

A1(λd.deny-01(d)∧ ARG0(d, A2(λp.person(p)∧ REL-of(p, A3(λh.have-org-role-91(h)∧ ARG1(h, A4(λc.city(c)∧ name(c, A5(λn.name(n) ∧ op(n, PYONGYANG)))))∧ REL(h, A6(λo.official(o))))))∧ ARG1(d, A7(λi.involve-01(i)∧ ARG1(i, R(ID))))))

Potential mapping of placeholders defines assignments

slide-34
SLIDE 34

Factor Graph

A B

A1(λd.deny-01(d)∧ ARG0(d, A2(λp.person(p)∧ REL-of(p, A3(λh.have-org-role-91(h)∧ ARG1(h, A4(λc.city(c)∧ name(c, A5(λn.name(n) ∧ op(n, PYONGYANG)))))∧ REL(h, A6(λo.official(o))))))∧ ARG1(d, A7(λi.involve-01(i)∧ ARG1(i, R(ID))))))

Selectional preference features to specify REL to one of 67 active relations Features for resolving ID to 3

Features define factors to resolve underspecified constants

slide-35
SLIDE 35

Approach

  • Model:
  • Two-stage model for compositional semantics

and non-compositional distant references

  • Learning:
  • Lexicon induction
  • Parameter estimation
slide-36
SLIDE 36

Learning Algorithm Sketch

For T iterations:

  • For each training sample:
  • Two-pass generation of new lexical entries
  • Update the model lexicon
  • For each mini-batch of size M
  • Compute gradient with early updates
  • Apply update with AdaGrad

Lexicon Induction Parameter Estimation

slide-37
SLIDE 37

Learning Algorithm Sketch

For T iterations:

  • For each training sample:
  • Two-pass generation of new lexical entries
  • Update the model lexicon
  • For each mini-batch of size M
  • Compute gradient with early updates
  • Apply update with AdaGrad
slide-38
SLIDE 38

Two-pass Lexical Generation

  • Bottom-up: over-generate new entries and parse
  • Top-down: recursive splitting to complete partial

derivations

slide-39
SLIDE 39

Bottom-up Pass

Pyongyang officials denied their involvement

Generated Entries Templates Underspecified Logical Form

slide-40
SLIDE 40

Bottom-up Pass

Pyongyang officials denied their involvement

Generated Entries

CCG Parsing

Underspecified logical form

slide-41
SLIDE 41

Bottom-up Pass

Pyongyang officials denied their involvement

Generated Entries

Select lexical entries from max scoring correct derivation

Underspecified logical form

slide-42
SLIDE 42

Common Failure

Pyongyang officials denied their involvement

Generated Entries

  • High syntactic variation
  • Missing templates
  • No complete correct

derivation created Need to learn new templates

slide-43
SLIDE 43

Splitting CCG Categories

  • Introduced by Kwiatkowski et al. 2010
  • Approximately reverses CCG parsing operations
  • Explore new syntactic structures, learn new

templates

slide-44
SLIDE 44

Splitting CCG Categories

Given a CCG category C : h:

  • 1. Split logical form h to f and g s.t.:
  • r

f(g) = h λx.f(g(x)) = h

NP[nb] : λi.involve-01(i)∧ ARG1(i, R(ID)) R(ID) λx.λi.involve-01(i) ∧ ARG1(i, x) λf.λi.f(i) ∧ ARG1(i, R(ID)) λi.involve-01(i)

slide-45
SLIDE 45

Splitting CCG Categories

Given a CCG category C : h:

  • 1. Split logical form h to f and g s.t.:
  • r
  • 2. Infer syntax from logical form type

f(g) = h λx.f(g(x)) = h

NP[nb] : λi.involve-01(i)∧ ARG1(i, R(ID)) R(ID) λx.λi.involve-01(i) ∧ ARG1(i, x)

NP[x]/N[x] : N[nb] :

λf.λi.f(i) ∧ ARG1(i, R(ID)) λi.involve-01(i) NP[pl] : NP[nb]\NP :

slide-46
SLIDE 46

Top-down Pass

Pyongyang officials denied their involvement

Generated Entries

  • Given a packed chart

without a correct parse

slide-47
SLIDE 47

Top-down Pass

Pyongyang officials denied their involvement

Generated Entries

  • Starting from correct

logical form

  • Recursively split to

create a complete tree

Underspecified logical form

slide-48
SLIDE 48

Top-down Pass

Pyongyang officials denied their involvement

Generated Entries

  • Each split combines a

new category with an existing partial derivation

Underspecified logical form

slide-49
SLIDE 49

Splitting for CCG Induction

  • Kwiatkowski et al. 2010:
  • No restriction on result categories
  • Applied up to depth one
  • Our approach:
  • Combined with bottom-up template approach
  • Must connect to an existing partial derivation
  • Applied recursively
slide-50
SLIDE 50

Learning Algorithm Sketch

For T iterations:

  • For each training sample:
  • Two-pass generation of new lexical entries
  • Update the model lexicon
  • For each mini-batch of size M
  • Compute gradient with early updates
  • Apply update with AdaGrad
slide-51
SLIDE 51

Gradient Computation

Pyongyang officials denied their

Underspecified logical form

  • If a correct derivation exists:
  • Compute gradient with

inside-outside

  • Re-normalize with

constant mapping features

slide-52
SLIDE 52

Common Failure

Pyongyang officials denied their

  • No correct derivation exists,

~40% of training data

  • Previous work assumed that

all (or at least most) corpus can be parsed

  • Instead: early updates
slide-53
SLIDE 53

Early Updates

  • Collins and Roark (2004):
  • Given fully labeled parse trees
  • Update with partial derivations
  • Challenge: derivation is latent
slide-54
SLIDE 54

Early Update with Latent Structures

Pyongyang officials denied their

  • Extract sub-expression from

underspecified logical form

  • For each sub-expression:
  • Identify largest max-

scoring partial derivation

  • Compute gradient
slide-55
SLIDE 55

Early Update with Latent Structures

Pyongyang officials denied their

Underspecified Logical Form

A2(λp.person(p)∧ REL-of(p, A3(λh.have-org-role-91(h)∧ ARG1(h, A4(λc.city(c)∧ name(c, A5(λn.name(n) ∧ op(n, PYONGYANG))))) REL(h, A6(λo.official(o)))))) A4(λc.city(c)∧ name(c, A5(λn.name(n) ∧ op(n, PYONGYANG))))

slide-56
SLIDE 56

Early Update with Latent Structures

Pyongyang officials denied their

Underspecified Logical Form

A2(λp.person(p)∧ REL-of(p, A3(λh.have-org-role-91(h)∧ ARG1(h, A4(λc.city(c)∧ name(c, A5(λn.name(n) ∧ op(n, PYONGYANG))))) REL(h, A6(λo.official(o)))))) A4(λc.city(c)∧ name(c, A5(λn.name(n) ∧ op(n, PYONGYANG))))

slide-57
SLIDE 57

Related Work

CCG Semantic Parsing

[Zettlemoyer and Collins 2005, 2007; Kwiatkowski et al. 2010, 2011; Artzi and Zettlemoyer 2013]

Skolem Terms for CCG

[Steedman 2011]

AMR Evaluation

[Cai and Knight 2013]

Graph-based Parsing for AMR

[Flanigan et al. 2014]

Dependency Structure Transformation for AMR

[Wang et al. 2015a, 2015b]

Syntax-based MT for AMR

[Pust et al. 2015]

Rule-based Parsing for AMR

[Vanderwende et al. 2015]

AMR Applications

[Pan et al. 2015; Lin et al. 2015]

slide-58
SLIDE 58

Experimental Setup

  • AMR Bank release 1.0, proxy report portion
  • Evaluation metric: SMATCH [Cai and Knight 2013]
  • Features: lexical features, parsing operations, parsing

attachment, selectional preferences, control structures

  • Seed lexicon and templates:
  • 50 annotated sentences
  • Heuristic alignment from JAMR [Flanigan et al. 2014]
slide-59
SLIDE 59

Ablation Results

45.6 51.6 62.6 66.1

Full system w/o unrestricted lexical generation w/o early updates w/o surface-form similarity

  • Without early updates we fail to

learn effectively from much of the data

  • Poor performance without

heuristics demonstrates need for future work SMATCH F1

slide-60
SLIDE 60

Results

SMATCH F1

70 66.3 65.8 62.2 63.2

JAMR (fixed) Werling et al. 2015 Pust et al. 2015 Our Approach Wang et al. 2015b

  • AMR is getting a lot of attention!

… and will: SemEval 2016

  • Using solutions sub-problem

solution is a promising complimentary direction

slide-61
SLIDE 61

Contributions

  • Joint model for compositional and non-compositional

semantics

  • Scalable CCG induction for semantic parsing
  • First CCG approach to AMR
  • Code and models available in Cornell SPF: 


http://yoavarzti.com/spf

Cornell Tech

slide-62
SLIDE 62

[fin]