Global Neural CCG Parsing with Optimality Guarantees
Kenton Lee Mike Lewis† Luke Zettlemoyer University of Washington
† Now at Facebook AI Research
1
UWNLP
Global Neural CCG Parsing with Optimality Guarantees Kenton Lee Mike - - PowerPoint PPT Presentation
Global Neural CCG Parsing with Optimality Guarantees Kenton Lee Mike Lewis Luke Zettlemoyer University of Washington UWNLP Now at Facebook AI Research 1 This Talk Challenge : Global models (e.g. Recursive NNs) break dynamic programs
Kenton Lee Mike Lewis† Luke Zettlemoyer University of Washington
† Now at Facebook AI Research
1
UWNLP
Fruit
NP/NP
flies
NP
like
(S\NP)/NP
bananas
NP NP S\NP S
2
Challenge: Global models (e.g. Recursive NNs) break dynamic programs
Fruit
NP/NP
flies
NP
like
(S\NP)/NP
bananas
NP NP S\NP S
Challenge: Global models (e.g. Recursive NNs) break dynamic programs Our approach: Combine local and global models in A* parser Result: Global model with exact inference
3
Input Output
Fruit flies like bananas NP/NP NP (S\NP)/NP NP
> >
NP S\NP
<
S
4
Klein and Manning, 2001
Input Output
Fruit flies like bananas NP/NP NP (S\NP)/NP NP
> >
NP S\NP
<
S
∅ Fruit NP/NP flies NP like (S\NP)/NP bananas NP Fruit flies NP/NP NP
>
NP like bananas (S\NP)/NP NP
>
S\NP Fruit flies like bananas NP/NP NP (S\NP)/NP NP
> >
NP S\NP
<
S
Klein and Manning, 2001
5
∅ Fruit NP/NP flies NP like (S\NP)/NP bananas NP Fruit flies NP/NP NP
>
NP like bananas (S\NP)/NP NP
>
S\NP Fruit flies like bananas NP/NP NP (S\NP)/NP NP
> >
NP S\NP
<
S
Input Output
Fruit flies like bananas NP/NP NP (S\NP)/NP NP
> >
NP S\NP
<
S
Nodes represent partial parses
Klein and Manning, 2001
6
Input Output
Fruit flies like bananas NP/NP NP (S\NP)/NP NP
> >
NP S\NP
<
S
Hyperedges represent rule productions
∅ Fruit NP/NP flies NP like (S\NP)/NP bananas NP Fruit flies NP/NP NP
>
NP like bananas (S\NP)/NP NP
>
S\NP Fruit flies like bananas NP/NP NP (S\NP)/NP NP
> >
NP S\NP
<
S
7
Klein and Manning, 2001
∅ Fruit NP/NP flies NP like (S\NP)/NP bananas NP Fruit flies NP/NP NP
>
NP like bananas (S\NP)/NP NP
>
S\NP Fruit flies like bananas NP/NP NP (S\NP)/NP NP
> >
NP S\NP
<
S
Input Output
Fruit flies like bananas NP/NP NP (S\NP)/NP NP
> >
NP S\NP
<
S
Path represents a parse derivation
y = {e1, . . . , em}
8
Klein and Manning, 2001
Input Output
∅ Fruit NP/NP flies NP like (S\NP)/NP bananas NP Fruit flies NP/NP NP
>NP like bananas (S\NP)/NP NP
>S\NP Fruit flies like bananas NP/NP NP (S\NP)/NP NP
> >NP S\NP
<S
9
Input Output
∅ Fruit NP flies NP\NP like (S\NP)/NP bananas NP Fruit flies NP NP\NP
<NP like bananas (S\NP)/NP NP
>S\NP Fruit flies like bananas NP NP\NP (S\NP)/NP NP
< >NP S\NP
<S ∅ Fruit NP/NP flies NP like (S\NP)/NP bananas NP Fruit flies NP/NP NP
>NP like bananas (S\NP)/NP NP
>S\NP Fruit flies like bananas NP/NP NP (S\NP)/NP NP
> >NP S\NP
<S
10
∅ Fruit NP/NP flies NP like (S\NP)/NP bananas NP Fruit flies NP/NP NP
>NP like bananas (S\NP)/NP NP
>S\NP Fruit flies like bananas NP/NP NP (S\NP)/NP NP
> >NP S\NP
<S ∅ Fruit NP flies NP\NP like (S\NP)/NP bananas NP Fruit flies NP NP\NP
<NP like bananas (S\NP)/NP NP
>S\NP Fruit flies like bananas NP NP\NP (S\NP)/NP NP
< >NP S\NP
<S
Input Output
Each hyperedge is weighted with a score
e g(e)
11
Input Output
∅ Fruit NP flies NP\NP like (S\NP)/NP bananas NP Fruit flies NP NP\NP
<NP like bananas (S\NP)/NP NP
>S\NP Fruit flies like bananas NP NP\NP (S\NP)/NP NP
< >NP S\NP
<S ∅ Fruit NP/NP flies NP like (S\NP)/NP bananas NP Fruit flies NP/NP NP
>NP like bananas (S\NP)/NP NP
>S\NP Fruit flies like bananas NP/NP NP (S\NP)/NP NP
> >NP S\NP
<S
Score of parse derivation:
g(y) = X
e∈y
g(e)
12
13
Fruit flies like bananas NP/NP NP (S\NP)/NP NP
> >
NP S\NP
<
S Fruit flies like bananas NP NP\NP (S\NP)/NP NP
< >
NP S\NP
<
S
∅ Fruit NP flies S\NP like (S\S)/NP flies NP\NP Fruit NP/NP flies NP like (S\NP)/NP bananas NP Fruit flies NP/NP NP > NP Fruit flies NP NP\NP < NP like bananas (S\NP)/NP NP > S\NP like bananas (S\S)/NP NP > S\S Fruit flies NP S\NP < S Fruit flies like bananas NP/NP NP (S\NP)/NP NP > > NP S\NP < S Fruit flies like bananas NP NP\NP (S\NP)/NP NP < > NP S\NP < S Fruit flies like bananas NP S\NP (S\S)/NP NP < > S S\S < S14
❖ Predicted parse: ❖ Exponential number of nodes
Intractable inference
y∗ = argmax
y∈Y
g(y)
15
Approximate inference with global expressivity, e.g.
16
❖ Greedy / beam search: ❖ Nivre, 2008 ❖ Chen and Manning, 2014 ❖ Andor et al., 2016 ❖ Reranking: ❖ Charniak and Johnson, 2005 ❖ Huang, 2008 ❖ Socher et al., 2013
∅ Fruit NP flies S\NP like (S\S)/NP flies NP\NP Fruit NP/NP flies NP like (S\NP)/NP bananas NP Fruit flies
?NP like bananas
?S\NP like bananas
?S\S Fruit flies
?S Fruit flies like bananas
?S
Scores condition on local structures
❖ Make locality assumptions: ❖ e.g. features are local to CFG
productions
❖ Polynomial number of nodes ❖ Dynamic programs enable
tractable inference
17
∅ Fruit NP flies S\NP like (S\S)/NP flies NP\NP Fruit NP/NP flies NP like (S\NP)/NP bananas NP Fruit flies
?NP like bananas
?S\NP like bananas
?S\S Fruit flies
?S Fruit flies like bananas
?S
Scores condition on local structures
18
Dynamic programs with locally factored models, e.g.
❖ CKY: ❖ Collins, 1997 ❖ Durrett and Klein, 2015 ❖ Minimum spanning tree: ❖ McDonald et al., 2005 ❖ Kiperwasser and Goldberg, 2016
∅ Fruit NP flies S\NP like (S\S)/NP flies NP\NP Fruit NP/NP flies NP like (S\NP)/NP bananas NP Fruit flies
?NP like bananas
?S\NP like bananas
?S\S Fruit flies
?S Fruit flies like bananas
?S
Scores condition on local structures
19
Dynamic programs with locally factored models, e.g.
❖ CKY: ❖ Collins, 1997 ❖ Durrett and Klein, 2015 ❖ Minimum spanning tree: ❖ McDonald et al., 2005 ❖ Kiperwasser and Goldberg, 2016
y∗ = argmax
y∈Y
y∈Y
Global model: Local model:
∅ Fruit NP flies S\NP like (S\S)/NP flies NP\NP Fruit NP/NP flies NP like (S\NP)/NP bananas NP Fruit flies NP/NP NP > NP Fruit flies NP NP\NP < NP like bananas (S\NP)/NP NP > S\NP like bananas (S\S)/NP NP > S\S Fruit flies NP S\NP < S Fruit flies like bananas NP/NP NP (S\NP)/NP NP > > NP S\NP < S Fruit flies like bananas NP NP\NP (S\NP)/NP NP < > NP S\NP < S Fruit flies like bananas NP S\NP (S\S)/NP NP < > S S\S < S ∅ Fruit NP flies S\NP like (S\S)/NP flies NP\NP Fruit NP/NP flies NP like (S\NP)/NP bananas NP Fruit flies ? NP like bananas ? S\NP like bananas ? S\S Fruit flies ? S Fruit flies like bananas ? SEfficient Expressive Inexpressive Intractable
20
Combined model:
∅ Fruit NP flies S\NP like (S\S)/NP flies NP\NP Fruit NP/NP flies NP like (S\NP)/NP bananas NP Fruit flies NP/NP NP > NP Fruit flies NP NP\NP < NP like bananas (S\NP)/NP NP > S\NP like bananas (S\S)/NP NP > S\S Fruit flies NP S\NP < S Fruit flies like bananas NP/NP NP (S\NP)/NP NP > > NP S\NP < S Fruit flies like bananas NP NP\NP (S\NP)/NP NP < > NP S\NP < S Fruit flies like bananas NP S\NP (S\S)/NP NP < > S S\S < SEfficient Expressive
y∗ = argmax
y∈Y
❖ Background: A* parsing ❖ Combined global and local parsing model ❖ Learning to search accurately and efficiently ❖ Experiments on CCGBank 22
y∈Y
❖ Search in the space of partial parses ❖ First explored full parse guaranteed to be optimal
23
Klein and Manning, 2003
Fruit flies like bananas (S\NP)/NP NP
>
S\NP
Partial parse
24
Partial parse
Fruit flies like bananas (S\NP)/NP NP
>
S\NP
25
Exploration priority
Partial parse
Fruit flies like bananas (S\NP)/NP NP
>
S\NP
26
Inside score
Exploration priority Admissible A* heuristic
Fruit flies like bananas (S\NP)/NP NP > S\NP?
Fruit flies like bananas (S\NP)/NP NP
>
S\NP
Fruit flies like bananas (S\NP)/NP NP
>
S\NP
?
Fruit flies like bananas (S\NP)/NP NP > S\NP?
27
∅ Fruit NP flies S\NP like (S\S)/NP flies NP\NP Fruit NP/NP flies NP like (S\NP)/NP bananas NP Fruit flies NP/NP NP
>NP Fruit flies NP NP\NP
<NP like bananas (S\NP)/NP NP
>S\NP like bananas (S\S)/NP NP
>S\S Fruit flies NP S\NP
<S Fruit flies like bananas NP/NP NP (S\NP)/NP NP
> >NP S\NP
<S Fruit flies like bananas NP NP\NP (S\NP)/NP NP
< >NP S\NP
<S Fruit flies like bananas NP S\NP (S\S)/NP NP
< >S S\S
<S
explored agenda unexplored
28
∅ Fruit NP flies S\NP like (S\S)/NP flies NP\NP Fruit NP/NP flies NP like (S\NP)/NP bananas NP Fruit flies NP/NP NP
>NP Fruit flies NP NP\NP
<NP like bananas (S\NP)/NP NP
>S\NP like bananas (S\S)/NP NP
>S\S Fruit flies NP S\NP
<S Fruit flies like bananas NP/NP NP (S\NP)/NP NP
> >NP S\NP
<S Fruit flies like bananas NP NP\NP (S\NP)/NP NP
< >NP S\NP
<S Fruit flies like bananas NP S\NP (S\S)/NP NP
< >S S\S
<S
Agenda position
1 4.5 2 3.1 3 1.9 4
bananas NP
f(y)
like (S\NP)/NP
Fruit NP
Fruit NP/NP
explored agenda unexplored
29
Agenda position
1 4.5 2 3.1 3 1.9 4
like (S\NP)/NP
Fruit NP
Fruit NP/NP
f(y)
bananas NP
bananas NP ∅ Fruit NP flies S\NP like (S\S)/NP flies NP\NP Fruit NP/NP flies NP like (S\NP)/NP bananas NP Fruit flies NP/NP NP
>NP Fruit flies NP NP\NP
<NP like bananas (S\NP)/NP NP
>S\NP like bananas (S\S)/NP NP
>S\S Fruit flies NP S\NP
<S Fruit flies like bananas NP/NP NP (S\NP)/NP NP
> >NP S\NP
<S Fruit flies like bananas NP NP\NP (S\NP)/NP NP
< >NP S\NP
<S Fruit flies like bananas NP S\NP (S\S)/NP NP
< >S S\S
<S
explored agenda unexplored
30
Agenda position
2 3.1 3 1.9 4
f(y)
bananas NP ∅ Fruit NP flies S\NP like (S\S)/NP flies NP\NP Fruit NP/NP flies NP like (S\NP)/NP bananas NP Fruit flies NP/NP NP
>NP Fruit flies NP NP\NP
<NP like bananas (S\NP)/NP NP
>S\NP like bananas (S\S)/NP NP
>S\S Fruit flies NP S\NP
<S Fruit flies like bananas NP/NP NP (S\NP)/NP NP
> >NP S\NP
<S Fruit flies like bananas NP NP\NP (S\NP)/NP NP
< >NP S\NP
<S Fruit flies like bananas NP S\NP (S\S)/NP NP
< >S S\S
<S
like (S\NP)/NP
Fruit NP
Fruit NP/NP
explored agenda unexplored
31
Agenda position
1 3.1 2 1.9 3
4
f(y)
bananas NP ∅ Fruit NP flies S\NP like (S\S)/NP flies NP\NP Fruit NP/NP flies NP like (S\NP)/NP bananas NP Fruit flies NP/NP NP
>NP Fruit flies NP NP\NP
<NP like bananas (S\NP)/NP NP
>S\NP like bananas (S\S)/NP NP
>S\S Fruit flies NP S\NP
<S Fruit flies like bananas NP/NP NP (S\NP)/NP NP
> >NP S\NP
<S Fruit flies like bananas NP NP\NP (S\NP)/NP NP
< >NP S\NP
<S Fruit flies like bananas NP S\NP (S\S)/NP NP
< >S S\S
<S
flies NP
like (S\NP)/NP
Fruit NP
Fruit NP/NP
explored agenda unexplored
32
Agenda position
1 3.1 2 1.9 3
4
f(y)
like (S\NP)/NP
∅ Fruit NP flies S\NP like (S\S)/NP flies NP\NP Fruit NP/NP flies NP like (S\NP)/NP bananas NP Fruit flies NP/NP NP
>NP Fruit flies NP NP\NP
<NP like bananas (S\NP)/NP NP
>S\NP like bananas (S\S)/NP NP
>S\S Fruit flies NP S\NP
<S Fruit flies like bananas NP/NP NP (S\NP)/NP NP
> >NP S\NP
<S Fruit flies like bananas NP NP\NP (S\NP)/NP NP
< >NP S\NP
<S Fruit flies like bananas NP S\NP (S\S)/NP NP
< >S S\S
<S
flies NP
Fruit NP
Fruit NP/NP
explored agenda unexplored
33
Agenda position
2 1.9 3
4
f(y)
∅ Fruit NP flies S\NP like (S\S)/NP flies NP\NP Fruit NP/NP flies NP like (S\NP)/NP bananas NP Fruit flies NP/NP NP
>NP Fruit flies NP NP\NP
<NP like bananas (S\NP)/NP NP
>S\NP like bananas (S\S)/NP NP
>S\S Fruit flies NP S\NP
<S Fruit flies like bananas NP/NP NP (S\NP)/NP NP
> >NP S\NP
<S Fruit flies like bananas NP NP\NP (S\NP)/NP NP
< >NP S\NP
<S Fruit flies like bananas NP S\NP (S\S)/NP NP
< >S S\S
<S
flies NP
Fruit NP
Fruit NP/NP
explored agenda unexplored
34
Agenda position
1 2.1 2 1.9 3
4
f(y)
∅ Fruit NP flies S\NP like (S\S)/NP flies NP\NP Fruit NP/NP flies NP like (S\NP)/NP bananas NP Fruit flies NP/NP NP
>NP Fruit flies NP NP\NP
<NP like bananas (S\NP)/NP NP
>S\NP like bananas (S\S)/NP NP
>S\S Fruit flies NP S\NP
<S Fruit flies like bananas NP/NP NP (S\NP)/NP NP
> >NP S\NP
<S Fruit flies like bananas NP NP\NP (S\NP)/NP NP
< >NP S\NP
<S Fruit flies like bananas NP S\NP (S\S)/NP NP
< >S S\S
<S
like bananas (S\NP)/NP NP
>
S\NP
flies NP
Fruit NP
Fruit NP/NP
explored agenda unexplored
35
Agenda position
1 2.1 2 1.9 3
4
f(y)
like bananas (S\NP)/NP NP
>
S\NP
∅ Fruit NP flies S\NP like (S\S)/NP flies NP\NP Fruit NP/NP flies NP like (S\NP)/NP bananas NP Fruit flies NP/NP NP
>NP Fruit flies NP NP\NP
<NP like bananas (S\NP)/NP NP
>S\NP like bananas (S\S)/NP NP
>S\S Fruit flies NP S\NP
<S Fruit flies like bananas NP/NP NP (S\NP)/NP NP
> >NP S\NP
<S Fruit flies like bananas NP NP\NP (S\NP)/NP NP
< >NP S\NP
<S Fruit flies like bananas NP S\NP (S\S)/NP NP
< >S S\S
<S
flies NP
Fruit NP
Fruit NP/NP
explored agenda unexplored
36
Agenda position
1 1.9 2
3 … … 4 … …
f(y)
∅ Fruit NP flies S\NP like (S\S)/NP flies NP\NP Fruit NP/NP flies NP like (S\NP)/NP bananas NP Fruit flies NP/NP NP
>NP Fruit flies NP NP\NP
<NP like bananas (S\NP)/NP NP
>S\NP like bananas (S\S)/NP NP
>S\S Fruit flies NP S\NP
<S Fruit flies like bananas NP/NP NP (S\NP)/NP NP
> >NP S\NP
<S Fruit flies like bananas NP NP\NP (S\NP)/NP NP
< >NP S\NP
<S Fruit flies like bananas NP S\NP (S\S)/NP NP
< >S S\S
<S
like (S\S)/NP
Fruit NP
explored agenda unexplored
37
Fruit flies like bananas NP/NP NP (S\NP)/NP NP
> >
NP S\NP
<
S
38
Supertag-factored A* CCG Parser (Lewis et al, 2016):
Supertag-factored A* CCG Parser (Lewis et al, 2016):
like (S\NP)/NP
bananas NP
Fruit flies like bananas NP/NP NP (S\NP)/NP NP
> >
NP S\NP
<
S
Fruit flies like bananas NP/NP NP (S\NP)/NP NP > > NP S\NP < Sflies NP
Fruit NP/NP
39
Fruit flies like bananas (S\NP)/NP NP
>
S\NP
40
Supertag-factored A* CCG Parser (Lewis et al, 2016):
Fruit flies like bananas (S\NP)/NP NP
>
S\NP
like (S\NP)/NP
bananas NP
?
41
Supertag-factored A* CCG Parser (Lewis et al, 2016):
Fruit flies like bananas (S\NP)/NP NP
>
S\NP
like (S\NP)/NP
bananas NP
?
Fruit flies like bananas (S\NP)/NP NP > S\NP?
Fruit tag
flies tag
tag g(
tag g(
42
Supertag-factored A* CCG Parser (Lewis et al, 2016):
❖ Background: A* parsing ❖ Combined global and local parsing model ❖ Learning to search accurately and efficiently ❖ Experiments on CCGBank 43
y∈Y
❖ First explored full parse guaranteed to be optimal ❖ Global search graph is exponential in sentence length ❖ Open question: Can we still learn to search efficiently?
44
Fruit
NP/NP
flies
NP
like
(S\NP)/NP
bananas
NP NP S\NP S
Fruit flies like bananas (S\NP)/NP NP
>
S\NP
45
Non-positive global model
46
Non-positive global model
47
Any locally factored model with an admissible A* heuristic Non-positive global model
48
❖ Global expressivity ❖ Discriminative only
when necessary
❖ Limited expressivity ❖ Provides guidance with
an A* heuristic
49
Word embeddings Bidirectional LSTM Tree-LSTM Parse Scores
Fruit
NP/NP
flies
NP
like
(S\NP)/NP
bananas
NP NP S\NP S
50
Log-probability of a logistic regression layer
Fruit
NP/NP
flies
NP
like
(S\NP)/NP
bananas
NP NP S\NP S
51
❖ Global expressivity ❖ Discriminative only
when necessary
❖ Limited expressivity ❖ Provides guidance with
an A* heuristic
52
❖ Background: A* parsing ❖ Combined global and local parsing model ❖ Learning to search accurately and efficiently ❖ Experiments on CCGBank 53
∅ Fruit NP flies S\NP like (S\S)/NP flies NP\NP Fruit NP/NP flies NP like (S\NP)/NP bananas NP Fruit flies NP/NP NP
>NP Fruit flies NP NP\NP
<NP like bananas (S\NP)/NP NP
>S\NP like bananas (S\S)/NP NP
>S\S Fruit flies NP S\NP
<S Fruit flies like bananas NP/NP NP (S\NP)/NP NP
> >NP S\NP
<S Fruit flies like bananas NP NP\NP (S\NP)/NP NP
< >NP S\NP
<S Fruit flies like bananas NP S\NP (S\S)/NP NP
< >S S\S
<S
explored agenda unexplored
54
∅ Fruit NP flies S\NP like (S\S)/NP flies NP\NP Fruit NP/NP flies NP like (S\NP)/NP bananas NP Fruit flies NP/NP NP
>NP Fruit flies NP NP\NP
<NP like bananas (S\NP)/NP NP
>S\NP like bananas (S\S)/NP NP
>S\S Fruit flies NP S\NP
<S Fruit flies like bananas NP/NP NP (S\NP)/NP NP
> >NP S\NP
<S Fruit flies like bananas NP NP\NP (S\NP)/NP NP
< >NP S\NP
<S Fruit flies like bananas NP S\NP (S\S)/NP NP
< >S S\S
<S Agenda position
Is correct?
1 4.5 2 3.1 3 1.9 4
bananas NP
like (S\NP)/NP
Fruit NP Fruit NP/NP
f(y)
y
explored agenda unexplored
55
Agenda position
Is correct?
1 1.9 2
3 … … … 4 … … …
Fruit NP Fruit NP/NP
f(y)
y
∅ Fruit NP flies S\NP like (S\S)/NP flies NP\NP Fruit NP/NP flies NP like (S\NP)/NP bananas NP Fruit flies NP/NP NP
>NP Fruit flies NP NP\NP
<NP like bananas (S\NP)/NP NP
>S\NP like bananas (S\S)/NP NP
>S\S Fruit flies NP S\NP
<S Fruit flies like bananas NP/NP NP (S\NP)/NP NP
> >NP S\NP
<S Fruit flies like bananas NP NP\NP (S\NP)/NP NP
< >NP S\NP
<S Fruit flies like bananas NP S\NP (S\S)/NP NP
< >S S\S
<S
explored agenda unexplored
56
explored agenda unexplored
Agenda position
Is correct?
1 1.9 2
3 … … … 4 … … …
Fruit NP Fruit NP/NP
f(y)
y
∅ Fruit NP flies S\NP like (S\S)/NP flies NP\NP Fruit NP/NP flies NP like (S\NP)/NP bananas NP Fruit flies NP/NP NP
>NP Fruit flies NP NP\NP
<NP like bananas (S\NP)/NP NP
>S\NP like bananas (S\S)/NP NP
>S\S Fruit flies NP S\NP
<S Fruit flies like bananas NP/NP NP (S\NP)/NP NP
> >NP S\NP
<S Fruit flies like bananas NP NP\NP (S\NP)/NP NP
< >NP S\NP
<S Fruit flies like bananas NP S\NP (S\S)/NP NP
< >S S\S
<S
Agenda violation: incorrect partial parse explored
57
58
Top of agenda Best gold partial parse
59
T
t=1
y∈At f(y) −
y∈gold(At) f(y)
Correct partial parse can still be predicted via backtracking
Agenda position
Is correct?
1 1.9 2
3 … … … 4 … … …
Fruit NP Fruit NP/NP
f(y)
y
60
Correct partial parse can still be predicted via backtracking
Agenda position
Is correct?
1 1.9 2
3 … … … 4 … … …
Fruit NP Fruit NP/NP
f(y)
y
61
❖ Background: A* parsing ❖ Combined global and local parsing model ❖ Learning to search accurately and efficiently ❖ Experiments on CCGBank 62
❖ : supertag-factored model from Lewis et al. (2016) ❖ Evaluate on CCGBank (Hockenmaier & Steedman, 2007) ❖ Comparisons:
glocal(y)
63
Clark & Curran (2007) Xu et al. (2015) Lewis et al. (2016) Vaswani et al. (2016)
Is global?
Is exact?
❖ : supertag-factored model from Lewis et al. (2016) ❖ Evaluate on CCGBank (Hockenmaier & Steedman, 2007) ❖ Comparisons:
glocal(y)
64
Clark & Curran (2007) Xu et al. (2015) Lewis et al. (2016) Vaswani et al. (2016)
Global A*
Is global?
Is exact?
Test F1 (%) 84.0 85.0 86.0 87.0 88.0 89.0
88.7 88.3 88.1 87.0 85.2
65
Clark & Curran (2007) Xu et al. (2015) Lewis et al. (2016) Vaswani et al. (2016)
Global A*
Is global?
Is exact?
Test F1 (%) 84.0 85.0 86.0 87.0 88.0 89.0
88.7 88.3 88.1 87.0 85.2
66
Clark & Curran (2007) Xu et al. (2015) Lewis et al. (2016) Vaswani et al. (2016)
Global A*
Is global?
Is exact?
❖ Optimal parse found for 99.9% of sentences ❖ Explores only 190 partial parses on average
10 20 30 87.0 87.4 87.8 88.2 88.6 89.0 10-best Reranking 100-best Reranking 4-best Beam Search Global A*
Development F1 (%) Speed (sentences / second)
27.1 4.0 0.4 3.2 88.4 88.3 88.2 87.9
67
150 300 450 600 750 87.0 87.5 88.0 88.5 89.0 Global A* Global A* without context
Development F1 (%) Number of explorations (lower is better)
610.5 309.6 88.1 88.4
flies like
(S\NP)/NP
bananas
NP S\NP
flies like
(S\NP)/NP
bananas
NP S\NP
68
The favorite U.S. small business is one whose research and development can be milked for future Japanese use. Incorrect partial parse (syntactically plausible in isolation): Input sentence:
U.S. small business is
N/N (N/N)\(N/N) N (S\NP)/NP N
< > > <
S
Heavily penalized by the global model
69
❖ Combining local and global models enables exact inference
with global features
❖ Efficient decoding by learning to search ❖ State of the art for CCG parsing ❖ Applicable to other structured prediction tasks 70