An HDP Model for Inducing Combinatory Categorial Grammars
Yonatan Bisk & Julia Hockenmaier University of Illinois at Urbana-Champaign
TACL Vol 1(2013):75−88
1 Thursday, June 13, 13
An HDP Model for Inducing Combinatory Categorial Grammars Yonatan - - PowerPoint PPT Presentation
An HDP Model for Inducing Combinatory Categorial Grammars Yonatan Bisk & Julia Hockenmaier University of Illinois at Urbana-Champaign TACL Vol 1(2013):75 88 1 Thursday, June 13, 13 PRP VBD ADJ NN She ate crunchy granola 2
Yonatan Bisk & Julia Hockenmaier University of Illinois at Urbana-Champaign
TACL Vol 1(2013):75−88
1 Thursday, June 13, 13PRP VBD ADJ NN She ate crunchy granola
Thursday, June 13, 13Dependency Grammar Induction
3
PRP VBD ADJ NN She ate crunchy granola
Thursday, June 13, 13Dependency Grammar Induction
3
PRP VBD ADJ NN She ate crunchy granola
Thursday, June 13, 13Dependency Grammar Induction
3
PRP VBD ADJ NN She ate crunchy granola
Problem for unsupervised Dependency Grammar learner: Unlabeled dependencies provide no explicit structure
Thursday, June 13, 134
PRP VBD ADJ NN She ate crunchy granola
Thursday, June 13, 134
PRP VBD ADJ NN She ate crunchy granola
NP VP S A V N N
Thursday, June 13, 134
PRP VBD ADJ NN She ate crunchy granola
NP VP S A V N N
Problem for unsupervised CFG learner: CFG symbols and rewrite rules are arbitrary
Thursday, June 13, 135
PRP VBD ADJ NN She ate crunchy granola
X6 X2 X0 X32 X4 X5 X5
Thursday, June 13, 135
PRP VBD ADJ NN She ate crunchy granola
X6 X2 X0 X32 X4 X5 X5
What kind of grammatical representation is suitable for unsupervised induction?
Thursday, June 13, 136
PRP VBD ADJ NN She ate crunchy granola
N S\N S N/N (S\N)/N N N
Thursday, June 13, 137
PRP VBD ADJ NN She ate crunchy granola
N S\N S N/N (S\N)/N N N
Thursday, June 13, 138
PRP VBD ADJ NN She ate crunchy granola
N/N (S\N)/N N N
Thursday, June 13, 139
Thursday, June 13, 13symbolic representation:
9
Thursday, June 13, 13symbolic representation:
CCG captures core dependencies CCG captures basic word order
9
Thursday, June 13, 13symbolic representation:
CCG captures core dependencies CCG captures basic word order
are heavily constrained:
9
Thursday, June 13, 13symbolic representation:
CCG captures core dependencies CCG captures basic word order
are heavily constrained:
CCG categories are functions CCG rules = function application & composition
9
Thursday, June 13, 13symbolic representation:
Makes CCG more robust than DGs
are heavily constrained:
Gives CCG a simpler probability model than CFGs
10
Thursday, June 13, 13symbolic representation:
CCG is more robust than DG on longer sentences CCG returns linguistically interpretable parses
are heavily constrained:
11
Thursday, June 13, 13symbolic representation:
CCG is more robust than DG on longer sentences CCG returns linguistically interpretable parses
are heavily constrained:
CCG has a simpler probability model than CFGs CCG allows fast variational inference
12
Thursday, June 13, 1314
Thursday, June 13, 13CCG has two atomic categories:
14
Thursday, June 13, 13CCG has two atomic categories:
14
Thursday, June 13, 13CCG has two atomic categories:
All other CCG categories are functions:
14
Thursday, June 13, 13CCG has two atomic categories:
All other CCG categories are functions:
14
Thursday, June 13, 13CCG has two atomic categories:
All other CCG categories are functions:
14
CCG has two atomic categories:
All other CCG categories are functions:
14
CCG has two atomic categories:
All other CCG categories are functions:
14
15
Thursday, June 13, 1315
Thursday, June 13, 1315
Thursday, June 13, 1315
16
Thursday, June 13, 1316
Thursday, June 13, 1316
Thursday, June 13, 1316
Bisk & Hockenmaier, AAAI 2012
Thursday, June 13, 1318
Thursday, June 13, 1318
Atomic CCG category Part-of-speech tag class
Thursday, June 13, 1318
Atomic CCG category Part-of-speech tag class S Verb Det, Noun,
Thursday, June 13, 1318
Atomic CCG category Part-of-speech tag class S Verb N Det, Noun, Pron, Num
Thursday, June 13, 1318
Atomic CCG category Part-of-speech tag class S Verb N Det, Noun, Pron, Num conj Conj
Thursday, June 13, 1319
The man ate quickly N S
Thursday, June 13, 1319
The man ate quickly N S S\N
Thursday, June 13, 1319
The man ate quickly N S ? S\N
Thursday, June 13, 1319
The man ate quickly N S ? ? S\N
Thursday, June 13, 1319
The man ate quickly N S ? S\N
Thursday, June 13, 1319
The man ate quickly N S ? N/N S\N
Thursday, June 13, 1319
The man ate quickly N S N/N S\N
Thursday, June 13, 1319
The man ate quickly N S N/N S\S S\N
Thursday, June 13, 1319
The man ate quickly N S N/N S\S S/S N\N S\N ...
Thursday, June 13, 1321
Thursday, June 13, 1321
Nonparametric Bayesian model
Thursday, June 13, 1321
Nonparametric Bayesian model
We do not need to fix the category inventory in advance
Thursday, June 13, 1321
Nonparametric Bayesian model
We do not need to fix the category inventory in advance
Hierarchical model
Thursday, June 13, 1321
Nonparametric Bayesian model
We do not need to fix the category inventory in advance
Hierarchical model
All distributions share a common base
Thursday, June 13, 1321
Nonparametric Bayesian model
We do not need to fix the category inventory in advance
Hierarchical model
All distributions share a common base Parameter tying (smoothing)
Thursday, June 13, 1322
Liang et al. 2009
Thursday, June 13, 1322
X0
Liang et al. 2009
Thursday, June 13, 1322
X0 X2 X5
Liang et al. 2009
Thursday, June 13, 1322
X0 X2 X5 X6 X4
Liang et al. 2009
Thursday, June 13, 1322
X0 X2 X5 X6 X4 X32 X5
Liang et al. 2009
Thursday, June 13, 1323
Thursday, June 13, 13X1 X2 X3 X4 X5 X6 X7 X8 X9 ... X1 X2 X3 X4 X5 X6 X7 X8 X9 ...
23
Thursday, June 13, 13X1 X2 X3 X4 X5 X6 X7 X8 X9 ... X1 X2 X3 X4 X5 X6 X7 X8 X9 ...
? ? ? ? ? ? ? ? ? ?
23
Thursday, June 13, 13X1 X2 X3 X4 X5 X6 X7 X8 X9 ... X1 X2 X3 X4 X5 X6 X7 X8 X9 ...
? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ?
23
Thursday, June 13, 13X1 X2 X3 X4 X5 X6 X7 X8 X9 ... X1 X2 X3 X4 X5 X6 X7 X8 X9 ...
? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ?
23
Thursday, June 13, 13X1 X2 X3 X4 X5 X6 X7 X8 X9 ... X1 X2 X3 X4 X5 X6 X7 X8 X9 ...
? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ?
23
Problem for nonparametric PCFG models: Each LHS nonterminal Xi is allowed a doubly infinite cross-product
S N S/S S\S S/N S\N (S\N)/N (S\N)\S (S\N)\N ... S N S/S S\S S/N S\N (S\N)/N (S\N)\S (S\N)\N ...
? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ?
24
Thursday, June 13, 13S N S/S S\S S/N S\N (S\N)/N (S\N)\S (S\N)\N ... S N S/S S\S S/N S\N (S\N)/N (S\N)\S (S\N)\N ...
? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ?
24
Thursday, June 13, 13S N S/S S\S S/N S\N (S\N)/N (S\N)\S (S\N)\N ... S N S/S S\S S/N S\N (S\N)/N (S\N)\S (S\N)\N ...
? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ?
24
Thursday, June 13, 13S N S/S S\S S/N S\N (S\N)/N (S\N)\S (S\N)\N ... S N S/S S\S S/N S\N (S\N)/N (S\N)\S (S\N)\N ...
? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ?
24
Thursday, June 13, 13S N S/S S\S S/N S\N (S\N)/N (S\N)\S (S\N)\N ... S N S/S S\S S/N S\N (S\N)/N (S\N)\S (S\N)\N ...
? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ?
24
Thursday, June 13, 13S N S/S S\S S/N S\N (S\N)/N (S\N)\S (S\N)\N ... S N S/S S\S S/N S\N (S\N)/N (S\N)\S (S\N)\N ...
? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ?
24
Thursday, June 13, 13S N S/S S\S S/N S\N (S\N)/N (S\N)\S (S\N)\N ... S N S/S S\S S/N S\N (S\N)/N (S\N)\S (S\N)\N ...
? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ?
24
Thursday, June 13, 13S N S/S S\S S/N S\N (S\N)/N (S\N)\S (S\N)\N ... S N S/S S\S S/N S\N (S\N)/N (S\N)\S (S\N)\N ...
? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ?
24
Thursday, June 13, 13S N S/S S\S S/N S\N (S\N)/N (S\N)\S (S\N)\N ... S N S/S S\S S/N S\N (S\N)/N (S\N)\S (S\N)\N ...
? ? ? ? ? ? ? ? ? ? ? ? ? ? ?
24
Thursday, June 13, 1325
Thursday, June 13, 13Parent Combinator Left Right
(S\N)/N N >B ((S\N)/N)/Y Y (S\N)/N /N (S\N)/N \N)/N (S\N)/N ((S\N)/N)\Y (S\N)/N (S\N)\Y (S\N)/N S\Y
25
Thursday, June 13, 13Parent Combinator Left Right
(S\N)/N N >B0 ((S\N)/N)/Y Y (S\N)/N >B (S\N)/Y Y/N (S\N)/N \N)/N (S\N)/N ((S\N)/N)\Y (S\N)/N (S\N)\Y (S\N)/N S\Y
25
Thursday, June 13, 13Parent Combinator Left Right
(S\N)/N N >B0 ((S\N)/N)/Y Y (S\N)/N >B1 (S\N)/Y Y/N (S\N)/N >B S\Y (Y\N)/N (S\N)/N ((S\N)/N)\Y (S\N)/N (S\N)\Y (S\N)/N S\Y
25
Thursday, June 13, 13Parent Combinator Left Right
(S\N)/N N >B0 ((S\N)/N)/Y Y (S\N)/N >B1 (S\N)/Y Y/N (S\N)/N >B2 S\Y (Y\N)/N (S\N)/N <B Y ((S\N)/N)\Y (S\N)/N (S\N)\Y (S\N)/N S\Y
25
Thursday, June 13, 13Parent Combinator Left Right
(S\N)/N N >B0 ((S\N)/N)/Y Y (S\N)/N >B1 (S\N)/Y Y/N (S\N)/N >B2 S\Y (Y\N)/N (S\N)/N <B0 Y ((S\N)/N)\Y (S\N)/N <B Y/N (S\N)\Y (S\N)/N S\Y
25
Thursday, June 13, 13Parent Combinator Left Right
(S\N)/N N >B0 ((S\N)/N)/Y Y (S\N)/N >B1 (S\N)/Y Y/N (S\N)/N >B2 S\Y (Y\N)/N (S\N)/N <B0 Y ((S\N)/N)\Y (S\N)/N <B1 Y/N (S\N)\Y (S\N)/N <B (Y\N)/N S\Y
25
Thursday, June 13, 13Parent Combinator Left Right
(S\N)/N N >B0 ((S\N)/N)/Y Y (S\N)/N >B1 (S\N)/Y Y/N (S\N)/N >B2 S\Y (Y\N)/N (S\N)/N <B0 Y ((S\N)/N)\Y (S\N)/N <B1 Y/N (S\N)\Y (S\N)/N <B2 (Y\N)/N S\Y
25
Thursday, June 13, 1326
Thursday, June 13, 13Parent Y
(S\N)/N S (S\N)/N S (S\N)/N S (S\N)/N S (S\N)/N S (S\N)/N S
26
Thursday, June 13, 13Parent Y Combinator
(S\N)/N S (S\N)/N S (S\N)/N S (S\N)/N S (S\N)/N S (S\N)/N S
26
Thursday, June 13, 13Parent Y Combinator Left
(S\N)/N S >B0 ((S\N)/N)/ (S\N)/N S >B1 (S\N)/ (S\N)/N S >B2 S\ (S\N)/N S <B0 S (S\N)/N S <B1 S/N (S\N)/N S <B2 (S\N)/N
26
Thursday, June 13, 13Parent Y Combinator Left Right
(S\N)/N S >B0 ((S\N)/N)/S S (S\N)/N S >B1 (S\N)/S S/N (S\N)/N S >B2 S\S (S\N)/N (S\N)/N S <B0 S ((S\N)/N)\S (S\N)/N S <B1 S/N (S\N)\S (S\N)/N S <B2 (S\N)/N S\S
26
Thursday, June 13, 13Parent Y Combinator Left Right
(S\N)/N S >B0 ((S\N)/N)/S S (S\N)/N S >B1 (S\N)/S S/N (S\N)/N S >B2 S\S (S\N)/N (S\N)/N S <B0 S ((S\N)/N)\S (S\N)/N S <B1 S/N (S\N)\S (S\N)/N S <B2 (S\N)/N S\S
26
CCG rules are heavily constrained:
For a given parent category, the Y category and combinator determine both children
Thursday, June 13, 1327
Thursday, June 13, 1327
S
Thursday, June 13, 1327
S
Y = N Combinator = <B0
Thursday, June 13, 1327
S S\N N
Y = N Combinator = <B0
Thursday, June 13, 1327
S S\N N
Thursday, June 13, 1327
S N (S\N)/N S\N N S\N
Y = N Combinator = >B0
Thursday, June 13, 1327
S N (S\N)/N S\N N S\N
Thursday, June 13, 1327
S N (S\N)/N S\N N S\N N/N N N (S\N)/N
Y = N Combinator = >B0
Thursday, June 13, 1327
S N (S\N)/N S\N N S\N N/N N N (S\N)/N
Thursday, June 13, 1328
Thursday, June 13, 13CFG: doubly infinite P(Xi →Xj Xk| Xi )
28
Thursday, June 13, 13CFG: doubly infinite P(Xi →Xj Xk| Xi )
X1 X2 X3 X4 X5 X6 X7 X8 X9 ... X1 ? ? ? ? ? ? ? ? ? ? X2 ? ? ? ? ? ? ? ? ? ? X3 ? ? ? ? ? ? ? ? ? ? X4 ? ? ? ? ? ? ? ? ? ? X5 ? ? ? ? ? ? ? ? ? ? X6 ? ? ? ? ? ? ? ? ? ? X7 ? ? ? ? ? ? ? ? ? ? X8 ? ? ? ? ? ? ? ? ? ? X9 ? ? ? ? ? ? ? ? ? ? ... ? ? ? ? ? ? ? ? ? ?28
Thursday, June 13, 13CFG: doubly infinite P(Xi →Xj Xk| Xi ) CCG: infinite P( Y | Xi ) and finite P( c | Y, Xi)
X1 X2 X3 X4 X5 X6 X7 X8 X9 ... X1 ? ? ? ? ? ? ? ? ? ? X2 ? ? ? ? ? ? ? ? ? ? X3 ? ? ? ? ? ? ? ? ? ? X4 ? ? ? ? ? ? ? ? ? ? X5 ? ? ? ? ? ? ? ? ? ? X6 ? ? ? ? ? ? ? ? ? ? X7 ? ? ? ? ? ? ? ? ? ? X8 ? ? ? ? ? ? ? ? ? ? X9 ? ? ? ? ? ? ? ? ? ? ... ? ? ? ? ? ? ? ? ? ?28
Thursday, June 13, 13CFG: doubly infinite P(Xi →Xj Xk| Xi ) CCG: infinite P( Y | Xi ) and finite P( c | Y, Xi)
S N S/S S\S S/N S\N (S\N)/N (S\N)\S (S\N)\N ... X1 X2 X3 X4 X5 X6 X7 X8 X9 ... X1 ? ? ? ? ? ? ? ? ? ? X2 ? ? ? ? ? ? ? ? ? ? X3 ? ? ? ? ? ? ? ? ? ? X4 ? ? ? ? ? ? ? ? ? ? X5 ? ? ? ? ? ? ? ? ? ? X6 ? ? ? ? ? ? ? ? ? ? X7 ? ? ? ? ? ? ? ? ? ? X8 ? ? ? ? ? ? ? ? ? ? X9 ? ? ? ? ? ? ? ? ? ? ... ? ? ? ? ? ? ? ? ? ?28
Thursday, June 13, 13CFG: doubly infinite P(Xi →Xj Xk| Xi ) CCG: infinite P( Y | Xi ) and finite P( c | Y, Xi)
S N S/S S\S S/N S\N (S\N)/N (S\N)\S (S\N)\N ... X1 X2 X3 X4 X5 X6 X7 X8 X9 ... X1 ? ? ? ? ? ? ? ? ? ? X2 ? ? ? ? ? ? ? ? ? ? X3 ? ? ? ? ? ? ? ? ? ? X4 ? ? ? ? ? ? ? ? ? ? X5 ? ? ? ? ? ? ? ? ? ? X6 ? ? ? ? ? ? ? ? ? ? X7 ? ? ? ? ? ? ? ? ? ? X8 ? ? ? ? ? ? ? ? ? ? X9 ? ? ? ? ? ? ? ? ? ? ... ? ? ? ? ? ? ? ? ? ?28
The HDP-CFG base measure requires ββT The HDP-CCG base measure is the standard β ~ GEM(α) (akin to e.g. HDP-HMMs)
Thursday, June 13, 1329
Thursday, June 13, 13Computation parallels Inside-Outside:
29
Thursday, June 13, 13Computation parallels Inside-Outside:
29
WP(Y) = Ψ(C(P,Y)+αPβY)−Ψ(C(P,∗)+αP)
Thursday, June 13, 13Computation parallels Inside-Outside: Trivially parallelizeable; efficient
29
WP(Y) = Ψ(C(P,Y)+αPβY)−Ψ(C(P,∗)+αP)
Thursday, June 13, 13Computation parallels Inside-Outside: Trivially parallelizeable; efficient
1 min – 4 hrs
29
WP(Y) = Ψ(C(P,Y)+αPβY)−Ψ(C(P,∗)+αP)
Thursday, June 13, 1331
Thursday, June 13, 1331
WSJ comparison with Naseem et al. 2010’s Universal dependency grammar
Thursday, June 13, 1331
WSJ comparison with Naseem et al. 2010’s Universal dependency grammar
Trained and tested on rained and tested on ≤ 10 ≤ 20
Thursday, June 13, 1331
WSJ comparison with Naseem et al. 2010’s Universal dependency grammar
Trained and tested on rained and tested on ≤ 10 ≤ 20 Naseem et al. 71.9 50.4
Thursday, June 13, 1331
WSJ comparison with Naseem et al. 2010’s Universal dependency grammar
Trained and tested on rained and tested on ≤ 10 ≤ 20 Naseem et al. 71.9 50.4 HDP-CCG 68.2 64.2
Thursday, June 13, 1332
Trained and tested on rained and tested on ≤ 10 ≤ 20 Naseem et al. 71.9 50.4 HDP-CCG 68.2 64.2
Thursday, June 13, 1332
Can long sentences help performance
Trained and tested on rained and tested on ≤ 10 ≤ 20 Naseem et al. 71.9 50.4 HDP-CCG 68.2 64.2
Thursday, June 13, 1332
Can long sentences help performance
Yes! HDP-CCG achieves 71.9 on ≤10 if trained on ≤20
Trained and tested on rained and tested on ≤ 10 ≤ 20 Naseem et al. 71.9 50.4 HDP-CCG 68.2 64.2
Thursday, June 13, 1332
Can long sentences help performance
Yes! HDP-CCG achieves 71.9 on ≤10 if trained on ≤20
Trained and tested on rained and tested on ≤ 10 ≤ 20 Naseem et al. 71.9 50.4 HDP-CCG 68.2 64.2
Thursday, June 13, 1333
* Max over all best performing systems (extra data, tuning, etc.)
Thursday, June 13, 13NAACL WILS Shared Task 2012
33
* Max over all best performing systems (extra data, tuning, etc.)
Thursday, June 13, 13NAACL WILS Shared Task 2012 Average ≤10 accuracy on 10 languages
33
* Max over all best performing systems (extra data, tuning, etc.)
Thursday, June 13, 13NAACL WILS Shared Task 2012 Average ≤10 accuracy on 10 languages
(Arabic, Danish, Slovene, Swedish, Dutch, Basque, Portuguese, WSJ, CHILDES, Czech)
33
* Max over all best performing systems (extra data, tuning, etc.)
Thursday, June 13, 13NAACL WILS Shared Task 2012 Average ≤10 accuracy on 10 languages
(Arabic, Danish, Slovene, Swedish, Dutch, Basque, Portuguese, WSJ, CHILDES, Czech)
33
Dependencies Dependencies CCG CCG: new model Dependencies Dependencies
Bisk & Blunsom & Cohn 2010
State of the Art*
Bisk & Hockenmaier 2012
55.2 62.3 54.2
* Max over all best performing systems (extra data, tuning, etc.)
Thursday, June 13, 13NAACL WILS Shared Task 2012 Average ≤10 accuracy on 10 languages
(Arabic, Danish, Slovene, Swedish, Dutch, Basque, Portuguese, WSJ, CHILDES, Czech)
33
Dependencies Dependencies CCG CCG: new model CCG: new model Dependencies Dependencies
Bisk & Blunsom & Cohn 2010
State of the Art*
Bisk & Hockenmaier 2012
MLE
55.2 62.3 54.2 50.9
* Max over all best performing systems (extra data, tuning, etc.)
Thursday, June 13, 13NAACL WILS Shared Task 2012 Average ≤10 accuracy on 10 languages
(Arabic, Danish, Slovene, Swedish, Dutch, Basque, Portuguese, WSJ, CHILDES, Czech)
33
Dependencies Dependencies CCG CCG: new model CCG: new model Dependencies Dependencies
Bisk &
HDP-
Blunsom & Cohn 2010
State of the Art*
Bisk & Hockenmaier 2012
MLE HDP- CCG
55.2 62.3 54.2 50.9 64.5
* Max over all best performing systems (extra data, tuning, etc.)
Thursday, June 13, 13English Big Ball N/N N
34
Obj Adj
Thursday, June 13, 13English Arabic Big Ball N/N N ةركةريبك N N\N (ball) (big)
Obj Adj
34
Obj Adj
Thursday, June 13, 13The man wrote a letter N (S\N)/N N
35
English
O V S
Thursday, June 13, 13Child Directed Speech
The man wrote a letter N (S\N)/N N
∅ write a letter S/N
N
35
English
O V ∅ O V S
Thursday, June 13, 13Child Directed Speech Arabic
The man wrote a letter N (S\N)/N N
∅ write a letter S/N
N
بتكلاجرلاةلاسر (S/N)/N N N
(wrote) (the man) (a letter)
O V S
35
English
O V ∅ O V S
Thursday, June 13, 13Induced Lexicons: Adpositions
English ran
beach (S\N)/N (S\S)/N N
V O ADP
36
Thursday, June 13, 13Induced Lexicons: Adpositions
English Japanese ran
beach (S\N)/N (S\S)/N N 浜 を 走った N (S/S)\N (S\N)/N
(beach) (on) (ran)
V O ADP V O ADP
36
Thursday, June 13, 1337
Thursday, June 13, 13A new probability model for CCG
37
Thursday, June 13, 13A new probability model for CCG
37
Thursday, June 13, 13A new probability model for CCG
37
Thursday, June 13, 13A new probability model for CCG
State-of-the-Art accuracy
37
Thursday, June 13, 13A new probability model for CCG
State-of-the-Art accuracy
37
Thursday, June 13, 13A new probability model for CCG
State-of-the-Art accuracy
37
Thursday, June 13, 13A new probability model for CCG
State-of-the-Art accuracy
37
Thursday, June 13, 1338
Thursday, June 13, 13beyond context-free CCG fragment
38
Thursday, June 13, 13beyond context-free CCG fragment
generating words (not just POS tags)
38
Thursday, June 13, 13beyond context-free CCG fragment
generating words (not just POS tags)
38
Thursday, June 13, 13beyond context-free CCG fragment
generating words (not just POS tags)
38
Thank you!
Thursday, June 13, 13