Inversion Transduction Grammars
Wilker Aziz 3/5/17
Inversion Transduction Grammars Wilker Aziz 3/5/17 Word-based - - PowerPoint PPT Presentation
Inversion Transduction Grammars Wilker Aziz 3/5/17 Word-based Translation Mary did not slap the green witch Mary no di una bofetada a la bruja verde Every French word is generated by an English word (or null) 2 Generative
Wilker Aziz 3/5/17
Mary did not slap the green witch Mary no dió una bofetada a la bruja verde
Every French word is generated by an English word (or null)
2
Mary did not slap the green witch
3
Mary did not slap the green witch Mary did not slap slap slap the green witch
4
Mary did not slap the green witch Mary did not slap slap slap the green witch NULL
5
Mary did not slap the green witch Mary did not slap slap slap the green witch NULL Mary no dió una bofetada a la verde bruja
6
Mary did not slap the green witch Mary did not slap slap slap the green witch NULL Mary no dió una bofetada a la verde bruja Mary no dió una bofetada a la bruja verde
7
respect to word order
all the m! permutations
8
If we do not impose structural constraints (yet they do exist)
how not to violate them
9
10
Estimation
are not plausible (unlikely to be observed)
10
Estimation
are not plausible (unlikely to be observed) Generation
10
NP-complete problem
NP-complete problem
NP-complete problem
NP-complete problem
Let Σn = {a1, ..., an}
Regular grammar (there is an equivalent FSA)
Asveld (2006, 2008)
12
Note that nonterminals are indexed by subsets of Σn i.e. power set of Σ
13
S ➝ A123 A123 ➝ a1 A23 | a2 A13 | a3 A12 A12 ➝ a1 A2 | a2 A1 A13 ➝ a1 A3 | a3 A1 A23 ➝ a2 A3 | a3 A2 A1 ➝ a1 A2 ➝ a2 A3 ➝ a3
14
Distortion limit in generation but not in estimation
15
Inversion Transduction Grammars (ITGs) [Wu, 1995; 1997]
16
[Wu, 1997]
17
[Wu, 1997]
19
English French
19
English French S ➝ X X copy
19
English French S ➝ X X copy X ➝ X1 X2 X1 X2 copy
19
English French S ➝ X X copy X ➝ X1 X2 X1 X2 copy X2 X1 invert
19
English French S ➝ X X copy X ➝ X1 X2 X1 X2 copy X2 X1 invert X ➝ e f transduce
19
English French S ➝ X X copy X ➝ X1 X2 X1 X2 copy X2 X1 invert X ➝ e f transduce X ➝ e ε delete
19
English French S ➝ X X copy X ➝ X1 X2 X1 X2 copy X2 X1 invert X ➝ e f transduce X ➝ e ε delete X ➝ ε f insert
19
I really miss you tanto sua falta Sinto I really miss you tanto sua falta Sinto
I really miss you tanto sua falta Sinto I really miss you tanto sua falta Sinto
A B E F
Joint probability model P(T) = P(A, B, E, F)
22
t = hr1, . . . , rni e = yield1(t) f = yield2(t) a = alignment(t) b = bracketing(t)
P(T = t) = P(A = a, B = b, E = e, F = f) =
N
Y
i=1
θri
23
Multinomial: one parameter per rule
23
Multinomial: one parameter per rule
23
Multinomial: one parameter per rule
23
Multinomial: one parameter per rule
23
Multinomial: one parameter per rule
23
Multinomial: one parameter per rule
23
We do not typically construct treebanks of ITG trees
Expectations from parse forests
Typically initialised with IBM1
24
θX!α = hn(X ! α)iP (A,B|F,E) P
α0hn(X ! α0)iP (A,B|F,E)
Inference: complexity O(l3m3) Model: too few reordering parameters Decisions: ambiguity
25
arg max
A
P(A|F, E) = arg max
A
X
B
P(A, B|F, E) ≈ arg max
A,B
P(A, B|F, E)
Computational Linguistics. MIT Press.
statistical machine translation as a traveling salesman problem. In Proceedings of the Joint Conference of the 47th Annual Meeting of the ACL and the 4th International Joint Conference
Proceedings of ACL-08: HLT.
Normal Form. In Theoretical Computer Science. Elsevier Science Publishers Ltd.
Normal Form. In Theoretical Computer Science. Elsevier Science Publishers Ltd.
Proceedings of the 33rd Annual Meeting of the Association for Computational Linguistics. ACL.
Parsing of Parallel Corpora. In Computational Linguistics. MIT Press.
Proceedings of the Spring Conference of the Acoustical Society of America.
free grammars using the inside--outside algorithm. In Computer Speech and Language.
disambiguation by means of tree-grammars. In Proceedings of the 16th conference on Computational linguistics - Volume 2.