Joint work with Kevin Knight (ISI), Aravind Joshi (Penn), Haitao Mi and Qun Liu (ICT)
UC Berkeley, Feb 6, 2009
Tree-based and Forest-Based Translation
Liang Huang
Tree-based and Forest-Based Translation Liang Huang Joint work - - PowerPoint PPT Presentation
Tree-based and Forest-Based Translation Liang Huang Joint work with Kevin Knight (ISI), Aravind Joshi (Penn), Haitao Mi and Qun Liu (ICT) UC Berkeley, Feb 6, 2009 Translation is hard! zi zhu zhong duan self help
Joint work with Kevin Knight (ISI), Aravind Joshi (Penn), Haitao Mi and Qun Liu (ICT)
UC Berkeley, Feb 6, 2009
Liang Huang
2
(ATM, “self-service terminal”)
help oneself terminating machine
3
3
3
3
4
4
clear evidence that MT is used in real life.
5
Bush hold
and/ with
meeting Sharon
[past.]
布什 举行
与
会谈 沙龙
了
Bùshí juxíng
yu
huìtán Shalóng
le
5
Bush hold
and/ with
meeting Sharon
[past.]
布什 举行
与
会谈 沙龙
了
Bùshí juxíng
yu
huìtán Shalóng
le
5
Bush hold
and/ with
meeting Sharon
[past.]
“Bush held a meeting with Sharon” 布什 举行
与
会谈 沙龙
了
Bùshí juxíng
yu
huìtán Shalóng
le
6
x3 = y + 3;
6
x3 = y + 3;
6
x3 = y + 3;
LD R1, id2 ADDF R1, R1, #3.0 // add float RTOI R2, R1 // real to int ST id1, R2
6
x3 = y + 3;
LD R1, id2 ADDF R1, R1, #3.0 // add float RTOI R2, R1 // real to int ST id1, R2
7
Bush hold
and/ with
meeting Sharon [past.]
(Irons 1961; Lewis, Stearns 1968; Aho, Ullman 1972) ==> (Huang, Knight, Joshi 2006)
7
Bush hold
and/ with
meeting Sharon [past.]
(Irons 1961; Lewis, Stearns 1968; Aho, Ullman 1972) ==> (Huang, Knight, Joshi 2006)
8
(Huang, Knight, Joshi 2006); rules from (Galley et al., 04)
8
(Huang, Knight, Joshi 2006); rules from (Galley et al., 04)
8
(Huang, Knight, Joshi 2006); rules from (Galley et al., 04)
8
(Huang, Knight, Joshi 2006); rules from (Galley et al., 04)
9
(Huang, Knight, Joshi 2006); rules from (Galley et al., 04)
9
(Huang, Knight, Joshi 2006); rules from (Galley et al., 04)
9
(Huang, Knight, Joshi 2006); rules from (Galley et al., 04)
9
(Huang, Knight, Joshi 2006); rules from (Galley et al., 04)
9
(Huang, Knight, Joshi 2006); rules from (Galley et al., 04)
10
(Huang, Knight, Joshi 2006); rules from (Galley et al., 04)
10
(Huang, Knight, Joshi 2006); rules from (Galley et al., 04)
11
(Huang, Knight, Joshi 2006); rules from (Galley et al., 04)
11
(Huang, Knight, Joshi 2006); rules from (Galley et al., 04)
(beyond CFG)
12
(Huang, Knight, Joshi 2006); rules from (Galley et al., 04)
(beyond CFG)
12
(Huang, Knight, Joshi 2006); rules from (Galley et al., 04)
(beyond CFG)
12
(Huang, Knight, Joshi 2006); rules from (Galley et al., 04)
(beyond CFG)
12
(Huang, Knight, Joshi 2006); rules from (Galley et al., 04)
13
13
emergency exit
13
emergency exit
mind your head
14
14
14
14
14
14
NLP == dealing with ambiguities.
15
15
15
15
16
17
(Earley 1970; Billot and Lang 1989)
18
(Klein and Manning, 2001; Huang and Chiang, 2005)
0 I 1 saw 2 him 3 with 4 a 5 mirror 6
18
(Klein and Manning, 2001; Huang and Chiang, 2005)
0 I 1 saw 2 him 3 with 4 a 5 mirror 6
nodes hyperedges
a hypergraph
19
“and” / “with”
20
“and” / “with”
布什 举行 与 会谈 沙龙 了
“and”
20
pattern-matching
(linear-time in forest size)
“and” / “with”
布什 举行 与 会谈 沙龙 了
与
“and”
20
pattern-matching
(linear-time in forest size)
“and” / “with”
布什 举行 与 会谈 沙龙 了
与
“and”
21
pattern-matching
(linear-time in forest size)
“and” / “with”
布什 举行 与 会谈 沙龙 了
与
“and”
21
pattern-matching
(linear-time in forest size)
“and” / “with”
布什 举行 与 会谈 沙龙 了
与
“and”
21
pattern-matching
(linear-time in forest size)
“and” / “with”
布什 举行 与 会谈 沙龙 了
与
“and”
21
pattern-matching
(linear-time in forest size)
“and” / “with”
布什 举行 与 会谈 沙龙 了
与
22
22
22
“held a meeting”
“Sharon” “Bush”
22
“held a meeting”
“Sharon” “Bush”
“Bush held a meeting with Sharon”
23
parse forest translation forest translation+LM forest
parser
packed forests
(Huang and Chiang, 2005; 2007; Chiang, 2007)
input sentence 1-best translation k-best translations
pattern-matching w/ translation rules (exact) integrating language models (cube pruning)
24
parse forest translation forest translation+LM forest
parser pattern-matching w/ translation rules (exact) integrating language models (cube pruning)
packed forests
(Huang and Chiang, 2005; 2007; Chiang, 2007)
input sentence 1-best translation k-best translations
pruned forest
forest pruning
αβ(e) - αβ(TOP) > p for some threshold p
25
...
v u w
e
α(v) β(u) inside β(w) inside
Jonathan Graehl: relatively useless pruning
26
27
1.7 Bleu improvement over 1-best, 0.8 over 30-best, and even faster!
k = ~6.1×108 trees ~2×104 trees
28
32% beyond 100-best 20% beyond 1000-best 1000
suggested by Mark Johnson
(~6.1×108 -best)
xiǎoxīn
小心 X <=> be careful not to X
xiǎoxīn
小心 X <=> be careful not to X
xiǎoxīn gǒu
小心 狗 <=> be aware of dog
xiǎoxīn
小心 X <=> be careful not to X 小心 VP <=> be careful not to VP 小心 NP <=> be careful of NP . . .
xiǎoxīn gǒu
小心 狗 <=> be aware of dog
30
31
GHKM - (Galley et al 2004; 2006)
32
GHKM - (Galley et al 2004; 2006)
32
GHKM - (Galley et al 2004; 2006)
admissible set
32
GHKM - (Galley et al 2004; 2006)
admissible set
33
GHKM - (Galley et al 2004; 2006)
admissible set
33
GHKM - (Galley et al 2004; 2006)
admissible set
34
also in (Wang, Knight, Marcu, 2007)
35
also in (Wang, Knight, Marcu, 2007)
35
also in (Wang, Knight, Marcu, 2007)
35
also in (Wang, Knight, Marcu, 2007)
36
36
36
36
37
37
37
source sentence
training time
target sentence
aligner
word alignment
rule extractor
translation ruleset
parser
1-best/ forest
source sentence
training time
target sentence
aligner
word alignment
rule extractor
translation ruleset
translation time
pattern- matcher
target sentence
parser
1-best/ forest
source sentence
parser
1-best/forest
39
1.0 Bleu improvement over 1-best, twice as fast as 30-best extraction
~108 trees
40
0.2560 0.2674 0.2634 0.2767 0.2679 0.2816 0.2738
rules from ... translating on ...
Bàowēir shūo yǔ Alāfǎtè huìtán hěn zhòngyào Powell say with Arafat talk very important
41
42
More “forest-based” algorithms in my thesis (this talk is about Chap. 6).
self-service terminals carefully slide http://translate.google.com
self-service terminals carefully slide http://translate.google.com
self-service terminals carefully slide http://translate.google.com
45
improvement
0.2666 0.2939 0.2755 0.3084 0.2839 0.3149 1.7
2.1