Statistical Phrase-Based Translation
Philipp Koehn, Franz Och, Daniel Marcu
koehn@isi.edu, och@isi.edu, marcu@isi.edu
Information Sciences Institute University of Southern California
– p.1
Statistical Phrase-Based Translation Philipp Koehn, Franz Och, - - PowerPoint PPT Presentation
Statistical Phrase-Based Translation Philipp Koehn, Franz Och, Daniel Marcu koehn@isi.edu, och@isi.edu, marcu@isi.edu Information Sciences Institute University of Southern California p.1 Statistical Phrase-Based Translation p
– p.1
– p.2
– p.3
– p.4
– p.5
e: Mary f: *-------- p: .534 e: witch f: -------*- p: .182 e: f: ---------- p: 1 e: ... did f: *-------- p: .122 e: ... slap f: *-***---- p: .043
– p.6
– p.7
– p.8
– p.9
Maria no daba una bofetada a la bruja verde Mary witch green the slap not did
– p.10
– p.11
– p.12
– p.13
10k 20k 40k 80k 160k 320k .18 .19 .20 .21 .22 .23 .24 .25 .26 .27 Training Corpus Size BLEU WAIPh
✂ ✂ ✂ ✂ ✂ ✂Joint
✂ ✂ ✂ ✂ ✂ ✂Syn
✂ ✂ ✂ ✂ ✂ ✂ ✂ ✂ ✂ ✂ ✂ ✂M4
– p.14
– p.15
10k 20k 40k 80k 160k 320k .21 .22 .23 .24 .25 .26 .27 Training Corpus Size BLEU
✂ ✂ ✂ ✂ ✂ ✂ ✂ ✂ ✂ ✂ ✂ ✂ ✂ ✂ ✂ ✂ ✂ ✂ ✂ ✂ ✂ ✂ ✂ ✂ ✂ ✂ ✂ ✂ ✂ ✂max2 max3 max4 max5 max7
– p.16
– p.17
10k 20k 40k 80k 160k 320k .21 .22 .23 .24 .25 .26 .27 .28 Training Corpus Size BLEU
✂ ✂ ✂ ✂ ✂ ✂ ✂ ✂ ✂ ✂ ✂ ✂no-lex lex
– p.18
Maria no daba una bofetada a la bruja verde Mary witch green the slap not did
– p.19
– p.20
10k 20k 40k 80k 160k 320k .20 .21 .22 .23 .24 .25 .26 .27 .28 Training Corpus Size BLEU
✂ ✂ ✂ ✂ ✂ ✂ ✂ ✂ ✂ ✂ ✂ ✂ ✂ ✂ ✂ ✂ ✂ ✂ ✂ ✂ ✂ ✂ ✂ ✂ ✂ ✂ ✂ ✂ ✂ ✂ ✂ ✂ ✂ ✂ ✂ ✂diag-and diag base e2f f2e union
– p.21
10k 20k 40k 80k 160k 320k .20 .21 .22 .23 .24 .25 .26 .27 .28 Training Corpus Size BLEU
✂ ✂ ✂ ✂ ✂ ✂ ✂ ✂ ✂ ✂ ✂ ✂ ✂ ✂ ✂ ✂ ✂ ✂ ✂ ✂ ✂ ✂ ✂ ✂m4 m3 m2 m1
– p.22
– p.23
– p.24