Decoding in SMT
Nitin Madnani February 8, 2006
Decoding in SMT Nitin Madnani February 8, 2006 The Decoding - - PowerPoint PPT Presentation
Decoding in SMT Nitin Madnani February 8, 2006 The Decoding Problem Search Inputs: Input string Bunch of statistical models A function to assign score to any translation Output: Best scoring translation
Nitin Madnani February 8, 2006
ˆ e
ˆ e
Score (models, candidate, input string)
ˆ e
Search operation Score (models, candidate, input string)
ˆ e
Search operation Score (models, candidate, input string) search space (all possible translations)
ˆ e
Search operation Score (models, candidate, input string) search space (all possible translations) “Best” Translation
ˆ e
Search operation Score (models, candidate, input string) search space (all possible translations) “Best” Translation
Examples:
f1 f2 f3 f4 fm
f1 f2 f3 f4 fm
e1 e2 e3 e4 em
f1 f2 f3 f4 fm
e1 e2 e3 e4 em
f1 f2 f3 f4 fm
e1 e2 e3 e4 em
f1 f2 f3 f4 fm
e1 e2 e3 e4 em
e1 e2 e3 e4 e5 ... em
f1 f2 f3 f4 fm
e1 e2 e3 e4 em
e1 e2 e3 e4 e5 ... em
f1 f2 f3 f4 fm
e1 e2 e3 e4 em
e1 e2 e3 e4 e5 ... em
and search time
Hypothesis Stack (finite size and sorted by cost)
Hypothesis Stack (finite size and sorted by cost) Pop (1)
Hypothesis Stack (finite size and sorted by cost) Pop (1) Extend by translating every possible word (2)
Hypothesis Stack (finite size and sorted by cost) Pop (1) Extend by translating every possible word (2) Push (3)
Hypothesis Stack (finite size and sorted by cost) Pop (1) Extend by translating every possible word (2) Push (3) Repeat (1)-(3) until a complete hypothesis is encountered
prune the rest
sentence length
sources
imperfect models
* All decoders are Model 4 and tested on the same set
Examples: Pharaoh, Rewrite
(Hiero)