Machine Translation
12: (Non-neural) Statistical Machine Translation Rico Sennrich
University of Edinburgh
- R. Sennrich
MT – 2018 – 12 1 / 27
Machine Translation 12: (Non-neural) Statistical Machine Translation - - PowerPoint PPT Presentation
Machine Translation 12: (Non-neural) Statistical Machine Translation Rico Sennrich University of Edinburgh R. Sennrich MT 2018 12 1 / 27 Todays Lecture So far, main focus of lecture was on: neural machine translation research
MT – 2018 – 12 1 / 27
MT – 2018 – 12 1 / 27
MT – 2018 – 12 2 / 27
Bayes’ theorem
MT – 2018 – 12 3 / 27
MT – 2018 – 12 4 / 27
MT – 2018 – 12 5 / 27
MT – 2018 – 12 5 / 27
MT – 2018 – 12 5 / 27
MT – 2018 – 12 5 / 27
❅ ❅
✱ ✱
❅ ❅ ✱ ✱ ✱
MT – 2018 – 12 5 / 27
Bayes’ theorem
[Och, 2003]
MT – 2018 – 12 6 / 27
MT – 2018 – 12 7 / 27
MT – 2018 – 12 8 / 27
MT – 2018 – 12 9 / 27
I shall be passing some
to you comments Ich werde Ihnen die entsprechenden Anmerkungen aushändigen shall be = werde
MT – 2018 – 12 10 / 27
I shall be passing some
to you comments Ich werde Ihnen die entsprechenden Anmerkungen aushändigen some comments = die entsprechenden Anmerkungen
MT – 2018 – 12 10 / 27
I shall be passing some
to you comments Ich werde Ihnen die entsprechenden Anmerkungen aushändigen werde Ihnen die entsprechenden Anmerkungen aushändigen = shall be passing on to you some comments
MT – 2018 – 12 10 / 27
MT – 2018 – 12 11 / 27
he
it , it , he is are goes go yes is , of course not do not does not is not after to according to in house home chamber at home not is not does not do not home under house return home do not it is he will be it goes he goes is are is after all does to following not after not to not is not are not is not a
MT – 2018 – 12 12 / 27
er geht ja nicht nach hause
are
MT – 2018 – 12 13 / 27
er geht ja nicht nach hause
are it he
MT – 2018 – 12 13 / 27
er geht ja nicht nach hause
are it he goes does not yes go to home home
MT – 2018 – 12 13 / 27
er geht ja nicht nach hause
are it he goes does not yes go to home home
MT – 2018 – 12 13 / 27
MT – 2018 – 12 14 / 27
MT – 2018 – 12 15 / 27
Herr Steiger leider Köln ist nach gefahren Mr Steiger unfortunately Cologne , has gone to
Herr Steiger nicht leider Köln ist nach gefahren Herr Steiger does not unfortunately Cologne , has gone to
MT – 2018 – 12 16 / 27
I shall be passing some
to you comments Ich werde Ihnen die entsprechenden Anmerkungen aushändigen werde X aushändigen = shall be passing on X
MT – 2018 – 12 17 / 27
MT – 2018 – 12 18 / 27
MT – 2018 – 12 18 / 27
MT – 2018 – 12 18 / 27
MT – 2018 – 12 18 / 27
MT – 2018 – 12 18 / 27
MT – 2018 – 12 18 / 27
MT – 2018 – 12 18 / 27
MT – 2018 – 12 18 / 27
MT – 2018 – 12 18 / 27
MT – 2018 – 12 19 / 27
Herr Steiger NN S NN NP VAFIN leider ADV APPR Köln NE ist nach gefahren VVPP PP VP Herr Steiger NNP S NNP NP , unfortunately ADV VBZ Cologne NP , has gone VBN VP VP NNP to TO PP
MT – 2018 – 12 20 / 27
PRP I MD shall VB be VBG passing DT some RP on TO to PRP you NNS comments
Ich PPER werde VAFIN Ihnen PPER die ART
aushänd. VVFIN
NP PP VP VP VP S NP VP VP S
pro Ihnen
pp prp you to to
MT – 2018 – 12 21 / 27
⇒ r1: np → Josef K. | Josef K. 0.90 ⇒ r2: vbn → verleumdet | slandered 0.40 ⇒ r3: vbn → verleumdet | defamed 0.20 ⇒ r4: vp → mußte x1 x2 haben | must have vbn2 np1 0.10 ⇒ r5: s → jemand x1 | someone vp1 0.60 ⇒ r6: s → jemand mußte x1 x2 haben | someone must have vbn2 np1 0.80 ⇒ r7: s → jemand mußte x1 x2 haben | np1 must have been vbn1 by someone 0.05
jemand X someone S Source Target verleumdet X Josef haben X X mußte slandered have VBN must VP K. NP Josef K.
MT – 2018 – 12 22 / 27
“eine” → cat ART infl case nom declension mixed agr
f num sg
“Welt” → cat NN infl case nom agr
f num sg
n
root
attr subj det
MT – 2018 – 12 23 / 27
*NMT 2015 from U. Montréal: ❤tt♣s✿✴✴s✐t❡s✳❣♦♦❣❧❡✳❝♦♠✴s✐t❡✴❛❝❧✶✻♥♠t✴
MT – 2018 – 12 24 / 27
MT – 2018 – 12 25 / 27
MT – 2018 – 12 26 / 27
MT – 2018 – 12 27 / 27
Brown, P . F., Della Pietra, V. J., Della Pietra, S. A., and Mercer, R. L. (1993). The Mathematics of Statistical Machine Translation: Parameter Estimation. Computational Linguistics, 19(2):263–311. Koehn, P . and Knowles, R. (2017). Six Challenges for Neural Machine Translation. In Proceedings of the First Workshop on Neural Machine Translation, pages 28–39, Vancouver. Association for Computational Linguistics. Och, F. J. (2003). Minimum Error Rate Training in Statistical Machine Translation. In ACL ’03: Proceedings of the 41st Annual Meeting on Association for Computational Linguistics, pages 160–167, Sapporo,
Sennrich, R. (2015). Modelling and Optimizing on Syntactic N-Grams for Statistical Machine Translation. Transactions of the Association for Computational Linguistics, 3:169–182. Williams, P . (2009). Towards Statistical Machine Translation with Unification Grammars. Master’s thesis, University of Edinburgh, Edinburgh, UK.
MT – 2018 – 12 28 / 27