BREAKTHROUGHS IN NEURAL MACHINE TRANSLATION
Olof Mogren
Chalmers University of Technology
2016-09-29
BREAKTHROUGHS IN NEURAL MACHINE TRANSLATION Olof Mogren Chalmers - - PowerPoint PPT Presentation
BREAKTHROUGHS IN NEURAL MACHINE TRANSLATION Olof Mogren Chalmers University of Technology 2016-09-29 COMING SEMINARS T oday: Olof Mogren Neural Machine Translation October 6: John Wiedenhoeft Fast Bayesian inference in Hidden Markov
Olof Mogren
Chalmers University of Technology
2016-09-29
Neural Machine Translation
Fast Bayesian inference in Hidden Markov Models using Dynamic Wavelet Compression
Linguistic, signal processing, and machine learning approaches in eliciting information form speech ❤tt♣✿✴✴✇✇✇✳❝s❡✳❝❤❛❧♠❡rs✳s❡✴r❡s❡❛r❝❤✴❧❛❜✴s❡♠✐♥❛rs✴
7
[Edinburgh En-De WMT newstest2013 Cased BLEU; NMT 2015 from U. Montréal]
5 10 15 20 25 2013 2014 2015 2016 Phrase-based SMT Syntax-based SMT Neural MT
From [Sennrich 2016, http://www.meta-net.eu/events/meta-forum-2016/slides/09_sennrich.pdf]
A marvelous use of big data but … it’s mined out?!?
In 1519, six hundred Spaniards landed in Mexico to conquer the Aztec Empire with a population of a few million. They lost two thirds of their soldiers in the first clash. translate.google.com (2009): 1519 600 Spaniards landed in Mexico, millions of people to conquer the Aztec empire, the first two-thirds of soldiers against their loss. translate.google.com (2013): 1519 600 Spaniards landed in Mexico to conquer the Aztec empire, hundreds of millions of people, the initial confrontation loss of soldiers two-thirds. translate.google.com (2014): 1519 600 Spaniards landed in Mexico, millions of people to conquer the Aztec empire, the first two-thirds of the loss of soldiers they clash. translate.google.com (2015): 1519 600 Spaniards landed in Mexico, millions of people to conquer the Aztec empire, the first two-thirds of the loss of soldiers they clash. translate.google.com (2016): 1519 600 Spaniards landed in Mexico, millions of people to conquer the Aztec empire, the first two-thirds of the loss of soldiers they clash. }
The approach of modelling the entire MT process via one big artificial neural network.
x1 x2 x3 y2 y1 y3
LSTM (and variants) details
x3 x2 x1 y3 y2 y1
encoder
decoder
Ilya Sutskever, Oriol Vinyals, Quoc V. Le, NIPS 2014
x3 x2 x1 y3 y2 y1
encoder
decoder
Ilya Sutskever, Oriol Vinyals, Quoc V. Le, NIPS 2014
x3 x2 x1 y3 y2 y1
encoder decoder
Dzmitry Bahdanau, Kyunghyun Cho, Yoshua Bengio - ICLR 2015
x3 x2 x1 y3 y2 y1
encoder decoder attention
Dzmitry Bahdanau, Kyunghyun Cho, Yoshua Bengio - ICLR 2015
x3 x2 x1 y3 y2 y1
encoder decoder attention
Dzmitry Bahdanau, Kyunghyun Cho, Yoshua Bengio - ICLR 2015
x3 x2 x1 y3 y2 y1
encoder decoder attention
Dzmitry Bahdanau, Kyunghyun Cho, Yoshua Bengio - ICLR 2015
The agreement
the European Economic Area was signed in August 1992 . <end> L' accord sur la zone économique européenne a été signé en août 1992 . <end>
It should be noted that the marine environment is the least known
environments . <end> Il convient de noter que l' environment marin est le moins connu de l' environment . <end>
What’s not on that list?
Rico Sennrich and Barry Haddow and Alexandra Birch
machine translation Junyoung Chung, Kyunghyun Cho, and Yoshua Bengio, ACL 2016 Byte-pair encoding (BPE): ❛❛❛❜❞❛❛❛❜❛❝ ❩❛❜❞❩❛❜❛❝ ❩❂❛❛ ❩❨❞❩❨❛❝ ❨❂❛❜ ❩❂❛❛ ❳❞❳❛❝ ❳❂❩❨ ❨❂❛❜ ❩❂❛❛
word-character models Thang Luong and Chris Manning, ACL 2016.
W
(4 layers)
2 4 6 8 10 12 14 16 18 20 1K 10K 20K 50K
BLEU Vocabulary Size
Word Word + copy mechanism Hybrid
More than +2.0 BLEU over copy mechanism!
+11.4 +4.5 +3.5 +2.1
177
Rico Sennrich, Barry Haddow, Alexandra Birch, ACL 2016.