Sparse and Constrained Attention for Neural Machine Translation
Chaitanya Malaviya1, Pedro Ferreira2, André F.T. Martins2,3
1Carnegie Mellon University, 2Instituto Superior Técnico, 3Unbabel
1
Sparse and Constrained Attention for Neural Machine Translation - - PowerPoint PPT Presentation
Sparse and Constrained Attention for Neural Machine Translation Chaitanya Malaviya 1 , Pedro Ferreira 2 , Andr F.T. Martins 2,3 1 Carnegie Mellon University, 2 Instituto Superior Tcnico, 3 Unbabel 1 Adequacy in Neural Machine
1Carnegie Mellon University, 2Instituto Superior Técnico, 3Unbabel
1
Source: und wir benutzen dieses wort mit solcher verachtung . Translation: and we use this word with such contempt contempt . Reference: and we say that word with such contempt . Ein 28-jähriger Koch, der kürzlich nach Pittsburgh gezogen war, wurde diese Woche im Treppenhaus eines örtlichen Einkaufszentrums tot aufgefunden . A 28-year-old chef who recently moved to Pittsburgh was found dead in the staircase this week . A 28-year-old chef who recently moved to Pittsburgh was found dead in the staircase of a local shopping mall this week . Source: Reference: Translation:
2
3
4
5
h1 h2 h3 h4
attn_score attn_transform
attn_score:
attn_transform:
g1 c1 g2 c2 g3 c3 g4 c4
6
7
8
9
10
11
12
13
14
15
16
17
15 18.2 21.4 24.6 27.8 31 De-En Ja-En Ro-En 29.77 21.31 29.85 30.08 21.53 29.63 29.81 20.7 29.69 29.67 20.36 29.51
softmax softmax+CovPenalty softmax+CovVector csparsemax
18
0.0 3.2 6.4 9.6 12.8 16.0 De-En Ja-En Ro-En 1.98 11.4 2.67 2.42 11.07 2.93 2.48 14.12 3.47 2.45 13.48 3.37
softmax softmax+CovPenalty softmax+CovVector csparsemax
0.0 4.8 9.6 14.4 19.2 24.0 De-En Ja-En Ro-En 5.44 21.59 5.23 5.47 22.18 5.65 5.49 22.79 5.74 5.59 23.3 5.89
19
20
21
22