Search Aware Tuning for Machine Translation
Lemao Liu Liang Huang City University of New York
EMNLP 2014. Presented by Taro Watanabe.
1 2 3 4
Search Aware Tuning for Machine Translation 0 1 2 3 4 Lemao - - PowerPoint PPT Presentation
Search Aware Tuning for Machine Translation 0 1 2 3 4 Lemao Liu Liang Huang City University of New York EMNLP 2014. Presented by Taro Watanabe. Search Aware Tuning for Machine Translation Lemao Liu Liang Huang City University
Lemao Liu Liang Huang City University of New York
EMNLP 2014. Presented by Taro Watanabe.
1 2 3 4
Lemao Liu Liang Huang City University of New York
EMNLP 2014. Presented by Taro Watanabe.
Search-Aware Tuning - Liu & Huang (CUNY)
2
y
eval & update w
x
decoder
1 2 3 4
Search-Aware Tuning - Liu & Huang (CUNY)
2
y
eval & update w
x
decoder
1 2 3 4
Search-Aware Tuning - Liu & Huang (CUNY)
2
y
eval & update w
x
decoder
cf.: Y-chromosome Adam Mitochondria Eva
1 2 3 4
Search-Aware Tuning - Liu & Huang (CUNY)
3
Search-Aware Tuning - Liu & Huang (CUNY)
4
y
eval & update w
x
decoder
1 2 3 4
Search-Aware Tuning - Liu & Huang (CUNY)
4
y
eval & update w
x
decoder
1 2 3 4
Search-Aware Tuning - Liu & Huang (CUNY)
4
y
eval & update w
x
decoder
Search-Aware Tuning - Liu & Huang (CUNY)
4
y
eval & update w
x
decoder
Search-Aware Tuning - Liu & Huang (CUNY)
5
Search-Aware Tuning - Liu & Huang (CUNY)
the same bin may cover different source words
6
1 2 3 4
Search-Aware Tuning - Liu & Huang (CUNY)
the same bin may cover different source words
6
source: 我 从 上海 ⻜食 到 北京
1 2 3 4
Search-Aware Tuning - Liu & Huang (CUNY)
the same bin may cover different source words
6
source: 我 从 上海 ⻜食 到 北京 gloss: I from Shanghai fly to Beijing
1 2 3 4
Search-Aware Tuning - Liu & Huang (CUNY)
the same bin may cover different source words
6
source: 我 从 上海 ⻜食 到 北京 gloss: I from Shanghai fly to Beijing reference: I flew from Shanghai to Beijing
1 2 3 4
Search-Aware Tuning - Liu & Huang (CUNY)
the same bin may cover different source words
6
source: 我 从 上海 ⻜食 到 北京 gloss: I from Shanghai fly to Beijing reference: I flew from Shanghai to Beijing partial 1: I from
1 2 3 4
Search-Aware Tuning - Liu & Huang (CUNY)
the same bin may cover different source words
6
source: 我 从 上海 ⻜食 到 北京 gloss: I from Shanghai fly to Beijing reference: I flew from Shanghai to Beijing partial 1: I from partial 2: I fly
1 2 3 4
Search-Aware Tuning - Liu & Huang (CUNY)
7
source: 我 从 上海 ⻜食 到 北京 gloss: I from Shanghai fly to Beijing reference: I flew from Shanghai to Beijing partial 1: I from partial 2: I fly
unigram=2 unigram=1
Search-Aware Tuning - Liu & Huang (CUNY)
7
source: 我 从 上海 ⻜食 到 北京 gloss: I from Shanghai fly to Beijing reference: I flew from Shanghai to Beijing partial 1: I from partial 2: I fly
unigram=2 unigram=1 ✔︎
Search-Aware Tuning - Liu & Huang (CUNY)
8
worst
current state start state
Search-Aware Tuning - Liu & Huang (CUNY)
8
worst “most likely” potential
current state start state
Search-Aware Tuning - Liu & Huang (CUNY)
9
source: 我 从 上海 ⻜食 到 北京 gloss: I from Shanghai fly to Beijing reference: I flew from Shanghai to Beijing partial 1: I from partial 2: I fly
e(d) future(d, x) x = ¯ ex(d) =
monotonic reordering
Search-Aware Tuning - Liu & Huang (CUNY)
9
source: 我 从 上海 ⻜食 到 北京 gloss: I from Shanghai fly to Beijing reference: I flew from Shanghai to Beijing partial 1: I from partial 2: I fly
Shanghai fly to Beijing
e(d) future(d, x) x = ¯ ex(d) =
monotonic reordering
Search-Aware Tuning - Liu & Huang (CUNY)
9
source: 我 从 上海 ⻜食 到 北京 gloss: I from Shanghai fly to Beijing reference: I flew from Shanghai to Beijing partial 1: I from partial 2: I fly
Shanghai fly to Beijing from Shanghai to Beijing
e(d) future(d, x) x = ¯ ex(d) =
monotonic reordering
Search-Aware Tuning - Liu & Huang (CUNY)
9
source: 我 从 上海 ⻜食 到 北京 gloss: I from Shanghai fly to Beijing reference: I flew from Shanghai to Beijing partial 1: I from partial 2: I fly
Shanghai fly to Beijing from Shanghai to Beijing unigram=5, bi=2
e(d) future(d, x) x = ¯ ex(d) =
monotonic reordering
Search-Aware Tuning - Liu & Huang (CUNY)
9
source: 我 从 上海 ⻜食 到 北京 gloss: I from Shanghai fly to Beijing reference: I flew from Shanghai to Beijing partial 1: I from partial 2: I fly
Shanghai fly to Beijing from Shanghai to Beijing unigram=5, bi=2 unigram=5, bi=3, tri=2, 4gram=1
e(d) future(d, x) x = ¯ ex(d) =
monotonic reordering
Search-Aware Tuning - Liu & Huang (CUNY)
9
source: 我 从 上海 ⻜食 到 北京 gloss: I from Shanghai fly to Beijing reference: I flew from Shanghai to Beijing partial 1: I from partial 2: I fly
Shanghai fly to Beijing from Shanghai to Beijing unigram=5, bi=2 unigram=5, bi=3, tri=2, 4gram=1
e(d) future(d, x) x = ¯ ex(d) =
monotonic reordering
Search-Aware Tuning - Liu & Huang (CUNY)
10
Search-Aware Tuning - Liu & Huang (CUNY)
10
Traditional tuning MERT/MIRA/PRO
Search-Aware Tuning - Liu & Huang (CUNY)
10
Traditional tuning MERT/MIRA/PRO Search-aware tuning
Search-Aware Tuning - Liu & Huang (CUNY)
Yu et al 13)
11
Search-Aware Tuning - Liu & Huang (CUNY)
12
30 31 32 33 34 35 1 2 4 8 16 32 64 BLEU Beam Size Traditional MERT Tuning Search-aware MERT Tuning
100
Search-Aware Tuning - Liu & Huang (CUNY)
13
tuning test
1 2 3 4
Search-Aware Tuning - Liu & Huang (CUNY)
14
cf.: Y-chromosome Adam Mitochondria Eva
Search-Aware Tuning - Liu & Huang (CUNY)
15
decoding time: 20 min. on single CPU
Search-Aware Tuning - Liu & Huang (CUNY)
16
1 2 3 4