Translation Model Interpolation for Domain Adaptation in TectoMT - - PowerPoint PPT Presentation
Translation Model Interpolation for Domain Adaptation in TectoMT - - PowerPoint PPT Presentation
Rudolf Rosa , Ondej Duek, Michal Novk, Martin Popel {rosa,odusek,mnovak,popel}@ufal.mff.cuni.cz Translation Model Interpolation for Domain Adaptation in TectoMT Charles University in Prague Faculty of Mathematics and Physics Institute
Rosa, Dušek, Novák, Popel: Translation Model Interpolation for Domain Adaptation in TectoMT
2/29
Why domain adaptation?
not to do MT as “Europarl science”
reasonably good for e.g. economic news not particularly good for IT helpdesk
Rosa, Dušek, Novák, Popel: Translation Model Interpolation for Domain Adaptation in TectoMT
3/29
Why domain adaptation?
not to do MT as “Europarl science”
reasonably good for e.g. economic news not particularly good for IT helpdesk
Jak pošlu přílohu mailem? How do I send an attachment by e-mail? Click on the paperclip icon and select the file. Klikněte na ikonu sponky a vyberte soubor.
Rosa, Dušek, Novák, Popel: Translation Model Interpolation for Domain Adaptation in TectoMT
4/29
Why model interpolation?
available parallel data
out-of-domain (Europarl): 2 000 000 sentences in-domain (IT helpdesk): 1 000 sentences
Rosa, Dušek, Novák, Popel: Translation Model Interpolation for Domain Adaptation in TectoMT
5/29
Why model interpolation?
available parallel data
out-of-domain (Europarl): 2 000 000 sentences in-domain (IT helpdesk): 1 000 sentences
train on out+in concatenation
effect of in negligible (overpowered by out)
Rosa, Dušek, Novák, Popel: Translation Model Interpolation for Domain Adaptation in TectoMT
6/29
Why model interpolation?
available parallel data
out-of-domain (Europarl): 2 000 000 sentences in-domain (IT helpdesk): 1 000 sentences
train on out+in concatenation
effect of in negligible (overpowered by out)
train on in only
too weak, low coverage, worse than out alone
Rosa, Dušek, Novák, Popel: Translation Model Interpolation for Domain Adaptation in TectoMT
7/29
Why model interpolation?
available parallel data
out-of-domain (Europarl): 2 000 000 sentences in-domain (IT helpdesk): 1 000 sentences
train on out+in concatenation
effect of in negligible (overpowered by out)
train on in only
too weak, low coverage, worse than out alone
train 2 translation models, one on in, one on out
final TM = (weighted) interpolation of inTM and outTM
Rosa, Dušek, Novák, Popel: Translation Model Interpolation for Domain Adaptation in TectoMT
8/29
Why TectoMT (deep transfer MT)?
✔ better meaning preservation ✔ better generalization from small data ✗ limited support for TM interpolation ✗ no support for tuning of TM weights
Rosa, Dušek, Novák, Popel: Translation Model Interpolation for Domain Adaptation in TectoMT
9/29
Why TectoMT (deep transfer MT)?
✔ better meaning preservation ✔ better generalization from small data ✗ limited support for TM interpolation – solved ✗ no support for tuning of TM weights – TODO
Rosa, Dušek, Novák, Popel: Translation Model Interpolation for Domain Adaptation in TectoMT
10/29
Outline
Motivation TectoMT (deep transfer) Translation model interpolation in TectoMT Evaluation Conclusion
Rosa, Dušek, Novák, Popel: Translation Model Interpolation for Domain Adaptation in TectoMT
11/29
TectoMT deep transfer
Jak pošlu přílohu mailem?
Rosa, Dušek, Novák, Popel: Translation Model Interpolation for Domain Adaptation in TectoMT
12/29
TectoMT deep transfer
#PersPron drop poslat v:fin příloha n:4 Jak adv mail n:7
Jak pošlu přílohu mailem?
Rosa, Dušek, Novák, Popel: Translation Model Interpolation for Domain Adaptation in TectoMT
13/29
TectoMT deep transfer
#PersPron drop poslat v:fin příloha n:4 Jak adv mail n:7
Jak pošlu přílohu mailem?
Rosa, Dušek, Novák, Popel: Translation Model Interpolation for Domain Adaptation in TectoMT
14/29
TectoMT deep transfer
#PersPron drop poslat v:fin příloha n:4 Jak adv mail n:7
Jak pošlu přílohu mailem?
send v:fin
Rosa, Dušek, Novák, Popel: Translation Model Interpolation for Domain Adaptation in TectoMT
15/29
TectoMT deep transfer
#PersPron drop poslat v:fin příloha n:4 Jak adv mail n:7
Jak pošlu přílohu mailem?
send v:fin How adv
Rosa, Dušek, Novák, Popel: Translation Model Interpolation for Domain Adaptation in TectoMT
16/29
TectoMT deep transfer
#PersPron drop poslat v:fin příloha n:4 Jak adv mail n:7
Jak pošlu přílohu mailem?
#PersPron n:subj send v:fin How adv
Rosa, Dušek, Novák, Popel: Translation Model Interpolation for Domain Adaptation in TectoMT
17/29
TectoMT deep transfer
#PersPron drop poslat v:fin příloha n:4 Jak adv mail n:7
Jak pošlu přílohu mailem?
#PersPron n:subj send v:fin attachment n:obj How adv
Rosa, Dušek, Novák, Popel: Translation Model Interpolation for Domain Adaptation in TectoMT
18/29
TectoMT deep transfer
#PersPron drop poslat v:fin příloha n:4 Jak adv mail n:7
Jak pošlu přílohu mailem?
#PersPron n:subj send v:fin attachment n:obj How adv e-mail n:by+X
Rosa, Dušek, Novák, Popel: Translation Model Interpolation for Domain Adaptation in TectoMT
19/29
TectoMT deep transfer
#PersPron drop poslat v:fin příloha n:4 Jak adv mail n:7
Jak pošlu přílohu mailem? How do I send an attachment by e-mail?
#PersPron n:subj send v:fin attachment n:obj How adv e-mail n:by+X
Rosa, Dušek, Novák, Popel: Translation Model Interpolation for Domain Adaptation in TectoMT
20/29
T-lemma and formeme transfer
poslat v:fin mail n:7 ?
Rosa, Dušek, Novák, Popel: Translation Model Interpolation for Domain Adaptation in TectoMT
21/29
T-lemma and formeme transfer
poslat v:fin mail n:7 e-mail (n) 0.32
- nline (adv) 0.25
address (n) 0.11
- nline (adj)
0.05 ... n:by+X 0.23 n:with+X 0.22 adv 0.13 n:via+X 0.04 ...
Rosa, Dušek, Novák, Popel: Translation Model Interpolation for Domain Adaptation in TectoMT
22/29
T-lemma and formeme transfer
poslat v:fin mail n:7 e-mail (n) 0.32
- nline (adv) 0.25
address (n) 0.11
- nline (adj)
0.05 ... n:by+X 0.23 n:with+X 0.22 adv 0.13 n:via+X 0.04 ... e-mail n:by+X
Rosa, Dušek, Novák, Popel: Translation Model Interpolation for Domain Adaptation in TectoMT
23/29
Translation model interpolation
P(e-mail∣mail)= 2 3 MaxEnt mail(e-mail ,context )+ 1 3 Static(e-mail ,mail)
Rosa, Dušek, Novák, Popel: Translation Model Interpolation for Domain Adaptation in TectoMT
24/29
Translation model interpolation
P(e-mail∣mail)= 2 3 MaxEnt mail(e-mail ,context )+ 1 3 Static(e-mail ,mail) P(e-mail∣mail)= 2 6 OutMaxEntmail (e-mail, context)+ 1 6 OutStatic(e-mail , mail) + 2 6 InMaxEnt mail(e-mail , context) + 1 6 InStatic (e-mail , mail)
past: present:
Rosa, Dušek, Novák, Popel: Translation Model Interpolation for Domain Adaptation in TectoMT
25/29
Translation model interpolation
P(e-mail∣mail)=w1⋅ OutMaxEnt mail(e-mail ,context )+w2⋅OutStatic(e-mail , mail) +w3⋅InMaxEnt mail(e-mail ,context ) +w4⋅InStatic(e-mail , mail)
future: interpolation weights tuned on development set
Rosa, Dušek, Novák, Popel: Translation Model Interpolation for Domain Adaptation in TectoMT
26/29
Translation model interpolation
P(e-mail∣mail)=w1⋅ OutMaxEnt mail(e-mail ,context )+w2⋅OutStatic(e-mail , mail) +w3⋅InMaxEnt mail(e-mail ,context ) +w4⋅InStatic(e-mail , mail)
future: interpolation weights tuned on development set not available at the moment:
tuning in TectoMT development set
Rosa, Dušek, Novák, Popel: Translation Model Interpolation for Domain Adaptation in TectoMT
27/29
Evaluation
EN→CS CS→EN EN→ES ES→EN EN→NL NL→EN 16 18 20 22 24 26 28 30 32 34 36 38 40
30.6 27.1 20.4 18.5 23.0 37.0 28.4 21.5 23.3 18.5 21.4 33.7 31.3 28.3 26.5 20.4 24.3 38.9
- ut
in interpol
Rosa, Dušek, Novák, Popel: Translation Model Interpolation for Domain Adaptation in TectoMT
28/29
Conclusion
TectoMT deep transfer system
IT helpdesk domain (1000 parallel sentences) domain adaptation by TM interpolation no tuning of interpolation weights consistently positive results, up to +3 BLEU
future work
tuning of interpolation weights other domain adaptation methods
Rosa, Dušek, Novák, Popel: Translation Model Interpolation for Domain Adaptation in TectoMT
29/29