generative and discriminative methods for online
play

Generative and Discriminative Methods for Online Adaptation in SMT - PowerPoint PPT Presentation

Generative and Discriminative Methods for Online Adaptation in SMT aschle , P. Simianer , N. Bertoldi , S. Riezler , K. W M. Federico Department of Computational Linguistics, Heidelberg University, Germany FBK, Trento,


  1. Generative and Discriminative Methods for Online Adaptation in SMT aschle † , P. Simianer † , N. Bertoldi ‡ , S. Riezler † , K. W¨ M. Federico ‡ Department of Computational Linguistics, Heidelberg University, Germany † FBK, Trento, Italy ‡

  2. Outline 1 Introduction 2 Exploiting Feedback 3 Online Adaptation 4 Experiments and Results 5 Conclusions

  3. Outline 1 Introduction 2 Exploiting Feedback 3 Online Adaptation 4 Experiments and Results 5 Conclusions

  4. Introduction Exploiting Feedback Online Adaptation Experiments and Results Conclusions Motivation • SMT systems usually translate each sentence in a document in isolation → context information is lost, translations might be inconsistent • MT systems in a Computer-Assisted Translation (CAT) framework can benefit from user feedback from the same document → confirmed translations should be integrated into the MT engine as soon as they become available 1 / 22

  5. Introduction Exploiting Feedback Online Adaptation Experiments and Results Conclusions Online learning protocol Train global model M g for all documents d of | d | sentences do Reset local model M d = ∅ for all examples t = 1 , . . . , | d | do Combine M g and M d into M g + d Receive input sentence x t Output translation ˆ y t from M g + d M d has only knowledge of the previous t − 1 sentences! Receive user translation y t Refine M d on pair ( x t , y t ) end for end for 2 / 22

  6. Introduction Exploiting Feedback Online Adaptation Experiments and Results Conclusions Example id source sentence translation 7 Annex to the Technical Offer 8 Sistemi Informativi SpA This document is Sistemi 21 Informativi SpA ’s Technical Offer • MT hypothesis • user translation 3 / 22

  7. Introduction Exploiting Feedback Online Adaptation Experiments and Results Conclusions Example id source sentence translation Annex all’ Tecnica Offri 7 Annex to the Technical Offer 8 Sistemi Informativi SpA This document is Sistemi 21 Informativi SpA ’s Technical Offer • MT hypothesis • user translation 3 / 22

  8. Introduction Exploiting Feedback Online Adaptation Experiments and Results Conclusions Example id source sentence translation Annex all’ Tecnica Offri 7 Annex to the Technical Offer Allegato all’ Offerta Tecnica 8 Sistemi Informativi SpA This document is Sistemi 21 Informativi SpA ’s Technical Offer • MT hypothesis • user translation 3 / 22

  9. Introduction Exploiting Feedback Online Adaptation Experiments and Results Conclusions Example id source sentence translation Annex all’ Tecnica Offri 7 Annex to the Technical Offer Allegato all’ Offerta Tecnica 8 Sistemi Informativi SpA This document is Sistemi 21 Informativi SpA ’s Technical Offer • MT hypothesis • user translation 3 / 22

  10. Introduction Exploiting Feedback Online Adaptation Experiments and Results Conclusions Example id source sentence translation Annex all’ Tecnica Offri 7 Annex to the Technical Offer Allegato all’ Offerta Tecnica Sistemi Informativi SpA 8 Sistemi Informativi SpA This document is Sistemi 21 Informativi SpA ’s Technical Offer • MT hypothesis • user translation 3 / 22

  11. Introduction Exploiting Feedback Online Adaptation Experiments and Results Conclusions Example id source sentence translation Annex all’ Tecnica Offri 7 Annex to the Technical Offer Allegato all’ Offerta Tecnica Sistemi Informativi SpA 8 Sistemi Informativi SpA Sistemi Informativi S.p.A. This document is Sistemi 21 Informativi SpA ’s Technical Offer • MT hypothesis • user translation 3 / 22

  12. Introduction Exploiting Feedback Online Adaptation Experiments and Results Conclusions Example id source sentence translation Annex all’ Tecnica Offri 7 Annex to the Technical Offer Allegato all’ Offerta Tecnica Sistemi Informativi SpA 8 Sistemi Informativi SpA Sistemi Informativi S.p.A. This document is Sistemi 21 Informativi SpA ’s Technical Offer • MT hypothesis • user translation 3 / 22

  13. Introduction Exploiting Feedback Online Adaptation Experiments and Results Conclusions Example id source sentence translation Annex all’ Tecnica Offri 7 Annex to the Technical Offer Allegato all’ Offerta Tecnica Sistemi Informativi SpA 8 Sistemi Informativi SpA Sistemi Informativi S.p.A. This document is Sistemi Questo documento ` e Sistemi In- 21 Informativi SpA ’s Technical formativi SpA di Tecnica Offri Offer • MT hypothesis • user translation 3 / 22

  14. Introduction Exploiting Feedback Online Adaptation Experiments and Results Conclusions Example id source sentence translation Annex all’ Tecnica Offri 7 Annex to the Technical Offer Allegato all’ Offerta Tecnica Sistemi Informativi SpA 8 Sistemi Informativi SpA Sistemi Informativi S.p.A. This document is Sistemi Questo documento ` e Sistemi In- 21 Informativi SpA ’s Technical formativi SpA di Tecnica Offri Offer Il presente documento rappre- senta l’ Offerta Tecnica di Sis- temi Informativi S.p.A. • MT hypothesis • user translation 3 / 22

  15. Introduction Exploiting Feedback Online Adaptation Experiments and Results Conclusions Goals • integrate user feedback into an SMT system on a per-sentence basis • enable translation consistency, learn new, document-specific translations • focus on simple, easily integrable solutions as proof of concept that can serve as a baseline for enhanced approaches 4 / 22

  16. Introduction Exploiting Feedback Online Adaptation Experiments and Results Conclusions Approaches Generative: Interacting with the decoder Adapt language and translation models locally by passing information to the Moses decoder through XML markup and a cache feature. 5 / 22

  17. Introduction Exploiting Feedback Online Adaptation Experiments and Results Conclusions Approaches Discriminative: Reranking decoder output Train an external reranking model of sparse phrase pair and target n -gram features on the k -best output of the decoder; let reranker determine 1best translations. 6 / 22

  18. Introduction Exploiting Feedback Online Adaptation Experiments and Results Conclusions Related work • incremental learning for domain adaptation (Koehn and Schroeder, 2007; Bisazza et al., 2011; Liu et al., 2012) • translation consistency (Carpuat and Simard, 2012) • online learning for interactive machine translation (Nepveu et al., 2004; Ortiz-Mart´ ınez et al., 2010; Cesa-Bianchi et al., 2008) 7 / 22

  19. Outline 1 Introduction 2 Exploiting Feedback 3 Online Adaptation 4 Experiments and Results 5 Conclusions

  20. Introduction Exploiting Feedback Online Adaptation Experiments and Results Conclusions Exploiting user feedback • align source and user translation • extract phrase table (generative approach) features (reranking approach) from the alignment 8 / 22

  21. Introduction Exploiting Feedback Online Adaptation Experiments and Results Conclusions Constrained search for phrase alignment Tool by Cettolo et al. (2010) • produces an alignment at phrase level • given a set of translation options, constrained search optimizes the coverage of both source and target sentences • search produces exactly one phrase segmentation and alignment • target does not have to be reachable, i.e. gaps are allowed 9 / 22

  22. Introduction Exploiting Feedback Online Adaptation Experiments and Results Conclusions Phrase extraction Annex to the Technical Offer Allegato all’ Offerta Tecnica 10 / 22

  23. Introduction Exploiting Feedback Online Adaptation Experiments and Results Conclusions Phrase extraction Annex to the Technical Offer Allegato all’ Offerta Tecnica known phrase pairs Annex → Allegato , to the → all’ 10 / 22

  24. Introduction Exploiting Feedback Online Adaptation Experiments and Results Conclusions Phrase extraction Annex to the Technical Offer Allegato all’ Offerta Tecnica known phrase pairs Annex → Allegato , to the → all’ 10 / 22

  25. Introduction Exploiting Feedback Online Adaptation Experiments and Results Conclusions Phrase extraction Annex to the Technical Offer Allegato all’ Offerta Tecnica known phrase pairs new phrase pairs Annex → Allegato , Technical Offer → Offerta Tecnica to the → all’ 10 / 22

  26. Introduction Exploiting Feedback Online Adaptation Experiments and Results Conclusions Phrase extraction Annex to the Technical Offer Allegato all’ Offerta Tecnica known phrase pairs new phrase pairs Annex → Allegato , Technical Offer → Offerta Tecnica to the → all’ full sentence Annex to the Technical Offer → Allegato all’ Offerta Tecnica 10 / 22

  27. Introduction Exploiting Feedback Online Adaptation Experiments and Results Conclusions Reranking features two sparse feature templates are used: 1 phrase pairs used by the decoder (hypotheses); phrase pair features on the user translation given by the alignment output of the constrained search 2 target n -gram features ( n upto 4) these are indicator features, but we use source side token count (phrase pairs) or n (target n -grams) as feature values 11 / 22

  28. Outline 1 Introduction 2 Exploiting Feedback 3 Online Adaptation 4 Experiments and Results 5 Conclusions

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend