Generative and Discriminative Methods for Online Adaptation in SMT - PowerPoint PPT Presentation

Generative and Discriminative Methods for Online Adaptation in SMT aschle † , P. Simianer † , N. Bertoldi ‡ , S. Riezler † , K. W¨ M. Federico ‡ Department of Computational Linguistics, Heidelberg University, Germany † FBK, Trento, Italy ‡

Outline 1 Introduction 2 Exploiting Feedback 3 Online Adaptation 4 Experiments and Results 5 Conclusions

Introduction Exploiting Feedback Online Adaptation Experiments and Results Conclusions Motivation • SMT systems usually translate each sentence in a document in isolation → context information is lost, translations might be inconsistent • MT systems in a Computer-Assisted Translation (CAT) framework can benefit from user feedback from the same document → confirmed translations should be integrated into the MT engine as soon as they become available 1 / 22

Introduction Exploiting Feedback Online Adaptation Experiments and Results Conclusions Online learning protocol Train global model M g for all documents d of | d | sentences do Reset local model M d = ∅ for all examples t = 1 , . . . , | d | do Combine M g and M d into M g + d Receive input sentence x t Output translation ˆ y t from M g + d M d has only knowledge of the previous t − 1 sentences! Receive user translation y t Refine M d on pair ( x t , y t ) end for end for 2 / 22

Introduction Exploiting Feedback Online Adaptation Experiments and Results Conclusions Example id source sentence translation 7 Annex to the Technical Offer 8 Sistemi Informativi SpA This document is Sistemi 21 Informativi SpA ’s Technical Offer • MT hypothesis • user translation 3 / 22

Introduction Exploiting Feedback Online Adaptation Experiments and Results Conclusions Example id source sentence translation Annex all’ Tecnica Offri 7 Annex to the Technical Offer 8 Sistemi Informativi SpA This document is Sistemi 21 Informativi SpA ’s Technical Offer • MT hypothesis • user translation 3 / 22

Introduction Exploiting Feedback Online Adaptation Experiments and Results Conclusions Example id source sentence translation Annex all’ Tecnica Offri 7 Annex to the Technical Offer Allegato all’ Offerta Tecnica 8 Sistemi Informativi SpA This document is Sistemi 21 Informativi SpA ’s Technical Offer • MT hypothesis • user translation 3 / 22

Introduction Exploiting Feedback Online Adaptation Experiments and Results Conclusions Example id source sentence translation Annex all’ Tecnica Offri 7 Annex to the Technical Offer Allegato all’ Offerta Tecnica Sistemi Informativi SpA 8 Sistemi Informativi SpA This document is Sistemi 21 Informativi SpA ’s Technical Offer • MT hypothesis • user translation 3 / 22

Introduction Exploiting Feedback Online Adaptation Experiments and Results Conclusions Example id source sentence translation Annex all’ Tecnica Offri 7 Annex to the Technical Offer Allegato all’ Offerta Tecnica Sistemi Informativi SpA 8 Sistemi Informativi SpA Sistemi Informativi S.p.A. This document is Sistemi 21 Informativi SpA ’s Technical Offer • MT hypothesis • user translation 3 / 22

Introduction Exploiting Feedback Online Adaptation Experiments and Results Conclusions Example id source sentence translation Annex all’ Tecnica Offri 7 Annex to the Technical Offer Allegato all’ Offerta Tecnica Sistemi Informativi SpA 8 Sistemi Informativi SpA Sistemi Informativi S.p.A. This document is Sistemi Questo documento ` e Sistemi In- 21 Informativi SpA ’s Technical formativi SpA di Tecnica Offri Offer • MT hypothesis • user translation 3 / 22

Introduction Exploiting Feedback Online Adaptation Experiments and Results Conclusions Example id source sentence translation Annex all’ Tecnica Offri 7 Annex to the Technical Offer Allegato all’ Offerta Tecnica Sistemi Informativi SpA 8 Sistemi Informativi SpA Sistemi Informativi S.p.A. This document is Sistemi Questo documento ` e Sistemi In- 21 Informativi SpA ’s Technical formativi SpA di Tecnica Offri Offer Il presente documento rappre- senta l’ Offerta Tecnica di Sis- temi Informativi S.p.A. • MT hypothesis • user translation 3 / 22

Introduction Exploiting Feedback Online Adaptation Experiments and Results Conclusions Goals • integrate user feedback into an SMT system on a per-sentence basis • enable translation consistency, learn new, document-specific translations • focus on simple, easily integrable solutions as proof of concept that can serve as a baseline for enhanced approaches 4 / 22

Introduction Exploiting Feedback Online Adaptation Experiments and Results Conclusions Approaches Generative: Interacting with the decoder Adapt language and translation models locally by passing information to the Moses decoder through XML markup and a cache feature. 5 / 22

Introduction Exploiting Feedback Online Adaptation Experiments and Results Conclusions Approaches Discriminative: Reranking decoder output Train an external reranking model of sparse phrase pair and target n -gram features on the k -best output of the decoder; let reranker determine 1best translations. 6 / 22

Introduction Exploiting Feedback Online Adaptation Experiments and Results Conclusions Related work • incremental learning for domain adaptation (Koehn and Schroeder, 2007; Bisazza et al., 2011; Liu et al., 2012) • translation consistency (Carpuat and Simard, 2012) • online learning for interactive machine translation (Nepveu et al., 2004; Ortiz-Mart´ ınez et al., 2010; Cesa-Bianchi et al., 2008) 7 / 22

Introduction Exploiting Feedback Online Adaptation Experiments and Results Conclusions Exploiting user feedback • align source and user translation • extract phrase table (generative approach) features (reranking approach) from the alignment 8 / 22

Introduction Exploiting Feedback Online Adaptation Experiments and Results Conclusions Constrained search for phrase alignment Tool by Cettolo et al. (2010) • produces an alignment at phrase level • given a set of translation options, constrained search optimizes the coverage of both source and target sentences • search produces exactly one phrase segmentation and alignment • target does not have to be reachable, i.e. gaps are allowed 9 / 22

Introduction Exploiting Feedback Online Adaptation Experiments and Results Conclusions Phrase extraction Annex to the Technical Offer Allegato all’ Offerta Tecnica 10 / 22

Introduction Exploiting Feedback Online Adaptation Experiments and Results Conclusions Phrase extraction Annex to the Technical Offer Allegato all’ Offerta Tecnica known phrase pairs Annex → Allegato , to the → all’ 10 / 22

Introduction Exploiting Feedback Online Adaptation Experiments and Results Conclusions Phrase extraction Annex to the Technical Offer Allegato all’ Offerta Tecnica known phrase pairs new phrase pairs Annex → Allegato , Technical Offer → Offerta Tecnica to the → all’ 10 / 22

Introduction Exploiting Feedback Online Adaptation Experiments and Results Conclusions Phrase extraction Annex to the Technical Offer Allegato all’ Offerta Tecnica known phrase pairs new phrase pairs Annex → Allegato , Technical Offer → Offerta Tecnica to the → all’ full sentence Annex to the Technical Offer → Allegato all’ Offerta Tecnica 10 / 22

Introduction Exploiting Feedback Online Adaptation Experiments and Results Conclusions Reranking features two sparse feature templates are used: 1 phrase pairs used by the decoder (hypotheses); phrase pair features on the user translation given by the alignment output of the constrained search 2 target n -gram features ( n upto 4) these are indicator features, but we use source side token count (phrase pairs) or n (target n -grams) as feature values 11 / 22

Generative and Discriminative Methods for Online Adaptation in SMT - PowerPoint PPT Presentation

Generative and Discriminative Methods for Online Adaptation in SMT aschle , P. Simianer , N. Bertoldi , S. Riezler , K. W M. Federico Department of Computational Linguistics, Heidelberg University, Germany FBK, Trento,

Generative vs. discriminative Generative Discriminative Belief network A is more More

Discriminative Models Joakim Nivre Uppsala University Department of Linguistics and Philology

Logistic Regression, Generative and Discriminative Classifiers Recommended reading: Ng and

generative design systems Generative Brief Design Definitions Workshop Processes

Discriminative vs. Generative Learning CS 760@UW-Madison Goals for the lecture you should

Generative Models for Discriminative Problems Chris Dyer DeepMind ASRU 2017 December 19,

Generative networks part 2: GANs 23 / 54 Recap on generative networks Generative networks provide

Generative Learning INFO-4604, Applied Machine Learning University of Colorado Boulder November

Augmented Statistical Models: Exploiting Generative Models in Discriminative Classifiers Martin

Discriminative word alignment by learning the Discriminative word alignment by learning the

Three models for discriminative machine Three models for discriminative machine translation using

Introduction to Generative Models (and GANs) Haoqiang Fan fhq@megvii.com Nov. 2017 Figures

Basics on generative and discriminative classification Machine Learning and Object Recognition

CSC421/2516 Lecture 18: Generative Adversarial Networks Roger Grosse and Jimmy Ba Roger Grosse

Generative Adversarial Nets(GANs) Troy Cary and Chenzhi Zhao A generative adversarial net is

Deep Hybrid Models: Bridging Discriminative and Generative Approaches Volodymyr Kuleshov and

Oregon Learns K-12 Achievement C Compact t Implementation p Oregon Education Investment

AN INTRODUCTION TO MICROSOFT SMALL BASIC RUSTY KEELE UETN TECH SUMMIT 2019 INTRODUCTION

South Carolina Electric & Gas Company p y Virgil C. Summer Nuclear Station Units 2 and 3

CESNET e Infrastructure Storage services vision and plans Storage services vision and

Statically-informed Dynamic Analysis Tools to Detect Algorithmic Complexity Vulnerabilities 16th

Connection between g 2, EDMs, CLFV and LHC Paride Paradisi University of Padua EPS 2015

Andrea Raponi Andrea Raponi SUMMARY 1. Problems with the Classical Method 2. Understanding the

Estimating the In use Steel Stock of Ci il Civil Engineering and Building in China i i d

Generative and Discriminative Methods for Online Adaptation in SMT - PowerPoint PPT Presentation

Generative and Discriminative Methods for Online Adaptation in SMT aschle , P. Simianer , N. Bertoldi , S. Riezler , K. W M. Federico Department of Computational Linguistics, Heidelberg University, Germany FBK, Trento,

Generative vs. discriminative Generative Discriminative Belief network A is more More

Discriminative Models Joakim Nivre Uppsala University Department of Linguistics and Philology

Logistic Regression, Generative and Discriminative Classifiers Recommended reading: Ng and

generative design systems Generative Brief Design Definitions Workshop Processes

Discriminative vs. Generative Learning CS 760@UW-Madison Goals for the lecture you should

Generative Models for Discriminative Problems Chris Dyer DeepMind ASRU 2017 December 19,

Generative networks part 2: GANs 23 / 54 Recap on generative networks Generative networks provide

Generative Learning INFO-4604, Applied Machine Learning University of Colorado Boulder November

Augmented Statistical Models: Exploiting Generative Models in Discriminative Classifiers Martin

Discriminative word alignment by learning the Discriminative word alignment by learning the

Three models for discriminative machine Three models for discriminative machine translation using

Introduction to Generative Models (and GANs) Haoqiang Fan fhq@megvii.com Nov. 2017 Figures

Basics on generative and discriminative classification Machine Learning and Object Recognition

CSC421/2516 Lecture 18: Generative Adversarial Networks Roger Grosse and Jimmy Ba Roger Grosse

Generative Adversarial Nets(GANs) Troy Cary and Chenzhi Zhao A generative adversarial net is

Deep Hybrid Models: Bridging Discriminative and Generative Approaches Volodymyr Kuleshov and

Oregon Learns K-12 Achievement C Compact t Implementation p Oregon Education Investment

AN INTRODUCTION TO MICROSOFT SMALL BASIC RUSTY KEELE UETN TECH SUMMIT 2019 INTRODUCTION

South Carolina Electric &amp; Gas Company p y Virgil C. Summer Nuclear Station Units 2 and 3

CESNET e Infrastructure Storage services vision and plans Storage services vision and

Statically-informed Dynamic Analysis Tools to Detect Algorithmic Complexity Vulnerabilities 16th

Connection between g 2, EDMs, CLFV and LHC Paride Paradisi University of Padua EPS 2015

Andrea Raponi Andrea Raponi SUMMARY 1. Problems with the Classical Method 2. Understanding the

Estimating the In use Steel Stock of Ci il Civil Engineering and Building in China i i d

South Carolina Electric & Gas Company p y Virgil C. Summer Nuclear Station Units 2 and 3