Document Context Neural Machine Translation with Memory Networks - PowerPoint PPT Presentation

Document Context Neural Machine Translation with Memory Networks Document Context Neural Machine Translation with Memory Networks Sameen Maruf, Gholamreza Haffari Faculty of Information Technology Monash University July 17, 2017 1 / 30

Document Context Neural Machine Translation with Memory Networks Overview 1 Introduction 2 Document MT as Structured Prediction 3 Document NMT with MemNets 4 Experiments and Analysis 5 Conclusion 6 References 2 / 30

Document Context Neural Machine Translation with Memory Networks Introduction Overview 1 Introduction 2 Document MT as Structured Prediction 3 Document NMT with MemNets 4 Experiments and Analysis 5 Conclusion 6 References 3 / 30

Document Context Neural Machine Translation with Memory Networks Introduction Why document-level machine translation? 4 / 30

Document Context Neural Machine Translation with Memory Networks Introduction Why document-level machine translation? Most MT models translate sentences independently 4 / 30

Document Context Neural Machine Translation with Memory Networks Introduction Why document-level machine translation? Most MT models translate sentences independently Discourse phenomena are ignored, e.g. pronominal anaphora and lexical consistency which may have long range dependency 4 / 30

Document Context Neural Machine Translation with Memory Networks Introduction Why document-level machine translation? 5 / 30

Document Context Neural Machine Translation with Memory Networks Introduction Why document-level machine translation? Statistical MT attempts to document MT do not yield significant empirical improvements [Hardmeier and Federico, 2010, Gong et al., 2011, Garcia et al., 2014] 5 / 30

Document Context Neural Machine Translation with Memory Networks Introduction Why document-level machine translation? Statistical MT attempts to document MT do not yield significant empirical improvements [Hardmeier and Federico, 2010, Gong et al., 2011, Garcia et al., 2014] Previous context-NMT models only use local context and report deteriorated performance when using the target-side context [Jean et al., 2017, Wang et al., 2017, Bawden et al., 2018] 5 / 30

Document Context Neural Machine Translation with Memory Networks Introduction Why document-level machine translation? Statistical MT attempts to document MT do not yield significant empirical improvements [Hardmeier and Federico, 2010, Gong et al., 2011, Garcia et al., 2014] Previous context-NMT models only use local context and report deteriorated performance when using the target-side context [Jean et al., 2017, Wang et al., 2017, Bawden et al., 2018] We incorporate global source and target document contexts 5 / 30

Document Context Neural Machine Translation with Memory Networks Document MT as Structured Prediction Overview 1 Introduction 2 Document MT as Structured Prediction 3 Document NMT with MemNets 4 Experiments and Analysis 5 Conclusion 6 References 6 / 30

Document Context Neural Machine Translation with Memory Networks Document MT as Structured Prediction Document MT as Structured Prediction

Document Context Neural Machine Translation with Memory Networks Document MT as Structured Prediction Document MT as Structured Prediction 7 / 30

Document Context Neural Machine Translation with Memory Networks Document MT as Structured Prediction Document MT as Structured Prediction Two types of factors: f θ ( y t ; x t , x − t ), g θ ( y t ; y − t ) 8 / 30

Document Context Neural Machine Translation with Memory Networks Document MT as Structured Prediction Document MT as Structured Prediction Training objective: 9 / 30

Document Context Neural Machine Translation with Memory Networks Document MT as Structured Prediction Document MT as Structured Prediction Training objective: Maximise P ( y 1 , . . . , y | d | | x 1 , . . . , x | d | ) 9 / 30

Document Context Neural Machine Translation with Memory Networks Document MT as Structured Prediction Document MT as Structured Prediction Training objective: Maximise P ( y 1 , . . . , y | d | | x 1 , . . . , x | d | ) = ⇒ Maximise the pseudo-likelihood | d | � arg max P θ ( y t | x t , y − t , x − t ) (1) θ t =1 where f θ and g θ are subsumed in the P θ ( y t | x t , y − t , x − t ) 9 / 30

Document Context Neural Machine Translation with Memory Networks Document MT as Structured Prediction Document MT as Structured Prediction Challenge: During test time, the target document is not given 10 / 30

Document Context Neural Machine Translation with Memory Networks Document MT as Structured Prediction Document MT as Structured Prediction Challenge: During test time, the target document is not given Coordinate Ascent (i.e., Iterative Decoding)

Document Context Neural Machine Translation with Memory Networks Document MT as Structured Prediction Document MT as Structured Prediction Challenge: During test time, the target document is not given Coordinate Ascent (i.e., Iterative Decoding) 10 / 30

Document Context Neural Machine Translation with Memory Networks Document MT as Structured Prediction Document MT as Structured Prediction Iterative Decoding

Document Context Neural Machine Translation with Memory Networks Document MT as Structured Prediction Document MT as Structured Prediction Iterative Decoding 11 / 30

Document Context Neural Machine Translation with Memory Networks Document NMT with MemNets Overview 1 Introduction 2 Document MT as Structured Prediction 3 Document NMT with MemNets 4 Experiments and Analysis 5 Conclusion 6 References 12 / 30

Document Context Neural Machine Translation with Memory Networks Document NMT with MemNets Document NMT with MemNets = ⇒ P θ ( y t | x t , y − t , x − t ) 13 / 30

Document Context Neural Machine Translation with Memory Networks Document NMT with MemNets Document NMT with MemNets = ⇒ 14 / 30

Document Context Neural Machine Translation with Memory Networks Document NMT with MemNets Document NMT with MemNets = ⇒ Memory-to-Context: t , c trg s t , j = GRU ( s t , j − 1 , E T [ y t , j − 1 ] , c t , j , c src ) t 17 / 30

Document Context Neural Machine Translation with Memory Networks Document NMT with MemNets Document NMT with MemNets = ⇒ Memory-to-Output: + W yt · c trg y t , j ∼ softmax ( W y · r t , j + W ym · c src + b y ) t t 18 / 30

Document Context Neural Machine Translation with Memory Networks - PowerPoint PPT Presentation

Document Context Neural Machine Translation with Memory Networks Document Context Neural Machine Translation with Memory Networks Sameen Maruf, Gholamreza Haffari Faculty of Information Technology Monash University July 17, 2017 1 / 30

Neural Machine Translation Gongbo Tang 8 October 2018 Outline Neural Machine Translation 1

Introduction to Neural Machine Translation Gongbo Tang 16 September 2019 Outline Why Neural

Neural Machine Translation Philipp Koehn 6 October 2020 Philipp Koehn Machine Translation:

Neural Machine Translation II Refinements Philipp Koehn 17 October 2017 Philipp Koehn Machine

Document #15 Document #15 Document #15 Document #15 Document #15 Document #15 Document #15

Machine Translation 12: (Non-neural) Statistical Machine Translation Rico Sennrich University of

Statistical Machine Translation Nadir Durrani 21-November-2014 Machine Translation

Translation Memory & Machine Translation Dj Vu combines both smartly! Content

Convolutional over Recurrent Encoder for Neural Machine Translation Praveen Dakwale and Christof

Adaptive Multi-pass Decoder for Neural Machine Translation EMNLP 2018

Introd u ction to machine translation MAC H IN E TR AN SL ATION IN P YTH ON Th u shan

Machine Translation Machine Translation February 13, 2008 Andreas Eisele UdS Computerlinguistik

Neural Machine Translation Decoding Philipp Koehn 8 October 2020 Philipp Koehn Machine

11-731 Machine Translation Speech 2 Speech Translation Speech Translation Three part systems

Machine Translation Philipp Koehn 28 April 2020 Philipp Koehn Artificial Intelligence: Machine

Semi-supervised Learning for Neural Machine Translation Yong Cheng joint work with Wei Xu,

A SCARCE ASSET IN A TRUE MINING DISTRICT November 2017 FORWARD LOOKING STATEMENT This

CONNECTING COMMUNITIES Southwest TN Electric Membership Corporation is giving High School Junior

The Power to Be 900 WORDS CAN CHANGE EVERYTHING Southwest TN Electric Membership Corporation is

Set It and Forget It! Structured Content in WordPress with the Pods Framework About Me UX

Cleaning and analysis of the SCTS database Graeme L Hickey

Kenneth Cobleigh, Managing Director and Counsel for AIA Robin G. Banks, Goldberg & Banks, P.C.

INVESTOR PRESENTATION Q2 2019 A MID-CAP MOROCCAN GAS COMPANY New seismic de-risking

Investor Presentation Q1 2016 Updated 08-Feb-2016 Building a Mid-cap Mediterranean Gas

Document Context Neural Machine Translation with Memory Networks - PowerPoint PPT Presentation

Document Context Neural Machine Translation with Memory Networks Document Context Neural Machine Translation with Memory Networks Sameen Maruf, Gholamreza Haffari Faculty of Information Technology Monash University July 17, 2017 1 / 30

Neural Machine Translation Gongbo Tang 8 October 2018 Outline Neural Machine Translation 1

Introduction to Neural Machine Translation Gongbo Tang 16 September 2019 Outline Why Neural

Neural Machine Translation Philipp Koehn 6 October 2020 Philipp Koehn Machine Translation:

Neural Machine Translation II Refinements Philipp Koehn 17 October 2017 Philipp Koehn Machine

Document #15 Document #15 Document #15 Document #15 Document #15 Document #15 Document #15

Machine Translation 12: (Non-neural) Statistical Machine Translation Rico Sennrich University of

Statistical Machine Translation Nadir Durrani 21-November-2014 Machine Translation

Translation Memory &amp; Machine Translation Dj Vu combines both smartly! Content

Convolutional over Recurrent Encoder for Neural Machine Translation Praveen Dakwale and Christof

Adaptive Multi-pass Decoder for Neural Machine Translation EMNLP 2018

Introd u ction to machine translation MAC H IN E TR AN SL ATION IN P YTH ON Th u shan

Machine Translation Machine Translation February 13, 2008 Andreas Eisele UdS Computerlinguistik

Neural Machine Translation Decoding Philipp Koehn 8 October 2020 Philipp Koehn Machine

11-731 Machine Translation Speech 2 Speech Translation Speech Translation Three part systems

Machine Translation Philipp Koehn 28 April 2020 Philipp Koehn Artificial Intelligence: Machine

Semi-supervised Learning for Neural Machine Translation Yong Cheng joint work with Wei Xu,

A SCARCE ASSET IN A TRUE MINING DISTRICT November 2017 FORWARD LOOKING STATEMENT This

CONNECTING COMMUNITIES Southwest TN Electric Membership Corporation is giving High School Junior

The Power to Be 900 WORDS CAN CHANGE EVERYTHING Southwest TN Electric Membership Corporation is

Set It and Forget It! Structured Content in WordPress with the Pods Framework About Me UX

Cleaning and analysis of the SCTS database Graeme L Hickey

Kenneth Cobleigh, Managing Director and Counsel for AIA Robin G. Banks, Goldberg &amp; Banks, P.C.

INVESTOR PRESENTATION Q2 2019 A MID-CAP MOROCCAN GAS COMPANY New seismic de-risking

Investor Presentation Q1 2016 Updated 08-Feb-2016 Building a Mid-cap Mediterranean Gas

Translation Memory & Machine Translation Dj Vu combines both smartly! Content

Kenneth Cobleigh, Managing Director and Counsel for AIA Robin G. Banks, Goldberg & Banks, P.C.