Semi-supervised Learning for Neural Machine Translation Yong Cheng - PowerPoint PPT Presentation

Jul 17, 2023 •371 likes •650 views

Semi-supervised Learning for Neural Machine Translation Yong Cheng joint work with Wei Xu, Zhongjun He, Wei He, Hua Wu, Maosong Sun, Yang Liu 1 Machine Translation Automated translation using computer software 2 Machine Translation Rule-based

Semi-supervised Learning for Neural Machine Translation Yong Cheng joint work with Wei Xu, Zhongjun He, Wei He, Hua Wu, Maosong Sun, Yang Liu 1
Machine Translation Automated translation using computer software 2
Machine Translation Rule-based Machine Translation 1970s Example-based Machine Translation 1984 Statical Machine Translation (SMT) 1993 Neural Machine Translation � NMT � 2014 Trends: learning to translate from DATA 3
Machine Translation Parallel corpora are usually limited in & & quantity quality coverage Monolingual Corpora Parallel Corpora 4
Monolingual Corpora Used in SMT and NMT N-gram language model in SMT Koehn et al., [2007] Monolingual corpora as decipherment Ravi and Knight [2011] Integrate a neural language model into NMT. Gulccehre et al. [2015] Additional pseudo parallel corpus. Sennrich et al. [2016] 5
Supervised Training Parallel Corpus Objective 6
Unsupervised Training Monolingual Corpus 7
cc Our Approach — Autoencoders x bushi yu shalong juxing le huitan 8
cc Our Approach — Autoencoders ! θ ) P ( y | x ; x bushi yu shalong juxing le huitan 9
cc Our Approach — Autoencoders y latent Bush held a talk with sharon ! θ ) P ( y | x ; x bushi yu shalong juxing le huitan 10
cc Our Approach — Autoencoders ! θ ) P ( x | y ; y latent Bush held a talk with sharon ! θ ) P ( y | x ; x bushi yu shalong juxing le huitan 11
cc Our Approach — Autoencoders ′ x bushi yu shalong juxing le huitan ! θ ) P ( x | y ; y latent Bush held a talk with sharon ! θ ) P ( y | x ; x bushi yu shalong juxing le huitan 12
cc Our Approach — Autoencoders source autoencoder target autoencoder 13
Unsupervised Training (Autoencoders) Monolingual Corpus target autoencoder 14
Semi-supervised Training Training Objective 15
Translation Results Compared with Moses (SMT) and RNNSearch (NMT) 16
Translation Results Compared with Moses (SMT) and RNNSearch (NMT) 17
Translation Results Compared with Moses (SMT) and RNNSearch (NMT) 18
Translation Results Compared with Moses (SMT) and RNNSearch (NMT) 19
Translation Results Compared with Moses (SMT) and RNNSearch (NMT) 20
Translation Results Compared with Sennrich et al. [2015a] 21
Example Translation of Monolingual Corpus 22
Conclusion Monolingual corpora is an important resource for neural machine translation. We have proposed a semi-supervised approach to training bidirectional neural machine translation models for exploiting monolingual corpora. As our method is sensitive to the OOVs present in monolingual corpora, we plan to integrate Jean et al. (2015)’s technique on using very large vocabulary into our approach. 23
Thank You ! 24
Effect of Sample Size ZH-EN EN-ZH 25
Effect of OOV ratio ZH-EN EN-ZH 26

Recommend

Neural Machine Translation Gongbo Tang 8 October 2018 Outline Neural Machine Translation 1

Neural Machine Translation Gongbo Tang 8 October 2018 Outline Neural Machine Translation 1 Advances and Challenges 2 Gongbo Tang Neural Machine Translation 2/52 Neural Machine Translation Figure Recurrent neural network based NMT

907 views • 73 slides

Margin-based Semi-supervised Learning Using Apollonius circle MONA EMADI AND JAFAR TANHA T TC S

Margin-based Semi-supervised Learning Using Apollonius circle MONA EMADI AND JAFAR TANHA T TC S 2 0 2 0 Semi-supervised learning Training data Supervised All labeled data Model learning Some labeled data Semi-supervised Model learning

749 views • 23 slides

Introduction to Neural Machine Translation Gongbo Tang 16 September 2019 Outline Why Neural

Introduction to Neural Machine Translation Gongbo Tang 16 September 2019 Outline Why Neural Machine Translation ? 1 Introduction to Neural Networks 2 Neural Language Models 3 Gongbo Tang Introduction to Neural Machine Translation 2/38

686 views • 50 slides

Neural Machine Translation Philipp Koehn 6 October 2020 Philipp Koehn Machine Translation:

Neural Machine Translation Philipp Koehn 6 October 2020 Philipp Koehn Machine Translation: Neural Machine Translation 6 October 2020 Language Models 1 Modeling variants feed-forward neural network recurrent neural network long

485 views • 36 slides

Neural Machine Translation II Refinements Philipp Koehn 17 October 2017 Philipp Koehn Machine

Neural Machine Translation II Refinements Philipp Koehn 17 October 2017 Philipp Koehn Machine Translation: Neural Machine Translation II Refinements 17 October 2017 Neural Machine Translation 1 <s> the house is big .

821 views • 44 slides

Semi-Supervised Learning Maria-Florina Balcan 03/30/2015 Readings: Semi-Supervised Learning.

Semi-Supervised Learning Maria-Florina Balcan 03/30/2015 Readings: Semi-Supervised Learning. Encyclopedia of Machine Learning. Jerry Zhu, 2010 Combining Labeled and Unlabeled Data with Co- Training. Avrim Blum, Tom Mitchell. COLT

953 views • 45 slides

Unsupervised and Semi-supervised Learning of Structure Graham Neubig Site

CS11-747 Neural Networks for NLP Unsupervised and Semi-supervised Learning of Structure Graham Neubig Site https://phontron.com/class/nn4nlp2020/ Supervised, Unsupervised, Semi-supervised Most models handled here are supervised learning

581 views • 39 slides

Unsupervised and Semi-supervised Learning of Structure Graham Neubig Site

CS11-747 Neural Networks for NLP Unsupervised and Semi-supervised Learning of Structure Graham Neubig Site https://phontron.com/class/nn4nlp2018/ Supervised, Unsupervised, Semi-supervised Most models handled here are supervised learning

670 views • 32 slides

Machine Translation 12: (Non-neural) Statistical Machine Translation Rico Sennrich University of

Machine Translation 12: (Non-neural) Statistical Machine Translation Rico Sennrich University of Edinburgh R. Sennrich MT 2018 12 1 / 27 Todays Lecture So far, main focus of lecture was on: neural machine translation research

741 views • 46 slides

Support Vector Machines (SVMs). Semi-Supervised Learning. Semi-Supervised SVMs.

Support Vector Machines (SVMs). Semi-Supervised Learning. Semi-Supervised SVMs. Maria-Florina Balcan 03/25/2015 Support Vector Machines (SVMs). One of the most theoretically well motivated and practically most e ff ective

792 views • 40 slides

Statistical Machine Translation Nadir Durrani 21-November-2014 Machine Translation

Statistical Machine Translation Nadir Durrani 21-November-2014 Machine Translation www.uni-stuttart.de Problem: Automatic translation the foreign text: 2 Open Problems in Machine Translation www.uni-stuttart.de Ambiguity in translation

939 views • 44 slides

Welcome to the Machine Learning Toolbox! Machine Learning Toolbox Supervised learning caret

MACHINE LEARNING TOOLBOX Welcome to the Machine Learning Toolbox! Machine Learning Toolbox Supervised learning caret R package Automates supervised learning (a.k.a. predictive modeling ) Target variable Machine Learning Toolbox

634 views • 16 slides

CS330 Paper Presentation: October 16th, 2019 Supervised Classification Semi-Supervised

CS330 Paper Presentation: October 16th, 2019 Supervised Classification Semi-Supervised Classification: More realistic dataset Labelled Unlabelled Semi-Supervised Classification Most biologically plausible learning regime A familiar

651 views • 46 slides

Convolutional over Recurrent Encoder for Neural Machine Translation Praveen Dakwale and Christof

Annual Conference of the European Association for Machine Translation 2017 Convolutional over Recurrent Encoder for Neural Machine Translation Praveen Dakwale and Christof Monz Neural Machine Translation End to end neural network with RNN

354 views • 20 slides

Adaptive Multi-pass Decoder for Neural Machine Translation EMNLP 2018

Adaptive Multi-pass Decoder for Neural Machine Translation EMNLP 2018 http://aclweb.org/anthology/D18-1048 Neural Machine Translation(NMT) The encoder-decoder are widely used in neural machine translation the encoder transforms the source

121 views • 11 slides

Introd u ction to machine translation MAC H IN E TR AN SL ATION IN P YTH ON Th u shan

Introd u ction to machine translation MAC H IN E TR AN SL ATION IN P YTH ON Th u shan Ganegedara Data Scientist and A u thor Machine translation MACHINE TRANSLATION IN PYTHON Machine translation MACHINE TRANSLATION IN PYTHON Co u rse o u

839 views • 38 slides

4CSLL5 Advanced Computational Linguistics Introduction Phrase Based Machine Trans Martin

4CSLL5 Advanced Computational Linguistics Phrase Based Machine Trans 4CSLL5 Advanced Computational Linguistics Phrase Based Machine Trans 4CSLL5 Advanced Computational Linguistics Introduction Phrase Based Machine Trans

390 views • 7 slides

Syntax-Directed Translation for Top-Down Parsing 1 Midterm next week during class online

Syntax-Directed Translation for Top-Down Parsing 1 Midterm next week during class online Covers everything up to and incl todays class Sample questions be posted on website Exam is multiple choice. 2 Last Time: Built LL(1) Predictive

825 views • 29 slides

Leveling for Non-Volatile Main Memories Haikun Liu , Yuanyuan Ye, Xiaofei Liao, Hai Jin, Yu Zhang,

Space-Oblivious Compression and Wear Leveling for Non-Volatile Main Memories Haikun Liu , Yuanyuan Ye, Xiaofei Liao, Hai Jin, Yu Zhang, Wenbin Jiang, Bingsheng He* School of Computer Science and Technology Huazhong University of Science and

474 views • 23 slides

The HIT-LTRC Machine Translation System for IWSLT 2012 Xiaoning Zhu, Yiming Cui, Conghui Zhu,

The HIT-LTRC Machine Translation System for IWSLT 2012 Xiaoning Zhu, Yiming Cui, Conghui Zhu, Tiejun Zhao and Hailong Cao Harbin Institute of Technology Outline Introduction System summary Pialign Experiments

433 views • 14 slides

23 Advanced Topics 5: Multi-lingual Models Up until now, we have assumed that in the case of

(c) Model Pivoting (a) Result Pivoting (b) Data Pivoting src pivot pivot trg src pivot pivot trg src pivot pivot trg train train train train train train train train train train train train src src-pivot pivot-trg

179 views • 5 slides

Evaluating MT Quality Evaluation of Why do we want to do it? Translation Quality - Want to

Evaluating MT Quality Evaluation of Why do we want to do it? Translation Quality - Want to rank systems - Want to evaluate incremental changes ESSLLI 2005 How not to do it - ``Back translation'' Chris Callison-Burch - The vodka is not

293 views • 10 slides

ANSI TAG 37 Committee F43 Language Services and Products Interagency Language Roundtable

ANSI TAG 37 Committee F43 Language Services and Products Interagency Language Roundtable September 30, 2011 Sue Ellen Wright ISO TC 37, Terminology and Other Language and Content Resources TC 37, Terminology and other language and content

646 views • 16 slides

61A Lecture 35 Quiz 4 (SQL) released on Tuesday 4/28 is due Thursday 4/30 @ 11:59pm Friday,

Announcements Recursive Art Contest Entries due Monday 4/27 @ 11:59pm Email your code & a screenshot of your art to cs61a-tae@imail.eecs.berkeley.edu (Albert) Homework 9 (4 pts) due Wednesday 4/29 @ 11:59pm Homework Party

168 views • 3 slides

Semi-supervised Learning for Neural Machine Translation Yong Cheng - PowerPoint PPT Presentation

Semi-supervised Learning for Neural Machine Translation Yong Cheng joint work with Wei Xu, Zhongjun He, Wei He, Hua Wu, Maosong Sun, Yang Liu 1 Machine Translation Automated translation using computer software 2 Machine Translation Rule-based

Neural Machine Translation Gongbo Tang 8 October 2018 Outline Neural Machine Translation 1

Margin-based Semi-supervised Learning Using Apollonius circle MONA EMADI AND JAFAR TANHA T TC S

Introduction to Neural Machine Translation Gongbo Tang 16 September 2019 Outline Why Neural

Neural Machine Translation Philipp Koehn 6 October 2020 Philipp Koehn Machine Translation:

Neural Machine Translation II Refinements Philipp Koehn 17 October 2017 Philipp Koehn Machine

Semi-Supervised Learning Maria-Florina Balcan 03/30/2015 Readings: Semi-Supervised Learning.

Unsupervised and Semi-supervised Learning of Structure Graham Neubig Site

Unsupervised and Semi-supervised Learning of Structure Graham Neubig Site

Machine Translation 12: (Non-neural) Statistical Machine Translation Rico Sennrich University of

Support Vector Machines (SVMs). Semi-Supervised Learning. Semi-Supervised SVMs.

Statistical Machine Translation Nadir Durrani 21-November-2014 Machine Translation

Welcome to the Machine Learning Toolbox! Machine Learning Toolbox Supervised learning caret

CS330 Paper Presentation: October 16th, 2019 Supervised Classification Semi-Supervised

Convolutional over Recurrent Encoder for Neural Machine Translation Praveen Dakwale and Christof

Adaptive Multi-pass Decoder for Neural Machine Translation EMNLP 2018

Introd u ction to machine translation MAC H IN E TR AN SL ATION IN P YTH ON Th u shan

4CSLL5 Advanced Computational Linguistics Introduction Phrase Based Machine Trans Martin

Syntax-Directed Translation for Top-Down Parsing 1 Midterm next week during class online

Leveling for Non-Volatile Main Memories Haikun Liu , Yuanyuan Ye, Xiaofei Liao, Hai Jin, Yu Zhang,

The HIT-LTRC Machine Translation System for IWSLT 2012 Xiaoning Zhu, Yiming Cui, Conghui Zhu,

23 Advanced Topics 5: Multi-lingual Models Up until now, we have assumed that in the case of

Evaluating MT Quality Evaluation of Why do we want to do it? Translation Quality - Want to

ANSI TAG 37 Committee F43 Language Services and Products Interagency Language Roundtable

61A Lecture 35 Quiz 4 (SQL) released on Tuesday 4/28 is due Thursday 4/30 @ 11:59pm Friday,

Sambuz

Useful Links

Newsletter

Mail Us