Simultaneous GermanEnglish Lecture Translation Muntsin Kolss, - PowerPoint PPT Presentation

Simultaneous German�English Lecture Translation Muntsin Kolss, Matthias Wölfel, Florian Kraft, Jan Niehues, Matthias Paulik, Alex Waibel IWSLT 2008, October 21, 2008

Simultaneous Lecture Translation: Challenges (for German�English) • Unlimited Domain: Wide variety of topics Lectures often go deeply into detail: specialized vocabulary and expressions • Spoken Language: Most lecturers are not professionally trained speakers Conversational speech, more informal than prepared speeches Long monologues, often not easily separable utterances with sentence boundaries • Strict Real,time and Latency requirements • German,English specific: English words embedded in German, especially technical terms German compounds Long,distance word reordering

System Overview

English Words in German Lectures Language German English Both Unknown Total Words 4195 110 1397 887 Deletions 52 1 44 0 Insertions 58 9 37 2 Substitutions German 258 37 91 113 English 7 6 8 7 Both 68 10 33 56 Unknown 5 3 2 4 Total Error 448 66 215 182 WER 10.7% 60.0% 15.4% 20.5%

English Words in German Lectures • Two Approaches: Use two phoneme sets in parallel, one each for German and English (parallel) Map the English pronunciation dictionary to German phonemes (mapping) WER Language All German English Both Unknown Baseline 13.8% 10.7% 60.0% 15.4% 20.5% Mapping 12.7% 11.1% 34.6% 13.8% 16.1% Parallel 13.4% 11.4% 26.4% 14.7% 18.9%

Machine Translation: Adaptation to Lectures • Training data: German,English EPPS, News Commentary, Travel Expression Corpus • 100K corpus of German lectures held at Universtität Karlsruhe, transcribed and translated into English Dev Test Baseline 31.54 27.18 Language Model (LM) Adaptation 33.11 29.17 Translation Model (TM) Adaptation 33.09 30.46 LM and TM adaptation 34.00 30.94 + Rule,based word reordering 34.59 31.38 + Discriminative Word Alignment 35.24 31.40

Automatic Simultaneous Translation: Input Segmentation • Text Translation source sentence � MT Decoder � target sentence • Speech Translation (turn,based, „push,to,talk“ dialog systems) source utterance � MT Decoder � target utterance • Simultaneous Translation continuous ASR input � Segmentation � MT Decoder � target segment

Low latency translation is easy/ 40 35 30 25 [%] BLEU 20 15 Fixed Segment Length 10 5 0 1 2 3 4 5 6 7 8 9 10 15 20 50 100 10K Segment Length

Disadvantages of Input Segmentation • Choosing meaningful segment boundaries is difficult and error,prone • No recovery from segmentation errors, input segmentation makes hard decisions • Phrases which would match across the segment boundaries can no longer be used • No word reordering across segment boundaries is possible • Language model context is lost across the segment boundaries • If the language model is trained on sentence segmented data there will often be a mismatch for the begin,of,sentence and end,of,sentence LM events

Phrase�based SMT decoder “ I have heard traditional values referred to ” I have heard traditional values referred to “ he escuchado relacionarlo con valores tradicionales ”

Stream Decoding: Continuous Translation Lattice “ � and the inspiration for the exact motivation of the stimuli was derived from experiments in which we use these networks for geometrical figures and we ask subjects to describe ... ” • No input segmentation: process “infinite” input stream from speech recognizer, extending/truncating the translation lattice in F

Stream Decoding: Continuous Translation Lattice “ � and the inspiration for the exact motivation of the stimuli was derived from experiments in which we use these networks for geometrical figures and we ask subjects to describe ... ” • No input segmentation: process “infinite” input stream from speech recognizer, extending/truncating the translation lattice in which F

Stream Decoding: Continuous Translation Lattice “ � and the inspiration for the exact motivation of the stimuli was derived from experiments in which we use these networks for geometrical figures and we ask subjects to describe ... ” • No input segmentation: process “infinite” input stream from speech recognizer, extending/truncating the translation lattice in which we F

Stream Decoding: Continuous Translation Lattice “ � and the inspiration for the exact motivation of the stimuli was derived from experiments in which we use these networks for geometrical figures and we ask subjects to describe ... ” • No input segmentation: process “infinite” input stream from speech recognizer, extending/truncating the translation lattice in which we use F

Stream Decoding: Continuous Translation Lattice “ � and the inspiration for the exact motivation of the stimuli was derived from experiments in which we use these networks for geometrical figures and we ask subjects to describe ... ” • No input segmentation: process “infinite” input stream from speech recognizer, extending/truncating the translation lattice in which we use these networks for F

Stream Decoding: Continuous Translation Lattice “ � and the inspiration for the exact motivation of the stimuli was derived from experiments in which we use these networks for geometrical figures and we ask subjects to describe ... ” • No input segmentation: process “infinite” input stream from speech recognizer, extending/truncating the translation lattice use these networks for F

Stream Decoding: Asynchronous Input and Output • Each incoming source word from the recognizer triggers a new search through the current translation lattice • Output of resulting best hypothesis is partially or completely delayed, until either a time out occurs or new input arrives, which leads to lattice expansion and a new search • Creates sliding window during which translation output lags the incoming source stream use these networks for F

Stream Decoding: Output Segmentation • Decide which part of the current best translation hypothesis to output, if any at all: Minimum Latency L min : The translation covering the last L min untranslated source words received from the speech recognizer at any point is never output (except for time,outs) Maximum Latency L max : When the latency reaches L max source words, translation output covering the source words exceeding this value is forced in which we use these networks for F L min L max

Stream Decoding: Output Segmentation • Backtrace hypothesis until L min source words have been passed • If the hypothesis reached contains reordering gaps, continue backtracing until state with no open reorderings • If no such state can be found, perform a new restricted search that only expands hypotheses which have to open reorderings at the node where the maximum latency would be exceeded L min L max

Stream Decoding Performance under Latency Constraint L min and L max chosen to optimize translation quality 40 35 30 25 [%] BLEU 20 Fixed Segment Length 15 Keeping LM State Acoustic Features 10 Stream Decoding 5 0 1 2 3 4 5 6 7 8 9 10 15 20 50 100 10K Segment Length

Choosing optimal parameter values for L min and L max 40 35 30 25 [%] BLEU 20 15 10 9 5 7 5 Maximum Latency 0 3 10 9 8 7 6 5 1 4 3 2 1 Minimum Latency 0

Summary • Current system for simultaneous translation of German lectures to English combines state,of,the,art ASR and SMT components • ASR system modified to handle German compounds, and English terms and expressions embedded in German lectures • SMT system uses additional compound splitting and model adaptation to topic and style of lectures • Experiments with Stream Decoding to reduce latencies of the overall system • Generated translation output provides a good idea of what the German lecturer said • Major challenge for the future is better addressing long,range word reordering requirements between German and English

Simultaneous GermanEnglish Lecture Translation Muntsin Kolss, - PowerPoint PPT Presentation

Simultaneous GermanEnglish Lecture Translation Muntsin Kolss, Matthias Wlfel, Florian Kraft, Jan Niehues, Matthias Paulik, Alex Waibel IWSLT 2008, October 21, 2008 Simultaneous Lecture Translation: Challenges (for GermanEnglish)

Simple, Lexicalized Choice of Translation Timing for Simultaneous Speech Translation Tomoki

4 English I CP or Honors Credits English II CP or Honors of English III CP or

Simultaneous Translation: Recent Advances and Remaining Challenges Liang Huang Baidu

11-731 Machine Translation Speech 2 Speech Translation Speech Translation Three part systems

Community Translation By Willem Stoeller Examples Community Translation Virtual Teams Powering

CHANNEL ALLOCATION Channel Language Translation Channel Translation Language Channel 1 German

Simultaneous Speech Translation Graham Neubig Nara Institute of Science and Technology (NAIST)

Simultaneous Speech Translation Graham Neubig Nara Institute of Science and Technology (NAIST)

Most and the most in German, English and Swedish Christian Josefson & Elizabeth Coppock

Online Versus Offline NMT Quality An In-depth Analysis on English-German and German-English Maha

ENGLISH CHOICES AT WHEATLEY AN INTRODUCTION FOR NINTH GRADERS AND THEIR PARENTS ENGLISH

Statistical Machine Translation Statistical Machine Translation p Lecture 2 Theory and Praxis of

Statistical Machine Translation Nadir Durrani 21-November-2014 Machine Translation

Computer Aided Translation Philipp Koehn 30 April 2015 Philipp Koehn Machine Translation:

Computer Aided Translation Philipp Koehn 15 November 2018 Philipp Koehn Machine Translation:

Global Translation Services Website translation using post-edited machine translation and

Language Attitudes & Identity Itesh Sachdev (PhD Psychology) Professor Emeritus of Language

Kid s Box Level 6 Presentation Plus Kid s Box Level 6 Presentation Plus Book Review Book Review

Summer Reading Program Presentation to the Board of Education Todd Winch Assistant

Third-Grade Reading Webinar March 31, 2017 Month XX, 2016 Welcome to the Webinar Please mute

Pronunciation and Credentials on the Judgment of Credibility of Filipino Teachers and Counselors

Preservice to Practice: Preparing all (Florida) Teachers for English Language Learners in

BACK TO SCHOOL BBQ FAMILY DATA NIGHT F A M I L I E S P R E P A R I N G F O R S U C C E

MMLA M ARATHI S HALA 2010 1 A GENDA Progress Until Now Objective Background

Sambuz

Useful Links

Newsletter

Mail Us

Simultaneous GermanEnglish Lecture Translation Muntsin Kolss, - PowerPoint PPT Presentation

Simultaneous GermanEnglish Lecture Translation Muntsin Kolss, Matthias Wlfel, Florian Kraft, Jan Niehues, Matthias Paulik, Alex Waibel IWSLT 2008, October 21, 2008 Simultaneous Lecture Translation: Challenges (for GermanEnglish)

Simple, Lexicalized Choice of Translation Timing for Simultaneous Speech Translation Tomoki

4 English I CP or Honors Credits English II CP or Honors of English III CP or

Simultaneous Translation: Recent Advances and Remaining Challenges Liang Huang Baidu

11-731 Machine Translation Speech 2 Speech Translation Speech Translation Three part systems

Community Translation By Willem Stoeller Examples Community Translation Virtual Teams Powering

CHANNEL ALLOCATION Channel Language Translation Channel Translation Language Channel 1 German

Simultaneous Speech Translation Graham Neubig Nara Institute of Science and Technology (NAIST)

Simultaneous Speech Translation Graham Neubig Nara Institute of Science and Technology (NAIST)

Most and the most in German, English and Swedish Christian Josefson &amp; Elizabeth Coppock

Online Versus Offline NMT Quality An In-depth Analysis on English-German and German-English Maha

ENGLISH CHOICES AT WHEATLEY AN INTRODUCTION FOR NINTH GRADERS AND THEIR PARENTS ENGLISH

Statistical Machine Translation Statistical Machine Translation p Lecture 2 Theory and Praxis of

Statistical Machine Translation Nadir Durrani 21-November-2014 Machine Translation

Computer Aided Translation Philipp Koehn 30 April 2015 Philipp Koehn Machine Translation:

Computer Aided Translation Philipp Koehn 15 November 2018 Philipp Koehn Machine Translation:

Global Translation Services Website translation using post-edited machine translation and

Language Attitudes &amp; Identity Itesh Sachdev (PhD Psychology) Professor Emeritus of Language

Kid s Box Level 6 Presentation Plus Kid s Box Level 6 Presentation Plus Book Review Book Review

Summer Reading Program Presentation to the Board of Education Todd Winch Assistant

Third-Grade Reading Webinar March 31, 2017 Month XX, 2016 Welcome to the Webinar Please mute

Pronunciation and Credentials on the Judgment of Credibility of Filipino Teachers and Counselors

Preservice to Practice: Preparing all (Florida) Teachers for English Language Learners in

BACK TO SCHOOL BBQ FAMILY DATA NIGHT F A M I L I E S P R E P A R I N G F O R S U C C E

MMLA M ARATHI S HALA 2010 1 A GENDA Progress Until Now Objective Background

Sambuz

Useful Links

Newsletter

Mail Us

Most and the most in German, English and Swedish Christian Josefson & Elizabeth Coppock

Language Attitudes & Identity Itesh Sachdev (PhD Psychology) Professor Emeritus of Language