Bridging the Language Divide Alex Waibel and the InterACT Team - - PowerPoint PPT Presentation

bridging the language divide
SMART_READER_LITE
LIVE PREVIEW

Bridging the Language Divide Alex Waibel and the InterACT Team - - PowerPoint PPT Presentation

Bridging the Language Divide Alex Waibel and the InterACT Team Carnegie Mellon University Karlsruhe Institute of Technology alex@waibel.com waibel@cs.cmu.edu waibel@kit.edu Waibel, A. - Bridging the Language Divide Everyone Speaks


slide-1
SLIDE 1

Waibel, A. - Bridging the Language Divide

Bridging the Language Divide

Alex Waibel and the InterACT Team Carnegie Mellon University Karlsruhe Institute of Technology

alex@waibel.com waibel@cs.cmu.edu waibel@kit.edu

slide-2
SLIDE 2
slide-3
SLIDE 3

Waibel, A. - Bridging the Language Divide

“Everyone Speaks English”… ???

0% 10% 20% 30% 40% 50% 60% 70% 80% 90% 100%

English language knowledge (not mother tongue)

In Europe:

slide-4
SLIDE 4

Human Effort

slide-5
SLIDE 5

Waibel, A. - Bridging the Language Divide

  • German is the most widely-spoken first language in the

EU (~100 million speakers)

  • Most Germans speak at least two languages (English,

French, and Russian are most common)

  • Recognized minority languages:

– Danish – Plattdeutsch – Sorbian – Romany – Frisian

Languages in Germany

5

slide-6
SLIDE 6

Waibel, A. - Bridging the Language Divide

  • Germany has one official language (German)
  • Real life is something else:

– Immigration – Tourism – Trade and commerce – Regional development, governance, and cooperation

  • Mobility and traffic
  • Energy and climate change
  • Environment and natural resources

– Cross-border legal issues (e.g. marriage, birth, contracts)

Isn’t Germany monolingual?

6

slide-7
SLIDE 7

Waibel, A. - Bridging the Language Divide

Neighboring languages

7 Polish Czech French Dutch Danish

slide-8
SLIDE 8

Refugee Crisis 2015

slide-9
SLIDE 9

Refugee Crisis 2015

Germany is the second-most popular immigration destination (after the US)

20% of residents in Germany have some roots outside Germany 6.4 million come from outside the EU

slide-10
SLIDE 10

New Challenges

slide-11
SLIDE 11

New Challenges

Major immigrant languages have included:

Turkish (>2 million speakers) Kurdish Polish Balkan languages Russian ….

slide-12
SLIDE 12

New Challenges

Major immigrant languages have included:

Turkish (>2 million speakers) Kurdish Polish Balkan languages Russian ….

slide-13
SLIDE 13

Refugee Registration

slide-14
SLIDE 14

Communication

Effective Communication is not only Text, But:

– Speech – Images – Ill-formed Text “lol-jah I want hr to be like dat…”, Hppyyyy BD, CU, LMK

….what is he saying?

你们的评估准则是什么

slide-15
SLIDE 15

The daunting challenge requires innovative solutions

slide-16
SLIDE 16

An Interpreting Machine

To Build a Language Communicator

– 6 Component-Engines: Automatic Speech Recognition, Machine Translation, and Text-to-Speech Synthesis – Each is in Principle Language Independent, but Requires Language Dependent Models – Models are Automatically Trained but Require Large Corpora – Certain Language Dependent Challenges still Persist

slide-17
SLIDE 17

First Speech Translation VideoCall ‘91-92

  • 1992 – C-STAR Consortium for Speech Translation Advanced Research
  • 1993 – Public C-STAR Demo, ATR-CMU-UKA-Siemens
slide-18
SLIDE 18

First Feasibility Demo

  • 1991 – First Public Demonstration of Speech

July 27, 1991 – UKA, CMU, ATR

slide-19
SLIDE 19

Mobile Consecutive Interpretation Technologies for Cross-Lingual Dialog

slide-20
SLIDE 20

2009

slide-21
SLIDE 21
slide-22
SLIDE 22
slide-23
SLIDE 23

Jibbigo on Apple Commercials

slide-24
SLIDE 24

Humanitarian Deployment

slide-25
SLIDE 25
slide-26
SLIDE 26

Cobra Gold’11

Thailand

slide-27
SLIDE 27
slide-28
SLIDE 28
slide-29
SLIDE 29

Cambodia

slide-30
SLIDE 30
slide-31
SLIDE 31

San Jose , Honduras

slide-32
SLIDE 32
slide-33
SLIDE 33

Simultaneous Interpretation Domain Unlimited Translation

  • f Monolingual Monologues
slide-34
SLIDE 34

Domain Unlimited

Domain Unlimited Translators for:

– TV/Radio Broadcast Translation – Translation of Lectures and Speeches – Parliamentary Speeches (UN, EU,..) – Telephone Conversations – Meeting Translation

你们的评估准则是什么

slide-35
SLIDE 35

End-to-End Speech Translation

slide-36
SLIDE 36

www.eu-bridge.eu

27.10.2015

Text für Fußzeile

Alex Waibel / EU-BRIDGE Overview

The work leading to these results has received funding from the European Union under grant agreement n° 287658

EU-BRIDGE – Bridges across the Language Divide

slide-37
SLIDE 37

www.eu-bridge.eu

27.10.2015

Text für Fußzeile

Alex Waibel / EU-BRIDGE Overview

EU-BRIDGE Partners

slide-38
SLIDE 38

www.eu-bridge.eu

27.10.2015

Text für Fußzeile

Alex Waibel / EU-BRIDGE Overview

ASR MT

Use Case 2

Engines Services Use Cases

Language Service

Customization, Adaptation Develop and Insert Improved Technology Language Services for User and Developer Communities

slide-39
SLIDE 39

Subtitling: BBC Weatherview

slide-40
SLIDE 40

Subtitling & Translation: Euro-News

Euronews Language ID + multilingual ASR + MT 8 Euronews languages

slide-41
SLIDE 41

University Lectures

êß*0vúbØi∫BA¬pysUêÍ}hÿ5 ≈ƒÄ<„y‡ëŒkû¢OFˇØ∏kô#å ¯«Zeû

slide-42
SLIDE 42
slide-43
SLIDE 43

Lecture Translation

slide-44
SLIDE 44
slide-45
SLIDE 45

Lecture Transcription/Translation at KIT

  • Speech more Spontaneous than TED
  • Real-Time Requirement
  • Specialist Vocabularies
slide-46
SLIDE 46

Lecture Translator in Karlsruhe

slide-47
SLIDE 47

Lecture Translation E->F

slide-48
SLIDE 48

Lecture Translation G->E

slide-49
SLIDE 49
  • Translation of Power Point Slides
  • Presentation by Sub-Titles

Tools for Students

slide-50
SLIDE 50

Can Tech Support Human Interpretation?

slide-51
SLIDE 51

EP Rectors’ Conferences Nov.’12-’14

slide-52
SLIDE 52

EP Rectors’ Conferences Nov.’12-’14

  • Demonstrating automatic real-time lecture interpretation
  • University Presidents; Interpretation Training & Services
  • Promising but Controversial
slide-53
SLIDE 53

Three Use Cases:

– Terminology Support – Named Entity Support – Interpreter’s ‘Cruise Control’

Human-Machine Symbiosis

slide-54
SLIDE 54

Voting Sessions

Observations: Interpreting Voting Sessions is…

– Boring and Repetitive – Still Stressful, and Demanding – Many Numbers and Named Entities

slide-55
SLIDE 55

Field Test at the EP (Dec.14)

slide-56
SLIDE 56

Interactive Systems Labs

Why is this so Hard ?

Language is Ambiguous at All Levels:

– Semantics:

  • The Spirit is Willing but the Flesh is Weak
  •  The Vodka is Good but the Meat is Rotten

– Syntax:

  • Time Flies Like an Arrow  6 Different Parses

– Phonetics:

  • This Machine Can Recognize Speech 

This Machine Can Wrack a Nice Beach

  • Give me a New Display  Give me a Nudist Play
slide-57
SLIDE 57

Why is German so Hard?

  • German has some particularly difficult peculiarities:

– Wordorder: Ich schlage Ihnen einen Termin für nächste Woche in meinem Büro am Adenauerring in Karlsruhe, in dem ….. vor.  I propose [hit?] a meeting for next week at my office in Karlsruhe on the Adenauerring… – Inflections and Agreement: Zu der nächsten wichtigen interessanten Vorlesung – Compounds: Worterkennungsfehlerrate  Word Recognition Error Rate

slide-58
SLIDE 58

Compounding

Die Fehlerstromschutzschalterprüfung Die Wirtschaftsdelegationsmitglieder Die Bankwirtschaftsfreigabeerklärung Die Lehrverpflichtungserklärungen Die Schiffskommunalschuldverschreibungen Die Vorkaufsrechtverzichtserklärung Das Mehrzweckkirschentkerngerät Die Gemeindegrundsteuerveranlagung Die Nummernschildbedruckungsmaschine Der Mehrkornroggenvollkornbrotmehlzulieferer Die Verkehrsinfrastrukturfinanzierungsgesellschaft Die Feuerwehrrettungshubschraubernotlandeplatzaufseherin Das Rindfleischetikettierungsüberwachungsaufgabenübertragungsgesetz

slide-59
SLIDE 59

Compounding

Zentraleuropa: Zentral-Europa  Central Europe Zentrale-Ur-Opa  Headquarter-Great-Grandpa Dramatisch: drama-t-isch  dramatic drama-tisch  drama table Asiatisch: asia-t-isch  asian asia-tich  asia table

slide-60
SLIDE 60

Interpreting Language

„Ich freue mich, dass Sie heute so zahlreich....“  you, she, they ? „If the baby does not like the milk, boil it“  es, sie ?

slide-61
SLIDE 61

Words, Words, Words….

  • Technical Terms & Special Usage

– epstral-Koeffizienten, Wälzlagerungen  Roller Bearings – Klausur  Final Exam (not Retreat), Vorzeichen  Sign (not Omen)

  • Formulas:

– Eff von Ix  f(x)

  • Foreign Words in a German Lecture

– Computer Science- English Expressions – “Cloud”, “iPhone”, “iPad”, “Laser”

  • Declinations and Compounding incl. foreign Words

– Web-ge-casted, down-ge-loaded – Cloudbasierter Webcastzugriff

slide-62
SLIDE 62

Scientific Challenge

Language Problems can only be Conquered, if Machines Embrace, Represent, Process:

– Ambiguity: Scores, Statistics, Neural Activations, .. – Learning: Build Models, Extract Knowledge from Human Data & Interaction, Automatically

 Performance Depends on Data & Computing

slide-63
SLIDE 63

Neural Nets: Bigger, Deeper, Faster (1987) (1989) (2013)

TDNN: Shift-Invariance, Waibel ‘87 Modular (deep) TDNN: Waibel ’87 Waibel et al. Babel, 2013

Weights: ~6,000 ~40,0000 ~33,000,000 TrnData[hrs]: ~0.1 ~1 ~1,000 Time[weeks] ~1 ~1 ~1

slide-64
SLIDE 64

English Text Copora

200 400 600 800 1000 1200 1400 1600 2007 2008 2009 2010 2011 2012

News Shuffle Size

Million Words

  • Computer MT or ASR systems train on >> 1GWords

– News Shuffle, GigaWord, Europarl, VideoLectures, …

  • Human speaks 0.5 GigaWords in a Lifetime!!
slide-65
SLIDE 65

The Data Challenge

  • Machine Learning + Massive Data

Lead to Better Performance

  • Is the problem too hard? Is it too easy?

Already done? Google Translate?

  • Effective Language Solutions

– Not only from/to English, but from/to German, … – Minority Languages and Regional Dialects – Need targeted solutions in domain/application – Privacy and Security – Dissemination, not only Assimilation

  • European Language Solutions

– Language (and technology) must be cultivated and treasured – Data Volume and Access Key Challenge

slide-66
SLIDE 66

Conclusion

Communication between the people of the world

– Bridging the Linguistic Divide – Technology can already make helpful contributions – Methods: Machine Learning from Data Adaptation, Error Recovery, Learning, Forgetting – User Interaction, Appropriate Interfaces – More Data, more Robust Performance – Better Language Portability – Integration into Services

slide-67
SLIDE 67