Simple, Lexicalized Choice of Translation Timing for Simultaneous - - PowerPoint PPT Presentation

simple lexicalized choice of translation timing for
SMART_READER_LITE
LIVE PREVIEW

Simple, Lexicalized Choice of Translation Timing for Simultaneous - - PowerPoint PPT Presentation

Simple, Lexicalized Choice of Translation Timing for Simultaneous Speech Translation Simple, Lexicalized Choice of Translation Timing for Simultaneous Speech Translation Tomoki Fujita, Graham Neubig, Sakriani Sakti, Tomoki Toda, Satoshi


slide-1
SLIDE 1

1

Simple, Lexicalized Choice of Translation Timing for Simultaneous Speech Translation

Simple, Lexicalized Choice

  • f Translation Timing

for Simultaneous Speech Translation

Tomoki Fujita, Graham Neubig, Sakriani Sakti, Tomoki Toda, Satoshi Nakamura Graduate School of Information Science, Nara Institute of Science and Technology

slide-2
SLIDE 2

2

Simple, Lexicalized Choice of Translation Timing for Simultaneous Speech Translation

Background

slide-3
SLIDE 3

3

Simple, Lexicalized Choice of Translation Timing for Simultaneous Speech Translation

Speech Translation Systems

  • Translate speech from source language to target

ASR こんにちは、駅はどこですか? MT Hello, where is the station? TTS

slide-4
SLIDE 4

4

Simple, Lexicalized Choice of Translation Timing for Simultaneous Speech Translation

Problem: Delay

  • Wait for the whole utterance to end before translating

ASR こんにちは、駅はどこですか? MT Hello, where is the station? TTS

Delay

slide-5
SLIDE 5

5

Simple, Lexicalized Choice of Translation Timing for Simultaneous Speech Translation

Solution: Divide into Smaller Chunks

ASR

こんにちは、

  • Choose appropriate timing to start translation

MT

駅は

MT

どこですか?

MT

Hello,

the station where is it? TTS TTS TTS

Delay: Reduced

slide-6
SLIDE 6

6

Simple, Lexicalized Choice of Translation Timing for Simultaneous Speech Translation

Previous Work: Incremental Dependency Parsing/Manual Rules [Ryu+ 04]

  • Utilize knowledge of English/Japanese to derive rules

I went to the park with your brother

subj prep prep Translate after the first prepositional phrase completes!

私は公園に行きました あなたの弟と

MT MT

  • - Requires a bilingual linguist to design rules
  • - Requires an accurate incremental dependency parser
slide-7
SLIDE 7

7

Simple, Lexicalized Choice of Translation Timing for Simultaneous Speech Translation

Previous Work: Division on Pauses [Fugen+ 08, Bangalore+ 12]

  • Simply divide on short pauses in the utterance

ASR

hello where is the station

  • - Cannot capture relationship between languages
  • - Result will greatly change with speech speed,

disfluencies

slide-8
SLIDE 8

8

Simple, Lexicalized Choice of Translation Timing for Simultaneous Speech Translation

Proposed Method

  • Utilize the TM directly to choose translation timing
  • + Can be constructed automatically
  • + Uses information about the language pair
  • + Very simple to implement
  • Specifically:
  • Choose translation timing at the end of each phrase in

the phrase table

  • Utilize reordering probabilities to adjust granularity
  • Adapt the language model to the translation task
slide-9
SLIDE 9

9

Simple, Lexicalized Choice of Translation Timing for Simultaneous Speech Translation

Preliminaries

slide-10
SLIDE 10

10

Simple, Lexicalized Choice of Translation Timing for Simultaneous Speech Translation

Phrase Based Machine Translation

  • Divide the sentence into small phrases and translate

Today I will give a lecture on machine translation .

Today 今日は、 I will give を行います a lecture on の講義 machine translation 機械翻訳 . 。 Today 今日は、 I will give を行います a lecture on の講義 machine translation 機械翻訳 . 。

今日は、機械翻訳の講義を行います。

  • Score translations with translation model (TM),

reordering model (RM), and language model (LM)

slide-11
SLIDE 11

Simple, Lexicalized Choice of Translation Timing for Simultaneous Speech Translation

Translation Model Creation

  • Perform automatic alignment of bitext
  • From aligned text, extract phrases for translation

the hotel front desk ホ テ 受 ルの付

ホテル の → hotel ホテル の → the hotel 受付 → front desk ホテルの受付 → hotel front desk ホテルの受付 → the hotel front desk

slide-12
SLIDE 12

Simple, Lexicalized Choice of Translation Timing for Simultaneous Speech Translation

Lexicalized Reordering Model

  • Probabilistically models reorderings for increased

accuracy of translation

  • Given current phrase and next phrase:
  • “monotone” + “discontinuous right” = “right probability”

背 の 高い 男 the tall man

Monotone:

太郎 を 訪問 した visited Taro

Swap:

私 は 太郎 を 訪問した I visited Taro

Discontinuous Right: Discontinuous Left:

背 の 高い 男 を 訪問 した visited the tall man

slide-13
SLIDE 13

13

Simple, Lexicalized Choice of Translation Timing for Simultaneous Speech Translation

Proposed Method

slide-14
SLIDE 14

Simple, Lexicalized Choice of Translation Timing for Simultaneous Speech Translation

Method One: Choosing Translation Timing with Phrases

  • Input words one at a time from ASR
  • While words exist in phrase table, don't translate yet
slide-15
SLIDE 15

Simple, Lexicalized Choice of Translation Timing for Simultaneous Speech Translation

Method One: Choosing Translation Timing with Phrases

  • Input words one at a time from ASR
  • While words exist in phrase table, don't translate yet

Input String hello where is the station Phrase Table hello→ こんにちは where→ どこ where is→ どこですか the→ その the station→ 駅

slide-16
SLIDE 16

Simple, Lexicalized Choice of Translation Timing for Simultaneous Speech Translation

Method One: Choosing Translation Timing with Phrases

  • Input words one at a time from ASR
  • While words exist in phrase table, don't translate yet

Input String hello where is the station Phrase Table hello→ こんにちは where→ どこ where is→ どこですか the→ その the station→ 駅

slide-17
SLIDE 17

Simple, Lexicalized Choice of Translation Timing for Simultaneous Speech Translation

Method One: Choosing Translation Timing with Phrases

  • Input words one at a time from ASR
  • While words exist in phrase table, don't translate yet

Input String hello where is the station

“hello” phrase exists ↓ wait

Phrase Table hello→ こんにちは where→ どこ where is→ どこですか the→ その the station→ 駅

slide-18
SLIDE 18

Simple, Lexicalized Choice of Translation Timing for Simultaneous Speech Translation

Method One: Choosing Translation Timing with Phrases

  • Input words one at a time from ASR
  • While words exist in phrase table, don't translate yet

Input String hello where is the station

“hello” phrase exists ↓ wait

Phrase Table hello→ こんにちは where→ どこ where is→ どこですか the→ その the station→ 駅

slide-19
SLIDE 19

Simple, Lexicalized Choice of Translation Timing for Simultaneous Speech Translation

Method One: Choosing Translation Timing with Phrases

  • Input words one at a time from ASR
  • While words exist in phrase table, don't translate yet

Input String hello where is the station

“hello” phrase exists ↓ wait “hello where” phrase missing ↓ translate “hello”

Phrase Table hello→ こんにちは where→ どこ where is→ どこですか the→ その the station→ 駅

slide-20
SLIDE 20

Simple, Lexicalized Choice of Translation Timing for Simultaneous Speech Translation

Method One: Choosing Translation Timing with Phrases

  • Input words one at a time from ASR
  • While words exist in phrase table, don't translate yet

Input String hello where is the station

“hello” phrase exists ↓ wait “hello where” phrase missing ↓ translate “hello”

Phrase Table hello→ こんにちは where→ どこ where is→ どこですか the→ その the station→ 駅

slide-21
SLIDE 21

Simple, Lexicalized Choice of Translation Timing for Simultaneous Speech Translation

Method One: Choosing Translation Timing with Phrases

  • Input words one at a time from ASR
  • While words exist in phrase table, don't translate yet

Input String hello where is the station

“hello” phrase exists ↓ wait “hello where” phrase missing ↓ translate “hello” “where is” phrase exists ↓ wait

Phrase Table hello→ こんにちは where→ どこ where is→ どこですか the→ その the station→ 駅

slide-22
SLIDE 22

Simple, Lexicalized Choice of Translation Timing for Simultaneous Speech Translation

Method One: Choosing Translation Timing with Phrases

  • Input words one at a time from ASR
  • While words exist in phrase table, don't translate yet

Input String hello where is the station

“hello” phrase exists ↓ wait “hello where” phrase missing ↓ translate “hello” “where is” phrase exists ↓ wait

Phrase Table hello→ こんにちは where→ どこ where is→ どこですか the→ その the station→ 駅

slide-23
SLIDE 23

Simple, Lexicalized Choice of Translation Timing for Simultaneous Speech Translation

Method One: Choosing Translation Timing with Phrases

  • Input words one at a time from ASR
  • While words exist in phrase table, don't translate yet

Input String hello where is the station

“hello” phrase exists ↓ wait “hello where” phrase missing ↓ translate “hello” “where is” phrase exists ↓ wait “where is the” phrase missing ↓ translate “where is”

Phrase Table hello→ こんにちは where→ どこ where is→ どこですか the→ その the station→ 駅

slide-24
SLIDE 24

Simple, Lexicalized Choice of Translation Timing for Simultaneous Speech Translation

Method One: Choosing Translation Timing with Phrases

  • Input words one at a time from ASR
  • While words exist in phrase table, don't translate yet

Input String hello where is the station

“hello” phrase exists ↓ wait “hello where” phrase missing ↓ translate “hello” “where is” phrase exists ↓ wait “where is the” phrase missing ↓ translate “where is”

Phrase Table hello→ こんにちは where→ どこ where is→ どこですか the→ その the station→ 駅

slide-25
SLIDE 25

Simple, Lexicalized Choice of Translation Timing for Simultaneous Speech Translation

Method One: Choosing Translation Timing with Phrases

  • Input words one at a time from ASR
  • While words exist in phrase table, don't translate yet

Input String hello where is the station

“hello” phrase exists ↓ wait “hello where” phrase missing ↓ translate “hello” “where is” phrase exists ↓ wait “where is the” phrase missing ↓ translate “where is” “the station” utterance ends ↓ translate “the station”

Phrase Table hello→ こんにちは where→ どこ where is→ どこですか the→ その the station→ 駅

slide-26
SLIDE 26

Simple, Lexicalized Choice of Translation Timing for Simultaneous Speech Translation

Problem with Method One

  • Has the potential to degrade translation accuracy:

こんにちは 駅 は どこ ですか Hello, where is the station Normal phrase-based translation: こんにちは 駅 は どこ ですか Hello, where is it the station Translation with early timing:

slide-27
SLIDE 27

Simple, Lexicalized Choice of Translation Timing for Simultaneous Speech Translation

Method Two: Adjusting Timing with Reordering Probabilities

  • First, temporarily choose strings according to method one
  • Next, if that phrase's right probability exceeds a

threshold, actually translate the words in the cache

  • Threshold 1.0 = traditional, 0.0 = method one

Example (threshold = 0.8): hello where is the station

“hello” phrase exists ↓ wait “hello where” phrase missing ↓ choose “hello” ↓ right probability is 0.9 > 0.8 ↓ translate “hello” “where is” phrase exists ↓ wait “where is the” phrase missing ↓ choose “where is” ↓ right probability is 0.6 < 0.8 ↓ do not translate yet “the station” utterance ends ↓ translate “where is the station”

slide-28
SLIDE 28

Simple, Lexicalized Choice of Translation Timing for Simultaneous Speech Translation

Problem with Method Two

  • LMs are traditionally trained on sentences
  • This is not appropriate for translating shorter chunks
  • e.g.: The translator will try to “finish” sentences

こんにちは 駅 は どこ ですか Hello . Where is it . Where is the station ?

slide-29
SLIDE 29

Simple, Lexicalized Choice of Translation Timing for Simultaneous Speech Translation

Method Three: Language Model Adaptation

  • Before learning the language model, split the training

data according to the same criterion

Traditional LM Training Hello, where is the station. My name is John. Hello, where is the station. My name is John. Proposed Method

LM Training Sentence Level LM Chunk Segmentation LM Training Chunk Level LM

slide-30
SLIDE 30

Simple, Lexicalized Choice of Translation Timing for Simultaneous Speech Translation

Experiments

slide-31
SLIDE 31

Simple, Lexicalized Choice of Translation Timing for Simultaneous Speech Translation

Experimental Setup

  • Four Types of Experiments:
  • Japanese-English BTEC Travel Conversation (ja-en)
  • Japanese-English BTEC with 11+ Words (ja-en 11+)
  • English-Japanese BTEC Travel Conversation (en-ja)
  • French-English WMT News (fr-en)
  • Evaluation Measures:
  • Accuracy

– 14-ref BLEU for BTEC, 1-ref BLEU for News – Manually-graded acceptability

  • Delay (Seconds)
slide-32
SLIDE 32

Simple, Lexicalized Choice of Translation Timing for Simultaneous Speech Translation

Result One: Comparison Across Settings

2 4 6 8 10 12 14 10 20 30 40 50 60 70 80 en-ja ja-en ja-en (11+) fr-en Delay (Seconds) A c c u r a c y ( B L E U )

  • Delay decreases in all settings
  • Better delay/accuracy tradeoff for long sentences,

similar languages

t=0.0 t=1.0

slide-33
SLIDE 33

Simple, Lexicalized Choice of Translation Timing for Simultaneous Speech Translation

1 1.5 2 2.5 3 3.5 4 4.5 20 25 30 35 40 45 50 Proposed Pause Delay (Seconds) A c c u r a c y ( B L E U )

Result Two: Compare with Pause-based Segmentation

  • In faster settings proposed method best
  • In slower settings pause-based method best
slide-34
SLIDE 34

Simple, Lexicalized Choice of Translation Timing for Simultaneous Speech Translation

Manual Evaluation

  • Decrease in manual evaluation as well, but less
  • bvious than evaluated by BLEU
slide-35
SLIDE 35

Simple, Lexicalized Choice of Translation Timing for Simultaneous Speech Translation

Conclusion

slide-36
SLIDE 36

Simple, Lexicalized Choice of Translation Timing for Simultaneous Speech Translation

Conclusion

  • Proposed method for choosing timing in speech

translation using phrase table and reordering model

  • Considers reordering tendencies across languages
  • Simple and language independent
  • Competitive accuracy
  • Future work:
  • Combination of prosodic, reordering, and syntactic cues
  • How do we evaluate translations?
slide-37
SLIDE 37

Simple, Lexicalized Choice of Translation Timing for Simultaneous Speech Translation

Thank You!

slide-38
SLIDE 38

Simple, Lexicalized Choice of Translation Timing for Simultaneous Speech Translation

Example:

サーフィン に いい 場所 を 教え て ください サーフィン に | いい 場所 を 教え て ください please tell me a good surfing place for surfing | please tell me a good place