1
Simultaneous Speech Translation
Simultaneous Speech Translation Graham Neubig Nara Institute of - - PowerPoint PPT Presentation
Simultaneous Speech Translation Simultaneous Speech Translation Graham Neubig Nara Institute of Science and Technology (NAIST) 5/18/2015 Joint Work With: Satoshi Nakamura, Tomoki Toda, Sakriani Sakti, Tomoki Fujita, Hiroaki Shimizu,
1
Simultaneous Speech Translation
2
Simultaneous Speech Translation
3
Simultaneous Speech Translation
4
Simultaneous Speech Translation
5
Simultaneous Speech Translation
6
Simultaneous Speech Translation
7
Simultaneous Speech Translation
8
Simultaneous Speech Translation
subj prep prep Translate after the first prepositional phrase completes!
9
Simultaneous Speech Translation
10
Simultaneous Speech Translation
Classifier
Classifier
Classifier
11
Simultaneous Speech Translation
12
Simultaneous Speech Translation
Simultaneous Speech Translation
Simultaneous Speech Translation
Simultaneous Speech Translation
Simultaneous Speech Translation
Simultaneous Speech Translation
Simultaneous Speech Translation
Simultaneous Speech Translation
Simultaneous Speech Translation
Simultaneous Speech Translation
Simultaneous Speech Translation
Simultaneous Speech Translation
Simultaneous Speech Translation
Simultaneous Speech Translation
Simultaneous Speech Translation
Simultaneous Speech Translation
Simultaneous Speech Translation
Simultaneous Speech Translation
– 14-ref BLEU for BTEC, 1-ref BLEU for News – Manually-graded acceptability
Simultaneous Speech Translation
Simultaneous Speech Translation
32
Simultaneous Speech Translation
Simultaneous Speech Translation
Simultaneous Speech Translation
Simultaneous Speech Translation
36
Simultaneous Speech Translation
37
Simultaneous Speech Translation
Pronoun + Verb Noun + Conjunction Determiner + Noun
38
Simultaneous Speech Translation
39
Simultaneous Speech Translation
40
Simultaneous Speech Translation
41
Simultaneous Speech Translation
Predict!
42
Simultaneous Speech Translation
43
Simultaneous Speech Translation
44
Simultaneous Speech Translation
45
Simultaneous Speech Translation
46
Simultaneous Speech Translation
47
Simultaneous Speech Translation
Verb Phrase
48
Simultaneous Speech Translation
for syntactically incorrect translation units
Translation unit Additional syntactic constituent
without additional constituent with it
49
Simultaneous Speech Translation
this is a pen this is a pen Segmentation これです ペン Retrieve words from speech Make groups
↓ Translation units Translate each translation unit respectively Translation Output ASR
50
Simultaneous Speech Translation
this is NP this is a pen this is a pen これは NP です これはペンです this is a pen Waiting this is Proposed method 2 Proposed method 1 Segmentation ASR Segmentation Translation Parsing Output
51
Simultaneous Speech Translation
Word:R1=I POS:R1=NN Word:R1-2=I,min. POS:R1-2=NN,NNS ... ROOT=PP ROOT-L=IN ROOT-R=NP ...
until 'nil' is generated
Translation unit
classification
constituents into the tail
52
Simultaneous Speech Translation
This is a pen DT VBZ NN NP VP NP S DT This is DT VBZ NP VP NP S is a pen VBZ NN NP VP DT is a VBZ NN NP VP DT
is a pen [nil] is a [NN] [nil] This is [NP] [nil]
53
Simultaneous Speech Translation
– e.g. English→Japanese
This is a pen DT VBZ NN NP VP NP S DT これ は ペン で す Parse MT
54
Simultaneous Speech Translation
– e.g. English→Japanese
This is DT VBZ
VP NP S これ は NP で す Parse MT
55
Simultaneous Speech Translation
56
Simultaneous Speech Translation
Correct Result!
57
Simultaneous Speech Translation
TED [WIT3]
English → 日本語
Stanford Tokenizer, KyTea
Ckylark [Oda+2015]
Moses(PBMT), Travatar(T2S)
BLEU ・ RIBES
n-words, GreedySeg[Oda+2014]
PBMT (Moses)
T2S-MT (Travatar) without constjtuent predictjon
T2S-MT (Travatar) with constjtuent predictjon
T2S-MT (Travatar) with constjtuent predictjon & waitjng
58
Simultaneous Speech Translation
2 4 6 8 10 12 14 16 18 0.07 0.08 0.09 0.1 0.11 0.12 0.13 0.14 0.15 T2S T2S-tag T2S-wait PBMT
59
Simultaneous Speech Translation
2 4 6 8 10 12 14 16 18 0.07 0.08 0.09 0.1 0.11 0.12 0.13 0.14 0.15 T2S T2S-tag T2S-wait PBMT
PBMT T2S Accuracies are reversed
mean #words
60
Simultaneous Speech Translation
2 4 6 8 10 12 14 16 18 0.07 0.08 0.09 0.1 0.11 0.12 0.13 0.14 0.15 T2S T2S-tag T2S-wait PBMT
T2S-wait T2S-tag Keep accuracy even many segmentation
mean #words
61
Simultaneous Speech Translation
62
Simultaneous Speech Translation
63
Simultaneous Speech Translation
64
Simultaneous Speech Translation
65
Simultaneous Speech Translation
66
Simultaneous Speech Translation
67
Simultaneous Speech Translation
68
Simultaneous Speech Translation
System Accuracy
more / reasonable / is there a hotel ? / more / reasonable / is there a hotel ? / do you have a more reasonable hotel ? / do you have a more reasonable hotel ? / 原文:「もっと 手頃 な ホテル は あり ませ ん か」 原文:「もっと 手頃 な ホテル は あり ませ ん か」 Don't split Split Finely
Low Delay Accuracy High Low Score
69
Simultaneous Speech Translation
High Low
Accuracy Accuracy Accuracy Delay Delay Delay
70
Simultaneous Speech Translation
Evaluated Data Evaluated Data Accuracy Accuracy Delay Delay
Movies with various delays and accuracies
71
Simultaneous Speech Translation
System A System B System C System D System E Output A Output B Output C Output D Output E
4 1 3 2 5
72
Simultaneous Speech Translation
Weight vector Features useful in evaluation (i.e., delay and accuracy)
73
Simultaneous Speech Translation
Video 20 Types 20-30 Seconds Delay 7 Types 0,1,2,3,5,7,10 Seconds Subjects 15 Japanese speakers Method Ranking 1-3
74
Simultaneous Speech Translation
Speech Output Subtitle Output
5 4 3 2 1 2 4 6 8 10
Delay (s)
5 4 3 2 1 2 4 6 8 10
Delay (s) 5 Level Acceptability 5 Level Acceptability
75
Simultaneous Speech Translation
76
Simultaneous Speech Translation