Pronunciation Extraction Through Cross-Lingual Word-to-Phoneme - PowerPoint PPT Presentation

Pronunciation Extraction Through Cross-Lingual Word-to-Phoneme Alignment Felix Stahlberg, Tim Schlippe , Stephan Vogel, Tanja Schultz SLSP 2013 – 1st International Conference on Statistical Language and Speech Processing Tarragona, Spain KIT – University of the State of Baden-Wuerttemberg and www.kit.edu National Research Center of the Helmholtz Association

Outline 1. Motivation 2. Word Segmentation 3. Word Pronunciation Extraction 4. Experiments 1. Corpus 2. Evaluation Measures 3. Which Translation Is Favorable? 4. Combining Multiple Translations 5. Analysis of the Results – Common errors 5. Conclusion and Future Work 2 31-July-2013 Pronunciation Extraction Through Cross-lingual Word-to-Phoneme Alignment

Scenario Say “I am sick.” in your mother tongue. /b/ /o/ /l/ /e/ /s/ /t/ /a/ /n/ /s/ /a/ /m/ /z/ /d/ /r/ /a/ /v/ /s/ /a/ /m/ Say “I am healthy.” in your mother tongue. • /s/ /a/ /m/ seems to be a word (meaning I am ) • /b/ /o/ /l/ /e/ /s/ /t/ /a/ /n/ seems to be a word (meaning sick ) • /z/ /d/ /r/ /a/ /v/ seems to be a word (meaning healthy ) 3 31-July-2013 Pronunciation Extraction Through Cross-lingual Word-to-Phoneme Alignment

Long Term Goal We obtain Transcribed audio data (in terms of IDs) /l/ /ae/ /ng/ /w/ /ah/ /jh/ /v/ /er/ /s/ /ae/ /n/ /d/ /th/ /ich/ /ng/ /k/ /s/ /f/ /er/ /y/ /uw/ 1 7 3 5 4 6 Pronunciation dictionary Train ASR System (future work) Language model 2 8 2 9 2 10 1 7 3 5 4 6 4 31-July-2013 Pronunciation Extraction Through Multilingual Word-to-Phoneme Alignment

Applications http://www.fotopedia.com/items/_avPIZmqM3w-6716j3F1J-U Dialects Speech processing for non- written and under-resourced languages 5 31-July-2013 Pronunciation Extraction Through Cross-lingual Word-to-Phoneme Alignment

Roadmap Need phonetic transcription of what is said Usually phoneme recognizer In this work: Perfect phonetic transcriptions Focus to define and evaluate steps for extracting a pronunciation dictionary from the phoneme sequences 6 31-July-2013 Pronunciation Extraction Through Cross-lingual Word-to-Phoneme Alignment

Roadmap How can we find word boundaries and segment phoneme sequences into word units? Inproved segmentation with cross-lingual information Alignment between word units in written translation and phoneme sequences of target language 7 31-July-2013 Pronunciation Extraction Through Cross-lingual Word-to-Phoneme Alignment

Word-Segmentation – Word-to-Phoneme Alignments Sentence: Sprache die für dich dichtet und denkt German (Source Language) English Phoneme (Target Language) sequence: l ae ng g w ah jh v er s ae n d th ih ng k s f er y uw Phoneme Recognizer Audio: (Besacier et. al., 2006) (Stüker and Waibel, 2008) (Stüker and Besacier, 2009) (Stahlberg et. al., 2012) 8 31-July-2013 Pronunciation Extraction Through Cross-lingual Word-to-Phoneme Alignment

Word-Segmentation – Results (Stahlberg et. al., 2012) http://code.google.com/p/pisa/ 9 31-July-2013 Pronunciation Extraction Through Cross-lingual Word-to-Phoneme Alignment

Roadmap 10 31-July-2013 Pronunciation Extraction Through Cross-lingual Word-to-Phoneme Alignment

Word-Pronunciation Extraction (Stahlberg et. al, 2013) 11 31-July-2013 Pronunciation Extraction Through Cross-lingual Word-to-Phoneme Alignment

Experiments – Corpus Parallel data from the Christian Bible (30.6k verses,14 written translations) Variety of linguistic approaches to Bible translation (dynamic equivalence, formal equivalence, and idiomatic translation) English as “under - resourced target language” (deeper insight in strengths and weaknesses of our algorithm)  ESV Bible “Perfect phoneme recognizer”: Replaced words in ESV Bible and removed word boundaries 12 31-July-2013 Pronunciation Extraction Through Cross-lingual Word-to-Phoneme Alignment

Evaluation Measures (1) Pronunciation Pronunciation 1 h e l o hello h e l o 2 f ih n ih sh t ih t world w o r l t 3 w o l t language l ae ng w ah jh 4 o r l t finished f ih n ih sh t 5 h a l o h w 13 31-July-2013 Pronunciation Extraction Through Cross-lingual Word-to-Phoneme Alignment

Evaluation Measures (2) Out-Of-Vocabulary Rate (OOV-Rate) Pronunciation Pronunciation 1 h e l o hello h e l o 2 f ih n ih sh t ih t world w o r l t 3 w o l t language l ae ng w ah jh 4 o r l t finished f ih n ih sh t 5 h a l o h w 14 31-July-2013 Pronunciation Extraction Through Cross-lingual Word-to-Phoneme Alignment

Evaluation Measures (3) Phoneme Error Rate (PER) Pronunciation Pronunciation 1 h e l o hello h e l o 2 f ih n ih sh t ih t world w o r l t 3 w o l t language l ae ng w ah jh 4 o r l t finished f ih n ih sh t 5 h a l o h w 15 31-July-2013 Pronunciation Extraction Through Cross-lingual Word-to-Phoneme Alignment

Evaluation Measures (4) Hypo/Ref Ratio Pronunciation Pronunciation 1 h e l o hello h e l o 2 f ih n ih sh t ih t world w o r l t 3 w o l t language l ae ng w ah jh 4 o r l t finished f ih n ih sh t 5 h a l o h w 16 31-July-2013 Pronunciation Extraction Through Cross-lingual Word-to-Phoneme Alignment

Which Translation Is Favorable? – Distribution of edit distances Number of # entries Phoneme Error Rate (PER) extracted vocabulary entries Distribution of the edit Edit distances of distances between the Number of extracted extracted vocabulary extracted pronunciations and vocabulary entries close to the nearest entry entries to the next real target language words in the reference dictionary for reference vocabulary all 14 source translations (<0.1 edit distance) entry 17 31-July-2013 Pronunciation Extraction Through Cross-lingual Word-to-Phoneme Alignment

Which Translation Is Favorable? – Impact of 4 factors to our evaluation measures ∆ vocabulary size: Difference between vocabulary size of the source translation and size of the ESV Bible ∆ average number of words per verse: Difference between average verse length in the source translation and in the ESV Bible ∆ average word frequency: Difference between the average number of word repetitions in the source translation and in the ESV Bible IBM-4 PPL: To measure the general correspondence of the translation to IBM- Model based alignment models, we run GIZA++ with default configuration at the word level and use the final perplexity of IBM- Model 4 18 31-July-2013 Pronunciation Extraction Through Cross-lingual Word-to-Phoneme Alignment

Which Translation Is Favorable? – Correlation of evaluation measures 19 31-July-2013 Pronunciation Extraction Through Cross-lingual Word-to-Phoneme Alignment

Combining multiple translations Concatenate pronunciations and remove homophones Evaluation measures over the number of combined source translations Combining all 14 translations results in a dictionary with only 7.9% OOV rate, But more than 9 of 10 dictionary entries are extracted unnecessarily (Hypo/Ref ratio 10.7:1) 20 31-July-2013 Pronunciation Extraction Through Cross-lingual Word-to-Phoneme Alignment

Common Errors (1) Off-by-one alignment errors Extracted (incorrectly) Correct f ih s t s (fists) z f ih s t s ih k s t f ih k s t (fixed) ih z r ey l (israel) ih z r ey l ah Context information may be helpful 21 31-July-2013 Pronunciation Extraction Through Cross-lingual Word-to-Phoneme Alignment

Common Errors (2) Different words with the same stem are merged together Extracted (incorrectly) Correct s ih d uw s ih t s ih d uw s t (seduced) or s ih d uw s i ng (seducing) ih k n aa l ih jh m ih k n aa l ih jh (acknowledge) or ih k n aa l ih jh m ah n t (acknowledgement) Clustering issue 22 31-July-2013 Pronunciation Extraction Through Cross-lingual Word-to-Phoneme Alignment

Common Errors (3) Missing word boundaries between words often occurring in the same context Extracted (incorrectly) Correct w er ih n d ih g n ah n t were indignant f ih n ih sh t ih t finished it Cross-lingual information of multiple languages may help 23 31-July-2013 Pronunciation Extraction Through Cross-lingual Word-to-Phoneme Alignment

Summary Speech processing in non-written and under-resourced languages or dialects Cross-lingual information helps to find word boundaries Proposed steps for extracting a pronunciation dictionary with word IDs from these segmentations and alignments Pronunciation quality is still not good enough for productive use Need better compensation for alignment and phoneme recognition errors when extracting pronunciations Initial approach for combining dictionaries from multiple translations drops OOV rate, but increases number of unnecessary entries 24 31-July-2013 Pronunciation Extraction Through Cross-lingual Word-to-Phoneme Alignment

Possible Next Steps Iterative extraction Better clustering Analysis for different cluster algorithms Add contextual information Use information from multiple source languages Integrate monolingual word and syllable segmentation Real phoneme recognizer How to bootstrap the phoneme recognizer? – maybe multilingual voting and adaptation techniques based on confidence score 25 31-July-2013 Pronunciation Extraction Through Cross-lingual Word-to-Phoneme Alignment

¡Muchas gracias! ¡Moltes gràcies! 26 31-July-2013 Pronunciation Extraction Through Cross-lingual Word-to-Phoneme Alignment

References 27 31-July-2013 Pronunciation Extraction Through Cross-lingual Word-to-Phoneme Alignment

Pronunciation Extraction Through Cross-Lingual Word-to-Phoneme - PowerPoint PPT Presentation

Pronunciation Extraction Through Cross-Lingual Word-to-Phoneme Alignment Felix Stahlberg, Tim Schlippe , Stephan Vogel, Tanja Schultz SLSP 2013 1st International Conference on Statistical Language and Speech Processing Tarragona, Spain KIT

uf: Minimizing the Coq Extraction TCB Eric Mullen , Stuart Pernsteiner, James Wilcox, Zachary

Books by Mark Hancock English Pronunciation in Use Intermediate (CUP) Pronunciation Games (CUP)

Using weekly group political presentation to enhance pronunciation - Tran Hong Le - Questions to

Pronunciation of Nouns in Pronunciation of Nouns in Text to Speech systems Text to Speech

Pronunciation Lexicon Background Outline Brief Introduction on Pronunciation Lexicon

Memory Memory Decoders M bits M bits RWM NVRWM ROM S 0 S 0 Word 0 Word 0 S 1 Word 1 Word

02 | 27 SOUTHERN CROSS 23.04 03 | 27 SOUTHERN CROSS 23.04 04 | 27 SOUTHERN CROSS 23.04 06

Bleaching Text: Abstract Features for Cross-lingual Gender Prediction. Rob van der Goot, Nikola

Cross-lingual Information Retrieval Pavel Pecina Institute of Formal and Applied Linguistics

Cross-Lingual Information Retrieval Language Technology I Language Technology I Crosslingual

WMT 2016 Shared Task on Cross-lingual Pronoun Prediction . Liane Guillou, Christian Hardmeier,

Cross-lingual NLP Sara Stymne Uppsala University Department of Linguistics and Philology

Probabilistic Pronunciation + N-gram Models CMSC 35100 Natural Language Processing April 15,

Soil Extraction Cell: An Alternative Soil Extraction Cell: An Alternative Method of Soil

Cross-lingual Predicate Cluster Acquisition to Improve Bilingual Event Extraction by Inductive

The Shadow of the Cross The Cross of Jesus part 1B The Shadow of the Cross Hebrews 10:1-14 The

Cross-Lingual Semantic Mapping of Authority Files Nadine Steinmetz, and Harald Sack November,

Fall Product Training _ _

Cross linguality and machine translation without bilingual data ith t bili l d t Enek

W ITH the widespread use of hands-free electronic gad- are mapped to a multilingual set using a

The Moment of Meaning The Moment of Meaning

Stacking With Auxiliary Features: Improved Ensembling for Natural Language and Vision Nazneen

Cross-Lingual Word Sense Disambiguation using WordNets and Context Mapping Priyank Jaini Ankit

Automatic Machine Translation Evaluation using Source Language Inputs and Cross-lingual Language

Sambuz

Useful Links

Newsletter

Mail Us

Pronunciation Extraction Through Cross-Lingual Word-to-Phoneme - PowerPoint PPT Presentation

Pronunciation Extraction Through Cross-Lingual Word-to-Phoneme Alignment Felix Stahlberg, Tim Schlippe , Stephan Vogel, Tanja Schultz SLSP 2013 1st International Conference on Statistical Language and Speech Processing Tarragona, Spain KIT

uf: Minimizing the Coq Extraction TCB Eric Mullen , Stuart Pernsteiner, James Wilcox, Zachary

Books by Mark Hancock English Pronunciation in Use Intermediate (CUP) Pronunciation Games (CUP)

Using weekly group political presentation to enhance pronunciation - Tran Hong Le - Questions to

Pronunciation of Nouns in Pronunciation of Nouns in Text to Speech systems Text to Speech

Pronunciation Lexicon Background Outline Brief Introduction on Pronunciation Lexicon

Memory Memory Decoders M bits M bits RWM NVRWM ROM S 0 S 0 Word 0 Word 0 S 1 Word 1 Word

02 | 27 SOUTHERN CROSS 23.04 03 | 27 SOUTHERN CROSS 23.04 04 | 27 SOUTHERN CROSS 23.04 06

Bleaching Text: Abstract Features for Cross-lingual Gender Prediction. Rob van der Goot, Nikola

Cross-lingual Information Retrieval Pavel Pecina Institute of Formal and Applied Linguistics

Cross-Lingual Information Retrieval Language Technology I Language Technology I Crosslingual

WMT 2016 Shared Task on Cross-lingual Pronoun Prediction . Liane Guillou, Christian Hardmeier,

Cross-lingual NLP Sara Stymne Uppsala University Department of Linguistics and Philology

Probabilistic Pronunciation + N-gram Models CMSC 35100 Natural Language Processing April 15,

Soil Extraction Cell: An Alternative Soil Extraction Cell: An Alternative Method of Soil

Cross-lingual Predicate Cluster Acquisition to Improve Bilingual Event Extraction by Inductive

The Shadow of the Cross The Cross of Jesus part 1B The Shadow of the Cross Hebrews 10:1-14 The

Cross-Lingual Semantic Mapping of Authority Files Nadine Steinmetz, and Harald Sack November,

Fall Product Training ___________________________________ ___________________________________

Cross linguality and machine translation without bilingual data ith t bili l d t Enek

W ITH the widespread use of hands-free electronic gad- are mapped to a multilingual set using a

The Moment of Meaning The Moment of Meaning

Stacking With Auxiliary Features: Improved Ensembling for Natural Language and Vision Nazneen

Cross-Lingual Word Sense Disambiguation using WordNets and Context Mapping Priyank Jaini Ankit

Automatic Machine Translation Evaluation using Source Language Inputs and Cross-lingual Language

Sambuz

Useful Links

Newsletter

Mail Us

Fall Product Training _ _