Bi-directional talker-listener Source Environmental / Receiver - PowerPoint PPT Presentation

Speech communication in real-world settings typically involves several sources of adverse conditions Speaker Environment Listener Bi-directional talker-listener Source Environmental / Receiver limitations: adaptation across a language barrier degradation: transmission degradation: • Peripheral deficiency • Conversational speech • Incomplete language model • Accented speech • Primarily energetic Ann Bradlow • Impaired language model • Disordered speech masking (e.g. broadband access /use Department of Linguistics noise) • Energetic & informational • Cognitive load Northwestern University masking (e.g. background speech) Speech communication across a language barrier (Mattys, Davis, Bradlow and Scott, Language & Cognitive Processing , SI on Speech Recognition in Adverse conditions, 2012.) Speech communication in real-world settings typically Why is foreign-accented speech hard to understand? involves several sources of adverse conditions Deviation of the signal from the native talker norm/target Speaker Environment Listener The children dropped the bag. Source Environmental / Receiver limitations: Native- degradation: transmission accented degradation: • Conversational speech • Peripheral deficiency • Incomplete language • Primarily energetic • Accented speech masking (e.g. broadband model • Disordered speech noise) • Impaired language Chinese- • Energetic & informational model access /use accented masking (e.g. background • Cognitive load speech) Speech communication across a language barrier • a challenge The foreign-accented sentence: • an opportunity for innovation ‣ is ~30% longer overall (lots of pauses, less fluent) ‣ exhibits different segmental/sub-segmental timing relations ‣ etc. (Mattys, Davis, Bradlow and Scott, Language & Cognitive Processing , SI on Speech Recognition in Adverse conditions, 2012.) 1

Systematicity of foreign-accented speech ”deviations„ Systematicity of foreign-accented speech ”deviations„ (2) L2 typological peculiarities => Accent-independent adaptation (1) L1-L2 interactions => Talker-independent adaptation Adaptation to foreign-accented speech by talkers from a Adaptation to an accent as it extends across a group of variety of native language backgrounds foreign-accented talkers from the same native language background Slovak Slovak English English Mandarin Mandarin Adaptation to foreign-accented speech L1-L2 interactions => Talker-independent adaptation Adaptation to an accent as it extends across a group of foreign- Study 1: Adaptation to systematic deviations of foreign-accented accented talkers from the same native language background speech following exposure to stimuli that vary along the to-be- learned dimension Test Training 1 Training 2 • Talker-independent adaptation Adaptation to an accent as it extends across a group of foreign-accented talkers from the same native language background • Accent-independent adaptation A. Chinese-accented test talker Adaptation to foreign-accented speech by talkers from a variety of native B. Multiple Chinese-accented talkers language backgrounds C. Single Chinese-accented talker Chinese-accented Study 2 : Adaptation to foreign-accented speech in response to D. Multiple native-accented talkers (in white noise, +5 dB SNR) variation in the training task Slovakian-accented E. Untrained controls • Does perceptual learning for foreign-accented speech require (in white noise, +5 dB SNR) active performance of a sentence recognition task? Bradlow and Bent, 2008 . See also Clarke & Garrett, 2004; Sidaras, Alexander & Nygaard, 2009. 2

L2 typological peculiarities => Accent-independent adaptation Talker-independent adaptation to a foreign-accent Adaptation to foreign-accented speech by talkers from a variety of native language backgrounds Test Training 1 Training 2 A. Chinese-accented test talker B. Multiple Chinese-accented talkers C. Single Chinese-accented talker Chinese-accented D. Multiple native-accented talkers (in white noise, +5 dB SNR) E. Multiple accents (Chinese, Romanian, Thai, Hindi, Korean) Slovakian-accented (in white noise, +5 dB SNR) F. Untrained controls Adaptation to foreign-accented speech Accent-independent adaptation to a foreign accent Post test 1: Chinese-accented talker Multiple accent training : 100 Study 1: Adaptation to systematic deviations of foreign-accented ‣ Chinese ( � test talker) 90 speech following exposure to stimuli that vary along the to-be- ‣ Romanian Percent Correct learned dimension ‣ Thai 80 ‣ Hindi 70 • Talker-independent adaptation ‣ Korean 60 Adaptation to an accent as it extends across a group of foreign-accented talkers from the same native language background 50 • Accent-independent adaptation Multi- Multi- Test talker Single Native Untrained Accent talker talker talker Adaptation to foreign-accented speech by talkers from a variety of native 100 Post test 2: Slovakian-accented talker language backgrounds 90 Percent Correct Study 2 : Adaptation to foreign-accented speech in response to 80 variation in the training task 70 60 • Does perceptual learning for foreign-accented speech require active performance of a sentence recognition task? 50 Multi- Multi- Test talker Single Native Untrained Accent talker talker talker Baese-Berk, Bradlow & Wright, 2013. 3

Auditory perceptual learning with a combination of Auditory perceptual learning with a combination of active task performance and passive stimulus exposure active task performance and passive stimulus exposure worse worse � Pretest � Pretest 16 16 Frequency-discrimination Frequency-discrimination � Post-test � Post-test threshold (Hz) threshold (Hz) 10 10 6 6 better better Frequency Training Frequency Training Frequency Training Control Control Frequency Training + (untrained) (untrained) + Stimulus Exposure Stimulus Exposure Wright, B.A., Sabin, A.T., Zhang, Y., Marrone, N., & Fitzgerald, M.B. (2010), J. Neuroscience. Wright, B.A., Sabin, A.T., Zhang, Y., Marrone, N., & Fitzgerald, M.B. (2010), J. Neuroscience. Adaptation to foreign-accented speech with a combination of Post-test 1: Chinese-accented Talker Training: Multi-talker 100 active task performance and passive stimulus exposure 90 Test: Mandarin-accented Talker 80 70 60 50 Multi-Accent Multi-talker Test talker Single talker Native talker Untrained All Active Active Training Active+passive Passive Exposure All Passive Passive Task (Silence) Short Active Post-Test “Passive” task: Active+Passive training results in as much learning as All-Active training. 4

Adaptation to foreign-accented speech Talker-listener interaction: Spontaneous conversational patterns across a language barrier • Systematic deviations of foreign-accented speech allow highly The Diapix task (dialogue-based picture matching) generalized perceptual learning with exposure to appropriately ‣ A ”spot-the-difference„ game with 2 pictures and 2 participants. variable training stimuli. ‣ Without seeing each other‚s picture, participants work together to find differences. ‣ Elicits a wide range of utterance types (questions, declaratives, exclamations etc.). • Adaptation to foreign accents can occur in response to a ‣ Elicits connected speech from both participants without predetermined roles. combination of active performance of a sentence recognition task and passive listening situations. Picture B Picture A • Perceptual flexibility underlying perceptual adaptation to foreign- accented speech may eventually lead to parallel adaptations in speech production. => A link between individual-level adaptation to variable speech input and population-level, contact-induced sound change. Communicative efficiency, phonetic convergence, Communicative efficiency and language distance and language distance Task accuracy Time to complete the diapix task Communicative efficiency Experimenter imposed time limit • Task completion time • Type-to-token ratio Phonetic convergence • Talker similarity judgments at the beginning versus at the end of a conversation Language distance Close Far Close Far NN 1 -NN 1 N 1 -N 2 N-NN NN 1 -NN 2 N 1 -N 1 N 1 -N 2 N-NN NN 1 -NN 1 NN 1 -NN 2 N 1 -N 1 Efficiency decreases with increasing language distance. Van Engen, Baese-Berk, Baker, Choi, Kim & Bradlow, , 2010. See also Baker & Hazan, 2011; Hazan and Baker, 2011. 5

Bi-directional talker-listener Source Environmental / Receiver - PowerPoint PPT Presentation

Speech communication in real-world settings typically involves several sources of adverse conditions Speaker Environment Listener Bi-directional talker-listener Source Environmental / Receiver limitations: adaptation across a language

Seminar: Search and Optimization Directional Consistency Gabi R oger Universit at Basel

Directional recurrence, ergodicity, and weak mixing Ay se S ahin DePaul University June

Revamping the CallMonitor Listener EECS 4315 Franck van Breugel March 29, 2020 1/13

Blue A Sketch Model Review Blue A Blue A Smooth Passenger Bag Be Gone Talker Blue A

Socket Programming Instruction Sheet In this lab, our goal is to make two machines talk to each

The Listener as Speaker: Implications for Teaching Listening Henry D. Schlinger., Jr. California

? sync ref chosen as sync source by Listener Stream B: Presentation Stream C: timestamps

The meanings of indexical words What does a listener understand sustainable meanings of the

speaker change detection using fundamental frequency with application to multi-talker segmentation

Covariation of Stop Consonant Acoustics: Corpus Evidence and Implications for Talker Adaptation

A talker on Docker: How containers can make your work more reproducible, accessible, and ready

Written-out Talk Scripted Talker University of Careful Planning May 1, 1894 But vertical motion

Every Child a Mover! Every Child a Talker! Early Years & Childcare Service Being Active....

Daily maps of the Bi-directional Reflectance Distribution Function (BRDF) over the Siberian region

Constrained Directional Enhancement Filter Jean-Marc Valin, Steinar Midtskogen March 28, 2017

2.6 Gradients and Directional Derivatives Prof. Tesler Math 20C Fall 2018 Prof. Tesler 2.6

The State of Hooking into Drupal Track: Symfony The State of Hooking into Drupal who am I?

Welcome to your home church! The he V Val alue of e of Su Sufferi fering ng Ja Jame mes

Lecture 6: Vector Semantics and Word Embeddings Julia Hockenmaier juliahmr@illinois.edu 3324

GENERAL PERSPECTIVES ON GENERAL PERSPECTIVES ON LONG- -TERM SURVEY RESEARCH TERM SURVEY

CSE 7/5337: Information Retrieval and Web Search Web crawling and indexes (IIR 20) Michael

Neural representation of linguistic feature Neural representation of linguistic feature hierarchy

ADOBE CONNECT CHAT BOX If you have a question, please type it in the Chat Box at any time.

Application of iRODS to NIEHS Data Management Mike Conway, Deep Patel Office of Data Science

Sambuz

Useful Links

Newsletter

Mail Us

Bi-directional talker-listener Source Environmental / Receiver - PowerPoint PPT Presentation

Speech communication in real-world settings typically involves several sources of adverse conditions Speaker Environment Listener Bi-directional talker-listener Source Environmental / Receiver limitations: adaptation across a language

Seminar: Search and Optimization Directional Consistency Gabi R oger Universit at Basel

Directional recurrence, ergodicity, and weak mixing Ay se S ahin DePaul University June

Revamping the CallMonitor Listener EECS 4315 Franck van Breugel March 29, 2020 1/13

Blue A Sketch Model Review Blue A Blue A Smooth Passenger Bag Be Gone Talker Blue A

Socket Programming Instruction Sheet In this lab, our goal is to make two machines talk to each

The Listener as Speaker: Implications for Teaching Listening Henry D. Schlinger., Jr. California

? sync ref chosen as sync source by Listener Stream B: Presentation Stream C: timestamps

The meanings of indexical words What does a listener understand sustainable meanings of the

speaker change detection using fundamental frequency with application to multi-talker segmentation

Covariation of Stop Consonant Acoustics: Corpus Evidence and Implications for Talker Adaptation

A talker on Docker: How containers can make your work more reproducible, accessible, and ready

Written-out Talk Scripted Talker University of Careful Planning May 1, 1894 But vertical motion

Every Child a Mover! Every Child a Talker! Early Years &amp; Childcare Service Being Active....

Daily maps of the Bi-directional Reflectance Distribution Function (BRDF) over the Siberian region

Constrained Directional Enhancement Filter Jean-Marc Valin, Steinar Midtskogen March 28, 2017

2.6 Gradients and Directional Derivatives Prof. Tesler Math 20C Fall 2018 Prof. Tesler 2.6

The State of Hooking into Drupal Track: Symfony The State of Hooking into Drupal who am I?

Welcome to your home church! The he V Val alue of e of Su Sufferi fering ng Ja Jame mes

Lecture 6: Vector Semantics and Word Embeddings Julia Hockenmaier juliahmr@illinois.edu 3324

GENERAL PERSPECTIVES ON GENERAL PERSPECTIVES ON LONG- -TERM SURVEY RESEARCH TERM SURVEY

CSE 7/5337: Information Retrieval and Web Search Web crawling and indexes (IIR 20) Michael

Neural representation of linguistic feature Neural representation of linguistic feature hierarchy

ADOBE CONNECT CHAT BOX If you have a question, please type it in the Chat Box at any time.

Application of iRODS to NIEHS Data Management Mike Conway, Deep Patel Office of Data Science

Sambuz

Useful Links

Newsletter

Mail Us

Every Child a Mover! Every Child a Talker! Early Years & Childcare Service Being Active....