speech synthesis and
play

Speech Synthesis and Perception with Envelope Cue B ACKGROUND I - PowerPoint PPT Presentation

Signals and Systems Speech Synthesis and Perception with Envelope Cue B ACKGROUND I MPLEMENTATION R ESULTS D ISCUSSION I MPROVEMENT B ACKGROUND | P ART 1 History - Artificial Cochlea First extra-auricular electric simulation 1748


  1. Signals and Systems Speech Synthesis and Perception with Envelope Cue

  2. 目录 B ACKGROUND I MPLEMENTATION R ESULTS D ISCUSSION I MPROVEMENT

  3. B ACKGROUND | P ART 1

  4. History - Artificial Cochlea • First extra-auricular electric simulation 1748 • Invention of an electrical stimulating system 1905 • Electrode placed in the acoustic nerve produced a copy of the speech waveform. 1930

  5. • The first true cochlea implant was implanted by the American otologist William Bill House 1961 • FDA allowed them to be implanted in adults. 1984 • The implants are approved for infants over 12 months old. 2000

  6. I MPLEMENTATION | P ART 2

  7. Figure 1. The operation of a four-channel cochlear implant . Reprinted from "Introduction to cochlear implants," by P . C. Loizou, 1999, IEEE Engineering in Medicine and Biology Magazine, vol. 18, no. 1.

  8. synthesize.m-modulation band = 8; order = 4

  9. synthesize.m-8 band pass filters order = 4

  10. add_ssn.m SNR=-5

  11. GUI.m

  12. R ESULTS D ISCUSSION | P ART 3

  13. Task1 Variation in Channel Number N=1 • Butter Filters: Order = 4 N=2 • 𝑔 𝑑𝑣𝑢𝑝𝑔𝑔 = 50𝐼𝑨 N=4 • N=8 • N=16 • N=20 • N=32 •

  14. Why N is limited? • Instability of filters • Interference between electrodes • Continuous interleaved sampling

  15. Task2 Variation in Cut-off Frequency Implement tone- Describe how the LPF Set the number vocoder by cut-off frequency affects changing the LPF of bands N=4. the intelligibility of synthesized sentence. cut-off frequency .

  16. Task2 Results and Conclusion N=4 • 𝑔 𝑑𝑣𝑢𝑝𝑔𝑔 = 20Hz • 𝑔 𝑑𝑣𝑢𝑝𝑔𝑔 = 50Hz • 𝑔 𝑑𝑣𝑢𝑝𝑔𝑔 = 100Hz • 𝑔 𝑑𝑣𝑢𝑝𝑔𝑔 = 400Hz

  17. Task3 Noise & Variation in Band Number Describe how the Implement Generate a number of bands affects Set LPF tone-vocoder the intelligibility of noisy signal cut-off by changing synthesized sentence, at SNR frequency and compare findings the number of -5 dB with those obtained to 50 Hz bands in task 1

  18. Task3 Results and Conclusion N=2 • N=4 • N=6 • N=8 • N=16 •

  19. Task4 Noise & Variation in Cut-off Frequency Describe how the Implement LPF cut-off Generate a Set the tone-vocoder frequency affects noisy signal number by changing the intelligibility at of bands the LPF cut-off of synthesized SNR -5 dB to N=6 frequency sentence

  20. Task4 Noise & Variation in Cut-off Frequency

  21. English & Chinese Comparison • Synthesized speech is likely to lose its tone. • Chinese: tonal; English: non-tonal Processed :

  22. English & Chinese Comparison Reprinted from " 电子耳蜗言语处理策略的频谱特征研究 ." by 陈又圣 , et al. (2017) 生物医学工程学杂志 34(5): 760-766.

  23. How about music?

  24. I MPROVEMENT | P ART 4

  25. Noise Reduction S. V. Vaseghi, Advanced Digital Signal Processing and Noise Reduction . 2008.

  26. Noise Reduction using Wiener filters Original • Noisy • Noise Reduced • Synthesized (noisy) • Synthesized (noise reduced) •

  27. Reference : [1] A. Mudry and M. Mills, "The early history of the cochlear implant: a retrospective," (in eng), JAMA Otolaryngol Head Neck Surg, vol. 139, no. 5, pp. 446-53, May 2013. [2] R. V. Shannon, F. G. Zeng, V. Kamath, J. Wygonski, and M. Ekelid, "Speech recognition with primarily temporal cues," (in eng), Science, vol. 270, no. 5234, pp. 303-4, Oct 13 1995. [3] S. V. Vaseghi, Advanced Digital Signal Processing and Noise Reduction. Wiley, 2008. [4] Chen, F., et al. (2015). "Evaluation of noise reduction methods for sentence recognition by mandarin- speaking cochlear implant listeners." Ear and hearing 36(1): 61-71. 陈又圣 , et al. (2017). " 电子耳蜗言语处理策略的频谱特征研究 ." 生物医学工程学杂志 34(5): 760-766. [5] 龚树生 , and 郝瑾 , “国产人工耳蜗 , 任重道远 , ” 中国医学文摘 : 耳鼻咽喉科学 , vol. 28, no. 5, pp. 231-236, [6] 2013.

  28. 感谢观看 | THANK YOU

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend