temporal code temporal code temporal code
play

Temporal Code Temporal Code Temporal Code (Acoustic Front-end) - PowerPoint PPT Presentation

Temporal Code Temporal Code Temporal Code (Acoustic Front-end) Human Recognition Machine Recognition RECOGNIZED UTTERANCE LANGUAGE MODELING (Back-end) HYPOTHESIZED UTTERANCES STATISTICAL SEQUENCE RECOGNITION ACOUSTIC REPRESENTATION


  1. Temporal Code

  2. Temporal Code

  3. Temporal Code (Acoustic Front-end)

  4. Human Recognition

  5. Machine Recognition RECOGNIZED UTTERANCE LANGUAGE MODELING (Back-end) � HYPOTHESIZED UTTERANCES STATISTICAL SEQUENCE RECOGNITION ACOUSTIC REPRESENTATION ACOUSTIC FRONT END SPEECH WAVEFORM

  6. Human vs. Machine

  7. “Top-down” Processing

  8. Machine Training • Aurora-4 Speech Database � • Wall Street Journal (WSJO) Corpus � • Large Vocabulary Continuous Speech Recognition � • 7,138 clean speech utterances, 16kHz

  9. Human Training • Wernicke’s Area: Speech Understanding � • Broca’s Area: Speech Production

  10. Acoustic Model Hidden Markov Model (HMM) • Each triphone characterized by HMM consisting of 3 states, 8 Gaussian mixtures per state Transition Probability Emission Probability Density

  11. Acoustic Model • Maximum likelihood (ML) training applied to estimate a set of context-dependent triphone acoustic models

  12. Language Model • Standard 5k lexicon (CMU pronouncing Dictionary) • Tri-gram language model

  13. Decoder • Single-pass Viterbi beam search-based decoder

  14. Human Recognition � Noise-Vocoder � Tone-Vocoder

  15. CI Recognition

  16. Normal Hearing vs. CI • Cochlear Implant range (hatched area) compared with average normal hearing scores (filled squares)

  17. CI vs. Machine Recognition • ASR provided most accurate simulation ever!

  18. Machine Recognition • ASR derived by world’s best auditory scientists

  19. Effects of Training

  20. Effects of Training

  21. Effects of Training

  22. Clinical Implications • Alter Frequency Allocation � • Deactivate Interfering Electrodes � • Alter Compression Curve � • Modify Electric Pulse Width

  23. Summary

  24. Information Technology • 2014, HMM can now improve Hearing Science

  25. Future Work • Design improved signal processing to mimic: � • 1) Place code of neurons � • 2) Neural Firing Rates

  26. FAME Strategy • Frequency Amplitude Modulation Encoder

  27. SOUND S pectral Or � U ndertone N ormalization D ecomposition

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend