chive
play

CHiVE Varying Prosody in Speech Synthesis with a Linguistically - PowerPoint PPT Presentation

CHiVE Varying Prosody in Speech Synthesis with a Linguistically Driven Dynamic Hierarchical Conditional Variational Network Vincent Wan, Chun-an Chan, Tom Kenter, Jakub Vit, Rob Clark Modelling intonation in prosody A conditional variational


  1. CHiVE Varying Prosody in Speech Synthesis with a Linguistically Driven Dynamic Hierarchical Conditional Variational Network Vincent Wan, Chun-an Chan, Tom Kenter, Jakub Vit, Rob Clark

  2. Modelling intonation in prosody

  3. A conditional variational autoencoder captures the difgerent intonations

  4. Language has a hierarchical linguistic structure Sentence Words sil hello sil Syllables sil h+e l+ou sil Phonemes sil h e l ou sil Frames

  5. Add linguistic knowledge to the network

  6. The structured model is betuer Baseline (30.7%) CHiVE (46.1%) No preference (23.2%)

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend