SLIDE 21 References I
[Braunschweiler and Buchholz (2011)] Braunschweiler, N. & Buchholz, S. (2011) Automatic Sentence Selection from Speech Corpora Including Diverse Speech for Improved HMM-TTS Synthesis Quality. Interspeech 2011 [Braunschweiler et al (2010)] Braunschweiler, N., Gales, M.J.F. & Buchholz, S. (2010) Lightly supervised recognition for automatic alignment of large coherent speech recordings. Interspeech 2010 [Chen et al (1998)] Chen, S.H., Hwang, S.H. & Wang, Y.R. (1998) An RNN-based prosodic information synthesizer for Mandarin text-to-speech IEEE Transactions on Speech and Audio Processing [Lu et al (2013)] Lu, H., King, S. & Watts, O. (2013) Combining a vector space representation of linguistic context with a deep neural network for text-to-speech synthesis Speech Synthesis Workshop 8 (2013) [Mikolov et al (2013)] Mikolov, T., Chen, K., Corrado, G. & Dean, J. (2013) Efficient estimation of word representations in vector space arXiv preprint arXiv:1301.3781 [Ribeiro et al (2016)] Ribeiro, M.S., Watts, O. & Yamagishi, J. (2016) Parallel and cascaded deep neural networks for text-to-speech synthesis 9th ISCA Workshop on Speech Synthesis, Proceedings, Sunnyvale, 2016 [Wang et al (2015)] Wang, P, Qian, Y., Soong, F.K, He, L. & Zhao, H. (2015) Word Embedding for Recurrent Neural Network Based TTS Synthesis ICASSP 2015 21 / 22