artificial intelligence data science in the industrial
play

Artificial Intelligence, Data Science in the Industrial World, - PowerPoint PPT Presentation

Artificial Intelligence, Data Science in the Industrial World, Speech Synthesis matveeva.yulia@huawei.com Yulia MATVEEVA 23rd May 2019 1/59 Self-introduction Data Science in the Industrial World Huawei VoiceKit Project and Personal


  1. Self-introduction Data Science in the Industrial World Huawei VoiceKit Project and Personal Assistant Speech Synthesis Job Opportunities at Huawei, Russia The Job of a Data Scientist : what it is NOT (Usually) Data Science is NOT about . 11/59 matveeva.yulia@huawei.com Yulia MATVEEVA Artificial Intelligence, Data Science, Speech Synthesis

  2. Self-introduction Data Science in the Industrial World Huawei VoiceKit Project and Personal Assistant Speech Synthesis Job Opportunities at Huawei, Russia The Job of a Data Scientist : what it is NOT (Usually) Data Science is NOT about Complex program architecture : • designing an hierarchy of (OOP) classes ; • implementing patterns of complex inter-communication between program modules. 11/59 matveeva.yulia@huawei.com Yulia MATVEEVA Artificial Intelligence, Data Science, Speech Synthesis

  3. Self-introduction Data Science in the Industrial World Huawei VoiceKit Project and Personal Assistant Speech Synthesis Job Opportunities at Huawei, Russia The Job of a Data Scientist : what it is NOT (Usually) Data Science is NOT about Implementing classical algorithms from scratch... in C. 11/59 matveeva.yulia@huawei.com Yulia MATVEEVA Artificial Intelligence, Data Science, Speech Synthesis

  4. Self-introduction Data Science in the Industrial World Huawei VoiceKit Project and Personal Assistant Speech Synthesis Job Opportunities at Huawei, Russia The Job of a Data Scientist : what it is NOT (Usually) Data Science is NOT about Designing algorithms from scratch, proving theorems, ... 11/59 matveeva.yulia@huawei.com Yulia MATVEEVA Artificial Intelligence, Data Science, Speech Synthesis

  5. Self-introduction Data Science in the Industrial World Huawei VoiceKit Project and Personal Assistant Speech Synthesis Job Opportunities at Huawei, Russia The Job of a Data Scientist 12/59 matveeva.yulia@huawei.com Yulia MATVEEVA Artificial Intelligence, Data Science, Speech Synthesis

  6. Self-introduction Data Science in the Industrial World Huawei VoiceKit Project and Personal Assistant Speech Synthesis Job Opportunities at Huawei, Russia The Job of a Data Scientist : what it is 13/59 matveeva.yulia@huawei.com Yulia MATVEEVA Artificial Intelligence, Data Science, Speech Synthesis

  7. Self-introduction Data Science in the Industrial World Huawei VoiceKit Project and Personal Assistant Speech Synthesis Job Opportunities at Huawei, Russia The Job of a Data Scientist : what it is Translating business needs into math problems. 13/59 matveeva.yulia@huawei.com Yulia MATVEEVA Artificial Intelligence, Data Science, Speech Synthesis

  8. Self-introduction Data Science in the Industrial World Huawei VoiceKit Project and Personal Assistant Speech Synthesis Job Opportunities at Huawei, Russia The Job of a Data Scientist : what it is Translating business needs into math problems. Chosing appropriate models. 13/59 matveeva.yulia@huawei.com Yulia MATVEEVA Artificial Intelligence, Data Science, Speech Synthesis

  9. Self-introduction Data Science in the Industrial World Huawei VoiceKit Project and Personal Assistant Speech Synthesis Job Opportunities at Huawei, Russia The Job of a Data Scientist : what it is Translating business needs into math problems. Chosing appropriate models. Data processing : • Validating, cleaning, filtering, transforming, ... 13/59 matveeva.yulia@huawei.com Yulia MATVEEVA Artificial Intelligence, Data Science, Speech Synthesis

  10. Self-introduction Data Science in the Industrial World Huawei VoiceKit Project and Personal Assistant Speech Synthesis Job Opportunities at Huawei, Russia The Job of a Data Scientist : what it is 14/59 matveeva.yulia@huawei.com Yulia MATVEEVA Artificial Intelligence, Data Science, Speech Synthesis

  11. Self-introduction Data Science in the Industrial World Huawei VoiceKit Project and Personal Assistant Speech Synthesis Job Opportunities at Huawei, Russia The Job of a Data Scientist : what it is Playing lego : 14/59 matveeva.yulia@huawei.com Yulia MATVEEVA Artificial Intelligence, Data Science, Speech Synthesis

  12. Self-introduction Data Science in the Industrial World Huawei VoiceKit Project and Personal Assistant Speech Synthesis Job Opportunities at Huawei, Russia The Job of a Data Scientist : what it is Playing lego : • combining algorithms together ; 14/59 matveeva.yulia@huawei.com Yulia MATVEEVA Artificial Intelligence, Data Science, Speech Synthesis

  13. Self-introduction Data Science in the Industrial World Huawei VoiceKit Project and Personal Assistant Speech Synthesis Job Opportunities at Huawei, Russia The Job of a Data Scientist : what it is Playing lego : • combining algorithms together ; • constructing neural networks in NN frameworks (tensorflow, pytorch, ...). 14/59 matveeva.yulia@huawei.com Yulia MATVEEVA Artificial Intelligence, Data Science, Speech Synthesis

  14. Self-introduction Data Science in the Industrial World Huawei VoiceKit Project and Personal Assistant Speech Synthesis Job Opportunities at Huawei, Russia The Job of a Data Scientist : what it is Playing lego : • combining algorithms together ; • constructing neural networks in NN frameworks (tensorflow, pytorch, ...). Tuning hyper-parameters. 14/59 matveeva.yulia@huawei.com Yulia MATVEEVA Artificial Intelligence, Data Science, Speech Synthesis

  15. Self-introduction Data Science in the Industrial World Huawei VoiceKit Project and Personal Assistant Speech Synthesis Job Opportunities at Huawei, Russia The Job of a Data Scientist : what it is Setting up experiments + analyzing the results. 15/59 matveeva.yulia@huawei.com Yulia MATVEEVA Artificial Intelligence, Data Science, Speech Synthesis

  16. Self-introduction Data Science in the Industrial World Huawei VoiceKit Project and Personal Assistant Speech Synthesis Job Opportunities at Huawei, Russia The Job of a Data Scientist : what it is Setting up experiments + analyzing the results. Problem solving, learning quickly, adapting to a changing environment. 15/59 matveeva.yulia@huawei.com Yulia MATVEEVA Artificial Intelligence, Data Science, Speech Synthesis

  17. Self-introduction Data Science in the Industrial World Huawei VoiceKit Project and Personal Assistant Speech Synthesis Job Opportunities at Huawei, Russia The Job of a Data Scientist : what it is www.datanami.com/2018/ 09/17/ improving-your-odds-with- data-science-hiring 16/59 matveeva.yulia@huawei.com Yulia MATVEEVA Artificial Intelligence, Data Science, Speech Synthesis

  18. Self-introduction Data Science in the Industrial World Huawei VoiceKit Project and Personal Assistant Speech Synthesis Job Opportunities at Huawei, Russia The Job of a Data Scientist : what it is 17/59 matveeva.yulia@huawei.com Yulia MATVEEVA Artificial Intelligence, Data Science, Speech Synthesis

  19. Self-introduction Data Science in the Industrial World Huawei VoiceKit Project and Personal Assistant Speech Synthesis Job Opportunities at Huawei, Russia The Job of a Data Scientist Why You Are Good for It 18/59 matveeva.yulia@huawei.com Yulia MATVEEVA Artificial Intelligence, Data Science, Speech Synthesis

  20. Self-introduction Data Science in the Industrial World Huawei VoiceKit Project and Personal Assistant Speech Synthesis Job Opportunities at Huawei, Russia The Job of a Data Scientist Why You Are Good for It Understanding mathematics ! 18/59 matveeva.yulia@huawei.com Yulia MATVEEVA Artificial Intelligence, Data Science, Speech Synthesis

  21. Self-introduction Data Science in the Industrial World Huawei VoiceKit Project and Personal Assistant Speech Synthesis Job Opportunities at Huawei, Russia The Job of a Data Scientist Why You Are Good for It Understanding mathematics ! Knowing computer science. 18/59 matveeva.yulia@huawei.com Yulia MATVEEVA Artificial Intelligence, Data Science, Speech Synthesis

  22. Self-introduction Data Science in the Industrial World Huawei VoiceKit Project and Personal Assistant Speech Synthesis Job Opportunities at Huawei, Russia The Job of a Data Scientist Why You Are Good for It Understanding mathematics ! Knowing computer science. Problem solving ! 18/59 matveeva.yulia@huawei.com Yulia MATVEEVA Artificial Intelligence, Data Science, Speech Synthesis

  23. Self-introduction Data Science in the Industrial World Huawei VoiceKit Project and Personal Assistant Speech Synthesis Job Opportunities at Huawei, Russia Machine Learning (Artificial Intelligence) Data Science in the Industrial World : some examples. 19/59 matveeva.yulia@huawei.com Yulia MATVEEVA Artificial Intelligence, Data Science, Speech Synthesis

  24. Self-introduction Data Science in the Industrial World Huawei VoiceKit Project and Personal Assistant Speech Synthesis Job Opportunities at Huawei, Russia Recommender Systems Problem Statement { q i } n { w j } m Users i =1 , items j =1 . History of user-item interaction. What items do we recommend to user u i in a particular setting ? 20/59 matveeva.yulia@huawei.com Yulia MATVEEVA Artificial Intelligence, Data Science, Speech Synthesis

  25. Self-introduction Data Science in the Industrial World Huawei VoiceKit Project and Personal Assistant Speech Synthesis Job Opportunities at Huawei, Russia Recommender Systems Matrix X (n x m) of user-item ratings. Large dimensionality. Zeros vs. missing values. 21/59 matveeva.yulia@huawei.com Yulia MATVEEVA Artificial Intelligence, Data Science, Speech Synthesis

  26. Self-introduction Data Science in the Industrial World Huawei VoiceKit Project and Personal Assistant Speech Synthesis Job Opportunities at Huawei, Russia Recommender Systems Simple Solution : Collaborative Filtering 22/59 matveeva.yulia@huawei.com Yulia MATVEEVA Artificial Intelligence, Data Science, Speech Synthesis

  27. Self-introduction Data Science in the Industrial World Huawei VoiceKit Project and Personal Assistant Speech Synthesis Job Opportunities at Huawei, Russia Recommender Systems Simple Solution : Collaborative Filtering Matrix Factorization (SVD). 22/59 matveeva.yulia@huawei.com Yulia MATVEEVA Artificial Intelligence, Data Science, Speech Synthesis

  28. Self-introduction Data Science in the Industrial World Huawei VoiceKit Project and Personal Assistant Speech Synthesis Job Opportunities at Huawei, Russia Recommender Systems : Collaborative Filtering Singular Value Decomposition (SVD) : ′ T , X = U Σ T V V ′ – orthonormal basis for span ( { X [1 , · ] , . . . , X [ n , · ] } ) , U – orthonormal basis for span ( { X [ · , 1] , . . . , X [ · , m ] } ) ˆ ′ T X k = U [ · , 1: k ] Σ T [1: k , 1: k ] V [ · , 1: k ] = = arg rank ( A )= k || X − A || . min 23/59 matveeva.yulia@huawei.com Yulia MATVEEVA Artificial Intelligence, Data Science, Speech Synthesis

  29. Self-introduction Data Science in the Industrial World Huawei VoiceKit Project and Personal Assistant Speech Synthesis Job Opportunities at Huawei, Russia Self-introduction 1 Data Science in the Industrial World 2 Huawei VoiceKit Project and Personal Assistant 3 Speech Synthesis 4 Job Opportunities at Huawei, Russia 5 24/59 matveeva.yulia@huawei.com Yulia MATVEEVA Artificial Intelligence, Data Science, Speech Synthesis

  30. Self-introduction Data Science in the Industrial World Huawei VoiceKit Project and Personal Assistant Speech Synthesis Job Opportunities at Huawei, Russia Huawei VoiceKit Project 25/59 matveeva.yulia@huawei.com Yulia MATVEEVA Artificial Intelligence, Data Science, Speech Synthesis

  31. Self-introduction Data Science in the Industrial World Huawei VoiceKit Project and Personal Assistant Speech Synthesis Job Opportunities at Huawei, Russia Huawei VoiceKit Project 26/59 matveeva.yulia@huawei.com Yulia MATVEEVA Artificial Intelligence, Data Science, Speech Synthesis

  32. Self-introduction Data Science in the Industrial World Huawei VoiceKit Project and Personal Assistant Speech Synthesis Job Opportunities at Huawei, Russia Huawei VoiceKit Project [ Drawing credits : www.researchgate.net/profile/Theodora_Koulouri ] 27/59 matveeva.yulia@huawei.com Yulia MATVEEVA Artificial Intelligence, Data Science, Speech Synthesis

  33. Self-introduction Data Science in the Industrial World Huawei VoiceKit Project and Personal Assistant Speech Synthesis Job Opportunities at Huawei, Russia Machine Learning Seminars [ Huawei ] Natural Language Processing and more : https://sites.google.com/view/nlp-seminars/main Talk on Speech Synthesis : 8th of June. 28/59 matveeva.yulia@huawei.com Yulia MATVEEVA Artificial Intelligence, Data Science, Speech Synthesis

  34. Self-introduction Data Science in the Industrial World Huawei VoiceKit Project and Personal Assistant Speech Synthesis Job Opportunities at Huawei, Russia Self-introduction 1 Data Science in the Industrial World 2 Huawei VoiceKit Project and Personal Assistant 3 Speech Synthesis 4 Job Opportunities at Huawei, Russia 5 29/59 matveeva.yulia@huawei.com Yulia MATVEEVA Artificial Intelligence, Data Science, Speech Synthesis

  35. Self-introduction Data Science in the Industrial World Huawei VoiceKit Project and Personal Assistant Speech Synthesis Job Opportunities at Huawei, Russia Text-To-Speech : problem statement Create a system that is able to transform arbitrary text in a given language to speech in the form of an audio waveform . 30/59 matveeva.yulia@huawei.com Yulia MATVEEVA Artificial Intelligence, Data Science, Speech Synthesis

  36. Self-introduction Data Science in the Industrial World Huawei VoiceKit Project and Personal Assistant Speech Synthesis Job Opportunities at Huawei, Russia TTS : problem particularities and particular problems Essentially a sequence to sequence problem with a highly correlated output sequence : • strong sequential dependencies ; • each (output) point taken individually is meaningless (it’s a vibration that is encoded). 31/59 matveeva.yulia@huawei.com Yulia MATVEEVA Artificial Intelligence, Data Science, Speech Synthesis

  37. Self-introduction Data Science in the Industrial World Huawei VoiceKit Project and Personal Assistant Speech Synthesis Job Opportunities at Huawei, Russia TTS : problem particularities and particular problems Essentially a sequence to sequence problem with a highly correlated output sequence : • strong sequential dependencies ; • each (output) point taken individually is meaningless (it’s a vibration that is encoded). Need to take particularities of human perception of sound into account : 31/59 matveeva.yulia@huawei.com Yulia MATVEEVA Artificial Intelligence, Data Science, Speech Synthesis

  38. Self-introduction Data Science in the Industrial World Huawei VoiceKit Project and Personal Assistant Speech Synthesis Job Opportunities at Huawei, Russia TTS : problem particularities and particular problems Essentially a sequence to sequence problem with a highly correlated output sequence : • strong sequential dependencies ; • each (output) point taken individually is meaningless (it’s a vibration that is encoded). Need to take particularities of human perception of sound into account : • it is logarithmic ; 31/59 matveeva.yulia@huawei.com Yulia MATVEEVA Artificial Intelligence, Data Science, Speech Synthesis

  39. Self-introduction Data Science in the Industrial World Huawei VoiceKit Project and Personal Assistant Speech Synthesis Job Opportunities at Huawei, Russia TTS : problem particularities and particular problems Essentially a sequence to sequence problem with a highly correlated output sequence : • strong sequential dependencies ; • each (output) point taken individually is meaningless (it’s a vibration that is encoded). Need to take particularities of human perception of sound into account : • it is logarithmic ; • what we percieve as pitch ? 31/59 matveeva.yulia@huawei.com Yulia MATVEEVA Artificial Intelligence, Data Science, Speech Synthesis

  40. Self-introduction Data Science in the Industrial World Huawei VoiceKit Project and Personal Assistant Speech Synthesis Job Opportunities at Huawei, Russia Human perception in speech synthesis Standard techniques 1 Human perception of sound is logarithmic : • Mu-law quantization, convert to dB. 2 High/low frequencies : • Pre-emphasis (high-pass filter) : y t − α y t − 1 . • De-emphasis (low-pass filter). 32/59 matveeva.yulia@huawei.com Yulia MATVEEVA Artificial Intelligence, Data Science, Speech Synthesis

  41. Self-introduction Data Science in the Industrial World Huawei VoiceKit Project and Personal Assistant Speech Synthesis Job Opportunities at Huawei, Russia Non-uniform quantization 33/59 matveeva.yulia@huawei.com Yulia MATVEEVA Artificial Intelligence, Data Science, Speech Synthesis

  42. Self-introduction Data Science in the Industrial World Huawei VoiceKit Project and Personal Assistant Speech Synthesis Job Opportunities at Huawei, Russia Text-To-Speech (TTS) : system architectures Families of Text-To-Speech Systems 34/59 matveeva.yulia@huawei.com Yulia MATVEEVA Artificial Intelligence, Data Science, Speech Synthesis

  43. Self-introduction Data Science in the Industrial World Huawei VoiceKit Project and Personal Assistant Speech Synthesis Job Opportunities at Huawei, Russia Text-To-Speech (TTS) : system architectures Families of Text-To-Speech Systems Concatenative unit-selection. 34/59 matveeva.yulia@huawei.com Yulia MATVEEVA Artificial Intelligence, Data Science, Speech Synthesis

  44. Self-introduction Data Science in the Industrial World Huawei VoiceKit Project and Personal Assistant Speech Synthesis Job Opportunities at Huawei, Russia Text-To-Speech (TTS) : system architectures Families of Text-To-Speech Systems Concatenative unit-selection. End-2-end speech synthesis (neural). 34/59 matveeva.yulia@huawei.com Yulia MATVEEVA Artificial Intelligence, Data Science, Speech Synthesis

  45. Self-introduction Data Science in the Industrial World Huawei VoiceKit Project and Personal Assistant Speech Synthesis Job Opportunities at Huawei, Russia Text-To-Speech (TTS) : system architectures Families of Text-To-Speech Systems Concatenative unit-selection. End-2-end speech synthesis (neural). Statistical Parametric Speech Synthesis (SPSS) (neural or non-neural). 34/59 matveeva.yulia@huawei.com Yulia MATVEEVA Artificial Intelligence, Data Science, Speech Synthesis

  46. Self-introduction Data Science in the Industrial World Huawei VoiceKit Project and Personal Assistant Speech Synthesis Job Opportunities at Huawei, Russia Speech synthesis : pre-processing of the training data 0 Big corpus of { text + speech } : 35/59 matveeva.yulia@huawei.com Yulia MATVEEVA Artificial Intelligence, Data Science, Speech Synthesis

  47. Self-introduction Data Science in the Industrial World Huawei VoiceKit Project and Personal Assistant Speech Synthesis Job Opportunities at Huawei, Russia Speech synthesis : pre-processing of the training data 0 Big corpus of { text + speech } : usually aligned by sentences. 35/59 matveeva.yulia@huawei.com Yulia MATVEEVA Artificial Intelligence, Data Science, Speech Synthesis

  48. Self-introduction Data Science in the Industrial World Huawei VoiceKit Project and Personal Assistant Speech Synthesis Job Opportunities at Huawei, Russia Speech synthesis : pre-processing of the training data 0 Big corpus of { text + speech } : usually aligned by sentences. 1 Split into units (segments) + align. 35/59 matveeva.yulia@huawei.com Yulia MATVEEVA Artificial Intelligence, Data Science, Speech Synthesis

  49. Self-introduction Data Science in the Industrial World Huawei VoiceKit Project and Personal Assistant Speech Synthesis Job Opportunities at Huawei, Russia Concatenative unit-selection : training 36/59 matveeva.yulia@huawei.com Yulia MATVEEVA Artificial Intelligence, Data Science, Speech Synthesis

  50. Self-introduction Data Science in the Industrial World Huawei VoiceKit Project and Personal Assistant Speech Synthesis Job Opportunities at Huawei, Russia Concatenative unit-selection : training Phoneme alignment : how ? 1 Phoneme-2-letter alignment : EM-like algorithm : • A ij : phoneme-to-letter associations • Start from A 0 ij sentence/word alignment : increment each a ij if this (phoneme, letter) pair occurs in the same sentence/word. • Given A k ij : find the phone-2-letter alignmemnt that maximizes the association (path-finding algotihm). 2 Waveform segmentation. 37/59 matveeva.yulia@huawei.com Yulia MATVEEVA Artificial Intelligence, Data Science, Speech Synthesis

  51. Self-introduction Data Science in the Industrial World Huawei VoiceKit Project and Personal Assistant Speech Synthesis Job Opportunities at Huawei, Russia Concatenative unit-selection : model Hidden Markov Model y 0 , ..., y n — units = speech segments = pieces of waveforms (taken from a database Y = { y ′ j } N j =1 ), x 0 , ..., x n — linguistic features corresponding to segments of text (letters, phonemes, duration, accentuation, left/right context, ...). P ( y t , y t − 1 , . . . , y 0 | x t , . . . , x 0 ) = P ( y 0 ) � t P ( x t | y t ) P ( y t | y t − 1 ) P ( x t , . . . , x 0 ) 38/59 matveeva.yulia@huawei.com Yulia MATVEEVA Artificial Intelligence, Data Science, Speech Synthesis

  52. Self-introduction Data Science in the Industrial World Huawei VoiceKit Project and Personal Assistant Speech Synthesis Job Opportunities at Huawei, Russia Concatenative unit-selection : training 4 Transition and emission cost estimation ( ≃ HMMs). � P ( y t , y t − 1 , . . . , y 0 | x t , . . . , x 0 ) ∝ P ( x t | y t ) P ( y t | y t − 1 ) . t (is proportional to) 39/59 matveeva.yulia@huawei.com Yulia MATVEEVA Artificial Intelligence, Data Science, Speech Synthesis

  53. Self-introduction Data Science in the Industrial World Huawei VoiceKit Project and Personal Assistant Speech Synthesis Job Opportunities at Huawei, Russia Concatenative unit-selection : synthesis 1 Viterbi search (over a pruned search space). 40/59 matveeva.yulia@huawei.com Yulia MATVEEVA Artificial Intelligence, Data Science, Speech Synthesis

  54. Self-introduction Data Science in the Industrial World Huawei VoiceKit Project and Personal Assistant Speech Synthesis Job Opportunities at Huawei, Russia Viterbi algorithm n ˆ P ( x t | y t )ˆ ˆ � P ( y 0 ) P ( y t | y t − 1 ) − { y 1 ,..., y n }∈Y n max , − − − − − − − → t =1 ˆ P ∗ k − 1 = max P ( y 0 , . . . , y k − 1 | x 0 , . . . , x k − 1 ) , y 0 ,..., y k ˆ { ˆ y 0 , . . . , ˆ y k − 1 } = arg max P ( y 0 , . . . , y k | x 0 , . . . , y k − 1 ) , y 0 ,..., y k { ˆ y 0 , . . . , ˆ y k } = ˆ = arg max P (ˆ y 0 , . . . , ˆ y k − 1 , y k | x 0 , . . . , x k ) = y k = arg max P ∗ k − 1 ˆ P ( x k | y k )ˆ P ( y k | ˆ y k − 1 ) . (1) 41/59 matveeva.yulia@huawei.com Yulia MATVEEVA Artificial Intelligence, Data Science, Speech Synthesis

  55. Self-introduction Data Science in the Industrial World Huawei VoiceKit Project and Personal Assistant Speech Synthesis Job Opportunities at Huawei, Russia Viterbi algorithm 42/59 matveeva.yulia@huawei.com Yulia MATVEEVA Artificial Intelligence, Data Science, Speech Synthesis

  56. Self-introduction Data Science in the Industrial World Huawei VoiceKit Project and Personal Assistant Speech Synthesis Job Opportunities at Huawei, Russia Concatenative unit-selection : pros and cons Pros Big representative corpus ⇒ outperforms all other approaches (intelligibility and naturalness). Generally easy (fast) training. Cons Large model size (data base), inadequate for offline mode. Low flexibility, ability to adapt to new contexts / new tasks. 43/59 matveeva.yulia@huawei.com Yulia MATVEEVA Artificial Intelligence, Data Science, Speech Synthesis

  57. Self-introduction Data Science in the Industrial World Huawei VoiceKit Project and Personal Assistant Speech Synthesis Job Opportunities at Huawei, Russia Concatenative unit-selection in our life Production examples Siri (Apple) (2016–2017) : 44/59 matveeva.yulia@huawei.com Yulia MATVEEVA Artificial Intelligence, Data Science, Speech Synthesis

  58. Self-introduction Data Science in the Industrial World Huawei VoiceKit Project and Personal Assistant Speech Synthesis Job Opportunities at Huawei, Russia Concatenative unit-selection in our life Production examples Siri (Apple) (2016–2017) : hybrid unit-selection approach with deep-learning based emission/transition cost estimation. 44/59 matveeva.yulia@huawei.com Yulia MATVEEVA Artificial Intelligence, Data Science, Speech Synthesis

  59. Self-introduction Data Science in the Industrial World Huawei VoiceKit Project and Personal Assistant Speech Synthesis Job Opportunities at Huawei, Russia Concatenative unit-selection in our life Production examples Siri (Apple) (2016–2017) : hybrid unit-selection approach with deep-learning based emission/transition cost estimation. See for yourself ! Find a pronunciaton dictionary. Open-source phonemizer ( type “python phonemizer” in Google ;) ). Festvox / Flite : open-source toolkit by the Carnegie Mellon University’s speech group. 44/59 matveeva.yulia@huawei.com Yulia MATVEEVA Artificial Intelligence, Data Science, Speech Synthesis

  60. Self-introduction Data Science in the Industrial World Huawei VoiceKit Project and Personal Assistant Speech Synthesis Job Opportunities at Huawei, Russia Text-To-Speech (TTS) : end-2-end speech synthesis 45/59 matveeva.yulia@huawei.com Yulia MATVEEVA Artificial Intelligence, Data Science, Speech Synthesis

  61. Self-introduction Data Science in the Industrial World Huawei VoiceKit Project and Personal Assistant Speech Synthesis Job Opportunities at Huawei, Russia Text-To-Speech (TTS) : end-2-end speech synthesis [ Photo credits : www.unsplash.com/search/photos/electricity ] 45/59 matveeva.yulia@huawei.com Yulia MATVEEVA Artificial Intelligence, Data Science, Speech Synthesis

  62. Self-introduction Data Science in the Industrial World Huawei VoiceKit Project and Personal Assistant Speech Synthesis Job Opportunities at Huawei, Russia Text-To-Speech (TTS) : end-2-end speech synthesis [ Photo credits : www.unsplash.com/search/photos/electricity ] 46/59 matveeva.yulia@huawei.com Yulia MATVEEVA Artificial Intelligence, Data Science, Speech Synthesis

  63. Self-introduction Data Science in the Industrial World Huawei VoiceKit Project and Personal Assistant Speech Synthesis Job Opportunities at Huawei, Russia End-2-end speech synthesis : pros and cons Pros Saves feature-engineering effort. In theory very flexible : • can be embedded in a multi-tasking neural net ; • allows for efficient style transfer (voice conversion). 47/59 matveeva.yulia@huawei.com Yulia MATVEEVA Artificial Intelligence, Data Science, Speech Synthesis

  64. Self-introduction Data Science in the Industrial World Huawei VoiceKit Project and Personal Assistant Speech Synthesis Job Opportunities at Huawei, Russia End-2-end speech synthesis : pros and cons Pros Saves feature-engineering effort. In theory very flexible : • can be embedded in a multi-tasking neural net ; • allows for efficient style transfer (voice conversion). Cons Time ! 47/59 matveeva.yulia@huawei.com Yulia MATVEEVA Artificial Intelligence, Data Science, Speech Synthesis

  65. Self-introduction Data Science in the Industrial World Huawei VoiceKit Project and Personal Assistant Speech Synthesis Job Opportunities at Huawei, Russia End-2-end speech synthesis : pros and cons Pros Saves feature-engineering effort. In theory very flexible : • can be embedded in a multi-tasking neural net ; • allows for efficient style transfer (voice conversion). Cons Time ! Original WaveNet model : 1 hour to generate 1 second of audio. 47/59 matveeva.yulia@huawei.com Yulia MATVEEVA Artificial Intelligence, Data Science, Speech Synthesis

  66. Self-introduction Data Science in the Industrial World Huawei VoiceKit Project and Personal Assistant Speech Synthesis Job Opportunities at Huawei, Russia Text-To-Speech (TTS) : parametric speech synthesis Statistical Parametric Speech Synthesis : 1 Extract and model a parametric representation of the speech signal (spectrum, excitation, etc.). 2 Reconstruct the waveform from the parametric representation. 48/59 matveeva.yulia@huawei.com Yulia MATVEEVA Artificial Intelligence, Data Science, Speech Synthesis

  67. Self-introduction Data Science in the Industrial World Huawei VoiceKit Project and Personal Assistant Speech Synthesis Job Opportunities at Huawei, Russia Parametric speech synthesis : in production SPSS synthesis : production examples Google assistant , Amazon Alexa , Huawei assistant . 49/59 matveeva.yulia@huawei.com Yulia MATVEEVA Artificial Intelligence, Data Science, Speech Synthesis

  68. Self-introduction Data Science in the Industrial World Huawei VoiceKit Project and Personal Assistant Speech Synthesis Job Opportunities at Huawei, Russia Self-introduction 1 Data Science in the Industrial World 2 Huawei VoiceKit Project and Personal Assistant 3 Speech Synthesis 4 Job Opportunities at Huawei, Russia 5 50/59 matveeva.yulia@huawei.com Yulia MATVEEVA Artificial Intelligence, Data Science, Speech Synthesis

  69. Self-introduction Data Science in the Industrial World Huawei VoiceKit Project and Personal Assistant Speech Synthesis Job Opportunities at Huawei, Russia Huawei is Looking for Talents ! Two Types of Job Opportunities 51/59 matveeva.yulia@huawei.com Yulia MATVEEVA Artificial Intelligence, Data Science, Speech Synthesis

  70. Self-introduction Data Science in the Industrial World Huawei VoiceKit Project and Personal Assistant Speech Synthesis Job Opportunities at Huawei, Russia Huawei is Looking for Talents ! Two Types of Job Opportunities 1 Saint-Petersburg Research Center : Data Science Engineer. 51/59 matveeva.yulia@huawei.com Yulia MATVEEVA Artificial Intelligence, Data Science, Speech Synthesis

  71. Self-introduction Data Science in the Industrial World Huawei VoiceKit Project and Personal Assistant Speech Synthesis Job Opportunities at Huawei, Russia Huawei is Looking for Talents ! Two Types of Job Opportunities 1 Saint-Petersburg Research Center : Data Science Engineer. 2 Moscow Research Center : Research Engineer. 51/59 matveeva.yulia@huawei.com Yulia MATVEEVA Artificial Intelligence, Data Science, Speech Synthesis

  72. Self-introduction Data Science in the Industrial World Huawei VoiceKit Project and Personal Assistant Speech Synthesis Job Opportunities at Huawei, Russia Huawei : jobs at Saint-Petersburg Research Center Data Science Engineer : Speech Synthesis Team 52/59 matveeva.yulia@huawei.com Yulia MATVEEVA Artificial Intelligence, Data Science, Speech Synthesis

  73. Self-introduction Data Science in the Industrial World Huawei VoiceKit Project and Personal Assistant Speech Synthesis Job Opportunities at Huawei, Russia Huawei : jobs at Saint-Petersburg Research Center Data Science Engineer : Speech Synthesis Team Track the current state-of-the-art in academic research. 52/59 matveeva.yulia@huawei.com Yulia MATVEEVA Artificial Intelligence, Data Science, Speech Synthesis

  74. Self-introduction Data Science in the Industrial World Huawei VoiceKit Project and Personal Assistant Speech Synthesis Job Opportunities at Huawei, Russia Huawei : jobs at Saint-Petersburg Research Center Data Science Engineer : Speech Synthesis Team Track the current state-of-the-art in academic research. Experiment with existing implementations / implement missing components. 52/59 matveeva.yulia@huawei.com Yulia MATVEEVA Artificial Intelligence, Data Science, Speech Synthesis

  75. Self-introduction Data Science in the Industrial World Huawei VoiceKit Project and Personal Assistant Speech Synthesis Job Opportunities at Huawei, Russia Huawei : jobs at Saint-Petersburg Research Center Data Science Engineer : Speech Synthesis Team Track the current state-of-the-art in academic research. Experiment with existing implementations / implement missing components. Find ways to optimize : • model size (minimize) ; • generation speed (minimize). 52/59 matveeva.yulia@huawei.com Yulia MATVEEVA Artificial Intelligence, Data Science, Speech Synthesis

  76. Self-introduction Data Science in the Industrial World Huawei VoiceKit Project and Personal Assistant Speech Synthesis Job Opportunities at Huawei, Russia Huawei : jobs at Saint-Petersburg Research Center Data Science Engineer : Speech Synthesis Team Adapt to new tasks : • model emotions ; • mode for non-native speakers ; • voice conversion. 53/59 matveeva.yulia@huawei.com Yulia MATVEEVA Artificial Intelligence, Data Science, Speech Synthesis

  77. Self-introduction Data Science in the Industrial World Huawei VoiceKit Project and Personal Assistant Speech Synthesis Job Opportunities at Huawei, Russia Huawei : jobs at Saint-Petersburg Research Center Contacts Me (Yulia MATVEEVA) : matveeva.yulia@huawei.com , yu125@statmod.ru Saint-Petersburg Huawei R&D HR department : chernysheva.yuliya@huawei.com 54/59 matveeva.yulia@huawei.com Yulia MATVEEVA Artificial Intelligence, Data Science, Speech Synthesis

  78. Self-introduction Data Science in the Industrial World Huawei VoiceKit Project and Personal Assistant Speech Synthesis Job Opportunities at Huawei, Russia Huawei : jobs at Saint-Petersburg Research Center Digital Signal Processing and Speech Synthesis : References (links) Rabiner, Schafer, 2009, Theory and Applications of Digital Speech Processing. Zen et al., 2009, Statistical Parametric Speech Synthesis. Oord et al., 2016, WAVENET: A GENERATIVE MODEL FOR RAW AUDIO. Shen et al., 2018, Natural tts synthesis by conditioning wavenet on mel spectrogram predictions. Kalchbrenner et al., 2018, Efficient neural audio synthesis. Kim et al., 2018, FloWaveNet: A Generative Flow for Raw Audio. 55/59 matveeva.yulia@huawei.com Yulia MATVEEVA Artificial Intelligence, Data Science, Speech Synthesis

  79. Self-introduction Data Science in the Industrial World Huawei VoiceKit Project and Personal Assistant Speech Synthesis Job Opportunities at Huawei, Russia Huawei : Saint-Petersburg Research Center Other Machine Learning teams in Saint Petersburg : • Automatic Speech Recognition ; • Natural Language Understanding ; • and others. 56/59 matveeva.yulia@huawei.com Yulia MATVEEVA Artificial Intelligence, Data Science, Speech Synthesis

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend