Analyzing Hidden Representations in End-to-End Automatic Speech Recognition Systems1 Leda Sarı
January 31, 2019
1Belinkov and Glass, NIPS 2017 1 / 9
Analyzing Hidden Representations in End-to-End Automatic Speech - - PowerPoint PPT Presentation
Analyzing Hidden Representations in End-to-End Automatic Speech Recognition Systems 1 Leda Sar January 31, 2019 1 Belinkov and Glass, NIPS 2017 1 / 9 Introduction End-to-End (E2E) directly maps acoustic features to symbol (character or word)
1Belinkov and Glass, NIPS 2017 1 / 9
2 / 9
2Belinkov and Glass, NIPS 2017 3 / 9
4 / 9
5 / 9
6 / 9
7 / 9
1 semivowels/vowels 2 affricates/stops 3 affricates/fricatives 8 / 9
1 Empirically evaluate the quality of hidden representations with
2 First CNN better represents the phonetic information than the 2nd
3 After certain number of RNN layers, accuracy drops =
4 Relatively similar coarse classes are confused more 9 / 9