www.intelligentvoice.com
Deep Convolution Neural Networks fo for Dia ialect Classification of f Sp Spectrogram Im Images
Nigel Cannings
Chase Information Technology Services Limited 1
fo for Dia ialect Classification of f Sp Spectrogram Im Images - - PowerPoint PPT Presentation
www.intelligentvoice.com Deep Convolution Neural Networks fo for Dia ialect Classification of f Sp Spectrogram Im Images Nigel Cannings Chase Information Technology Services Limited 1 www.intelligentvoice.com Convolution Networks: Brief
www.intelligentvoice.com
Nigel Cannings
Chase Information Technology Services Limited 1
www.intelligentvoice.com
2
Fukushima, Kunihiko, ‘Neocognitron: A Self-organizing Neural Network Model for a Mechanism of Pattern Recognition Unaffected by Shift in Position,’ Biological Cybernetics 36 (4): 193-202, 1980 LeNet 5 (1998), image source: http://yann.lecun.com/exdb/lenet/
www.intelligentvoice.com
3
www.intelligentvoice.com
4 Szegedy, ‘Going deeper with convolutions,’ arXiv, 2014
www.intelligentvoice.com
5
www.intelligentvoice.com
6
2015 NIST Language Recognition Evaluation, http://www.nist.gov/itl/iad/lre15.cfm
www.intelligentvoice.com
7
www.intelligentvoice.com 8
www.intelligentvoice.com 9
www.intelligentvoice.com 10
Dat Database: 501248 spectrograms for training 24352 spectrograms for validation 51501 spectrograms for testing
www.intelligentvoice.com 11
Dat Database: 501248 spectrograms for training 24352 spectrograms for validation 51501 spectrograms for testing Apply convolutions to extract primitives such as edges
www.intelligentvoice.com 12
Dat Database: 501248 spectrograms for training 24352 spectrograms for validation 51501 spectrograms for testing Apply convolutions to extract primitives such as edges Object parts extracted
www.intelligentvoice.com 13
Dat Database: 501248 spectrograms for training 24352 spectrograms for validation 51501 spectrograms for testing Apply convolutions to extract primitives such as edges Object parts extracted Full Spectral Features, e.g. phones, words
www.intelligentvoice.com 14
Dat Database: 501248 spectrograms for training 24352 spectrograms for validation 51501 spectrograms for testing Apply convolutions to extract primitives such as edges Object parts extracted Full Spectral Features, e.g. phones, words Refinement
www.intelligentvoice.com 15
Dat Database: 501248 spectrograms for training 24352 spectrograms for validation 51501 spectrograms for testing Apply convolutions to extract primitives such as edges Object parts extracted Full Spectral Features, e.g. phones, words Refinement
Dial Dialect Clas lassi sifi fication Loss1 Loss2 Loss3
www.intelligentvoice.com
16
20 40 60 80 100 Arabic-Leventine French-Haitian Slavic-Polish Chinese-Wu French-West_African English-American Arabic-Iraqi Chinese-Mandarin Arabic-Maghrebi Slavic-Russian Spanish-Caribbean English-British Arabic-Egyptian Chinese-Cantonese Arabic-Modern_Standard Chinese-Min_Dong Spanish-European Spanish-… Portuguese-Brazilian English-South_Asian_(Indian)
www.intelligentvoice.com
17
www.intelligentvoice.com
18