Convolutional Neural Network to Model Articulation Impairments in - PowerPoint PPT Presentation

Convolutional Neural Network to Model Articulation Impairments in Patients with Parkinson’s Disease asquez-Correa 1 , 2 Juan Camilo V´ Juan Rafael Orozco-Arroyave 1 , 2 , Elmar N¨ oth 2 1 GITA research group, University of Antioquia UdeA. 2 Pattern recognition Lab. Friedrich Alexander Universit¨ at. Erlangen-N¨ urnberg. jcamilo.vasquez@udea.edu.co 18th INTERSPEECH, 2017 1 / 32 November 9, 2017

Outline Introduction Methods Experimental framework Results Conclusion 2 / 32

Introduction: Parkinson’s Disease ◮ Second most prevalent neurologi- cal disorder worldwide. ◮ Patients develop several motor and non-motor impairments. (O. Hornykiewicz 1998). ◮ Speech impairments are one of the earliest manifestations. 4 / 32

Introduction: Speech impairments Speech impairments in PD patients: hypokinetic dysarthria Phonation Prosody Intelligibility pataka pataka Articulation 5 / 32

Introduction: Imprecise articulation ◮ One of the most deviant speech dimensions in PD. ◮ Reduced velocity of lip, tongue, and jaw movements. ◮ Strong indication of the literature statement: imprecise con- sonants caused by reduced range of movements of ar- ticulators pa ta ka 6 / 32

Introduction: Hypothesis PD patients have difficulties to begin and to stop the vocal fold vibration, and such difficulties can be observed on speech sig- nals by modeling the transitions between voiced and unvoiced sounds Onset transition Offset transition Unvoiced Voiced Voiced Unvoiced Voiced Unvoiced 7 / 32

Introduction: Aims ◮ To model the time-frequency (TF) information provided by the onset and offset transitions: short-time Fourier transform (STFT) and continuous wavelet transform (CWT). ◮ To “learn” features from time-frequency representations: convolutional neural network (CNN). ◮ Why TF and feature-learning? both have been successfully used in several paralinguistics tasks: emotion, deception, depression, and others. 8 / 32

Methods Transitions Time frequency Convolutional detection representations neural network 10 / 32

Methods: Transitions detection Transitions Time frequency Convolutional detection representations neural network Onset transition Offset transition Onset and offset are detected according to the presence of the fundamental frequency. 11 / 32

Methods: Time-frequency representation Transitions Time frequency Convolutional detection representations neural network 4000 3500 3000 Frequency (Hz) 2500 2000 1500 1000 500 0.00 0.02 0.04 0.06 0.08 0.10 0.12 0.14 0.00 0.02 0.04 0.06 0.08 0.10 0.12 0.14 Time (s) Time (s) STFT of onset for a PD patient (left) and a HC subject (right) Play PD Play HC 12 / 32

Methods: Time-frequency representation Transitions Time frequency Convolutional detection representations neural network 500 400 300 Scale 200 100 0.00 0.02 0.04 0.06 0.08 0.10 0.12 0.14 0.00 0.02 0.04 0.06 0.08 0.10 0.12 0.14 Time (s) Time (s) CWT of onset for a PD patient (left) and a HC subject (right) 13 / 32

Methods: Convolutional neural network Transitions Time frequency Convolutional detection representations neural network Feature maps 1 Input layer Feature maps 2 PD vs. HC Convolution layer I Max-pool. layer 1 Convolution layer II Max-pool layer 2 Fully conected MLP CNN learns high–level representations from the low–level raw data 14 / 32

Data ◮ Three databases with recordings in three languages: Span- ish, German, and Czech. ◮ Diadochokinetic exercises, isolated sentences, read texts, and monologues. 16 / 32

Data Language Description Spanish 50 Patients and 50 Healthy controls. Balanced in age (60 years old) and gender. Patients in middle state of the disease. German 88 Patients and 88 Healthy controls. Balanced in age (64 years old). patients in low and middle state of the disease. Czech 20 Patients and 15 Healthy controls. All male speakers. Patients diagnosed during recording session. Table: Databases 17 / 32

Experiments and validation ◮ Classification of PD patients vs. HC subjects in the same language. ◮ 10 fold cross-validation: 8 for training, 1 to optimize hyper-parameters, and 1 for test. ◮ Cross-language classification. ◮ One language used for train and validation and other language used for test. 18 / 32

Experiments and validation ◮ Results are compared respect to previous studies 1 . Support vector machine 1 Juan Camilo V´ asquez-Correa et al. “Effect of acoustic conditions on algorithms to detect Parkinson’s disease from speech”. In: International Conference on Acoustics, Speech and Signal Processing (ICASSP), . 2017, pp. 5065–5069. 19 / 32

Results: same language for train and test TFR Onset Offset Onset+Offset Spanish CNN-STFT 85.3 81.6 85.9 CNN-CWT 84.2 81.8 85.2 Baseline 69.3 69.6 71.6 German CNN-STFT 70.3 68.0 75.0 CNN-CWT 68.0 66.9 70.5 Baseline 72.7 70.9 74.0 Czech CNN-STFT 77.9 80.4 84.4 CNN-CWT 89.2 87.7 89.4 Baseline 75.3 74.4 78.8 21 / 32

Results: same language for train and test 4000 Frequency (Hz) 3000 2000 1000 0 50 100 150 50 100 150 Time (ms) Time (ms) Low Energy High Energy Figure: Output of the CNN after the last max–pool layer: PD patient (left) and a HC speaker (right) 24 / 32

Results: same language for train and test Speech tasks Spanish German Czech read text 85.0 70.3 88.5 monologue 85.6 70.3 89.1 /pa-ta-ka/ 85.4 70.7 89.2 25 / 32

Results: different language for train and test Test Lang. TFR onset offset onset+offset Train with Spanish German CNN-STFT 51.7 50.2 54.7 German Baseline 53.7 55.0 54.1 Czech CNN-CWT 55.2 55.4 57.9 Czech Baseline 60.3 57.4 60.4 Train with German Spanish CNN-STFT 58.0 55.7 55.8 Spanish Baseline 53.5 53.5 53.6 Czech CNN-STFT 53.0 52.4 53.0 Czech Baseline 50.9 51.7 52.6 Train with Czech Spanish CNN-CWT 53.8 56.3 56.7 Spanish Baseline 53.4 51.6 52.4 German CNN-STFT 54.0 51.8 54.0 German Baseline 51.2 51.0 50.7 26 / 32

Conclusion ◮ A deep learning approach is proposed to model articulation impairments of PD patients. ◮ Voiced-Unvoiced transitions are modeled with CNNs using STFT and CWT. 30 / 32

Conclusion ◮ The proposed method is able to classify PD patients and HC subjects and improves the baseline when the language used for train and test is the same. ◮ Additional approaches should be proposed when the train and test language are different. ◮ Recurrent neural networks and other architectures may be considered to assess co-articulation. ◮ Deep learning approaches trained with phonation, articulation, and prosody information may be addressed to evaluate specific speech impairments. 31 / 32

Convolutional Neural Network to Model Articulation Impairments in - PowerPoint PPT Presentation

Convolutional Neural Network to Model Articulation Impairments in Patients with Parkinsons Disease asquez-Correa 1 , 2 Juan Camilo V Juan Rafael Orozco-Arroyave 1 , 2 , Elmar N oth 2 1 GITA research group, University of Antioquia UdeA. 2

Finding Articulation Points and Bridges Articulation Points Articulation Point Articulation

Articulation Disorders What are articulation disorders? Articulation disorders are disorders of

Convolutional Neural Networks Convolutional neural networks One of the major kinds of ANNs in use

Convolutional Neural Networks ---- Off the shelf top notch performances Convolutional Neural

Convolutional Kuan-Ting Lai 2020/3/31 Neural Network Convolutional Neural Networks (CNN)

High School Barton Articulation Agreement Definition of Articulation:

Introduction CSCE 970 CSCE 970 Lecture 4: Lecture 4: Convolutional Convolutional Neural

Convolutional Neural Nets 4-25-16 Reading Quiz Convolutional neural networks are most commonly

ON TEGRA X1 ALAN WANG, NVIDIA Convolutional Neural Network optimization target Result

Outline Convolutional Neural Network Architectures for Matching Natural Language Sentences.

Neural Network Part 3: Convolutional Neural Networks CS 760@UW-Madison Goals for the lecture

Convolutional Neural Nets CS447 Natural Language Processing (J. Hockenmaier)

Convolutional Neural Networks for Sentence Classification Yoon Kim New York University 1 / 34

Convolutional Neural Networks 08, 10 & 17 Nov, 2016 J. Ezequiel Soto S. Image Processing

FUNCTIONAL ANATOMY OF SHOULDER JOINT ARTICULATION Articulation is between: The rounded

Neural Information Retrieval Wassila Lalouani 1 Plan Neural network architectures Neural

The webinar will start at 12:00 PM EST Topics to be covered What are patient considerations in

NIHs role NIH le in in the e fig ight ag again ainst Chief, Parkinsons Disease

PD is in the house: Impact on children/teens/young adults Elaine Book, MSW Pacific Parkinsons

Deeply-Supervised Nets AISTATS, 2015 Deep Learning Workshop, NIPS 2014 Zhuowen Tu Department of

PARKINSONS DISEASE PHARMACOLOGY University of Hawaii Hilo Pre -Nursing Program NURS 203

April is AUTISM AWARENESS MONTH. Help us Celebrate Special Needs at BAC! We see the ability and

Ontologising the GWAS Catalog A picture paints a thousand traits Helen Parkinson, EBI 17

Acorda 4Q and Full Year 2015 Update February 11, 2016 Forward Looking Statement This

Convolutional Neural Network to Model Articulation Impairments in - PowerPoint PPT Presentation

Convolutional Neural Network to Model Articulation Impairments in Patients with Parkinsons Disease asquez-Correa 1 , 2 Juan Camilo V Juan Rafael Orozco-Arroyave 1 , 2 , Elmar N oth 2 1 GITA research group, University of Antioquia UdeA. 2

Finding Articulation Points and Bridges Articulation Points Articulation Point Articulation

Articulation Disorders What are articulation disorders? Articulation disorders are disorders of

Convolutional Neural Networks Convolutional neural networks One of the major kinds of ANNs in use

Convolutional Neural Networks ---- Off the shelf top notch performances Convolutional Neural

Convolutional Kuan-Ting Lai 2020/3/31 Neural Network Convolutional Neural Networks (CNN)

High School Barton Articulation Agreement Definition of Articulation:

Introduction CSCE 970 CSCE 970 Lecture 4: Lecture 4: Convolutional Convolutional Neural

Convolutional Neural Nets 4-25-16 Reading Quiz Convolutional neural networks are most commonly

ON TEGRA X1 ALAN WANG, NVIDIA Convolutional Neural Network optimization target Result

Outline Convolutional Neural Network Architectures for Matching Natural Language Sentences.

Neural Network Part 3: Convolutional Neural Networks CS 760@UW-Madison Goals for the lecture

Convolutional Neural Nets CS447 Natural Language Processing (J. Hockenmaier)

Convolutional Neural Networks for Sentence Classification Yoon Kim New York University 1 / 34

Convolutional Neural Networks 08, 10 &amp; 17 Nov, 2016 J. Ezequiel Soto S. Image Processing

FUNCTIONAL ANATOMY OF SHOULDER JOINT ARTICULATION Articulation is between: The rounded

Neural Information Retrieval Wassila Lalouani 1 Plan Neural network architectures Neural

The webinar will start at 12:00 PM EST Topics to be covered What are patient considerations in

NIHs role NIH le in in the e fig ight ag again ainst Chief, Parkinsons Disease

PD is in the house: Impact on children/teens/young adults Elaine Book, MSW Pacific Parkinsons

Deeply-Supervised Nets AISTATS, 2015 Deep Learning Workshop, NIPS 2014 Zhuowen Tu Department of

PARKINSONS DISEASE PHARMACOLOGY University of Hawaii Hilo Pre -Nursing Program NURS 203

April is AUTISM AWARENESS MONTH. Help us Celebrate Special Needs at BAC! We see the ability and

Ontologising the GWAS Catalog A picture paints a thousand traits Helen Parkinson, EBI 17

Acorda 4Q and Full Year 2015 Update February 11, 2016 Forward Looking Statement This

Convolutional Neural Networks 08, 10 & 17 Nov, 2016 J. Ezequiel Soto S. Image Processing