[PPT] - Automatic Detection of Parkinsons Disease from Continuous Speech PowerPoint Presentation

SLIDE 1

Automatic Detection of Parkinson’s Disease from Continuous Speech Recorded in Non- Controlled Noise Conditions

J.C. Vásquez-Correa1, T. Arias-Vergara1, J.R Orozco-Arroyave1,2, J.F Vargas-Bonilla1, J.D Arias-Londoño1, Elmar Nöth2

1 Faculty of Engineering, University of Antioquia UdeA 2 Pattern Recognition Lab, Friedrich Alexander Universität, Erlangen-Nürnberg

Interspeech 2015

SLIDE 2

2

1. Introduction
2. Methodology
3. Database
4. Device
5. Results
6. Conclusion

Outline

interspeech-2015

jcamilo.vasquez@udea.edu.co

SLIDE 3

1. Introduction

3

Missarticulation Monotonic speech Reduced Loudness

 Voice impairments appear in about 90%

f

people with Parkinson’s diasease.  Only from 3% to 4% of patients recieve speech therapy

interspeech-2015

jcamilo.vasquez@udea.edu.co

SLIDE 4

4

1. Introduction

 Intereset: Develop computer-aided tools to perform screenings from

speech. The main aims are

 Spare patients moving from home to the hospital  Raise early alerts to patient and doctors

interspeech-2015

jcamilo.vasquez@udea.edu.co

SLIDE 5

5

1. Introduction

interspeech-2015

jcamilo.vasquez@udea.edu.co

Currently

 The devices are mainly focused on the analysis of sustained phonations.  The devices could be considered invasives.  Methodologies are not adapted for evaluating speech recorded in non- controlled noise conditions.

SLIDE 6

6

1. Introduction

interspeech-2015

jcamilo.vasquez@udea.edu.co

SLIDE 7

7

1. Introduction

interspeech-2015

jcamilo.vasquez@udea.edu.co

SLIDE 8

8

1. Introduction

 Portable devices and tools for the analysis of speech of people wirth Parkinson’s disease considering continuous speech signals recorded in non-controlled noise conditions.

interspeech-2015

jcamilo.vasquez@udea.edu.co

What is missing?

SLIDE 9

9

1. Introduction

 Portable device for recording and analysis of speech of patients  Evaluation of methodology in non-controlled noise conditions.

interspeech-2015

jcamilo.vasquez@udea.edu.co

Aims

SLIDE 10

10

1. Introduction
2. Methodology
3. Databases
4. Device
5. Results
6. Conclusion

Outline

interspeech-2015

jcamilo.vasquez@udea.edu.co

SLIDE 11

11

2. Methodology

Speech signal Pre- processing V/U segmentation

Voiced

Unvoiced ΔF0, Jitter, Shimmer, log-Energy, MFCC MFCC BBE

SVM SVM

interspeech-2015

jcamilo.vasquez@udea.edu.co

SLIDE 12

12

2. Methodology

Speech signal Pre- processing V/U segmentation

Voiced

Unvoiced ΔF0, Jitter, Shimmer, log-Energy, MFCC MFCC BBE

SVM SVM

interspeech-2015

jcamilo.vasquez@udea.edu.co

SLIDE 13

13

2. Methodology

Pre-processing

 Speech enhancement  Mean cepstral subtraction

interspeech-2015

jcamilo.vasquez@udea.edu.co

0.5 1 1.5 2

1
0,5

1 Noisy Signal 0.5 1 1.5 2

1

0,1 Time [s] Enhanced Signal 0.5 1 1.5 2 1 2 x 10

4

Noisy Signal Frequency (Hz) 0.5 1 1.5 2 1 2 x 10

4

Time [s] Enhanced Signal Frequency (Hz)

SLIDE 14

14

2. Methodology

Speech signal Pre- processing V/U segmentation

Voiced

Unvoiced ΔF0, Jitter, Shimmer, log-Energy, MFCC MFCC BBE

SVM SVM

interspeech-2015

jcamilo.vasquez@udea.edu.co

SLIDE 15

15

Two types of sound:  Voiced  Unvoiced Both kind

f

segments are processed independently

2. Methodology

Segmentation interspeech-2015

jcamilo.vasquez@udea.edu.co

SLIDE 16

16

2. Methodology

Speech signal Pre- processing V/U segmentation

Voiced

Unvoiced ΔF0, Jitter, Shimmer, log-Energy, MFCC MFCC BBE

SVM SVM

interspeech-2015

jcamilo.vasquez@udea.edu.co

SLIDE 17

17

2. Methodology

Characterization

 ΔF0  Jitter  Shimmer  log-Energy  12 MFCC  12 MFCC  25 Energy coefficients acccording to Bark scale

Features Voiced Segments Features Unvoiced Segments interspeech-2015

jcamilo.vasquez@udea.edu.co

SLIDE 18

18

2. Methodology

Characterization interspeech-2015

jcamilo.vasquez@udea.edu.co

mean, standard deviation, skewness, kurtosis

2 4 6 8 10 12 x 104

0.8
0.6
0.4
0.2

0.2 0.4 0.6

500 1000 1500

0.4
0.3
0.2
0.1

0.1 0.2 0.3 0.4

20 ms length Time-shift of 10 ms

SLIDE 19

19

2. Methodology

Speech signal Pre- processing V/U segmentation

Voiced

Unvoiced ΔF0, Jitter, Shimmer, log-Energy, MFCC MFCC BBE

SVM SVM

interspeech-2015

jcamilo.vasquez@udea.edu.co

SLIDE 20

20

Classification

2. Methodology

 Gaussian kernel SVM.  Parameters of the SVM are optimized in a range:  10-1 < C < 10 4  10-2 < γ < 10 2  Leave One Speaker Out (LOSO) Cross-validation

interspeech-2015

jcamilo.vasquez@udea.edu.co

SLIDE 21

21

1. Introduction
2. Methodology
3. Database
4. Device
5. Results
6. Conclusion

Outline

interspeech-2015

jcamilo.vasquez@udea.edu.co

SLIDE 22

22

3. Database

Patients with Parkinson’s disease Healthy controls Num recordings 14 14 Age Mean 61.64 ± 6.43 Mean 63.29 ± 10.43 Gender 7 male, 7 female 7 male, 7 female Sampling frequency 44100 Hz Quantization bits 16

interspeech-2015

jcamilo.vasquez@udea.edu.co

SLIDE 23

23

3. Database

 Six different sentences  One read text with 36 words

interspeech-2015

jcamilo.vasquez@udea.edu.co

SLIDE 24

24

1. Introduction
2. Methodology
3. Database
4. Device
5. Results
6. Conclusion

Outline

interspeech-2015

jcamilo.vasquez@udea.edu.co

SLIDE 25

25

3. Database

Recording Device

interspeech-2015

jcamilo.vasquez@udea.edu.co

SLIDE 26

26

1. Introduction
2. Methodology
3. Database
4. Device
5. Results
6. Conclusion

Outline

interspeech-2015

jcamilo.vasquez@udea.edu.co

SLIDE 27

27

jcamilo.vasquez@udea.edu.co

4. Results

Voiced segments

Sentence Accuracy (%) Noisy Enhanced 1 71 ± 26 82 ± 25 2 75 ± 26 64 ± 36 3 71 ± 26 79 ± 25 4 86 ± 31 79 ± 25 5 79 ± 25 82 ± 25 6 86 ± 23 75 ± 26 Read text 79 ± 25 71 ± 26

interspeech-2015

SLIDE 28

28

Sentence Accuracy (%) Noisy Enhanced 1 92 ± 19 93 ± 17 2 94 ± 15 91 ± 20 3 86 ± 23 97 ± 12 4 93 ± 18 94 ± 16 5 78 ± 25 90 ± 20 6 86 ± 23 97 ± 12 Read text 99 ± 3 99 ± 1

4. Results

Unvoiced segments interspeech-2015

jcamilo.vasquez@udea.edu.co

SLIDE 29

29

Sentence Accuracy (%) Noisy Enhanced 1 92 ± 19 93 ± 17 2 94 ± 15 91 ± 20 3 86 ± 23 97 ± 12 4 93 ± 18 94 ± 16 5 78 ± 25 90 ± 20 6 86 ± 23 97 ± 12 Read text 99 ± 3 99 ± 1

4. Results

Unvoiced segments interspeech-2015

jcamilo.vasquez@udea.edu.co

SLIDE 30

30

0.5 1 0.2 0.4 0.6 0.8 1 False Positive Rate True Positive Rate

4. Results

0.5 1 0.2 0.4 0.6 0.8 1 False Positive Rate True Positive Rate Enhanced Noisy

Voiced frames Sentence 6 Unvoiced frames Sentence 6 interspeech-2015

jcamilo.vasquez@udea.edu.co

SLIDE 31

31

1. Introduction
2. Methodology
3. Databases
4. Device
5. Results
6. Conclusion

Outline

interspeech-2015

jcamilo.vasquez@udea.edu.co

SLIDE 32

5. Conclusion
1. A computational tool is presented for the recording and analysis of

speech with Parkinson’s disease

2. The methodology evaluated considers

speech recorded in non- controlled noise conditions.

3. It is useful the speech enhancement technique?

interspeech-2015

jcamilo.vasquez@udea.edu.co

SLIDE 33

5. Conclusion
4. The incorporation to the methodology of prosody features derived from

timing, duration, and speech rate is planned for the near future.

5. The use of the computer tool to follow the speech therapy of the patients,

and to asses their neurological state is also expected in the future.

interspeech-2015

jcamilo.vasquez@udea.edu.co

SLIDE 34

Thanks!

interspeech-2015

jcamilo.vasquez@udea.edu.co