em emotion recognition in in sound
play

EM EMOTION RECOGNITION IN IN SOUND ANASTASIYA S. POPOVA HSE NN - PowerPoint PPT Presentation

EM EMOTION RECOGNITION IN IN SOUND ANASTASIYA S. POPOVA HSE NN 2017 INTRODUCTION THE PROBLEM y : X Y y : R n Y THE DATASET (RA RAVDESS DA DATABASE) http://neuron.arts.ryerson.ca/ravdess/?f=3 PRETREATMENT Length equalization


  1. EM EMOTION RECOGNITION IN IN SOUND ANASTASIYA S. POPOVA HSE NN 2017

  2. INTRODUCTION

  3. THE PROBLEM y : X → Y y : R n → Y

  4. THE DATASET (RA RAVDESS DA DATABASE) http://neuron.arts.ryerson.ca/ravdess/?f=3

  5. PRETREATMENT Length equalization

  6. PRETREATMENT Loudness normalization

  7. PRETREATMENT Highpass&Lowpass filters, voice audio detection (VAD) algorithm

  8. SPECTROGRAM -> MELSPECTROGRAM

  9. THE DIFFERENCE BETWEEN CLASSES (HYPOTHESIS ) neutral calm happy sad surprised fearful angry disgust

  10. CONVOLUTION NETWORK

  11. Input RGB image VGG-11 à VGG-16 Conv3-64 Maxpool Input RGB image Conv3-128 Conv3-64 Maxpool Maxpool Conv3-256 Conv3-128 Conv3-256 Maxpool Conv3-256 Conv3-256 Conv3-256 Maxpool Conv3-512 Maxpool Conv3-512 Conv3-512 Conv3-512 Conv3-512 Maxpool Maxpool Conv3-512 Conv3-512 Conv3-512 Conv3-512 Conv3-512 Maxpool Maxpool FC-4096 FC-4096 FC-4096 FC-4096 FC-1000 FC-1000 Soft-max Soft-max

  12. CLASSIFICATION ON 8 CLASSES ACCURACY VGG-11 + spectrogram VGG-16 + melspectrogram

  13. CONFUSION MATRIX

  14. MEL FREQUENCY CEPSTRAL COEFFICIENTS (MFCC)

  15. stasysp.96@gmail.com

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend