EM EMOTION RECOGNITION IN IN SOUND
ANASTASIYA S. POPOVA
HSE NN 2017
EM EMOTION RECOGNITION IN IN SOUND ANASTASIYA S. POPOVA HSE NN - - PowerPoint PPT Presentation
EM EMOTION RECOGNITION IN IN SOUND ANASTASIYA S. POPOVA HSE NN 2017 INTRODUCTION THE PROBLEM y : X Y y : R n Y THE DATASET (RA RAVDESS DA DATABASE) http://neuron.arts.ryerson.ca/ravdess/?f=3 PRETREATMENT Length equalization
ANASTASIYA S. POPOVA
HSE NN 2017
http://neuron.arts.ryerson.ca/ravdess/?f=3
Length equalization
Loudness normalization
Highpass&Lowpass filters, voice audio detection (VAD) algorithm
neutral calm sad angry fearful happy surprised disgust
Input RGB image Conv3-64 Maxpool Conv3-128 Maxpool Conv3-256 Conv3-256 Maxpool Conv3-512 Conv3-512 Maxpool Conv3-512 Conv3-512 Maxpool FC-4096 FC-4096 FC-1000 Soft-max Input RGB image Conv3-64 Maxpool Conv3-128 Maxpool Conv3-256 Conv3-256 Conv3-256 Maxpool Conv3-512 Conv3-512 Conv3-512 Maxpool Conv3-512 Conv3-512 Conv3-512 Maxpool FC-4096 FC-4096 FC-1000 Soft-max
VGG-11 + spectrogram VGG-16 + melspectrogram
stasysp.96@gmail.com