SLIDE 1 E9 205 Machine Learning for Signal Processing
19-08-2019
Introduction to Machine Learning of Sensory Signals
Instructor - Sriram Ganapathy (sriramg@iisc.ac.in) Teaching Assistant - Prachi Singh (prachisingh@iisc.ac.in)
http://leap.ee.iisc.ac.in/sriram/teaching/MLSP_19/
SLIDE 2 Feature Extraction
Scope for this course
- I. Feature Extraction in Text.
- II. Feature Extraction in Speech and Audio signals.
- III. Processing of Images.
SLIDE 3
Speech and Audio Processing
SLIDE 4
Summary of STFT Properties
SLIDE 5 Narrowband versus Wideband
❖ Short windows - poor frequency resolution - wideband spectrogram ❖ Long windows - poor time resolution - narrowband spectrogram
SLIDE 6 Spectrogram of Real Sounds
Dan Ellis, “STFT Tutorial”
SLIDE 7 Narrowband versus Wideband
Dan Ellis, “STFT Tutorial”
SLIDE 8
Mel Frequency Cepstral Coefficients
SLIDE 9
Mel Frequency Cepstral Coefficients
SLIDE 10
Mel Frequency Cepstral Coefficients
SLIDE 11
Mel Frequency Cepstral Coefficients
SLIDE 12
Image Processing
SLIDE 13
Image Capture and Representation
SLIDE 14
Image Capture and Representation
SLIDE 15
Image Filtering
SLIDE 16
Image Filtering
SLIDE 17
Edge Detection Example
SLIDE 18
Convolution Operation in Images
SLIDE 19
Matrix Derivatives (Appendix C, PRML, Bishop)
SLIDE 20
Dimensionality Reduction - PCA (Chapter 12.1, PRML, Bishop)
SLIDE 21 Principal Component Analysis
❖ Reducing the data of dimension to lower
dimension
❖ Projecting the data into subspace which
preserves maximum data variance
❖ Maximize variance in projected space ❖ Equivalent formulated as minimizing the error
between the original and projected data points.