E9 205 Machine Learning for Signal Processing Introduction to - - PowerPoint PPT Presentation

e9 205 machine learning for signal processing
SMART_READER_LITE
LIVE PREVIEW

E9 205 Machine Learning for Signal Processing Introduction to - - PowerPoint PPT Presentation

E9 205 Machine Learning for Signal Processing Introduction to Machine Learning of Sensory Signals 08-08-2018 Instructor - Sriram Ganapathy (sriramg@iisc.ac.in) Teaching Assistant - Akshara Soman (aksharas@iisc.ac.in). Feature Extraction


slide-1
SLIDE 1

E9 205 Machine Learning for Signal Processing

08-08-2018

Introduction to Machine Learning of Sensory Signals

Instructor - Sriram Ganapathy (sriramg@iisc.ac.in) Teaching Assistant - Akshara Soman (aksharas@iisc.ac.in).

slide-2
SLIDE 2

Feature Extraction

❖ Feature Extraction ❖ Using measured data to build desirable values. ❖ Attributes of the data that are informative and non-

redundant.

❖ Resilience to noise/artifacts. ❖ Facilitating subsequent learning algorithm.

slide-3
SLIDE 3

Feature Extraction

❖ Representation Problem

Cartesian Coordinates Polar Coordinates

slide-4
SLIDE 4

Feature Extraction

Scope for this course

  • I. Feature Extraction in Text.
  • II. Feature Extraction in Speech and Audio signals.
  • III. Feature Extraction for Images.
slide-5
SLIDE 5

Text Modeling - Introduction to NLP

❖ Definitions ❖ Documents, Corpora, Tokens (Terms) ❖ Term Frequency (TF) ❖ Collection Frequency (CF) ❖ Document Frequency (DF) ❖ TF-IDF ❖ Bag of words model

slide-6
SLIDE 6

Text Processing

slide-7
SLIDE 7

Example [Manning and Schutze, 2006]

https://nlp.stanford.edu/IR-book/pdf/irbookonlinereading.pdf

slide-8
SLIDE 8

Perplexity

❖ Measuring the goodness of language modeling ❖

https://web.stanford.edu/~jurafsky/slp3/4.pdf On a Wall-street Journal Corpus

slide-9
SLIDE 9

Speech and Audio Processing

slide-10
SLIDE 10

Speech and Audio

❖ Speech/Audio - 1D signals ❖ Generated by pressure variations producing regions

  • f high pressure and low pressure.

❖ Travels through a medium of propagation (like air,

water etc).

❖ Human sensory organ - eardrum. ❖ Converting pressure variations to electrical signals. ❖ Action mimicked by a microphone.

slide-11
SLIDE 11

Sound waves in a computer

❖ Analog continuous signal from the microphone ❖ Discretized in time - sampling. ❖ Digitized in values - quantization.

http://mlsp.cs.cmu.edu/courses/fall2014/lectures/slides/Class1.Introduction.pdf

slide-12
SLIDE 12

Why do we need time varying Fourier Transform

❖ When the signal properties change in time ❖ DFT will only capture the average spectral character ❖ Short-window analysis can indicate the change in

spectrum.

slide-13
SLIDE 13

Summary of STFT Properties

slide-14
SLIDE 14

Narrowband versus Wideband

❖ Short windows - poor frequency resolution - wideband spectrogram ❖ Long windows - poor time resolution - narrowband spectrogram

slide-15
SLIDE 15

Narrowband versus Wideband

Dan Ellis, “STFT Tutorial”

slide-16
SLIDE 16

Spectrogram of Real Sounds

Dan Ellis, “STFT Tutorial”

slide-17
SLIDE 17

Narrowband versus Wideband

Dan Ellis, “STFT Tutorial”

slide-18
SLIDE 18

Mel Frequency Cepstral Coefficients

slide-19
SLIDE 19

Mel Frequency Cepstral Coefficients

slide-20
SLIDE 20

Mel Frequency Cepstral Coefficients

slide-21
SLIDE 21

Mel Frequency Cepstral Coefficients

slide-22
SLIDE 22

Image Processing

slide-23
SLIDE 23

Image Capture and Representation

slide-24
SLIDE 24

Image Capture and Representation

slide-25
SLIDE 25

Image Filtering

slide-26
SLIDE 26

Image Filtering

slide-27
SLIDE 27

Edge Detection Example

slide-28
SLIDE 28

Convolution Operation in Images