E9 205 Machine Learning for Signal Processing ML, MAP, MMSE and - - PowerPoint PPT Presentation

e9 205 machine learning for signal processing
SMART_READER_LITE
LIVE PREVIEW

E9 205 Machine Learning for Signal Processing ML, MAP, MMSE and - - PowerPoint PPT Presentation

E9 205 Machine Learning for Signal Processing ML, MAP, MMSE and Gaussian 28-08-2019 Modeling Instructor - Sriram Ganapathy (sriram@ee.iisc.ernet.in) Teaching Assistant - Prachi Singh (prachisingh@iisc.ac.in). Decision Theory (PRML Chap. 1.5)


slide-1
SLIDE 1

E9 205 Machine Learning for Signal Processing

28-08-2019

ML, MAP, MMSE and Gaussian Modeling

Instructor - Sriram Ganapathy (sriram@ee.iisc.ernet.in) Teaching Assistant - Prachi Singh (prachisingh@iisc.ac.in).

slide-2
SLIDE 2

Decision Theory (PRML Chap. 1.5)

❖ Decision Theory ❖ Inference problem ❖ Finding the joint density ❖ Decision problem ❖ Using the inference to make the

classification or regression decision

slide-3
SLIDE 3

Decision Problem - Classification

❖ Minimizing the mis-classification error ❖ Decision based on maximum posteriors ❖ Loss matrix ❖ Minimizing the expected loss

slide-4
SLIDE 4

Visualizing the Max. Posterior Classifier

slide-5
SLIDE 5

Approaches for Inference and Decision

  • I. Finding the joint density from the data.
  • II. Finding the posteriors directly.
  • III. Using discriminant functions for classification.
slide-6
SLIDE 6

Approaches for Inference and Decision

  • I. Finding the joint density from the data.
  • II. Finding the posteriors directly.
  • III. Using discriminant functions for classification.
slide-7
SLIDE 7

Advantage of Posteriors

slide-8
SLIDE 8

Decision Rule for Regression

❖ Minimum mean square error loss ❖ Solution is conditional expectation.

slide-9
SLIDE 9

Generative Modeling

Classifiers Generative Parametric Non- parametric

slide-10
SLIDE 10

Non-parametric Modeling

  • Non-parametric models do not specify an apriori set of

parameters to model the distribution. Example - Histogram

  • 0.25
  • 0.2
  • 0.15
  • 0.1
  • 0.05

0.05 0.1 0.15 0.2 0.25

Sample value

0.5 1 1.5 2 2.5 3 Bin Count #104

The density is not smooth and has block like shape.

slide-11
SLIDE 11

Non-parametric Modeling

  • Non-parametric models do not specify an apriori set of parameters to model the

distribution.

  • Example - Kernel Density Estimators

Histogram Kernel Density

Kernel is a smooth function which obeys certain properties

slide-12
SLIDE 12

Non-parametric Modeling

  • Non-parametric methods are dependent on number
  • f data points
  • Estimation is difficult for large datasets.
  • Likelihood computation and model comparisons

are hard.

  • Limited use in classifiers
slide-13
SLIDE 13

❖ Collection of probability distributions which are described by a

finite dimensional parameter set

  • Examples -
  • Poisson Distribution
  • Bernoulli Distribution
  • Gaussian Distribution

Parametric Models (Chap 2 PRML)

slide-14
SLIDE 14

Gaussian Distribution

One of most widely used and well studied model

Points of equal probability lie on on contour Diagonal Gaussian with Identical Variance

slide-15
SLIDE 15

Gaussian Distribution

Insights into two dimensional Gaussian distribution

Diagonal Gaussian with different variance

slide-16
SLIDE 16

Gaussian Distribution

Insights into two dimensional Gaussian Distribution

Full covariance Gaussian distribution

slide-17
SLIDE 17

Gaussian Distribution

Fitting the data with a Gaussian Model

slide-18
SLIDE 18

Finding the parameters of the Model

❖ The Gaussian model has the following parameters ❖ Total number of parameters to be learned for D dimensional

data is

❖ Given N data points how do we estimate the

parameters of model.

❖ Several criteria can be used ❖ The most popular method is the maximum likelihood

estimation (MLE).

slide-19
SLIDE 19

MLE

Define the likelihood function as The maximum likelihood estimator (MLE) is The MLE satisfies nice properties like

  • Consistency (covergence to true value)
  • Efficiency (has the least Mean squared error).
slide-20
SLIDE 20

MLE

For the Gaussian distribution To estimate the parameters