Estimating Gaussian Mixture Models from Data with Missing Features - PowerPoint PPT Presentation

Estimating Gaussian Mixture Models from Data with Missing Features by Daniel McMichael CSSIP

Missing Data In classification we frequently see to classify objects using vectors of measured features. Sometimes these features are missing: [ 0 : 5 0 : 7 0 : 4 X 0 : 2 X 0 : 8 x = ]

Gaussian Mixture Models (GMMs) A probability density model (a weighted sum of Gaussians): � 1 n i exp ) = 2 T � f� ( y � � ) � ( y � � g (1) X i i n p ( y jf � ; � ; � g ) = i = 1 i i i j 2 p � � j i = 1 i : Qualities of GMMs: � Can model any density (given enough components) � Can be applied to classification � Widely used � “Easy” to analyse

Heteroscedastic GMMs (HGMMs) Conventionally, GMMs are homoscedastic: all data are modelled with the same Gaussian distribution: n (2) X p ( y j � ; � ; � ) = � p ( y j � ; � ) j i i i i i i i = 1 Introduce a heteroscedastic variant, where the response to each datum is different: n (3) X T p ( y j y ; M ; � ; � ; � ) = � p ( y j y + M � ; M � M + � ) j j i i i i j i j i y j j j j i = 1

Uses for HGMMs � estimation of GMM parameters from data with missing features; � estimation and prediction of indirectly observed mixture processes; � modelling heteroscedastic data. Need only a simplified HGMM: n (4) X T p ( y j �) = � p ( y j M � ; M � M ) j i j j i j i j i = 1 The gain matrices of each of the N data, = 1 , contain only 1s and 0s , N f M g j j and are formed by deleting the rows from an identity matrix corresponding to the missing features of each datum. This is the marginal distribution of the remaining features.

The EM Algorithm [Refer: Dempster, Laird and Rubin, 1977 ] Aim: to maximise a likelihood or posterior, over the parameters � , whilst integrating out the nuisance parameters Z . To maximise the likelihood ) , start with the guess, � : p ( Y j � ; Z � = � � E-step: R Q (� j � ) = p ( Y j � ; Z ) p ( Z j Y ; � ) d Z � � = arg max � M-step: � Q (� j � ) � � �

The E-step for HGMMs Assume conditionally independent data = 1 , and group the N Y = f y g j j heteroscedastic parameters = 1 together into a set M . N f M g j j The E-step is the calculation of ) for all the pairs of data P ( i j y ; � ; M y � j j and HGMM components: p ( y j i; � ; M ) P ( i j � ; M ) � � j P ( i j y ; � ; M ) = : � j n P p ( y j l ; � ; M ) P ( l j � ; M ) l = 1 j � � i.e. p ( y j i; � ; M ) � � j � i P ( i j y ; � ; M ) = � j n P p ( y j l ; � ; M ) � l = 1 � j � l

The M-step for HGMMs Maximise ) with respect to � : Q (� j � � 1 � � = P ( i j y ; � ; M ) � i j N � 1 � � � � N N P P � � = P ( i j y ; � )) H M P ( i j y ; � ) H y = 1 = 1 � � i j j j j j j j j � iterate to find i : � � ! � + � � ; i i i N � i X T T � = P ( i j y ; � ) H [( y � M � )( y � M � ) H � I ] � � i 2 j j j j i j j i i j = 1 j If < 2 then 8 i will never be non-positive definite. � � i

Results Figure 1: Left: each feature missing 10% of the time. Figure 2: Right: each feature missing 60% of the time.

Conclusions � Method for ML or MAP estimation of GMMs for data with missing features. � An EM algorithm: fast. � Able to stand very high levels of missing data. � Other applications

Estimating Gaussian Mixture Models from Data with Missing Features - PowerPoint PPT Presentation

Estimating Gaussian Mixture Models from Data with Missing Features by Daniel McMichael CSSIP Missing Data In classification we frequently see to classify objects using vectors of measured features. Sometimes these features are missing: [ 0

Bernoulli Mixture Models Victor Medina Researcher at SBIF DataCamp Mixture Models in R The

Structure of mixture models Victor Medina Researcher at SBIF DataCamp Mixture Models in R

Estimating Variance under Estimating Mean . . . Interval and Fuzzy Estimating Variance . . .

Deep Gaussian Mixture Models Cinzia Viroli (University of Bologna, Italy) joint with Geoff

Gaussian Mixture Models & EM CE-717: Machine Learning Sharif University of Technology M.

Gaussian Filter The Gaussian filter 1 2 1 A Gaussian kernel gives less 1 2 4 2 weight to

Multiple Imputation for Missing Data in KLoSA Juwon Song Korea University and UCLA Contents 1.

Using Gaussian Mixture Models to Detect Figurative Language in Context Linlin Li and Caroline

ELEN E6884 - Topics in Signal Processing Recap Topic: Speech Recognition Gaussian Mixture

Expectation Maximization Greg Mori - CMPT 419/726 Bishop PRML Ch. 9 K-Means Gaussian Mixture

Missing Data and Imputation NINA ORWITZ OCTOBER 30 TH , 2017 Outline Types of missing data

Mixture models of truncated data for estimating the number of species. Li-Thiao-T e S

MLE 04-09-2019 For Gaussian and Mixture Gaussian Models Instructor - Sriram Ganapathy

Vine copula mixture models and clustering for non-Gaussian data Statistical Methods in Machine

Estimating Estimating Covariance . . . Statistical Characteristics Estimating . . . Proof of

Lecture 3 Capacity of Multiuser Gaussian Channels The Gaussian uplink: 6.1 The fading

Data Analytics Instructor: Prof. Shuai Huang Industrial and Systems Engineering University of

Statistics 151a - Linear Modelling: Theory and Applications Adityanand Guntuboyina Department of

Hybrid Models with Deep and Invertible Features Eric Nalisnick , Akihiro Matsukawa, Yee Whye

When distributions fail: nonparametrics, permutations, and the bootstrap Joshua Loftus July 30,

The publication cycle E6891 Lecture 2 2014-01-29 Todays plan The publication cycle

Outlier Outlier Outlier- Outlier - -robust - robust robust robust identification

Optimal adaptive detection of small Plan of the talk Some testing correlation functions

Styles of data analysis DAAG Chapter 2 Objectives Learn the common tools of Exploratory Data

Estimating Gaussian Mixture Models from Data with Missing Features - PowerPoint PPT Presentation

Estimating Gaussian Mixture Models from Data with Missing Features by Daniel McMichael CSSIP Missing Data In classification we frequently see to classify objects using vectors of measured features. Sometimes these features are missing: [ 0

Bernoulli Mixture Models Victor Medina Researcher at SBIF DataCamp Mixture Models in R The

Structure of mixture models Victor Medina Researcher at SBIF DataCamp Mixture Models in R

Estimating Variance under Estimating Mean . . . Interval and Fuzzy Estimating Variance . . .

Deep Gaussian Mixture Models Cinzia Viroli (University of Bologna, Italy) joint with Geoff

Gaussian Mixture Models &amp; EM CE-717: Machine Learning Sharif University of Technology M.

Gaussian Filter The Gaussian filter 1 2 1 A Gaussian kernel gives less 1 2 4 2 weight to

Multiple Imputation for Missing Data in KLoSA Juwon Song Korea University and UCLA Contents 1.

Using Gaussian Mixture Models to Detect Figurative Language in Context Linlin Li and Caroline

ELEN E6884 - Topics in Signal Processing Recap Topic: Speech Recognition Gaussian Mixture

Expectation Maximization Greg Mori - CMPT 419/726 Bishop PRML Ch. 9 K-Means Gaussian Mixture

Missing Data and Imputation NINA ORWITZ OCTOBER 30 TH , 2017 Outline Types of missing data

Mixture models of truncated data for estimating the number of species. Li-Thiao-T e S

MLE 04-09-2019 For Gaussian and Mixture Gaussian Models Instructor - Sriram Ganapathy

Vine copula mixture models and clustering for non-Gaussian data Statistical Methods in Machine

Estimating Estimating Covariance . . . Statistical Characteristics Estimating . . . Proof of

Lecture 3 Capacity of Multiuser Gaussian Channels The Gaussian uplink: 6.1 The fading

Data Analytics Instructor: Prof. Shuai Huang Industrial and Systems Engineering University of

Statistics 151a - Linear Modelling: Theory and Applications Adityanand Guntuboyina Department of

Hybrid Models with Deep and Invertible Features Eric Nalisnick *, Akihiro Matsukawa*, Yee Whye

When distributions fail: nonparametrics, permutations, and the bootstrap Joshua Loftus July 30,

The publication cycle E6891 Lecture 2 2014-01-29 Todays plan The publication cycle

Outlier Outlier Outlier- Outlier - -robust - robust robust robust identification

Optimal adaptive detection of small Plan of the talk Some testing correlation functions

Styles of data analysis DAAG Chapter 2 Objectives Learn the common tools of Exploratory Data

Gaussian Mixture Models & EM CE-717: Machine Learning Sharif University of Technology M.

Hybrid Models with Deep and Invertible Features Eric Nalisnick , Akihiro Matsukawa, Yee Whye