FlexMix: Flexible fitting of finite mixtures with the EM algorithm - PowerPoint PPT Presentation

FlexMix: Flexible fitting of finite mixtures with the EM algorithm Bettina Gr¨ un Friedrich Leisch WU Wien LMU M¨ unchen useR! 2008, August 12–14 2008

Finite mixture models ● ● ● 8 ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● 6 ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ●● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● 4 ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● 2 ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● 0 ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● −2 ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● −5 0 5 10

Finite mixture models ● ● ● 8 ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● 1 ● ● ● ● ● ● 6 ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ●● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● 4 4 ● ● ● ● ● ● 2 3 2 0 −2 −5 0 5 10

Finite mixture models 8 2 6 6 3 4 5 ● ● ● ● ● ● ● 2 ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● 1 ● ● ● 4 ● ● ● ● ● 0 ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● −2 ● ● ● ● ● −5 0 5 10

Finite mixture models ● ● ● 50 ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● 40 ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● 30 ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● yn ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● 20 ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● 10 ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● 0 ● ● ● 0 2 4 6 8 10 x

Finite mixture models ● ● ● 50 ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● 40 ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● 30 ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● yn ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● 20 ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● 10 ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● 0 ● ● ● 0 2 4 6 8 10 x

Finite mixture models The finite mixture density is given by K � h ( y | x, w, ψ ) = π k ( w, α ) f k ( y | x, θ k ) k =1 K D � � = π k ( w, α ) f kd ( y d | x d , θ kd ) , k =1 d =1 with K � ∀ w : π k ( w, α ) = 1 ∧ π k ( w, α ) > 0 ∀ k. k =1 The posterior probabilities are given by π k ( w, α ) f k ( y | x, θ k ) τ k ( y | x, ψ ) = . K � π l ( w, α ) f l ( y | x, θ l ) l =1

EM algorithm • General method for ML estimation in a missing data setting → component membership • Iterates between E-step: determines the a-posteriori probabilities M-step: maximizes the complete likelihood where the missing component memberships are replaced → weighted ML problem of the component specific model and the concomitant variable model • Likelihood is increased in each step → converges to a local optimum if the likelihood is bounded • Variants: additional step between E- and M-step – Stochastic EM (SEM): assigns each observation to one component by drawing from the multinomial distribution induced by the a-posteriori probabilities – Classification EM (CEM): assigns each observation to the component with the maximum a-posteriori probability

FlexMix Design • Primary goal is extensibility: ideal for trying out new mixture models • No replacement of specialized mixture packages like mclust , but complement • Usage of S4 classes and methods • Formula-based interface • Multivariate responses: – Combination of univariate families: assumption of indepen- dence (given x ), each response may have its own model formula, i.e., a different set of regressors – multivariate families: if family handles multivariate response directly, then arbitrary multivariate response distributions are possible

Fit function flexmix() • flexmix() takes the following arguments: formula: A symbolic description of the model to be fit. The general form is y~x|g where y is the response, x the set of predictors and g an optional grouping factor for repeated measurements. data: An optional data frame containing the variables in the model. k: Number of clusters (not needed if cluster is specified). cluster: Either a matrix with k columns of initial cluster membership probabilities for each observation; or a factor or integer vector with the initial cluster assignments of observations. model: Object of class "FLXM" or list of these objects. concomitant: Object of class "FLXP" . control: Object of class "FLXcontrol" or a named list. – repeated calls of flexmix() with stepFlexmix() – returns an object of class "flexmix"

Controlling the EM algorithm • "FLXcontrol" : for the overall behaviour of the EM algorithm: iter.max: Maximum number of iterations minprior: Minimum prior probability for components verbose: If larger than zero, then flexmix() gives status messages each verbose iterations. classify: One of “auto”, “weighted”, “CEM” (or “hard”), “SEM” (or “random”). For convenience flexmix() also accepts a named list of control parameters with argument name completion, e.g. flexmix(..., control=list(class="r"))

FlexMix: Flexible fitting of finite mixtures with the EM algorithm - PowerPoint PPT Presentation

FlexMix: Flexible fitting of finite mixtures with the EM algorithm Bettina Gr un Friedrich Leisch WU Wien LMU M unchen useR! 2008, August 1214 2008 Finite mixture models 8

Track fitting, vertex fitting and Track fitting, vertex fitting and Track fitting, vertex fitting

Week 2 Video 5 Cross-Validation and Over-Fitting Over-Fitting Ive mentioned over-fitting a

Lecture 11 Fitting ARIMA Models 10/10/2018 1 Model Fitting Fitting ARIMA For an

Analysis of a model of elastic plastic mixtures (Prandtl-Reuss-mixtures) Project of Josef

The The Beverly Beverly Middle Middle School School Flexible Flexible Learning Learning

Functions and Data Fitting COMPSCI 371D Machine Learning COMPSCI 371D Machine Learning

Over fitting distribution functions over Bayesian Regression / " ' i diggllloise dist

Fitting high resolution structures into low resolution EM maps Michael Rossmann Purdue

Fitting a Line, Residuals, and Correlation October 28, 2019 October 28, 2019 1 / 36 Fitting a

Unit 1: Data Fitting Motivation Data fitting: Construct a continuous function that represents

Fitting a Line, Residuals, and Correlation August 27, 2019 August 27, 2019 1 / 54 Fitting a

Least Squares and Data Fitting Data fitting How do we best fit a set of data points? Linear

Personalized Learning Flexible Seating and Space Flexible Seating and Space Flexible Seating and

Release granular mushrooms Release granular mushrooms and dried mixtures and dried mixtures

The science of mixtures and separation techniques Rahul Bhambure PhD Scientist, Chemical

Mixtures of models Michel Bierlaire michel.bierlaire@epfl.ch Transport and Mobility Laboratory

Expectation maximization don't have any labels. Can you still do something? ! Amazingly you can!

Learning from Unlabeled Data INFO-4604, Applied Machine Learning University of Colorado Boulder

Lecture 12: EM Algorithm Kai-Wei Chang CS @ University of Virginia kw@kwchang.net Couse

A unifying methodology A Gentle Introduction to the EM Algorithm Dempster, Laird & Rubin

EM-like algorithms for nonparametric estimation in multivariate mixtures Didier Chauveau MAPMO -

HMM, MEMM and CRF Probabilistic Graphical Models Sharif University of Technology Spring 201 7

CSC 411: Lecture 13: Mixtures of Gaussians and EM Class based on Raquel Urtasun & Rich

Unsupervised Learning Marco Chiarandini Department of Mathematics & Computer Science

FlexMix: Flexible fitting of finite mixtures with the EM algorithm - PowerPoint PPT Presentation

FlexMix: Flexible fitting of finite mixtures with the EM algorithm Bettina Gr un Friedrich Leisch WU Wien LMU M unchen useR! 2008, August 1214 2008 Finite mixture models 8

Track fitting, vertex fitting and Track fitting, vertex fitting and Track fitting, vertex fitting

Week 2 Video 5 Cross-Validation and Over-Fitting Over-Fitting Ive mentioned over-fitting a

Lecture 11 Fitting ARIMA Models 10/10/2018 1 Model Fitting Fitting ARIMA For an

Analysis of a model of elastic plastic mixtures (Prandtl-Reuss-mixtures) Project of Josef

The The Beverly Beverly Middle Middle School School Flexible Flexible Learning Learning

Functions and Data Fitting COMPSCI 371D Machine Learning COMPSCI 371D Machine Learning

Over fitting distribution functions over Bayesian Regression / &quot; ' i diggllloise dist

Fitting high resolution structures into low resolution EM maps Michael Rossmann Purdue

Fitting a Line, Residuals, and Correlation October 28, 2019 October 28, 2019 1 / 36 Fitting a

Unit 1: Data Fitting Motivation Data fitting: Construct a continuous function that represents

Fitting a Line, Residuals, and Correlation August 27, 2019 August 27, 2019 1 / 54 Fitting a

Least Squares and Data Fitting Data fitting How do we best fit a set of data points? Linear

Personalized Learning Flexible Seating and Space Flexible Seating and Space Flexible Seating and

Release granular mushrooms Release granular mushrooms and dried mixtures and dried mixtures

The science of mixtures and separation techniques Rahul Bhambure PhD Scientist, Chemical

Mixtures of models Michel Bierlaire michel.bierlaire@epfl.ch Transport and Mobility Laboratory

Expectation maximization don't have any labels. Can you still do something? ! Amazingly you can!

Learning from Unlabeled Data INFO-4604, Applied Machine Learning University of Colorado Boulder

Lecture 12: EM Algorithm Kai-Wei Chang CS @ University of Virginia kw@kwchang.net Couse

A unifying methodology A Gentle Introduction to the EM Algorithm Dempster, Laird &amp; Rubin

EM-like algorithms for nonparametric estimation in multivariate mixtures Didier Chauveau MAPMO -

HMM, MEMM and CRF Probabilistic Graphical Models Sharif University of Technology Spring 201 7

CSC 411: Lecture 13: Mixtures of Gaussians and EM Class based on Raquel Urtasun &amp; Rich

Unsupervised Learning Marco Chiarandini Department of Mathematics &amp; Computer Science

Over fitting distribution functions over Bayesian Regression / " ' i diggllloise dist

A unifying methodology A Gentle Introduction to the EM Algorithm Dempster, Laird & Rubin

CSC 411: Lecture 13: Mixtures of Gaussians and EM Class based on Raquel Urtasun & Rich

Unsupervised Learning Marco Chiarandini Department of Mathematics & Computer Science