Nonparametric spectral-based estimation of latent structures - PowerPoint PPT Presentation

Nonparametric spectral-based estimation of latent structures Stéphane Bonhomme (Chicago), Koen Jochmans (Sciences Po) and J.-M. Robin (Sciences Po and UCL) May 27, 2014 1 / 29 Bonhomme, Jochmans, Robin Nonparametric spectral-based estimation of latent structures

Paper question Economist like unobserved heterogeneity and dynamic factor models. Usually discrete mixtures of parametric distributions (derived from theory) For identification and also estimation, it is useful to consider discrete mixtures of nonparametric models. This paper proposes a simple estimation procedure for discrete mixtures and hidden Markov models of nonparametric distribution components. 2 / 29 Bonhomme, Jochmans, Robin Nonparametric spectral-based estimation of latent structures

Identification The question of identification in latent structures is the topic of a very recent and active literature. Nonparametric identification from univariate/cross-sectional data typically fails. (Some exceptions for location models) Multivariate data (panel data) can present a powerful identification source. Finite mixtures/latent-class models: Hall and Zhou (2003); 1 Allman et al. (2009) (Dynamic) discrete-choice models: Magnac and Thesmar (2002); 2 Kasahara and Shimotsu (2009) Hidden Markov/regime-switching models: Allman et al. (2009); 3 Gassiat et al. (2013) Models for corrupted and misclassified data: Schennach (2004); 4 Hu and Schennach (2008) 3 / 29 Bonhomme, Jochmans, Robin Nonparametric spectral-based estimation of latent structures

Contribution We propose a new constructive identification argument... that delivers a least square-type estimator for mixture weights... allowing for asymptotic distributional theory. 4 / 29 Bonhomme, Jochmans, Robin Nonparametric spectral-based estimation of latent structures

Discrete mixtures of discrete distributions Let ( y 1 ,..., y q ) be q discrete variables with supp ( y i ) = { 1 ,..., κ i } . There exists a latent variable x ∈ { 1 ,..., r } with π j ≡ Pr { x = j } . Let p ij ∈ [ 0 , 1 ] κ i denote the vector of conditional probability masses of y i given x = j : p ij ( k ) ≡ Pr { y i = k | x = j } , k = 1 ,..., κ i 5 / 29 Bonhomme, Jochmans, Robin Nonparametric spectral-based estimation of latent structures

Unconditional distribution for DMs The unconditional joint PDF of ( y 1 ,..., y q ) is r ∑ P ( y 1 ,..., y q ) = π j p 1 j ( y 1 ) p 2 j ( y 2 ) ... p qj ( y q ) j = 1 The set of values P ( y 1 ,..., y q ) for all ( y 1 ,..., y q ) defines a q -dimensional array r ∑ P = π j p 1 j � p 2 j � ··· � p qj j = 1 � is the Kronecker product 6 / 29 Bonhomme, Jochmans, Robin Nonparametric spectral-based estimation of latent structures

Hidden Markov models There are q discrete latent variables ( x 1 ,..., x q ) for q measurements ( y 1 ,..., y q ) . Stationarity: Pr { x i = j } = π j , i = 1 ,..., q Pr { x i + 1 | x i } = K ( x i , x i + 1 ) , i = 1 ,..., q − 1 Pr { y i = k | x i = j } = p j ( k ) , k = 1 ,..., κ Conditional independence: measurements y 1 ,..., y q are independent conditional on ( x 1 ,..., x q ) . 7 / 29 Bonhomme, Jochmans, Robin Nonparametric spectral-based estimation of latent structures

Unconditional distribution for HHMs (1) The unconditional joint PDF of ( y 1 ,..., y 3 ) is P ( y 1 , y 2 , y 3 ) � � �� r r r ∑ ∑ ∑ = π j 1 p j 1 ( y 1 ) K ( j 1 , j 2 ) p j 2 ( y 2 ) K ( j 2 , j 3 ) p j 3 ( y 3 ) j 1 = 1 j 2 = 1 j 3 = 1 �� r r r ∑ ∑ ∑ = p j 1 ( y 1 ) π j 1 K ( j 1 , j 2 ) p j 2 ( y 2 ) K ( j 2 , j 3 ) p j 3 ( y 3 ) j 2 = 1 j 1 = 1 j 3 = 1 8 / 29 Bonhomme, Jochmans, Robin Nonparametric spectral-based estimation of latent structures

Unconditional distribution for HHMs (2) Let P = [ p 1 ,..., p r ] ∈ R κ × r and Π = diag ( π 1 ,..., π r ) . Hence the 3-dimensional array r � PK ⊤ � ∑ P = ( P Π K ) j � p j � j j = 1 where M j denotes the j th column of matrix M If q > 3 one can select all consecutive triples or regroup observations into 3 consecutive clusters: ( y 1 ,..., y k − 1 ) , y k , ( y k + 1 ,..., y q ) . 9 / 29 Bonhomme, Jochmans, Robin Nonparametric spectral-based estimation of latent structures

Identification of such latent array structures Kruskal (Psychometrica 1976, Linear Algebra Appl. 1977) Consider a κ 1 × κ 2 × κ 3 array P = ∑ r j = 1 p 1 j � p 2 j � p 3 j Let P i = [ p i 1 ,..., p ir ] ∈ R κ i × r , i = 1 , 2 , 3 Let r i = max { k : all collections of k columns of P i are independent} (the Kruskal-rank of P i ). Note that if P ∈ R κ × r has rank r it also has Kruskal-rank r . If r 1 + r 2 + r 3 ≥ 2 r + 2 then P uniquely determines the matrices P i up to simultaneous column-permutation and common column-scaling. 10 / 29 Bonhomme, Jochmans, Robin Nonparametric spectral-based estimation of latent structures

Application to statistics Allman, Matias and Rhodes (AoS, 2009) Allman et al. use Kruskal’s result to give conditions for the identification of discrete mixtures of discrete and continuous nonparametric distributions, hidden Markov models and some stochastic graphs. 11 / 29 Bonhomme, Jochmans, Robin Nonparametric spectral-based estimation of latent structures

Discrete mixtures Allman, Matias and Rhodes (AoS, 2009) Kruskal’s theorem applies with r ∑ P = π j p 1 j � p 2 j � p 3 j j = 1 P 1 = [ π 1 p 11 ,..., π r p 1 r ] , P i = [ p i 1 ,..., p ir ] , i > 1 Since sum ( P 1 , 1 ) = [ π 1 ,..., π r ] and (Corollary 2) sum ( P i , 1 ) = [ 1 ,..., 1 ] , i > 1 , then, if r 1 + r 2 + r 3 ≥ 2 r + 2 , group-probabilities π j and conditional probabilities p ij are identified up to labeling. (Theorem 8) Holds for continuous mixture components if the component densities are linearly independent ( r 1 = r 2 = r 3 = r ). 12 / 29 Bonhomme, Jochmans, Robin Nonparametric spectral-based estimation of latent structures

HMMs Allman, Matias and Rhodes (AoS, 2009), Theorem 6 The parameters of an HMM with r hidden states and κ observable states are generically identifiable from the marginal distribution of 2 k + 1 consecutive variables provided k satisfies � k + κ − 1 � ≥ r κ − 1 � k + κ − 1 � Note that = κ for k = 1 (3 measurements) and κ − 1 � k + κ − 1 � = k + 1 for κ = 2 (binary outcomes). κ − 1 13 / 29 Bonhomme, Jochmans, Robin Nonparametric spectral-based estimation of latent structures

Application to HMMs Gassiat, Cleynen, Robin (arXiv, 2013), Theorem 2.1 They use Allman et al.’s result to prove the following result. Assume that r is known, P = [ p 1 ,..., p r ] is full column rank, and K has full rank. Then K and P are identifiable from from the distribution of 3 consecutive observations ( y 1 , y 2 , y 3 ) up to label swapping of the hidden states. Estimation by penalized ML or EM algorithm. 14 / 29 Bonhomme, Jochmans, Robin Nonparametric spectral-based estimation of latent structures

Constructive identification procedures There exists few constructive identification procedures. De Lathauwer (SIAM, 2006) applies to the case where one outcome (say y 1 ) is such that P 1 is full column rank. However it provides identification only up to relabeling AND scaling. Group probabilities π j are thus not identified. We propose one such constructive identification that works both for DMs and HMMs, inspired from ICA or blind deconvolution. 15 / 29 Bonhomme, Jochmans, Robin Nonparametric spectral-based estimation of latent structures

DMs P = ∑ r j = 1 π j p 1 j � p 2 j � p 3 j Let Π = diag ( π 1 ,..., π r ) , and P i = [ p i 1 ,..., p ir ] ∈ R κ i × r , i = 1 , 2 , 3 . Assume rank ( P i ) = r and π j > 0 . 16 / 29 Bonhomme, Jochmans, Robin Nonparametric spectral-based estimation of latent structures

DMs P 1 Π P ⊤ 2 = ∑ r j = 1 π j p 1 j p ⊤ 2 j = ∑ r j = 1 π j p 1 j � p 2 j is the matrix containing probabilities P ( y 1 , y 2 ) . Observable. SVD on P 1 Π P ⊤ 2 , which has rank r , allows to construct U and V such that κ 2 × r = I r ⇒ ( VP 2 ) ⊤ = ( UP 1 Π) − 1 ≡ Q − 1 r × κ 1 P 1 Π P ⊤ 2 V ⊤ U r × r P (: , : , k ) = ∑ r j = 1 π j p 1 j � p 2 j � p 3 j ( k ) = P 1 Π D 3 k P ⊤ 2 , with D 3 k = diag [ p 31 ( k ) ,..., p 3 r ( k )] , is the matrix containing probabilities P ( y 1 , y 2 , k ) (for any y 1 , y 2 and y 3 = k ). Also observable. W k = U P (: , : , k ) V ⊤ = QD 3 k Q − 1 (whitening) P 3 identified by the eigenvalues of matrices W 1 ,..., W κ 3 Repeat for P 1 and P 2 . π = [ π 1 ; ... ; π r ] identified from P ( y i ) = ∑ r j = 1 π j p ij ( y i ) = P π 17 / 29 Bonhomme, Jochmans, Robin Nonparametric spectral-based estimation of latent structures

Nonparametric spectral-based estimation of latent structures - PowerPoint PPT Presentation

Nonparametric spectral-based estimation of latent structures Stphane Bonhomme (Chicago), Koen Jochmans (Sciences Po) and J.-M. Robin (Sciences Po and UCL) May 27, 2014 1 / 29 Bonhomme, Jochmans, Robin Nonparametric spectral-based estimation

Part III: Latent Tree Models Le Song ICML 2012 Tutorial on Spectral Algorithms for Latent

Spectral Clustering Spectral Clustering? Spectral methods Methods using eigenvectors of

Nonparametric Minimax Estimation of the Estimation of the Volatility in High- Volatility in

Part 3. Spectrum Estimation Part 3. Spectrum Estimation 3.2 Parametric Methods for Spectral

Lesson 9 Introduction Signal Spectral Analysis: Estimation of the power spectral density

Nonparametric Density Estimation October 1, 2018 Introduction If we cant fit a

Nonparametric density estimation Christopher F Baum EC 823: Applied Econometrics Boston College,

Nonparametric density estimation Christopher F Baum ECON 8823: Applied Econometrics Boston

Basic Definitions and The Spectral Estimation Problem Lecture 1 Lecture notes to accompany

An Introduction to Spectral Learning Hanxiao Liu November 8, 2013 An Introduction to Spectral

Nonparametric Regression Splines for Nonparametric Regression Splines for Regional Atmospheric

Nonparametric Sequential Change Detection for High-Dimensional Problems Yasin Ylmaz Electrical

The np package np : A Package for Nonparametric Kernel The np package implements a variety of

Nonparametric analysis of CMB Nonparametric analysis of CMB power spectrum data and consistency

1 Latent variable models In the next section we will discuss latent variable models for

Outline Density Estimation 1 Nonparametric Methods Bins Kernel Estimators k-Nearest Neighbor

Event Generation and Statistical Sampling with Deep Generative Models Rob Verheyen Introduction

Video Synthesis from the StyleGAN Latent Space Advisor Dr. Chris Pollett Committee Members By

Discriminating Languages in a Probabilistic Latent Subspace Aleksandr Sizov , Kong Aik Lee, Tomi

Representing Documents via Latent Keyphrase Inference April. 15 th , 2016 Document Representation

Latent Dimensions of Religion and Spirituality: A Longitudinal Correlated Topic Model Seong-Hyeon

Knowledge Transfer Using Latent Variable Models Ayan Acharya UT Austin, Department of ECE July

Learning Semantic Visual Codebook for Action Recognition by Embedding into Concept Space Behrouz

Distributed Variational Inference in Sparse Gaussian Process Regression and Latent Variable Models

Sambuz

Useful Links

Newsletter

Mail Us