Latent Factor Analysis of Gaussian Distributions under Graphical - PowerPoint PPT Presentation

Latent Factor Analysis of Gaussian Distributions under Graphical Constraints Md Mahmudul Hasan, Shuangqing Wei, Ali Moharrer School of Electrical Engineering and Computer Science, Louisiana State University Md Mahmudul Hasan, S Wei, Ali Moharrer (LSU) ISIT 2020 June 2020 1 / 23

Outline Motivation 1 Introduction 2 Main results 3 Building a Gaussian Tree 4 Md Mahmudul Hasan, S Wei, Ali Moharrer (LSU) ISIT 2020 June 2020 2 / 23

Motivation Outline Motivation 1 Introduction 2 Main results 3 Building a Gaussian Tree 4 Md Mahmudul Hasan, S Wei, Ali Moharrer (LSU) ISIT 2020 June 2020 3 / 23

Motivation Motivation • Here the primary aim is to do dimension reduction using rank of a matrix as the object function, which is hard to achieve. Hence we use trace as the object function, which is effectively almost as good as rank minimization. • Unlike traditional ways to do factor analysis i.e. numerically, providing algorithms or providing convergence proof, we are trying to do factor analysis under graphical constrainsts. Md Mahmudul Hasan, S Wei, Ali Moharrer (LSU) ISIT 2020 June 2020 4 / 23

Introduction Outline Motivation 1 Introduction 2 Main results 3 Building a Gaussian Tree 4 Md Mahmudul Hasan, S Wei, Ali Moharrer (LSU) ISIT 2020 June 2020 5 / 23

Introduction Definitions The well-known factor analytic decomposition of an nxn population covariance matrix Σ , Σ = (Σ − D ) + D (1) • MTFA- matrix Σ − D is Gramian and matrix D is diagonal • CMTFA- matrix Σ − D and matrix D is both are Gramian. Md Mahmudul Hasan, S Wei, Ali Moharrer (LSU) ISIT 2020 June 2020 6 / 23

Introduction A brief on CMTFA CMTFA seeks a minimum trace Σ t that solves the following decomposition problem. Σ x = Σ t + D (2) such that, Σ t is low rank (rank < n ), D is diagonal, both Σ t and D are Gramian matrices. It was shown in [1] that, the above decomposition problem is equivalent to solving the following convex optimization problem. min − tr ( D ) D s.t. − λ min ( D ) ≤ 0 and − D i,i ≤ 0 , i = 1 , . . . , n (3) where, λ min ( D ) is the minimum eigenvalue of the matrix D . 1 Giacomo Della Riccia and Alexander Shapiro, ”Minimum Rank and Minimum Trace of Covariance Matrices”, Psychometrika, vol. 47, No. 4, December, 1982. Md Mahmudul Hasan, S Wei, Ali Moharrer (LSU) ISIT 2020 June 2020 7 / 23

Introduction A brief on CMTFA (cont’d) The following Theorem given in the same paper, sets the ground rules for a matrix D ∗ to be the solution for the optimization problem given by (3). Theorem The matrix D ∗ is a solution of the CMTFA problem if and only if D ∗ i,i ≥ 0 , 1 ≤ i ≤ n , λ min (Σ x − D ∗ ) = 0 , and there exists n × r matrix T such that � t ∗ ,i ∈ N (Σ x − D ∗ ) , i = 1 , ...., r and the following holds, r � � t 2 � µ j � � 1 = ∗ ,i − ξ j (4) i =1 j ∈ I ( D ∗ ) where r ≤ n indicating the number of columns of the matrix T , I ( D ∗ ) = { i : D ∗ j ∈ I ( D ∗ ) } are non-negative numbers and i,i = 0 , 1 ≤ i ≤ n } , { µ j , ξ j , j ∈ I ( D ∗ ) } are column vectors in R n with all the components equal to 0 except for { � the j th component which is equal to 1 . Md Mahmudul Hasan, S Wei, Ali Moharrer (LSU) ISIT 2020 June 2020 8 / 23

Introduction Problem Statement Our goal is to characterize the solution space of CMTFA, when CMTFA is applied to a specially generated Σ x i.e. generated from the following model.  x 1   α 1   z 1  . . . .  = . � � + . (5) y       . . .      x n α n z n where, y ∼ N (0 , 1) , 0 ≤ | α j | ≤ 1 , j = 1 , 2 , . . . , n . and { z i } are independent Gausian random varables with z i ∼ N (0 , 1 − α 2 i ) The above model makes the following star topology with y being the latent variable. Figure: A star topology generative model. Md Mahmudul Hasan, S Wei, Ali Moharrer (LSU) ISIT 2020 June 2020 9 / 23

Introduction State of the Art In [2], a necessary and sufficient condition was found on the subspace of Σ x for MTFA solution of Σ x to recover a star structure, when Σ x is equipped with a latent star graphical constraint. The main differences between their work and ours are, • We found the same condtion for CMTFA as they did for MTFA. • Even more importantly, we also characterized the solution for the case when CMTFA fails to recover a star structure. • We characterized the solution for all possible situations. 2 J. SAUNDERSON, V. CHANDRASEKARAN, P. A. PARRILO , AND A. S. WILLSKY, ”DIAGONAL AND LOW-RANK MATRIX DECOMPOSITIONS, CORRELATION MATRICES, AND ELLIPSOID FITTING” SIAM J. MATRIX ANAL. APPL., vol. 33, no. 4, pp. 1395-1416, 2015. Md Mahmudul Hasan, S Wei, Ali Moharrer (LSU) ISIT 2020 June 2020 10 / 23

Introduction Our contributions α = [ α 1 , α 2 , α 3 , ....., α n ] ′ , and without the loss of generality we have We define, vector � assumed | α 1 | ≥ | α 2 | ≥ · · · ≥ | α n | . We apply CMTFA to the following Σ x , aiming to characterize the solution space. 1  . . . α 1 α n  . . ... . . Σ x = (6)   . .   1 α n α 1 . . . Our contributions can be summarized by the following two theorems. Md Mahmudul Hasan, S Wei, Ali Moharrer (LSU) ISIT 2020 June 2020 11 / 23

Main results Outline Motivation 1 Introduction 2 Main results 3 Building a Gaussian Tree 4 Md Mahmudul Hasan, S Wei, Ali Moharrer (LSU) ISIT 2020 June 2020 12 / 23

Main results Theorem α is non-dominant, i.e. | α 1 | ≤ � n i =2 | α i | . CMTFA solution of Σ x is Σ t,ND if and only if, � α 2  α 1 α 2 . . . α 1 α n  1 α 2 α 2 α 1 . . . α 2 α n 2   Σ t,ND = (7)  . . .  ... . . .   . . .   α 2 α n α 1 α n α 2 . . . n Theorem α is dominant, i.e. | α 1 | > � n CMTFA solution of Σ x is Σ t,DM if and only if, � i =2 | α i | . | α 1 | ( � n i =2 | α i | )  α 1 α 2 . . . α 1 α n  | α 2 | ( | α 1 | − � i � =1 , 2 | α i | ) α 2 α 1 . . . α 2 α n   Σ t,DM =  . . .  ... . . .   . . .   α n α 1 α 1 α 2 . . . | α n | ( | α 1 | − � i � =1 ,n | α i | ) (8) Md Mahmudul Hasan, S Wei, Ali Moharrer (LSU) ISIT 2020 June 2020 13 / 23

Main results Proof of the non-dominant case The following Lemma will help us prove our first theorem. Lemma α given by (9) is a necessary condition for the existence of such Non-dominance of vector � t j, ∗ || 2 = 1 , n × r matrix T that � 1 ≤ i ≤ r and || � t ∗ ,i ∈ N (Σ t,ND ) , 1 ≤ j ≤ n . n � | α 1 | ≤ | α i | (9) i =2 Md Mahmudul Hasan, S Wei, Ali Moharrer (LSU) ISIT 2020 June 2020 14 / 23

Main results Proof of the non-dominant case (cont’d) Referring to the necessary and sufficient condtion for CMTFA solution. • Σ t,ND is rank 1 , its minimum eigenvalue is 0 . • We only need to show the existance of such n × r matrix T that t j, ∗ || 2 = 1 , � 1 ≤ i ≤ r and || � t ∗ ,i ∈ N (Σ t,ND ) , 1 ≤ j ≤ n where 1 ≤ r ≤ n . • the Lemma has already stated that, for the existence of such T non-dominance of � α is a necessary condition. • We next show that for the existence of such a T , non-dominance of � α is also a sufficient condition. Md Mahmudul Hasan, S Wei, Ali Moharrer (LSU) ISIT 2020 June 2020 15 / 23

Main results Proof of the non-dominant case (cont’d) It is straightforward to find the following basis vectors for the null space of Σ t,ND , − α 2 − α 3 − α n       α 1 α 1 α 1 1 0 0        0   1   0  v 1 = v 2 = v n − 1 = (10) � ,� , . . . , �        .   .   .  . . .       . . .       0 0 1 v n − 1 , � n − 1 v i ] , where c i ∈ { 1 , − 1 } , We define, V = [ � v 1 , . . .� i =1 c i +1 � i = 2 , . . . , n . Let, T n × n = V n × n · B n × n , where B n × n is a diagonal matrix. We have, TT T = V BB T V T = V βV T (11) The diagonal matrix β = BB T can have only non-negative entries. Md Mahmudul Hasan, S Wei, Ali Moharrer (LSU) ISIT 2020 June 2020 16 / 23

Main results Proof of non-dominant case (cont’d) t j, ∗ || 2 = 1 , Because of || � 1 ≤ j ≤ n condition on T , we get the following n equations, � n � 2 n − 1 α 2 c j α j � i +1 � β ii + β nn = 1 (12) α 2 α 1 1 i =1 j =2 β ii + c 2 i = 1 , . . . , n − 1 i +1 β nn = 1 , (13) If we select c i α i = | α i | , i = 2 , . . . , n and solve the above equations under such selections, we get β ii ≥ 0 , 1 ≤ i ≤ n . That completes the proof of our first Theorem . Md Mahmudul Hasan, S Wei, Ali Moharrer (LSU) ISIT 2020 June 2020 17 / 23

Main results Proof of the dominant case The following two Lemmas play pivotal role in proving the second theorem we proposed. Lemma Σ t,DM is a rank n − 1 matrix. Lemma There exists a column vector Φ = [Φ 1 , Φ 2 , ...., Φ n ] ′ such that Σ t,DM Φ = 0 , where Φ i ∈ {− 1 , 1 } , 1 ≤ i ≤ n . Md Mahmudul Hasan, S Wei, Ali Moharrer (LSU) ISIT 2020 June 2020 18 / 23

Latent Factor Analysis of Gaussian Distributions under Graphical - PowerPoint PPT Presentation

Latent Factor Analysis of Gaussian Distributions under Graphical Constraints Md Mahmudul Hasan, Shuangqing Wei, Ali Moharrer School of Electrical Engineering and Computer Science, Louisiana State University Md Mahmudul Hasan, S Wei, Ali Moharrer

Triadic Factor Analysis Cynthia Glodeanu Institute of Algebra, TU Dresden October 19, 2010.

Factor Analysis Professor Patrick Sturgis Plan Measuring concepts using latent variables

Gaussian Filter The Gaussian filter 1 2 1 A Gaussian kernel gives less 1 2 4 2 weight to

Formal Modeling in Cognitive Science 1 Distributions Lecture 20: Joint, Marginal, and Conditional

Confirmatory Factor Analysis and Exploratory-Confirmatory Factor Analysis Maximum

Lecture 3 Capacity of Multiuser Gaussian Channels The Gaussian uplink: 6.1 The fading

? ? ? ? Basic Charts Outline - Distributions & Histograms - Mean, Mode, Average - Chart

Piecewise Bounds for Estimating Bernoulli- Logistic Latent Gaussian Models Mohammad Emtiyaz Khan

Case Study: Approximate Bayesian Inference for Latent Gaussian Models by Using Integrated Nested

Distributed Variational Inference in Sparse Gaussian Process Regression and Latent Variable Models

Latent Variable Models CS3750 Xiaoting Li 1 Out utli line Latent Variable Models

1 Latent variable models In the next section we will discuss latent variable models for

Faster Gaussian Lattice Sampling using Information Leakage Gaussian Sampling Our Work Lazy

Non-Gaussian likelihoods for Gaussian Processes Alan Saul Outline Motivation Non-Gaussian

CS70: Jean Walrand: Lecture 36. Gaussian and CLT CS70: Jean Walrand: Lecture 36. Gaussian and

Week 7 Video 5 Factor Analysis Factor Analysis You have a whole lot of variables Can

Object-based SSD (OSSD): Our Practice and Experience Jaesoo Lee jaesu.lee@samsung.com Flash

Outcome Delivery at Miki Renee Tsielepi ? Mobius Loop Workshop Reservations Purchasing

HVP contributions to anomalous magnetic moments of all leptons from first principle At Physical

Outline Background Venezia Hardware Architecture Venezia Software Architecture

Quickest Detection of a Dynamic Anomaly in a Heterogeneous Sensor Network Georgios Rovatsos, UIUC

Health Care Innovation Challenge November 17, 2011 The Innovation Center Mission Statement

Agenda What is the Connecticut Microgrid Program? CEFIA Overview Financing Challenge

Clean Energy Solutions Center and the ClimateWorks Foundation-International Council on Clean

Latent Factor Analysis of Gaussian Distributions under Graphical - PowerPoint PPT Presentation

Latent Factor Analysis of Gaussian Distributions under Graphical Constraints Md Mahmudul Hasan, Shuangqing Wei, Ali Moharrer School of Electrical Engineering and Computer Science, Louisiana State University Md Mahmudul Hasan, S Wei, Ali Moharrer

Triadic Factor Analysis Cynthia Glodeanu Institute of Algebra, TU Dresden October 19, 2010.

Factor Analysis Professor Patrick Sturgis Plan Measuring concepts using latent variables

Gaussian Filter The Gaussian filter 1 2 1 A Gaussian kernel gives less 1 2 4 2 weight to

Formal Modeling in Cognitive Science 1 Distributions Lecture 20: Joint, Marginal, and Conditional

Confirmatory Factor Analysis and Exploratory-Confirmatory Factor Analysis Maximum

Lecture 3 Capacity of Multiuser Gaussian Channels The Gaussian uplink: 6.1 The fading

? ? ? ? Basic Charts Outline - Distributions &amp; Histograms - Mean, Mode, Average - Chart

Piecewise Bounds for Estimating Bernoulli- Logistic Latent Gaussian Models Mohammad Emtiyaz Khan

Case Study: Approximate Bayesian Inference for Latent Gaussian Models by Using Integrated Nested

Distributed Variational Inference in Sparse Gaussian Process Regression and Latent Variable Models

Latent Variable Models CS3750 Xiaoting Li 1 Out utli line Latent Variable Models

1 Latent variable models In the next section we will discuss latent variable models for

Faster Gaussian Lattice Sampling using Information Leakage Gaussian Sampling Our Work Lazy

Non-Gaussian likelihoods for Gaussian Processes Alan Saul Outline Motivation Non-Gaussian

CS70: Jean Walrand: Lecture 36. Gaussian and CLT CS70: Jean Walrand: Lecture 36. Gaussian and

Week 7 Video 5 Factor Analysis Factor Analysis You have a whole lot of variables Can

Object-based SSD (OSSD): Our Practice and Experience Jaesoo Lee jaesu.lee@samsung.com Flash

Outcome Delivery at Miki Renee Tsielepi ? Mobius Loop Workshop Reservations Purchasing

HVP contributions to anomalous magnetic moments of all leptons from first principle At Physical

Outline Background Venezia Hardware Architecture Venezia Software Architecture

Quickest Detection of a Dynamic Anomaly in a Heterogeneous Sensor Network Georgios Rovatsos, UIUC

Health Care Innovation Challenge November 17, 2011 The Innovation Center Mission Statement

Agenda What is the Connecticut Microgrid Program? CEFIA Overview Financing Challenge

Clean Energy Solutions Center and the ClimateWorks Foundation-International Council on Clean

? ? ? ? Basic Charts Outline - Distributions & Histograms - Mean, Mode, Average - Chart