Approximate Factor Analysis Models Lorenzo Finesso, Peter Spreij - PowerPoint PPT Presentation

Approximate Factor Analysis Models Lorenzo Finesso, Peter Spreij Brixen – July 19, 2007

  1 0 0 1 1   0 P 1 =   2 2   1 1   0 2 2 1 1   2 0 2   1 1 P 2 = 2 0   2     0 0 1 1 1 1   2 4 4 1 1 1   P 2 P 1 =   2 4 4   1 1   0 2 2 1

Factor Analysis models Y = HX + ε where X ∈ R k and ε ∈ R n , independent zero mean normals, ( k < n ) C ov( X ) = I , and C ov( ε ) = D > 0, diagonal therefore C ov( Y ) := Σ 0 = HH ⊤ + D C ov( Y | X ) = D diagonal 2

Exact (weak) realization of FA models Problem Given the positive covariance matrix Σ 0 ∈ R n × n and the integer k < n find ( H, D ) such that H ∈ R n × k D > 0 diagonal n × n Σ 0 = HH ⊤ + D 3

Informational divergence between normal measures Given probability measures P 1 ≪ P 2 , on the same space D ( P 1 || P 2 ) = E P 1 log d P 1 d P 2 normal case on R n P 1 = N (0 , Σ 1 ) , P 2 = N (0 , Σ 2 ) D ( P 1 || P 2 ) := D (Σ 1 || Σ 2 ) = 1 2 log | Σ 2 | | Σ 1 | + 1 2 Σ 1 ) − n 2 tr(Σ − 1 2 4

Approximate FA models Problem Given Σ 0 ∈ R n × n positive and the integer k < n minimize 2 log | HH ⊤ + D | D (Σ 0 || HH ⊤ + D ) = 1 + 1 2 tr(( HH ⊤ + D ) − 1 Σ 0 ) − n | Σ 0 | 2 over ( H, D ), where H ∈ R n × k and D > 0 is diagonal of size n Proposition The approximate FA problem admits a (nonunique) solution 5

Lifted version of the problem Definitions � � � � Σ 11 Σ 12 Σ ∈ R ( n + k ) × ( n + k ) : Σ = Σ = > 0 Σ 21 Σ 22 Two subsets of Σ will play a special role Σ 0 = { Σ ∈ Σ : Σ 11 = Σ 0 } HH ⊤ + D � � �� HQ Σ 1 = Σ ∈ Σ : Σ = ( HQ ) ⊤ Q ⊤ Q Elements of Σ 1 will often be denoted by Σ( H, D, Q ) Remark Y ∼ N (0 , Σ 0 ) admits an exact FA model of size k iff Σ 0 ∩ Σ 1 � = ∅ 6

Lifted problem Problem D (Σ ′ || Σ 1 ) min Σ ′ ∈ Σ 0 , Σ 1 ∈ Σ 1 Proposition Let Σ 0 be given. It holds that H,D D (Σ 0 || HH ⊤ + D ) = D (Σ ′ || Σ 1 ) min min Σ ′ ∈ Σ 0 , Σ 1 ∈ Σ 1 7

First partial minimization Problem D (Σ ′ || Σ) min Σ ′ ∈ Σ 0 This problem has a unique solution 8

First partial minimization - general solution Proposition Let ( Y, X ) ∼ Q = Q Y,X and let P = P Y,X : P Y = P 0 � � P = for a given P 0 ≪ Q Y , then D ( P || Q ) = D ( P ∗ || Q ) min P ∈ P where P ∗ is given by P ∗ Y = P 0 , P ∗ X | Y = Q X | Y Moreover, for any P ∈ P , one has the Pythagorean law D ( P || Q ) = D ( P || P ∗ ) + D ( P ∗ || Q ) 9

First partial minimization – normal case Proposition Let Q ∼ N (0 , Σ) and P 0 ∼ N (0 , Σ 0 ) where Σ ∈ Σ and Σ 0 ∈ R n × n , then D (Σ ′ || Σ) min Σ ′ ∈ Σ 0 is attained by P ∗ ∼ N (0 , Σ ∗ ) with Σ 0 Σ − 1   Σ 0 11 Σ 12 Σ ∗ = Σ 21 Σ − 1 Σ 22 − Σ 21 Σ − 1 11 (Σ 11 − Σ 0 )Σ − 1   11 Σ 0 11 Σ 12 10

Second partial minimization Problem min D (Σ || Σ 1 ) Σ 1 ∈ Σ 1 This problem has a unique solution Σ ∗ 1 = Σ ∗ ( H ∗ , D ∗ , Q ∗ ) 11

Second partial minimization – normal case Notation For M square let ∆( M ) be the diagonal ∆( M ) ii = M ii Proposition An optimal point is ( H ∗ , D ∗ , Q ∗ ) with H ∗ = Σ 12 Σ − 1 / 2 22 D ∗ = ∆(Σ 11 − Σ 12 Σ − 1 22 Σ 21 ) Q ∗ = Σ 1 / 2 22 thus: Σ 12 Σ − 1 22 Σ 21 + ∆(Σ 11 − Σ 12 Σ − 1 � � 22 Σ 21 ) Σ 12 Σ ∗ 1 = Σ 21 Σ 22 moreover D (Σ || Σ( H, D, Q )) = D (Σ || Σ ∗ 1 ) + D (Σ ∗ 1 || Σ( H, D, Q )) for any Σ( H, D, Q ) ∈ Σ 1 12

Alternating minimization algorithm Given Σ 0 > 0, pick ( H 0 , D 0 , Q 0 ) and let Σ (0) = Σ( H 0 , D 0 , Q 0 ) 1 construct the sequence Σ (0) → Σ (1) → Σ (2) → Σ ′ (1) − → Σ ′ (2) − − − − → . . . 1 1 1 where D (Σ ′ ( t +1) || Σ ( t ) D (Σ ′ || Σ ( t ) . 1 ) = min 1 ) Σ ′ ∈ Σ 0 and D (Σ ′ ( t +1) || Σ ( t +1) D (Σ ′ ( t +1) || Σ 1 ) . ) = min 1 Σ 1 ∈ Σ 1 13

Algorithm At the t -th iteration the matrices H t , D t and Q t are available. � Q ⊤ t Q t − Q ⊤ t H ⊤ t ( H t H ⊤ t + D t ) − 1 H t Q t Q t +1 = � 1 / 2 + Q ⊤ t H ⊤ t ( H t H ⊤ t + D t ) − 1 Σ 0 ( H t H ⊤ t + D t ) − 1 H t Q t H t +1 = Σ 0 ( H t H ⊤ t + D t ) − 1 H t Q t Q − 1 t +1 D t +1 = ∆(Σ 0 − H t +1 H ⊤ t +1 ) 14

Algorithm Notice The update rules can be written in terms of ( H t , D t ) only R t = I − H ⊤ t ( H t H ⊤ t + D t ) − 1 ( H t H ⊤ t + D t − Σ 0 )( H t H ⊤ t + D t ) − 1 H t t + D t ) − 1 H t R − 1 / 2 H t +1 = Σ 0 ( H t H ⊤ t D t +1 = ∆(Σ 0 − H t +1 H ⊤ t +1 ) 15

Some properties of the algorithm Proposition (a) D t > 0 (b) R t is invertible (c) If H 0 is of full column rank, so is H t If Σ 0 = H t H ⊤ (e) t + D t the algorithm stops (f) The objective function decreases at each iteration (g) The limit points ( H, D ) of the algorithm satisfy the relations H = (Σ 0 − HH ⊤ ) D − 1 H, D = ∆(Σ 0 − HH ⊤ ) 16

Approximate Factor Analysis Models Lorenzo Finesso, Peter Spreij - PowerPoint PPT Presentation

Approximate Factor Analysis Models Lorenzo Finesso, Peter Spreij Brixen July 19, 2007 1 0 0 1 1 0 P 1 = 2 2 1 1 0 2 2 1 1 2 0 2 1 1 P 2 = 2 0 2

Triadic Factor Analysis Cynthia Glodeanu Institute of Algebra, TU Dresden October 19, 2010.

Factor Models: A Review James J. Heckman The University of Chicago Econ 312, Winter 2019

Confirmatory Factor Analysis and Exploratory-Confirmatory Factor Analysis Maximum

Probabilistic Graphical Models 10-708 Factor Analysis and State Space Factor Analysis and State

Week 7 Video 5 Factor Analysis Factor Analysis You have a whole lot of variables Can

Attribute Grammars intermediate syntax semantics representation Language Implementation 2

Certainty Factor certainty factor CF (is the certainty factor in the hypothesis H due to

(IHBG) Competitive NOFA Training Rating Factor 3: Soundness of Approach 1 Rating Factor 3

Predicting condition specific transcription factors for target gene. Kaur Alasoo 19.09.2012

Rating Factor 1 Review Rating Factor 1 Capacity of the Applicant 1 Rating Factor Review 2

Approximate Computing Is Dead; Long Live Approximate Computing Adrian Sampson Cornell Hardware

Approximate Nearest Neighbors Search Approximate Nearest Neighbors Search in High Dimensions in

Factor Analysis and Beyond Chris Williams School of Informatics, University of Edinburgh October

Backward Analysis via Over-Approximate Abstraction and Under-Approximate Subtraction Alexey

Approximate inference: Sampling methods Probabilistic Graphical Models Sharif University of

Approximate Knowledge Compilation by Online Collapsed Importance Sampling Tal Friedman and Guy

Dynamic Embedding on Textual Networks via a Gaussian Process Presenter : Pengyu Cheng Joint work

AnImprovedAnalytical SuperscalarMicroprocessor MemoryModel MemoryModel

Probability: Terminology and Examples 18.05 Spring 2014 Jeremy Orloff and Jonathan Bloom Discussion

COL863: Quantum Computation and Information Ragesh Jaiswal, CSE, IIT Delhi Ragesh Jaiswal, CSE,

Gravitational Waves from Phase Transition in a QCD-like hidden sector Mayumi Aoki (Kanazawa

Computing Nearby Non-Trivial Smith Forms Joseph Haraldson with Mark Giesbrecht and George Labahn

Mesh Simplification Mesh Simplification 1 Spring 2010 The The Law The The Law Law of Law

MA162: Finite mathematics . Jack Schmidt University of Kentucky April 11, 2012 Schedule: HW