Canonical Correlations for Group Symmetry Models Jesse Crawford - - PowerPoint PPT Presentation

canonical correlations for group symmetry models
SMART_READER_LITE
LIVE PREVIEW

Canonical Correlations for Group Symmetry Models Jesse Crawford - - PowerPoint PPT Presentation

Canonical Correlations for Group Symmetry Models Jesse Crawford Department of Mathematics Tarleton State University jcrawford@tarleton.edu March 28, 2009 Jesse Crawford (Tarleton State University) March 28, 2009 1 / 29 Outline Canonical


slide-1
SLIDE 1

Canonical Correlations for Group Symmetry Models

Jesse Crawford

Department of Mathematics Tarleton State University jcrawford@tarleton.edu

March 28, 2009

Jesse Crawford (Tarleton State University) March 28, 2009 1 / 29

slide-2
SLIDE 2

Outline

1

Canonical Correlations

2

Group Symmetry Models

3

Generalized Canonical Correlations

Jesse Crawford (Tarleton State University) March 28, 2009 2 / 29

slide-3
SLIDE 3

Outline

1

Canonical Correlations

2

Group Symmetry Models

3

Generalized Canonical Correlations

Jesse Crawford (Tarleton State University) March 28, 2009 3 / 29

slide-4
SLIDE 4

Developed by Hotelling (1936). Theory covered in Chapter 12 of Anderson (1984).

Jesse Crawford (Tarleton State University) March 28, 2009 4 / 29

slide-5
SLIDE 5

Setting

X1, . . . , XN are i.i.d. normally distributed random vectors with mean zero and covariance matrix Σ ∈ PD(I). RI = RJ1 ⊕ RJ2 Xn =

  • X (1)

n

X (2)

n

  • Testing problem: are X (1)

n

and X (2)

n

independent?

Jesse Crawford (Tarleton State University) March 28, 2009 5 / 29

slide-6
SLIDE 6

Notation

Suppose x ∈ RJ1 and y ∈ RJ2 Linear combinations: xtX (1)

n

and ytX (2)

n

CovΣ(x, y) := xtΣ12y VΣ(x) := xtΣ11x VΣ(y) := ytΣ22y

Jesse Crawford (Tarleton State University) March 28, 2009 6 / 29

slide-7
SLIDE 7

Definitions

The first canonical correlation is c1 = max{CovΣ(x, y) x ∈ RJ1, y ∈ RJ2, VΣ(x) = VΣ(y) = 1}. Suppose the maximum is attained at (x1, y1). (x1, y1) is the first pair of canonical covariates.

Jesse Crawford (Tarleton State University) March 28, 2009 7 / 29

slide-8
SLIDE 8

Definitions

The second canonical correlation is c2 = max{CovΣ(x, y) x ∈ RJ1, y ∈ RJ2, VΣ(x) = VΣ(y) = 1, CovΣ(x, x1) = CovΣ(y, y1) = 0}, Max is attained at (x2, y2), the second pair of canonical covariates. . . . The kth canonical correlation is ck = max{CovΣ(x, y) x ∈ RJ1, y ∈ RJ2, VΣ(x) = VΣ(y) = 1, CovΣ(x, xi) = CovΣ(y, yi) = 0, i = 1, . . . , k − 1}, Max is attained at (xk, yk), the kth pair of canonical covariates.

Jesse Crawford (Tarleton State University) March 28, 2009 8 / 29

slide-9
SLIDE 9

Results

Canonical correlations: c1, . . . , cJ2 Canonical covariate pairs: (x1, y1), . . . , (xJ2, yJ2).

Theorem

ck is the kth largest root of

  • −cΣ11

Σ12 Σ21 −cΣ22

  • = 0,

and the canonical covariates satisfy −ckΣ11 Σ12 Σ21 −ckΣ22 xk yk

  • = 0.

Jesse Crawford (Tarleton State University) March 28, 2009 9 / 29

slide-10
SLIDE 10

Proof.

c1 is the maximum value of xtΣ12y subject to the constraints xtΣ11x = 1 ytΣ22y = 1 Apply Lagrange multipliers to prove results for c1 and (x1, y1). Complete proof by inducting on J2.

Jesse Crawford (Tarleton State University) March 28, 2009 10 / 29

slide-11
SLIDE 11

Relation to the Maximal Invariant Statistic

A = A1 A2

  • , A1 ∈ GL(J1), A2 ∈ GL(J2)

Group Actions:

◮ A · x = Ax, for x ∈ RI×N ◮ A · Σ = AΣAt, for Σ ∈ PD(I)

Testing problem is invariant under these actions The family of empirical canonical correlations is a maximal invariant statistic.

Jesse Crawford (Tarleton State University) March 28, 2009 11 / 29

slide-12
SLIDE 12

Empirical Canonical Correlations and Eigenvalues

ˆ Σ = S = S11 S12 S21 S22

  • ˆ

Σ0 = S11 S22

  • Residual

R = ˆ Σ − ˆ Σ0 =

  • S12

S21

  • Empirical canonical correlations satisfy
  • −ckS11

S12 S21 −ckS22

  • = 0

|R − ck ˆ Σ0| = 0

Jesse Crawford (Tarleton State University) March 28, 2009 12 / 29

slide-13
SLIDE 13

Empirical Canonical Correlations and Eigenvalues

|R − ck ˆ Σ0| = 0 Empirical canonical correlations are eigenvalues of R wrt. ˆ Σ0. Canonical covariates satisfy (R − ck ˆ Σ0) xk yk

  • = 0

uk = 1 2 xk yk

  • is an eigenvector of R wrt. ˆ

Σ0 corresponding to ck vk = 1 2

  • xk

−yk

  • is an eigenvector corresponding to − ck

Jesse Crawford (Tarleton State University) March 28, 2009 13 / 29

slide-14
SLIDE 14

Empirical Canonical Covariates and Eigenvectors

xk = uk + vk yk = uk − vk

Jesse Crawford (Tarleton State University) March 28, 2009 14 / 29

slide-15
SLIDE 15

Outline

1

Canonical Correlations

2

Group Symmetry Models

3

Generalized Canonical Correlations

Jesse Crawford (Tarleton State University) March 28, 2009 15 / 29

slide-16
SLIDE 16

Theory: Andersson and Madsen (1998), Appendix A Ten Fundamental Testing Problems: Andersson, Brøns, and Jensen (1983)

Jesse Crawford (Tarleton State University) March 28, 2009 16 / 29

slide-17
SLIDE 17

Pattern Covariance Matrices

Testing covariance structure of a multivariate normal distribution.

Example (Testing Complex Structure)

H0 : Σ = A −B B A

  • Example (Testing Independence)

H0 : Σ = Σ11 Σ22

  • Example (Bartlett’s Test)

H0 : Σ = Γ Γ

  • vs. H : Σ =

Σ11 Σ22

  • Jesse Crawford (Tarleton State University)

March 28, 2009 17 / 29

slide-18
SLIDE 18

Notation

G ≤ O(I) is a compact group GLG(I) = {Invertible matrices that commute with G} PDG(I) = {Positive definite matrices that commute with G}

Group Symmetry Model

X1, X2, . . . , XN i.i.d. Normal(0, Σ) HG : Σ ∈ PDG(I)

Jesse Crawford (Tarleton State University) March 28, 2009 18 / 29

slide-19
SLIDE 19

Example (Testing Complex Structure)

G0 =

  • ±1I, ±

−1J 1J

  • Example (Testing Independence)

G0 =

  • ±1I, ±

1J1 −1J2

  • Example (Bartlett’s Test)

G0 = 1J −1J

  • ,

1J 1J

  • Jesse Crawford (Tarleton State University)

March 28, 2009 19 / 29

slide-20
SLIDE 20

Estimation

ˆ Σ = ΨG(S) =

  • G

gSgt dg ˆ Σ = 1 |G|

  • g∈G

gSgt

Example (Testing Complex Structure)

ˆ Σ0 = 1 2 S11 + S22 S12 − S21 S21 − S12 S11 + S22

  • ˆ

Σ ∼ generalized Wishart distribution on PDG(I).

Jesse Crawford (Tarleton State University) March 28, 2009 20 / 29

slide-21
SLIDE 21

Testing

G ≤ G0 PDG0(I) ⊆ PDG(I) Testing problem: H0 : Σ ∈ PDG0(I) vs. H : Σ ∈ PDG(I) Actions of A ∈ GLG(I)

◮ A · x = Ax, for x ∈ RI×N ◮ A · Σ = AΣAt, for Σ ∈ PDG(I)

Maximal invariant: eigenvalues of R = ˆ Σ − ˆ Σ0 wrt. ˆ Σ0.

Jesse Crawford (Tarleton State University) March 28, 2009 21 / 29

slide-22
SLIDE 22

Questions

Can canonical correlations be generalized? Are these maximally invariant eigenvalues canonical correlations?

Jesse Crawford (Tarleton State University) March 28, 2009 22 / 29

slide-23
SLIDE 23

Outline

1

Canonical Correlations

2

Group Symmetry Models

3

Generalized Canonical Correlations

Jesse Crawford (Tarleton State University) March 28, 2009 23 / 29

slide-24
SLIDE 24

Setting

X1, X2, . . . , XN i.i.d. Normal(0, Σ) Θ0 ⊆ Θ ⊆ PD(I) Testing problem: H0 : Σ ∈ Θ0 vs. H : Σ ∈ Θ t : Θ → Θ0 ˆ Σ0 = t(ˆ Σ)

Jesse Crawford (Tarleton State University) March 28, 2009 24 / 29

slide-25
SLIDE 25

Definitions

c1 = max{CovΣ(x, y) Covt(Σ)(x, y) = 0, VΣ(x) = VΣ(y) = 1} ck = max{CovΣ(x, y) Covt(Σ)(x, y) = 0, VΣ(x) = VΣ(y) = 1

◮ for i = 1, . . . , k − 1, ◮ Covt(Σ)(x, xi) = Covt(Σ)(x, yi) = 0, ◮ Covt(Σ)(y, xi) = Covt(Σ)(y, yi) = 0} Jesse Crawford (Tarleton State University) March 28, 2009 25 / 29

slide-26
SLIDE 26

Results

Eigenvalues of Σ − t(Σ) wrt. t(Σ): λ1 ≥ · · · ≥ λI ck = λk − λk+1−k λk + λI+1−k + 2, for k = 1, . . . , ⌊ I

2⌋

Covariate pairs: characterized in terms of eigenvectors

Proof.

WLOG, t(Σ) = 1I and Σ − t(Σ) = Λ = Diag(λ1, . . . , λI) Apply Lagrange multipliers to c1 = max{xtΛy x, y ∈ RI, xt(1I + Λ)x = yt(1I + Λ)y = 1, xty = 0} Induct on I

Jesse Crawford (Tarleton State University) March 28, 2009 26 / 29

slide-27
SLIDE 27

Canonical Correlations for Group Symmetry Models

Θ = PDG(I) Θ0 = PDG0(I) t = ΨG Eigenvalues of ˆ Σ − ˆ Σ0 wrt. ˆ Σ0 λ1, λ2, . . . , −λ2, −λ1 ck = λk, for k = 1, . . . , ⌊ I

2⌋

Covariate pairs: characterized in terms of eigenvectors

Jesse Crawford (Tarleton State University) March 28, 2009 27 / 29

slide-28
SLIDE 28

Further Research

Eigenvalues related to other testing problems, such as graphical models. Unbiasedness of likelihood ratio tests for group symmetry models.

Jesse Crawford (Tarleton State University) March 28, 2009 28 / 29

slide-29
SLIDE 29

References

Andersson, S.A., Brøns, H.K., and Tolver Jensen, S. (1983). Distribution of Eigenvalues in multivariate statistical analysis. Ann.

  • Statist. 11 392-415.

Andersson, S.A. and Madsen, J. (1998). Symmetry and lattice conditional independence in a multivariate normal distribution.

  • Ann. Statist. 26 525-572.

Hotelling, Harold (1936). Relations Between Two Sets of Variates. Biometrika 28 321-377.

Jesse Crawford (Tarleton State University) March 28, 2009 29 / 29