Grbner Bases of Gaussian Graphical Models Alex Fink, Jenna Rajchgot, - - PowerPoint PPT Presentation

gr bner bases of gaussian graphical models
SMART_READER_LITE
LIVE PREVIEW

Grbner Bases of Gaussian Graphical Models Alex Fink, Jenna Rajchgot, - - PowerPoint PPT Presentation

Grbner Bases of Gaussian Graphical Models Alex Fink, Jenna Rajchgot, Seth Sullivant Queen Mary University, University of Michigan, North Carolina State University October 4, 2015 Seth Sullivant (NCSU) Gaussian Graphical Models October 4,


slide-1
SLIDE 1

Gröbner Bases of Gaussian Graphical Models

Alex Fink, Jenna Rajchgot, Seth Sullivant

Queen Mary University, University of Michigan, North Carolina State University

October 4, 2015

Seth Sullivant (NCSU) Gaussian Graphical Models October 4, 2015 1 / 14

slide-2
SLIDE 2

Gaussian Graphical Models

Graphical models are a flexible framework for building statistical models on large collections of random variables. Edges of different types represent different types of interactions between neighboring random variables.

directed edges: i→j bidirected edges: i ↔ j undirected edges: i − j

Graph is used to express both

conditional independence structures parametric representations of the model.

For jointly normal random variables, graph structure relates variables to their neighbors via linear relationships with possibly correlated error terms.

Seth Sullivant (NCSU) Gaussian Graphical Models October 4, 2015 2 / 14

slide-3
SLIDE 3

Mixed Graphs

G = (V, B, D) graph with directed edges D (i→j) and bidirected edges B (i ↔ j) Vertex set V = [m] := {1, 2, . . . m} G is acyclic: i→j ∈ D implies i < j. 1 2 3 4

Seth Sullivant (NCSU) Gaussian Graphical Models October 4, 2015 3 / 14

slide-4
SLIDE 4

Gaussian Graphical Models

Gaussian graphical model is a statistical model that associates a family of normal distributions to the graph G = (V, B, D). PDm = cone of symmetric positive definite matrices Let PD(B) := {M ∈ PDm : Mij = 0 if i = j and i ↔ j / ∈ B} ǫ ∈ Rm, ǫ ∼ N(0, Ω) with Ω ∈ PD(B) For i→j ∈ D, let λij ∈ R Define X ∈ Rm recursively by Xj =

  • i:i→j∈D

λijXi + ǫj.

Seth Sullivant (NCSU) Gaussian Graphical Models October 4, 2015 4 / 14

slide-5
SLIDE 5

Example

1 2 3 4 ǫ ∼ N(0, Ω) Ω =     ω11 ω22 ω24 ω33 ω42 ω44     X1 = ǫ1, X2 = λ12X1 + ǫ2, X3 = λ13X1 + λ23X2 + ǫ3, X4 = λ34X3 + ǫ4

Seth Sullivant (NCSU) Gaussian Graphical Models October 4, 2015 5 / 14

slide-6
SLIDE 6

Matrix Factorization

Let Λ m × m upper triangular matrix such that Λij = λij if i→j ∈ D

  • therwise.

Proposition

X from the graph G = (V, B, D) is distributed N(0, Σ) where Σ = (I − Λ)−TΩ(I − Λ)−1. Note that (I − Λ)−1 = I + Λ + Λ2 + · · · + Λm−1.

Seth Sullivant (NCSU) Gaussian Graphical Models October 4, 2015 6 / 14

slide-7
SLIDE 7

The Algebraic Perspective

Let RD = {Λ ∈ Rm×m : λij = 0 if i→j / ∈ D} PD(B) := {M ∈ PDm : Mij = 0 if i = j and i ↔ j / ∈ B}

Definition

The Gaussian graphical model MG ⊆ PDm consists of all covariance matrices Σ, that arise for some choice of Λ ∈ RD and Ω ∈ PD(B). More algebraically, we have a polynomial map φG : RD × PD(B) → PDm, φG(Λ, Ω) = (I − Λ)−TΩ(I − Λ)−1. MG = imφG. We would like to “understand” φG and MG.

Seth Sullivant (NCSU) Gaussian Graphical Models October 4, 2015 7 / 14

slide-8
SLIDE 8

Constraints on Gaussian Graphical Models

Problem

Find a generating set or Gröbner basis of IG = I(MG) = {f ∈ R[σij : i, j ∈ [m]] : f(Σ) = 0 for all Σ ∈ MG}.

1 2 3 4

IG = |Σ12,13|, |Σ123,234|

Seth Sullivant (NCSU) Gaussian Graphical Models October 4, 2015 8 / 14

slide-9
SLIDE 9

Constraints on Gaussian Graphical Models

Problem

Find a generating set or Gröbner basis of IG = I(MG) = {f ∈ R[σij : i, j ∈ [m]] : f(Σ) = 0 for all Σ ∈ MG}.

1 2 3 4

IG = |Σ12,13|, |Σ123,234|

1 2 3 4

IG = σ12σ13σ14σ23 − σ11σ14σ2

23 − σ12σ2 13σ24 + σ11σ13σ23σ24

−σ2

12σ14σ33 + σ11σ14σ22σ33 + σ2 12σ13σ34 − σ11σ13σ22σ34

Seth Sullivant (NCSU) Gaussian Graphical Models October 4, 2015 8 / 14

slide-10
SLIDE 10

Trek Separation

A trek from i to j is a path in G from i to j with no sequence of edges k→l←m, k ↔ l←m, k→l ↔ m, or k ↔ l ↔ m.

Definition

Let A, B, C, and D be four subsets of V(G) (not necessarily disjoint). We say that (C, D) t-separates A from B if every trek from A to B passes through either a vertex in C on the A-side of the trek, or a vertex in D on the B-side of the trek.

C A D B

Seth Sullivant (NCSU) Gaussian Graphical Models October 4, 2015 9 / 14

slide-11
SLIDE 11

Theorem (S-Talaska-Draisma)

The matrix ΣA,B has rank ≤ d if and only if there are C, D ⊂ [n] with #C + #D ≤ d such that (C, D) t-separate A from B in G. Then all (d + 1) × (d + 1) minors of ΣA,B belong to IG.

Example

A c B

({c}, {c}) t-separates A from B. ΣA,B has rank at most 2. All 3 × 3 minors of ΣA,B belong to IG.

Seth Sullivant (NCSU) Gaussian Graphical Models October 4, 2015 10 / 14

slide-12
SLIDE 12

When do Determinantal Constraints Generate IG?

Question

What conditions on the graph G guarantee that the t-separation determinantal constraints generate IG?

Theorem (Sullivant 2008)

If G is a tree, then IG is generated in degree 1 and 2 by conditional independence constraints. In particular, IG generated by the t-separation determinantal constraints.

1 2 3 4 5

Proof Sketch.

For trees IG is a toric ideal. Do binomial manipulations.

Seth Sullivant (NCSU) Gaussian Graphical Models October 4, 2015 11 / 14

slide-13
SLIDE 13

1 2 3 4 5 6

Definition

A generalized Markov chain is a mixed graph G = (V, B, D) such that: If i → j ∈ D then i < j, If i → j ∈ D and i ≤ k < l ≤ j then k → l ∈ D, and If i ↔ j ∈ B and i ≤ k < l ≤ j then k ↔ l ∈ B.

Theorem (Fink-Rajchgot-S (2015))

If G is a generalized Markov chain then IG is generated by the t-separation determinantal constraints implied by G, and they form a Gröbner basis in a suitable lexicographic term order.

Proof Sketch.

Relate generalized Markov chain parametrization to parametrization of type B matrix Schubert varieties.

Seth Sullivant (NCSU) Gaussian Graphical Models October 4, 2015 12 / 14

slide-14
SLIDE 14

Further Results and Open Problems

We have extended t-separation characterization of determinantal constraints to ancestral graphs and AMP chain graphs. Connections to toric varieties and matrix Schubert varieties allow us to characterize the vanishing ideals for some graphs. How to determine general combinatorial descriptions of other hidden variable constraints?

1 2 3 4

  • σ11

σ11 σ12 σ13 σ12 σ12 σ22 σ23 σ13 σ23 σ33 σ14 σ24 σ34

  • = 0

Seth Sullivant (NCSU) Gaussian Graphical Models October 4, 2015 13 / 14

slide-15
SLIDE 15

References

  • A. Fink, J. Rajchgot, S. Sullivant. Symmetric matrix Schubert varieties and Gaussian graphical models. (2015) In

preparation. Knutson, Allen. Frobenius splitting, point-counting, and degeneration. (2009) :0911.4941 Knutson, Allen; Miller, Ezra. Gröbner geometry of Schubert polynomials. Ann. of Math. (2) 161 (2005), no. 3, 1245–1318.

  • S. Sullivant. Algebraic geometry of Gaussian Bayesian networks. Adv. in Appl. Math. 40 (2008), no. 4, 482–513.

0704.0918

  • S. Sullivant, K. Talaska and J. Draisma. Trek separation for Gaussian graphical models. Annals of Statistics 38 no.3 (2010)

1665-1685 0812.1938 Seth Sullivant (NCSU) Gaussian Graphical Models October 4, 2015 14 / 14