Polynomial systems of graphical models Elizabeth Gross University - - PowerPoint PPT Presentation

polynomial systems of graphical models
SMART_READER_LITE
LIVE PREVIEW

Polynomial systems of graphical models Elizabeth Gross University - - PowerPoint PPT Presentation

Polynomial systems of graphical models Elizabeth Gross University of Hawaii at M anoa ICERM Nonlinear Algebra, Graphical Models Working Group: Bibhas Adhikari, Alexandros Grosdos, Marc H ark onen, Cvetelina Hill, Sara Lamboglia,


slide-1
SLIDE 1

seal.jpg

Polynomial systems of graphical models

Elizabeth Gross University of Hawai‘i at M¯ anoa ICERM Nonlinear Algebra, Graphical Models Working Group: Bibhas Adhikari, Alexandros Grosdos, Marc H¨ ark¨

  • nen,

Cvetelina Hill, Sara Lamboglia, Samantha Sherman, Elias Tsigaridas, Dane Wilburne

Elizabeth Gross, UH M¯ anoa Structural Equation Models

slide-2
SLIDE 2

Graphical Models

Gaussian graphical models Undirected graphs Described in detail in ”Gaussian Graphical Models: An Algebraic and Geometric Perspective”, Uhler (2017) Conjecture on ML Degree of cycles in Section 7.4, Lectures on Algebraic Statistics, Drton–Sturfmels–Sullivant (2009) Linear Structural Equation Models Directed graphs, mixed graphs Overview in ”Algebraic Problems in Structural Equation Modeling”, Drton (2018)

Elizabeth Gross, UH M¯ anoa Structural Equation Models

slide-3
SLIDE 3

seal.jpg

Polynomial systems of graphical models

Elizabeth Gross University of Hawai‘i at M¯ anoa ICERM Nonlinear Algebra, Graphical Models Working Group: Bibhas Adhikari, Alexandros Grosdos, Marc H¨ ark¨

  • nen,

Cvetelina Hill, Sara Lamboglia, Samantha Sherman, Elias Tsigaridas, Dane Wilburne

Elizabeth Gross, UH M¯ anoa Structural Equation Models

slide-4
SLIDE 4

seal.jpg

Polynomial systems of graphical models

Elizabeth Gross University of Hawai‘i at M¯ anoa ICERM Nonlinear Algebra, Graphical Models Working Group: Bibhas Adhikari, Alexandros Grosdos, Marc H¨ ark¨

  • nen,

Cvetelina Hill, Sara Lamboglia, Samantha Sherman, Elias Tsigaridas, Dane Wilburne

Elizabeth Gross, UH M¯ anoa Structural Equation Models

slide-5
SLIDE 5

Structural Equation Models

RD =

  • Λ 2 RV ×V : λij = 0 if i ! j /

2 D RD

reg = subset of matrices Λ 2 RD

for which I Λ is invertible. PDV = cone of pos def symmetric V ⇥ V matrices. PD(B) = {Ω 2 PDV : ωij = 0 if i 6= j and i $ j / 2 B}. Definition The linear structural equation model given by a mixed graph G = (V , D, B) on V = [m] is the family of all probability distributions on Rm with covariance matrix Σ = (I Λ)−TΩ(I Λ)−1 for Λ 2 RD

reg and Ω 2 PD(B).

Elizabeth Gross, UH M¯ anoa Structural Equation Models

slide-6
SLIDE 6

Structural Equation Models

RD =

  • Λ 2 RV ×V : λij = 0 if i ! j /

2 D RD

reg = subset of matrices Λ 2 RD

for which I Λ is invertible. PDV = cone of pos def symmetric V ⇥ V matrices. PD(B) = {Ω 2 PDV : ωij = 0 if i 6= j and i $ j / 2 B}. Definition The linear structural equation model given by a mixed graph G = (V , D, B) on V = [m] is the family of all probability distributions on Rm with covariance matrix Σ = (I Λ)−TΩ(I Λ)−1 for Λ 2 RD

reg and Ω 2 PD(B).

Elizabeth Gross, UH M¯ anoa Structural Equation Models

slide-7
SLIDE 7

Identifiability

The covariance parametrization is φG : RD ⇥ PD(B) ! PDV (Λ, Ω) 7! (I Λ)TΩ(I Λ)1 Definition The fiber of a pair (Λ, Ω) 2 RD

reg ⇥ PD(B) is

FG(Λ, Ω) =

  • (Λ0, Ω0) 2 RD

reg ⇥ PD(B) : φG (Λ0, Ω0) = φG(Λ, Ω)

Elizabeth Gross, UH M¯ anoa Structural Equation Models

slide-8
SLIDE 8

Identifiability

The covariance parametrization is φG : RD ⇥ PD(B) ! PDV (Λ, Ω) 7! (I Λ)TΩ(I Λ)1 Definition The fiber of a pair (Λ, Ω) 2 RD

reg ⇥ PD(B) is

FG(Λ, Ω) =

  • (Λ0, Ω0) 2 RD

reg ⇥ PD(B) : φG (Λ0, Ω0) = φG(Λ, Ω)

If the map φG is injective, then we call the model global identifiable. If the map φG is generically injective, then we call the model generically identifiable. If the map φG is generically k-to-one, then we call the model generically locally identifiable. In this case, we call k the identifiability degree of the model.

Elizabeth Gross, UH M¯ anoa Structural Equation Models

slide-9
SLIDE 9

Identifiability

The covariance parametrization is φG : RD ⇥ PD(B) ! PDV (Λ, Ω) 7! (I Λ)TΩ(I Λ)1 Definition The fiber of a pair (Λ, Ω) 2 RD

reg ⇥ PD(B) is

FG(Λ, Ω) =

  • (Λ0, Ω0) 2 RD

reg ⇥ PD(B) : φG (Λ0, Ω0) = φG(Λ, Ω)

If the map φG is injective, then we call the model global identifiable. If the map φG is generically injective, then we call the model generically identifiable. If the map φG is generically k-to-one, then we call the model generically locally identifiable. In this case, we call k the identifiability degree of the model.

Elizabeth Gross, UH M¯ anoa Structural Equation Models

slide-10
SLIDE 10

Another set of equations

Lemma (Foygel–Draisma–Drton (2012), Drton (2018)) Let G = (V , D, B) be a mixed graph, and let Σ = φG(Λ0, Ω0) for Λ0 2 RD

reg and Ω0 2 PD(B). Then the fiber FG(Λ0, Ω0) is

isomorphic to the set of matrices Λ 2 RD

reg that solve the equation

system: Fij = [(I Λ)TΣ(I Λ)]ij = 0 i 6= j, i $ j / 2 B

  • r more explicitly:

Fij = σij X

k→i

λkiσkj X

l→j

λljσil + X

k→i

X

l→j

λkiσklλlj = 0 Need to be careful about spurious solutions (det(I Λ) = 0).

Elizabeth Gross, UH M¯ anoa Structural Equation Models

slide-11
SLIDE 11

Another set of equations

Lemma (Foygel–Draisma–Drton (2012), Drton (2018)) Let G = (V , D, B) be a mixed graph, and let Σ = φG(Λ0, Ω0) for Λ0 2 RD

reg and Ω0 2 PD(B). Then the fiber FG(Λ0, Ω0) is

isomorphic to the set of matrices Λ 2 RD

reg that solve the equation

system: Fij = [(I Λ)TΣ(I Λ)]ij = 0 i 6= j, i $ j / 2 B

  • r more explicitly:

Fij = σij X

k→i

λkiσkj X

l→j

λljσil + X

k→i

X

l→j

λkiσklλlj = 0 Need to be careful about spurious solutions (det(I Λ) = 0).

Elizabeth Gross, UH M¯ anoa Structural Equation Models

slide-12
SLIDE 12

Another set of equations

Lemma (Foygel–Draisma–Drton (2012), Drton (2018)) Let G = (V , D, B) be a mixed graph, and let Σ = φG(Λ0, Ω0) for Λ0 2 RD

reg and Ω0 2 PD(B). Then the fiber FG(Λ0, Ω0) is

isomorphic to the set of matrices Λ 2 RD

reg that solve the equation

system: Fij = [(I Λ)TΣ(I Λ)]ij = 0 i 6= j, i $ j / 2 B

  • r more explicitly:

Fij = σij X

k→i

λkiσkj X

l→j

λljσil + X

k→i

X

l→j

λkiσklλlj = 0 Need to be careful about spurious solutions (det(I Λ) = 0).

Elizabeth Gross, UH M¯ anoa Structural Equation Models

slide-13
SLIDE 13

Another set of equations

Lemma (Foygel–Draisma–Drton (2012), Drton (2018)) Let G = (V , D, B) be a mixed graph, and let Σ = φG(Λ0, Ω0) for Λ0 2 RD

reg and Ω0 2 PD(B). Then the fiber FG(Λ0, Ω0) is

isomorphic to the set of matrices Λ 2 RD

reg that solve the equation

system: Fij = [(I Λ)TΣ(I Λ)]ij = 0 i 6= j, i $ j / 2 B

  • r more explicitly:

Fij = σij X

k→i

λkiσkj X

l→j

λljσil + X

k→i

X

l→j

λkiσklλlj = 0 Need to be careful about spurious solutions (det(I Λ) = 0).

Elizabeth Gross, UH M¯ anoa Structural Equation Models

slide-14
SLIDE 14

Identifiability Results

Theorem Acyclic and no bidirected edges ⇒ globally identifiable. Theorem (Drton–Foygel–Sullivant) G does not contain a subgraph whose B is connected and D has a unique sink ⇒ globally identifiable. Theorem (Brito–Pearl) Acyclic and simple ⇒ generically identifiable Theorem (Brito–Pearl) G criterion ⇒ generically identifiable. Theorem (Foygel–Draisma–Drton) Half-trek criterion ⇒ generically identifiable. Extended by: Chen (2015), Drton–Weihs (2016), and Weihs–Robinson–Dufresne– Kenkel–Kubjas–McGee–Nguyen– Robeva (2018)

Elizabeth Gross, UH M¯ anoa Structural Equation Models

slide-15
SLIDE 15

Identifiability Degree Results

Theorem (ICERM Group) Simple ) generically locally identifiable (finite-to-one). Theorem (Drton–Foygel–Sullivant) The identifiability degree of a cycle with length 3 is 2. Theorem (Foygel–Draisma–Drton) Analysis of the identifiability degree of mixed graphs with up to five nodes. The maximum identifiability degree observed was 10.

Elizabeth Gross, UH M¯ anoa Structural Equation Models

slide-16
SLIDE 16

Identifiability Degree Results

Theorem (ICERM Group) Simple ) generically locally identifiable (finite-to-one). Theorem (Drton–Foygel–Sullivant) The identifiability degree of a cycle with length 3 is 2. Theorem (Foygel–Draisma–Drton) Analysis of the identifiability degree of mixed graphs with up to five nodes. The maximum identifiability degree observed was 10.

Elizabeth Gross, UH M¯ anoa Structural Equation Models

slide-17
SLIDE 17

Identifiability Degree Results

Theorem (ICERM Group) Simple ) generically locally identifiable (finite-to-one). Theorem (Drton–Foygel–Sullivant) The identifiability degree of a cycle with length 3 is 2. Theorem (Foygel–Draisma–Drton) Analysis of the identifiability degree of mixed graphs with up to five nodes. The maximum identifiability degree observed was 10.

Elizabeth Gross, UH M¯ anoa Structural Equation Models

slide-18
SLIDE 18

Tian’s Decomposition

Theorem (Tian (2005)) The degree of identifiability of a mixed graph G is the product of the degrees of identifiability of its mixed components G[C], C 2 C(G). In particular, φG is (generically) injective if and only if each φG[C] is so, for C 2 C(G).

Elizabeth Gross, UH M¯ anoa Structural Equation Models

slide-19
SLIDE 19

Identifiability Degree Results

Theorem (ICERM Group) The identifiability degree of a cycle plus an incoming edge is 1. Theorem (ICERM Group) The degree of a cycle plus an

  • utgoing edge is 2.

Theorem (ICERM Group) Let C0, ..., Cn be cycles and let vCi be a vertex in Ci for every i. Consider the graph G obtained by adding the edge vCi ! vCi+1 for every i = 0, ..., n 1. Then the identifiability degree is 2. Conjecture (ICERM Group) Gluing over a vertex or along an edge a chain of cycles has identifiability degree 1.

Elizabeth Gross, UH M¯ anoa Structural Equation Models

slide-20
SLIDE 20

Identifiability Degree Results

Theorem (ICERM Group) The identifiability degree of a cycle plus an incoming edge is 1. Theorem (ICERM Group) The degree of a cycle plus an

  • utgoing edge is 2.

Theorem (ICERM Group) Let C0, ..., Cn be cycles and let vCi be a vertex in Ci for every i. Consider the graph G obtained by adding the edge vCi ! vCi+1 for every i = 0, ..., n 1. Then the identifiability degree is 2. Conjecture (ICERM Group) Gluing over a vertex or along an edge a chain of cycles has identifiability degree 1.

Elizabeth Gross, UH M¯ anoa Structural Equation Models

slide-21
SLIDE 21

Identifiability Degree Results

Theorem (ICERM Group) The identifiability degree of a cycle plus an incoming edge is 1. Theorem (ICERM Group) The degree of a cycle plus an

  • utgoing edge is 2.

Theorem (ICERM Group) Let C0, ..., Cn be cycles and let vCi be a vertex in Ci for every i. Consider the graph G obtained by adding the edge vCi ! vCi+1 for every i = 0, ..., n 1. Then the identifiability degree is 2. Conjecture (ICERM Group) Gluing over a vertex or along an edge a chain of cycles has identifiability degree 1.

Elizabeth Gross, UH M¯ anoa Structural Equation Models

slide-22
SLIDE 22

Identifiability Degree Results

Theorem (ICERM Group) The identifiability degree of a cycle plus an incoming edge is 1. Theorem (ICERM Group) The degree of a cycle plus an

  • utgoing edge is 2.

Theorem (ICERM Group) Let C0, ..., Cn be cycles and let vCi be a vertex in Ci for every i. Consider the graph G obtained by adding the edge vCi ! vCi+1 for every i = 0, ..., n 1. Then the identifiability degree is 2. Conjecture (ICERM Group) Gluing over a vertex or along an edge a chain of cycles has identifiability degree 1.

Elizabeth Gross, UH M¯ anoa Structural Equation Models

slide-23
SLIDE 23

Questions

Can the identifiability degree of a mixed graph be arbitrarily large? What is the relationship between the number of vertices n of a mixed graph G and the range of possible identifiability degrees for G? For a fixed n is there a way to build a mixed graph G with maximum identifiability degree? We need your help!

Elizabeth Gross, UH M¯ anoa Structural Equation Models

slide-24
SLIDE 24

Mahalo! Thank you!

Elizabeth Gross, UH M¯ anoa Structural Equation Models