polynomial systems of graphical models
play

Polynomial systems of graphical models Elizabeth Gross University - PowerPoint PPT Presentation

Polynomial systems of graphical models Elizabeth Gross University of Hawaii at M anoa ICERM Nonlinear Algebra, Graphical Models Working Group: Bibhas Adhikari, Alexandros Grosdos, Marc H ark onen, Cvetelina Hill, Sara Lamboglia,


  1. Polynomial systems of graphical models Elizabeth Gross University of Hawai‘i at M¯ anoa ICERM Nonlinear Algebra, Graphical Models Working Group: Bibhas Adhikari, Alexandros Grosdos, Marc H¨ ark¨ onen, Cvetelina Hill, Sara Lamboglia, Samantha Sherman, Elias Tsigaridas, Dane Wilburne seal.jpg Elizabeth Gross, UH M¯ anoa Structural Equation Models

  2. Graphical Models Gaussian graphical models Undirected graphs Described in detail in ”Gaussian Graphical Models: An Algebraic and Geometric Perspective”, Uhler (2017) Conjecture on ML Degree of cycles in Section 7.4, Lectures on Algebraic Statistics , Drton–Sturfmels–Sullivant (2009) Linear Structural Equation Models Directed graphs, mixed graphs Overview in ”Algebraic Problems in Structural Equation Modeling”, Drton (2018) Elizabeth Gross, UH M¯ anoa Structural Equation Models

  3. Polynomial systems of graphical models Elizabeth Gross University of Hawai‘i at M¯ anoa ICERM Nonlinear Algebra, Graphical Models Working Group: Bibhas Adhikari, Alexandros Grosdos, Marc H¨ ark¨ onen, Cvetelina Hill, Sara Lamboglia, Samantha Sherman, Elias Tsigaridas, Dane Wilburne seal.jpg Elizabeth Gross, UH M¯ anoa Structural Equation Models

  4. Polynomial systems of graphical models Elizabeth Gross University of Hawai‘i at M¯ anoa ICERM Nonlinear Algebra, Graphical Models Working Group: Bibhas Adhikari, Alexandros Grosdos, Marc H¨ ark¨ onen, Cvetelina Hill, Sara Lamboglia, Samantha Sherman, Elias Tsigaridas, Dane Wilburne seal.jpg Elizabeth Gross, UH M¯ anoa Structural Equation Models

  5. Structural Equation Models R D = Λ 2 R V × V : λ ij = 0 if i ! j / � 2 D R D reg = subset of matrices Λ 2 R D for which I � Λ is invertible. PD V = cone of pos def symmetric V ⇥ V matrices. PD ( B ) = { Ω 2 PD V : ω ij = 0 if i 6 = j and i $ j / 2 B } . Definition The linear structural equation model given by a mixed graph G = ( V , D , B ) on V = [ m ] is the family of all probability distributions on R m with covariance matrix Σ = ( I � Λ ) − T Ω ( I � Λ ) − 1 for Λ 2 R D reg and Ω 2 PD ( B ). Elizabeth Gross, UH M¯ anoa Structural Equation Models

  6. Structural Equation Models R D = Λ 2 R V × V : λ ij = 0 if i ! j / � 2 D R D reg = subset of matrices Λ 2 R D for which I � Λ is invertible. PD V = cone of pos def symmetric V ⇥ V matrices. PD ( B ) = { Ω 2 PD V : ω ij = 0 if i 6 = j and i $ j / 2 B } . Definition The linear structural equation model given by a mixed graph G = ( V , D , B ) on V = [ m ] is the family of all probability distributions on R m with covariance matrix Σ = ( I � Λ ) − T Ω ( I � Λ ) − 1 for Λ 2 R D reg and Ω 2 PD ( B ). Elizabeth Gross, UH M¯ anoa Structural Equation Models

  7. Identifiability The covariance parametrization is φ G : R D ⇥ PD ( B ) ! PD V ( Λ , Ω ) 7! ( I � Λ ) � T Ω ( I � Λ ) � 1 Definition The fiber of a pair ( Λ , Ω ) 2 R D reg ⇥ PD ( B ) is ( Λ 0 , Ω 0 ) 2 R D � reg ⇥ PD ( B ) : φ G ( Λ 0 , Ω 0 ) = φ G ( Λ , Ω ) F G ( Λ , Ω ) = Elizabeth Gross, UH M¯ anoa Structural Equation Models

  8. Identifiability The covariance parametrization is φ G : R D ⇥ PD ( B ) ! PD V ( Λ , Ω ) 7! ( I � Λ ) � T Ω ( I � Λ ) � 1 Definition The fiber of a pair ( Λ , Ω ) 2 R D reg ⇥ PD ( B ) is ( Λ 0 , Ω 0 ) 2 R D � reg ⇥ PD ( B ) : φ G ( Λ 0 , Ω 0 ) = φ G ( Λ , Ω ) F G ( Λ , Ω ) = If the map φ G is injective, then we call the model global identifiable . If the map φ G is generically injective, then we call the model generically identifiable . If the map φ G is generically k -to-one, then we call the model generically locally identifiable . In this case, we call k the identifiability degree of the model. Elizabeth Gross, UH M¯ anoa Structural Equation Models

  9. Identifiability The covariance parametrization is φ G : R D ⇥ PD ( B ) ! PD V ( Λ , Ω ) 7! ( I � Λ ) � T Ω ( I � Λ ) � 1 Definition The fiber of a pair ( Λ , Ω ) 2 R D reg ⇥ PD ( B ) is ( Λ 0 , Ω 0 ) 2 R D � reg ⇥ PD ( B ) : φ G ( Λ 0 , Ω 0 ) = φ G ( Λ , Ω ) F G ( Λ , Ω ) = If the map φ G is injective, then we call the model global identifiable . If the map φ G is generically injective, then we call the model generically identifiable . If the map φ G is generically k -to-one, then we call the model generically locally identifiable . In this case, we call k the identifiability degree of the model. Elizabeth Gross, UH M¯ anoa Structural Equation Models

  10. Another set of equations Lemma (Foygel–Draisma–Drton (2012), Drton (2018)) Let G = ( V , D , B ) be a mixed graph, and let Σ = φ G ( Λ 0 , Ω 0 ) for Λ 0 2 R D reg and Ω 0 2 PD ( B ) . Then the fiber F G ( Λ 0 , Ω 0 ) is isomorphic to the set of matrices Λ 2 R D reg that solve the equation system: F ij = [( I � Λ ) T Σ ( I � Λ )] ij = 0 i 6 = j , i $ j / 2 B or more explicitly: X X X X F ij = σ ij � λ lj σ il + λ ki σ kl λ lj = 0 λ ki σ kj � k → i l → j k → i l → j Need to be careful about spurious solutions (det( I � Λ ) = 0). Elizabeth Gross, UH M¯ anoa Structural Equation Models

  11. Another set of equations Lemma (Foygel–Draisma–Drton (2012), Drton (2018)) Let G = ( V , D , B ) be a mixed graph, and let Σ = φ G ( Λ 0 , Ω 0 ) for Λ 0 2 R D reg and Ω 0 2 PD ( B ) . Then the fiber F G ( Λ 0 , Ω 0 ) is isomorphic to the set of matrices Λ 2 R D reg that solve the equation system: F ij = [( I � Λ ) T Σ ( I � Λ )] ij = 0 i 6 = j , i $ j / 2 B or more explicitly: X X X X F ij = σ ij � λ lj σ il + λ ki σ kl λ lj = 0 λ ki σ kj � k → i l → j k → i l → j Need to be careful about spurious solutions (det( I � Λ ) = 0). Elizabeth Gross, UH M¯ anoa Structural Equation Models

  12. Another set of equations Lemma (Foygel–Draisma–Drton (2012), Drton (2018)) Let G = ( V , D , B ) be a mixed graph, and let Σ = φ G ( Λ 0 , Ω 0 ) for Λ 0 2 R D reg and Ω 0 2 PD ( B ) . Then the fiber F G ( Λ 0 , Ω 0 ) is isomorphic to the set of matrices Λ 2 R D reg that solve the equation system: F ij = [( I � Λ ) T Σ ( I � Λ )] ij = 0 i 6 = j , i $ j / 2 B or more explicitly: X X X X F ij = σ ij � λ lj σ il + λ ki σ kl λ lj = 0 λ ki σ kj � k → i l → j k → i l → j Need to be careful about spurious solutions (det( I � Λ ) = 0). Elizabeth Gross, UH M¯ anoa Structural Equation Models

  13. Another set of equations Lemma (Foygel–Draisma–Drton (2012), Drton (2018)) Let G = ( V , D , B ) be a mixed graph, and let Σ = φ G ( Λ 0 , Ω 0 ) for Λ 0 2 R D reg and Ω 0 2 PD ( B ) . Then the fiber F G ( Λ 0 , Ω 0 ) is isomorphic to the set of matrices Λ 2 R D reg that solve the equation system: F ij = [( I � Λ ) T Σ ( I � Λ )] ij = 0 i 6 = j , i $ j / 2 B or more explicitly: X X X X F ij = σ ij � λ lj σ il + λ ki σ kl λ lj = 0 λ ki σ kj � k → i l → j k → i l → j Need to be careful about spurious solutions (det( I � Λ ) = 0). Elizabeth Gross, UH M¯ anoa Structural Equation Models

  14. Identifiability Results Theorem Theorem (Brito–Pearl) Acyclic and no bidirected edges G criterion ⇒ generically ⇒ globally identifiable. identifiable. Theorem Theorem (Foygel–Draisma–Drton) (Drton–Foygel–Sullivant) G does not contain a subgraph Half-trek criterion ⇒ generically identifiable. whose B is connected and D has a unique sink ⇒ globally Extended by: Chen (2015), identifiable. Drton–Weihs (2016), and Weihs–Robinson–Dufresne– Theorem (Brito–Pearl) Kenkel–Kubjas–McGee–Nguyen– Acyclic and simple ⇒ generically Robeva identifiable (2018) Elizabeth Gross, UH M¯ anoa Structural Equation Models

  15. Identifiability Degree Results Theorem (ICERM Group) Simple ) generically locally identifiable (finite-to-one). Theorem (Drton–Foygel–Sullivant) The identifiability degree of a cycle with length � 3 is 2. Theorem (Foygel–Draisma–Drton) Analysis of the identifiability degree of mixed graphs with up to five nodes. The maximum identifiability degree observed was 10. Elizabeth Gross, UH M¯ anoa Structural Equation Models

  16. Identifiability Degree Results Theorem (ICERM Group) Simple ) generically locally identifiable (finite-to-one). Theorem (Drton–Foygel–Sullivant) The identifiability degree of a cycle with length � 3 is 2. Theorem (Foygel–Draisma–Drton) Analysis of the identifiability degree of mixed graphs with up to five nodes. The maximum identifiability degree observed was 10. Elizabeth Gross, UH M¯ anoa Structural Equation Models

  17. Identifiability Degree Results Theorem (ICERM Group) Simple ) generically locally identifiable (finite-to-one). Theorem (Drton–Foygel–Sullivant) The identifiability degree of a cycle with length � 3 is 2. Theorem (Foygel–Draisma–Drton) Analysis of the identifiability degree of mixed graphs with up to five nodes. The maximum identifiability degree observed was 10. Elizabeth Gross, UH M¯ anoa Structural Equation Models

  18. Tian’s Decomposition Theorem (Tian (2005)) The degree of identifiability of a mixed graph G is the product of the degrees of identifiability of its mixed components G [ C ] , C 2 C ( G ) . In particular, φ G is (generically) injective if and only if each φ G [ C ] is so, for C 2 C ( G ) . Elizabeth Gross, UH M¯ anoa Structural Equation Models

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend