Existence of maximum likelihood estimators for 3 toric network - PowerPoint PPT Presentation

Existence of maximum likelihood estimators for 3 toric network models Sonja Petrovi´ c (University of Illinois at Chicago) Penn State University Toric geometry and applications Leuven, Belgium June 8, 2011 Sonja Petrovi´ c (UIC – Penn State) www.math.uic.edu/ ∼ petrovic June 8, 2011 1 / 17 / 23

Algebraic statistics Fact (Guiding principle) Many important statistical models correspond to algebraic or semi-algebraic sets of parameters. The geometry of these parameter spaces determines the behavior of widely used statistical inference procedures. A typical question: dimension, degree, singularities? Generators and Gr¨ obner bases: A typical question: the ideal of a given toric/secant/join variety? When a model is algebraic, use tools from algebraic geometry and computational algebra software packages ( 4ti2 , Macaulay2 , polymake , Singular , ...) Sonja Petrovi´ c (UIC – Penn State) www.math.uic.edu/ ∼ petrovic June 8, 2011 2 / 17 / 23

The Beta model for random undirected graphs In a pairwise comparison of n items, for every ordered pair ( i , j ): x i , j := the number of pairwise comparisons involving objects i and j in which i emerged as a winner. Fix N i , j := x i , j + x j , i , the total number of comparisons. Definition (Set of all possible outcomes) S n := { x i , j : i < j and x i , j ∈ { 0 , 1 , . . . , N i , j }} ⊂ N ( n 2 ) . Definition (Parametrization of the beta model) For each β ∈ R n : e β i + β j 1 p i , j = and p j , i = 1 − p i , j = ∀ i � = j . 1 + e β i + β j , 1 + e β i + β j The model M β for n vertices consists of all p i , j ’s of this form. Sonja Petrovi´ c (UIC – Penn State) www.math.uic.edu/ ∼ petrovic June 8, 2011 3 / 17 / 23

The Beta model for random undirected graphs It is parametrized by the vertex-edge incidence matrix of a complete graph:   1 1 1 0 0 0 1 0 0 1 1 0   A 4 =   0 1 0 1 0 1   0 0 1 0 1 1 rows indexed by the vertices; columns indexed by ( i , j ) with i < j . Definition (The model polytope) S n := conv { A n x , x ∈ S n } Example Represent the graph with an edge { 1 , 2 } and a triple edge { 1 , 3 } as x = [1 , 3 , 0 , . . . , 0] T ∈ S n . Corresponding point in the model polytope is A n x = [4 , 1 , 3 , 0 , . . . ] . Sonja Petrovi´ c (UIC – Penn State) www.math.uic.edu/ ∼ petrovic June 8, 2011 4 / 17 / 23

MLE existence problem Given the graph x ∈ S n : Goal (Motivating problem: maximum likelihood estimation) Determine parameter values that “best explain” this data. This is done by maximizing the log-likelihood function: p x ij � MLE ( p ) := argmax p ∈M n ij . i < j Definition If MLE(p) is achieved on the boundary of the model M , it is called the extended MLE. In this case, the MLE is said not to exist. Problem Determine the points x for which MLE ( p ) does not exist. Nonexistence of the MLE implies that only certain entries of p are estimable. (i.e., certain linear combinations of the natural parameters) Sonja Petrovi´ c (UIC – Penn State) www.math.uic.edu/ ∼ petrovic June 8, 2011 5 / 17 / 23

A small example of data leading to a nonexistent MLE × 0 N 1 , 2 × e β i + β j (Recall M β : p i , j = 1+ e β i + β j . ) × N 3 , 4 0 × × 0 1 2 × 0 0.5 0.5 3 × 2 1 1 × 0.5 0.5 2 1 × 3 0.5 0.5 × 1 1 2 0 × 0.5 0.5 0 × Left : data exhibiting the above pattern, when N i , j = 3 for all i � = j . Right : table of the extended MLE of the estimated probabilities. Under natural parametrization, the supremum of the log-likelihood is achieved in the limit for any sequence of natural parameters { β ( k ) } of the form β ( k ) = ( − c k , − c k , c k , c k ), where c k → ∞ as k → ∞ . Sonja Petrovi´ c (UIC – Penn State) www.math.uic.edu/ ∼ petrovic June 8, 2011 6 / 17 / 23

Polyhedral methods Theorem (Standard statistical theory of exponential families) The MLE for an observed graph x exists if and only if A n x ∈ int( S n ) . A general algorithm for deciding this problem and finding the relevant facial sets are presented in Eriksson, Fienberg, Rinaldo, Sullivant (’06). Problem Understand the model polytope for M β . Extend to other random graph models. Rinaldo, Fienberg (May ’11) provide a more thorough treatment of algorithms for log-linear (toric) models. Rinaldo, P., Fienberg (May ’11) specialize these to network models. Sonja Petrovi´ c (UIC – Penn State) www.math.uic.edu/ ∼ petrovic June 8, 2011 7 / 17 / 23

Motivation = size of random graph Example: The Collective Dynamics of Smoking in a Large Social Network (James Fowler) Node border= gender (red=female, blue=male). Arrow color = relation (purple=friend, green=spouse). Node color = smoking behavior (white=nonsmoker, gray=smoker); darker shades = more cigarettes consumed per day. Sonja Petrovi´ c (UIC – Penn State) www.math.uic.edu/ ∼ petrovic June 8, 2011 8 / 17 / 23

The polytope of degree sequences Definition The polytope of degree sequences is P n := convhull ( { Ax , x ∈ G n } ) . Facet-defining inequalities of P n are known (Mehadev-Peled ’96). Theorem (Rinaldo-P.-Fienberg) Let x ∈ S n be the observed vector of edge counts. The MLE exists if and only if x j , i x i , j � � + ∈ int ( P n ) , i = 1 , . . . , n . N i , j N i , j j < i j > i Example (Stanley) f ( P 8 ) = (334982 , 1726648 , 3529344 , 3679872 , 2074660 , 610288 , 81144 , 3322 , 1) . We used polymake for the computations on small polytopes. Sonja Petrovi´ c (UIC – Penn State) www.math.uic.edu/ ∼ petrovic June 8, 2011 9 / 17 / 23

Facial sets of the model polytope Proposition (Rinaldo-P.-Fienberg) A point y belongs to the interior of some face F of P n if and only if there exists a set F ⊂ { ( i , j ) , i < j } such that for any p = { p i , j : i < j , p i , j ∈ [0 , 1] } satisfying y = A n p , p i , j ∈ { 0 , 1 } if ( i , j ) �∈ F and p i , j ∈ (0 , 1) if ( i , j ) ∈ F . F is called a facial set of S n , and F c a co-facial set. The MLE does not exist for the graph x if and only if the set { ( i , j ): i < j , x i , j = 0 or N i , j } contains a co-facial set. Facial sets specify which probability parameters are estimable: only the probabilities { p i , j , ( i , j ) ∈ F} . Sonja Petrovi´ c (UIC – Penn State) www.math.uic.edu/ ∼ petrovic June 8, 2011 10 / 17 / 23

Example: Co-facial sets for nonexistent MLEs × 0 × 0 N 1 , 2 × N 1 , 2 × 0 0 × N 3 , 4 N 3 , 2 × 0 × N 4 , 2 × × 0 0 0 × 0 0 N 1 , 2 × N 1 , 2 × 0 N 1 , 3 × N 1 , 3 N 2 , 3 × N 4 , 1 × × × N 1 , 2 0 × 0 0 N 2 , 3 × N 2 , 4 × Table: Co-facial sets for P 4 (empty cells indicate any entry values). Sonja Petrovi´ c (UIC – Penn State) www.math.uic.edu/ ∼ petrovic June 8, 2011 11 / 17 / 23

Two related toric models where main Theorem applies Definition (Random graphs with fixed degree sequence) In the special case when N ij = 1 , the support S n reduces to G n := { 0 , 1 } ( n 2 ) , undirected simple graphs on n nodes. Corollary (RPF) A conjecture in Chatterjee-Diaconis-Sly (’10) is true: for the random graph model, the MLE exists if and only if d ( x ) ∈ int P n . Definition (The Rasch model) A random bipartite graph model, the support being G k , l , the set of bipartite graphs on k and l vertices. Theorem (RPF) The MLE of the Rasch model parameters exists if and only if d ( x ) ∈ int P p , q , the polytope of bipartite degree sequences. Sonja Petrovi´ c (UIC – Penn State) www.math.uic.edu/ ∼ petrovic June 8, 2011 12 / 17 / 23

Extensions (1) Removing the sampling constraint: Let quantities N i , j be random! Theorem (Thanks to Haase and Yu) The model polytope has 3 n facets, and is obtained from the product of simplices by removing the vertices { e i × e ′ i } , i = 1 , . . . , n. (2) Specialize (1) to directed graphs without multiple edges. This is the Bradley-Terry model for pairwise comparisons. Theorem (Zermelo ’29, Ford ’57) If the graph is strongly connected, then the MLE exists. Algorithms for detecting co-facial sets still apply. The matrix of the � n � model polytope has dimension ( + n ) × n ( n − 1). 2 Sonja Petrovi´ c (UIC – Penn State) www.math.uic.edu/ ∼ petrovic June 8, 2011 13 / 17 / 23

Extensions (3) A directed random graph model used in social networking: the p 1 model (Holland-Leinhardt ’81). � n � The model polytope is the Minkowski sum of polytopes. 2 Example (n=4) 4 10 = 1 , 048 , 576 different graphs x. Three cases of the p 1 model: 1 There are 225 , 025 points A 4 x, and the MLE exists for 7 , 983 . 2 349 , 500 , the MLE exists in 12 , 684 cases 3 583 , 346 , the MLE never exists. Theorem (Rinaldo-P.-Fienberg) Sufficient conditions for MLE existence, with large probability, as n grows. In the case of fixed degree sequence graphs, our asymptotic results improve those of Chatterjee-Diaconis ’11. Sonja Petrovi´ c (UIC – Penn State) www.math.uic.edu/ ∼ petrovic June 8, 2011 14 / 17 / 23

Existence of maximum likelihood estimators for 3 toric network - PowerPoint PPT Presentation

Existence of maximum likelihood estimators for 3 toric network models Sonja Petrovi c (University of Illinois at Chicago) Penn State University Toric geometry and applications Leuven, Belgium June 8, 2011 Sonja Petrovi c (UIC Penn

L-estimators, R-estimators, Redescending M gr. Jakub Petr asek Estimators Revision Seminar

Maximum Likelihood properties Maximum parsimony Maximum likelihood Experimental design

Snapshots from the History of Toric Geometry David A. Cox Geometry 19701988 Toric Geometry

The rational homology of real toric manifolds Alexander I. Suciu Toric manifolds. In a seminal

Computing toric degenerations of flag varieties Lara Bossinger SIAM 31. July, 2017 Computing

Phylogenetic trees IV Maximum Likelihood Gerhard Jger ESSLLI 2016 Gerhard Jger Maximum

Maximum likelihood models Tues. Feb. 27, 2018 1 Overview of today Informal notion of

Curve Fitting Re-visited, Bishop1.2.5 Maximum Likelihood Bishop 1.2.5 Model Likelihood

Maximum Likelihood Estimation CS 446 Maximum likelihood: abstract formulation Weve had one

Binary choice 3.3 Maximum likelihood estimation Michel Bierlaire Maximum likelihood

Maximum Likelihood Estimation CS 446 Maximum likelihood: abstract formulation Weve had one

Binary choice 3.3 Maximum likelihood estimation Michel Bierlaire Output of the estimation

Phylogenetic trees IV Maximum Likelihood Gerhard Jger Words, Bones, Genes, Tools February 28,

15-388/688 - Practical Data Science: Maximum likelihood estimation, nave Bayes J. Zico Kolter

Maximum likelihood parameter estimation Maximum likelihood parameter estimation For an HMM

MAXIMUM CARDS MAXIMUM CARDS What is a Maximum Card ? The Maximum Card is the one which contains a

scheduled for mastectomy compared with those scheduled for breast conserving surgery Lynsey

Automorphism groups of edge-transitive maps (Pilsen 2016) Gareth Jones University of

On the Shape of a Set In Honour of Lino Di Martino Topics in Groups and their Representations

Hypergraph Decompositions and Toric Ideals Elizabeth Gross and Kaie Kubjas June 9, 2015 Toric

Introduction du workshop et prsentation : Mangroves in a changing world 1 / 97 00001 - 00:00:01

Two Main Goals I. Improve student writing II. Reduce your grading time 2 1 12/18/2013 What do

Neural Program Synthesis Rishabh Singh, Google Brain Great Collaborators! Deep Learning and

Contents A Typical PPP Structure Key Investor Issues Key Bankability Issues

Sambuz

Useful Links

Newsletter

Mail Us