existence of maximum likelihood estimators for 3 toric
play

Existence of maximum likelihood estimators for 3 toric network - PowerPoint PPT Presentation

Existence of maximum likelihood estimators for 3 toric network models Sonja Petrovi c (University of Illinois at Chicago) Penn State University Toric geometry and applications Leuven, Belgium June 8, 2011 Sonja Petrovi c (UIC Penn


  1. Existence of maximum likelihood estimators for 3 toric network models Sonja Petrovi´ c (University of Illinois at Chicago) Penn State University Toric geometry and applications Leuven, Belgium June 8, 2011 Sonja Petrovi´ c (UIC – Penn State) www.math.uic.edu/ ∼ petrovic June 8, 2011 1 / 17 / 23

  2. Algebraic statistics Fact (Guiding principle) Many important statistical models correspond to algebraic or semi-algebraic sets of parameters. The geometry of these parameter spaces determines the behavior of widely used statistical inference procedures. A typical question: dimension, degree, singularities? Generators and Gr¨ obner bases: A typical question: the ideal of a given toric/secant/join variety? When a model is algebraic, use tools from algebraic geometry and computational algebra software packages ( 4ti2 , Macaulay2 , polymake , Singular , ...) Sonja Petrovi´ c (UIC – Penn State) www.math.uic.edu/ ∼ petrovic June 8, 2011 2 / 17 / 23

  3. The Beta model for random undirected graphs In a pairwise comparison of n items, for every ordered pair ( i , j ): x i , j := the number of pairwise comparisons involving objects i and j in which i emerged as a winner. Fix N i , j := x i , j + x j , i , the total number of comparisons. Definition (Set of all possible outcomes) S n := { x i , j : i < j and x i , j ∈ { 0 , 1 , . . . , N i , j }} ⊂ N ( n 2 ) . Definition (Parametrization of the beta model) For each β ∈ R n : e β i + β j 1 p i , j = and p j , i = 1 − p i , j = ∀ i � = j . 1 + e β i + β j , 1 + e β i + β j The model M β for n vertices consists of all p i , j ’s of this form. Sonja Petrovi´ c (UIC – Penn State) www.math.uic.edu/ ∼ petrovic June 8, 2011 3 / 17 / 23

  4. The Beta model for random undirected graphs It is parametrized by the vertex-edge incidence matrix of a complete graph:   1 1 1 0 0 0 1 0 0 1 1 0   A 4 =   0 1 0 1 0 1   0 0 1 0 1 1 rows indexed by the vertices; columns indexed by ( i , j ) with i < j . Definition (The model polytope) S n := conv { A n x , x ∈ S n } Example Represent the graph with an edge { 1 , 2 } and a triple edge { 1 , 3 } as x = [1 , 3 , 0 , . . . , 0] T ∈ S n . Corresponding point in the model polytope is A n x = [4 , 1 , 3 , 0 , . . . ] . Sonja Petrovi´ c (UIC – Penn State) www.math.uic.edu/ ∼ petrovic June 8, 2011 4 / 17 / 23

  5. MLE existence problem Given the graph x ∈ S n : Goal (Motivating problem: maximum likelihood estimation) Determine parameter values that “best explain” this data. This is done by maximizing the log-likelihood function: p x ij � MLE ( p ) := argmax p ∈M n ij . i < j Definition If MLE(p) is achieved on the boundary of the model M , it is called the extended MLE. In this case, the MLE is said not to exist. Problem Determine the points x for which MLE ( p ) does not exist. Nonexistence of the MLE implies that only certain entries of p are estimable. (i.e., certain linear combinations of the natural parameters) Sonja Petrovi´ c (UIC – Penn State) www.math.uic.edu/ ∼ petrovic June 8, 2011 5 / 17 / 23

  6. A small example of data leading to a nonexistent MLE × 0 N 1 , 2 × e β i + β j (Recall M β : p i , j = 1+ e β i + β j . ) × N 3 , 4 0 × × 0 1 2 × 0 0.5 0.5 3 × 2 1 1 × 0.5 0.5 2 1 × 3 0.5 0.5 × 1 1 2 0 × 0.5 0.5 0 × Left : data exhibiting the above pattern, when N i , j = 3 for all i � = j . Right : table of the extended MLE of the estimated probabilities. Under natural parametrization, the supremum of the log-likelihood is achieved in the limit for any sequence of natural parameters { β ( k ) } of the form β ( k ) = ( − c k , − c k , c k , c k ), where c k → ∞ as k → ∞ . Sonja Petrovi´ c (UIC – Penn State) www.math.uic.edu/ ∼ petrovic June 8, 2011 6 / 17 / 23

  7. Polyhedral methods Theorem (Standard statistical theory of exponential families) The MLE for an observed graph x exists if and only if A n x ∈ int( S n ) . A general algorithm for deciding this problem and finding the relevant facial sets are presented in Eriksson, Fienberg, Rinaldo, Sullivant (’06). Problem Understand the model polytope for M β . Extend to other random graph models. Rinaldo, Fienberg (May ’11) provide a more thorough treatment of algorithms for log-linear (toric) models. Rinaldo, P., Fienberg (May ’11) specialize these to network models. Sonja Petrovi´ c (UIC – Penn State) www.math.uic.edu/ ∼ petrovic June 8, 2011 7 / 17 / 23

  8. Motivation = size of random graph Example: The Collective Dynamics of Smoking in a Large Social Network (James Fowler) Node border= gender (red=female, blue=male). Arrow color = relation (purple=friend, green=spouse). Node color = smoking behavior (white=nonsmoker, gray=smoker); darker shades = more cigarettes consumed per day. Sonja Petrovi´ c (UIC – Penn State) www.math.uic.edu/ ∼ petrovic June 8, 2011 8 / 17 / 23

  9. The polytope of degree sequences Definition The polytope of degree sequences is P n := convhull ( { Ax , x ∈ G n } ) . Facet-defining inequalities of P n are known (Mehadev-Peled ’96). Theorem (Rinaldo-P.-Fienberg) Let x ∈ S n be the observed vector of edge counts. The MLE exists if and only if x j , i x i , j � � + ∈ int ( P n ) , i = 1 , . . . , n . N i , j N i , j j < i j > i Example (Stanley) f ( P 8 ) = (334982 , 1726648 , 3529344 , 3679872 , 2074660 , 610288 , 81144 , 3322 , 1) . We used polymake for the computations on small polytopes. Sonja Petrovi´ c (UIC – Penn State) www.math.uic.edu/ ∼ petrovic June 8, 2011 9 / 17 / 23

  10. Facial sets of the model polytope Proposition (Rinaldo-P.-Fienberg) A point y belongs to the interior of some face F of P n if and only if there exists a set F ⊂ { ( i , j ) , i < j } such that for any p = { p i , j : i < j , p i , j ∈ [0 , 1] } satisfying y = A n p , p i , j ∈ { 0 , 1 } if ( i , j ) �∈ F and p i , j ∈ (0 , 1) if ( i , j ) ∈ F . F is called a facial set of S n , and F c a co-facial set. The MLE does not exist for the graph x if and only if the set { ( i , j ): i < j , x i , j = 0 or N i , j } contains a co-facial set. Facial sets specify which probability parameters are estimable: only the probabilities { p i , j , ( i , j ) ∈ F} . Sonja Petrovi´ c (UIC – Penn State) www.math.uic.edu/ ∼ petrovic June 8, 2011 10 / 17 / 23

  11. Example: Co-facial sets for nonexistent MLEs × 0 × 0 N 1 , 2 × N 1 , 2 × 0 0 × N 3 , 4 N 3 , 2 × 0 × N 4 , 2 × × 0 0 0 × 0 0 N 1 , 2 × N 1 , 2 × 0 N 1 , 3 × N 1 , 3 N 2 , 3 × N 4 , 1 × × × N 1 , 2 0 × 0 0 N 2 , 3 × N 2 , 4 × Table: Co-facial sets for P 4 (empty cells indicate any entry values). Sonja Petrovi´ c (UIC – Penn State) www.math.uic.edu/ ∼ petrovic June 8, 2011 11 / 17 / 23

  12. Two related toric models where main Theorem applies Definition (Random graphs with fixed degree sequence) In the special case when N ij = 1 , the support S n reduces to G n := { 0 , 1 } ( n 2 ) , undirected simple graphs on n nodes. Corollary (RPF) A conjecture in Chatterjee-Diaconis-Sly (’10) is true: for the random graph model, the MLE exists if and only if d ( x ) ∈ int P n . Definition (The Rasch model) A random bipartite graph model, the support being G k , l , the set of bipartite graphs on k and l vertices. Theorem (RPF) The MLE of the Rasch model parameters exists if and only if d ( x ) ∈ int P p , q , the polytope of bipartite degree sequences. Sonja Petrovi´ c (UIC – Penn State) www.math.uic.edu/ ∼ petrovic June 8, 2011 12 / 17 / 23

  13. Extensions (1) Removing the sampling constraint: Let quantities N i , j be random! Theorem (Thanks to Haase and Yu) The model polytope has 3 n facets, and is obtained from the product of simplices by removing the vertices { e i × e ′ i } , i = 1 , . . . , n. (2) Specialize (1) to directed graphs without multiple edges. This is the Bradley-Terry model for pairwise comparisons. Theorem (Zermelo ’29, Ford ’57) If the graph is strongly connected, then the MLE exists. Algorithms for detecting co-facial sets still apply. The matrix of the � n � model polytope has dimension ( + n ) × n ( n − 1). 2 Sonja Petrovi´ c (UIC – Penn State) www.math.uic.edu/ ∼ petrovic June 8, 2011 13 / 17 / 23

  14. Extensions (3) A directed random graph model used in social networking: the p 1 model (Holland-Leinhardt ’81). � n � The model polytope is the Minkowski sum of polytopes. 2 Example (n=4) 4 10 = 1 , 048 , 576 different graphs x. Three cases of the p 1 model: 1 There are 225 , 025 points A 4 x, and the MLE exists for 7 , 983 . 2 349 , 500 , the MLE exists in 12 , 684 cases 3 583 , 346 , the MLE never exists. Theorem (Rinaldo-P.-Fienberg) Sufficient conditions for MLE existence, with large probability, as n grows. In the case of fixed degree sequence graphs, our asymptotic results improve those of Chatterjee-Diaconis ’11. Sonja Petrovi´ c (UIC – Penn State) www.math.uic.edu/ ∼ petrovic June 8, 2011 14 / 17 / 23

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend