Mallows ranking models: maximum likelihood estimate and regeneration - PowerPoint PPT Presentation

Mallows ranking models: maximum likelihood estimate and regeneration Wenpin Tang Department of Mathematics, UCLA June 14, 2019 Wenpin Tang Department of Mathematics, UCLA

Background Ranked data appear in many problems of social choice , user recommendation , information retrieval ... Examples : ranking candidates by voters in political elections ; preference list of competing items collected from consumers ; document retrieval by aggregating a ranked list of webpages output by various search algorithms. Wenpin Tang Department of Mathematics, UCLA

Mathematical models Ranking = Permutation. Given n items, a ranking π ∈ S n is described by word list : ( π ( 1 ) , π ( 2 ) , . . . , π ( n )) , ranked list : ( π − 1 ( 1 ) | π − 1 ( 2 ) | . . . | π − 1 ( n )) . π ( i ) = j : the item i has rank j , and π − 1 ( j ) = i : the j th most preferred is item i . Mallows model : P θ,π 0 , d ( π ) ∝ e − θ d ( π,π 0 ) for π ∈ S n , θ > 0 is the dispersion parameter , π 0 is the central ranking , d ( · , · ) is a discrepancy function which is right invariant : d ( π, σ ) = d ( π ◦ σ − 1 , id ) for π, σ ∈ S n . Wenpin Tang Department of Mathematics, UCLA

Mallows model Diaconis’ list of d ( · , · ) : Mallows’ θ model : d ( π, σ ) = � n i = 1 ( π ( i ) − σ ( i )) 2 is the Spearman’s rho , Mallows’ φ model : d ( π, σ ) = inv ( π ◦ σ − 1 ) is the Kendall’s tau ... Mallows’ φ model is more interesting, since it is an instance of two large models, Fligner and Verducci (’86, ’88) : distance-based ranking models , multistage ranking models . Correctness measure : inversion table ( s j ( π )) 1 ≤ j ≤ n − 1 � s j ( π ) := π − 1 ( j ) − 1 − 1 { π − 1 ( j ′ ) <π − 1 ( j ) } . j ′ < j n � exp ( − θ s j ( π ◦ π − 1 P π 0 ,θ ∝ 0 )) . j = 1 Wenpin Tang Department of Mathematics, UCLA

MLE � θ , � π 0 MLE � θ : easy by convex optimization. MLE � π 0 : Kemeny’s consensus ranking problem N � inv ( π i ◦ π − 1 � π 0 := argmin π 0 0 ) . i = 1 This problem is NP-hard , with a few heuristic algorithms. Theoretical properties of � θ , � π 0 : Are the MLEs � θ , � π 0 consistent ? 1 Is the MLE � θ unbiased ? 2 How fast do MLEs � π 0 converge to π 0 ? 3 Not well studied, only Mukherjee (’16) considered � θ . Wenpin Tang Department of Mathematics, UCLA

Properties of MLE Theorem Let � θ , � π 0 be the MLE of θ , π 0 with N samples. 1 E θ,π 0 � θ > θ. 2 � � � − N � � − N 2 cosh θ cosh θ ≤ P θ,π 0 ( � π 0 � = π 0 ) ≤ ( n − H n ) n ! . π N 2 2 Hint : For π ∼ Mallows’ φ , inv ( π ) is decomposed as independent truncated geometric variables . Then apply LDP bounds. Wenpin Tang Department of Mathematics, UCLA

Infinite Mallows models Motivation : Tackle the problem of ranking a large number items − → infinite ranking/permutation models.   t �  − θ s j ( π ◦ π − 1  , P θ,π 0 ( π ) ∝ exp 0 ) j = 1 regarded as a t -marginal of random permutation of N + . Theory : Pitman and Tang Regenerative random permutations of integers , AoP (’19) − → Infinite Mallows model enjoys the regenerative property : it is a concatenation of i.i.d. indecomposable blocks ( 2 , 3 , 4 , 1 , 6 , 8 , 7 , 10 , 5 , 9 , 12 , 13 , 11 , . . . ) � �� L 1 = 4 L 2 = 6 L 3 = 3 Wenpin Tang Department of Mathematics, UCLA

‘t’ selection algorithm Question : How to choose the model size t ? 1 Fact : E L = ( e − θ ; e − θ ) ∞ . With ‘t’ selected, we fit a Generalized Mallows model :   t � θ j s j ( π ◦ π − 1  −  . P � θ,π 0 ( π ) ∝ exp 0 ) j = 1 Wenpin Tang Department of Mathematics, UCLA

Synthetic data T ABLE : Accuracy of estimated rank & average training time for 50 simulated data with t max = 10 (resp. t max = 20, t max = 40) and � θ = ( 1 , 0 . 975 , . . . , 0 . 775 , 0 , . . . ) (resp. � θ = ( 1 , 0 . 975 , . . . , 0 . 525 , 0 , . . . ) , � θ = ( 1 , 0 . 975 , . . . , 0 . 025 , 0 , . . . ) ) by the IGM model of model size t = 1, t = 10 and Algorithm. t max = 10 IGM( t = 1) IGM( t = 10) A LGO 100 % 100 % 100 % A CC . EST . RANK 100 % 100 % 100 % 100 % 100 % 100 % A VE . TIME 1 . 56 S 1 . 56 S 1 . 56 S 14 . 45 S 2 . 80 S t max = 20 IGM( t = 1) IGM( t = 10) A LGO 100 % 100 % A CC . EST . RANK 94 % 100 % 100 % 100 % 100 % A VE . TIME 5 . 73 S 5 . 73 S 5 . 73 S 54 . 45 S 24 . 42 S t max = 40 IGM( t = 1) IGM( t = 10) A LGO 100 % 100 % A CC . EST . RANK 82 % 100 % 100 % 100 % 100 % A VE . TIME 70 . 26 S 70 . 26 S 70 . 26 S 684 . 65 S 391 . 20 S Wenpin Tang Department of Mathematics, UCLA

The algorithm is also applied to other data as APA data, university’s homepage search data... Thank you for your attention ! Wenpin Tang Department of Mathematics, UCLA

Mallows ranking models: maximum likelihood estimate and regeneration - PowerPoint PPT Presentation

Mallows ranking models: maximum likelihood estimate and regeneration Wenpin Tang Department of Mathematics, UCLA June 14, 2019 Wenpin Tang Department of Mathematics, UCLA Background Ranked data appear in many problems of social choice , user

Maximum Likelihood properties Maximum parsimony Maximum likelihood Experimental design

Maximum likelihood models Tues. Feb. 27, 2018 1 Overview of today Informal notion of

Binary choice 3.3 Maximum likelihood estimation Michel Bierlaire Maximum likelihood

Lecture 3. Inadmissibility of Maximum Likelihood Estimate and James-Stein Estimator Yuan Yao

Phylogenetic trees IV Maximum Likelihood Gerhard Jger ESSLLI 2016 Gerhard Jger Maximum

Curve Fitting Re-visited, Bishop1.2.5 Maximum Likelihood Bishop 1.2.5 Model Likelihood

Maximum likelihood parameter estimation Maximum likelihood parameter estimation For an HMM

15-388/688 - Practical Data Science: Maximum likelihood estimation, nave Bayes J. Zico Kolter

Maximum Likelihood Estimation CS 446 Maximum likelihood: abstract formulation Weve had one

Maximum Likelihood Estimation CS 446 Maximum likelihood: abstract formulation Weve had one

Binary choice 3.3 Maximum likelihood estimation Michel Bierlaire Output of the estimation

Phylogenetic trees IV Maximum Likelihood Gerhard Jger Words, Bones, Genes, Tools February 28,

MAXIMUM CARDS MAXIMUM CARDS What is a Maximum Card ? The Maximum Card is the one which contains a

Outline n Maximum likelihood (ML) n Priors, and maximum a posteriori (MAP) n

CS 287 Advanced Robotics (Fall 2019) Lecture 13: Kalman Smoother, Maximum A Posteriori, Maximum

Chapter 8: Estimation In this chapter we will cover: 1. The likelihood and maximum likelihood

TWO DEPENDENCE MEASURES FOR MULTIVARIATE EXTREME VALUE DISTRIBUTIONS WEISSMAN, ISHAY Technion

Center for Science of Information Bryn Mawr Structural Information: Howard MIT Progress

NEAT001/ANRS 143 Trial Darunavir/r + Raltegravir versus Darunavir/r + TDF-FTC NEAT001/ANRS143:

antiplatelet therapy as appropriate duration) study Masato Nakamura, MD. Phd. Disclosures Grants

Divide-Conquer-Glue Algorithms Divide-and-conquer. Mergesort and Counting Inversions Divide

Correlations between Students Behaviour in Learning Management Systems and their Learning

Computational social choice Lirong Xia Todays schedule Computational social choice: the

Introduction Consider the following scenario: You have just developed a new algorithm A that,