just sort it
play

Just Sort It! A Simple and E ff ective Approach to Active Preference - PowerPoint PPT Presentation

Just Sort It! A Simple and E ff ective Approach to Active Preference Learning Lucas Maystre, Matthias Grossglauser School of Computer and Communication Sciences, EPFL ICML August 8 th , 2017 Goal Learning a ranking from noisy pairwise


  1. Just Sort It! A Simple and E ff ective Approach to Active Preference Learning Lucas Maystre, Matthias Grossglauser School of Computer and Communication Sciences, EPFL ICML — August 8 th , 2017

  2. Goal Learning a ranking from noisy pairwise comparisons . some outcomes are 
 inconsistent with the ranking Recover the ranking accurately , but sample sparingly . choose pairs of items adaptively based on previous observations 2

  3. ... 3

  4. Main Idea Use a sorting algorithm ! Ground-truth ranking Random ranking Outputs of Quicksort with noise Quicksort Quicksort Quicksort ML estimator ML estimate ranking 4

  5. Why Sorting-based AL ? Prior work : greedy active learning strategies 
 [Houlsby et al. NIPS 2012 , Chen et al. WSDM 2013 , Wang et al. KDD 2014 , ...] r details. Time (in seconds) to select the n +1-st T [s] pair among n items n = 10 2 n = 10 3 n = 10 4 Strategy uncertainty 0 . 05 0 . 5 11 entropy 0 . 3 40 — ε < 10 -5 
 KL-divergence 0 . 9 71 — orders of magnitude faster Mergesort ε ε ε Quicksort ε ε ε random ε ε ε ... and simpler to implement 5

  6. This Work Theory Practice Accuracy of output of single call to Empirical evaluation of sorting- Quicksort based active learning on real data vs. Quicksort Quicksort Quicksort ML estimator ranking 6

  7. Noise Model Bradley-Terry model [Zermelo 1928 , Bradley & Terry 1952 ] 1 p ( i � j ) = 1 + e − ( θ i − θ j ) θ 1 θ 2 θ 3 R error is likely error is unlikely 7

  8. Sorting Algorithm We analyze Quicksort . 2 5 5 4 7 1 3 6 2 4 1 1 3 5 7 6 6 1 2 4 3 3 5 6 7 1 2 3 4 5 6 7 8

  9. Model Parameters 1 p ( i � j ) = 1 + e − ( θ i − θ j ) Di ff iculty of ranking the items We assume that parameters are depends on | θ 2 - θ 1 |, | θ 3 - θ 2 |, ... drawn from a Poisson point process . Our approach : postulate a i.i.d. θ i +1 − θ i ∼ Exp( λ ) distribution over the parameters s.t. n = 20 E [ | θ i +1 − θ i | ] = λ − 1 controls the average amount of noise 9

  10. Main Result We measure the rank displacement of Theorem : if noise is Bradley-Terry and Quicksort's output σ . i.i.d. θ i +1 − θ i ∼ Exp( λ ) n X with high probability, ∆ ( σ ) = | σ ( i ) − i | i =1 ∆ ( σ ) = O ( λ 2 n ) Rank of item i in Quicksort's output with high probability, True rank of item i max | σ ( i ) − i | = O ( λ log n ) i 10

  11. Sketch of Proof Lemma : Displacement of Quicksort's w.h.p., ∆ ( σ ) = O ( λ 2 n ) output can be bounded by a sum over “individual errors” . ( 1 if outcome is incorrect z ij = 0 otherwise X ∆ ( σ ) ≤ 2 | i − j | comparison pair ( i,j ) ∈ E X E [ ∆ ( σ )] ≤ 2 | i − j | E [ z ij ] pairs sampled by Quicksort i<j that resulted in an error ∞ X k E [ z i,i + k ] = O ( λ 2 n ) ≤ 2 n Does not assume anything about the k =1 noise generating process. decreases exponentially fast in k 11

  12. Experimental Results Sushi dataset GIFGIF dataset × 10 6 1 . 6 2500 uncertainty Mergesort 1 . 4 Quicksort entropy 2000 KL-div random 1 . 2 Mergesort Displacement 1 . 0 Quicksort 1500 random 0 . 8 1000 0 . 6 0 . 4 500 0 . 2 0 0 . 0 0 2000 4000 6000 8000 10000 0 . 1 0 . 2 0 . 3 0 . 4 0 . 5 0 . 6 0 . 7 0 . 8 0 . 9 1 . 0 × 10 6 Number of comparisons c Number of comparisons c Matches the performance of alternatives at a fraction of computational cost . 12

  13. Conclusions • Use sorting algorithms to learn a GIFGIF dataset ranking from noisy comparisons! • works well in practice • computationally inexpensive • Some theoretical results under assumptions on the noise 6170 animated GIF images 
 2.7+ million pairwise comparisons 13

  14. 14

  15. How To Select Comparison Pairs? Batch setting Sequential setting Role of the spectral gap of the Greedy active learning strategies (EIG, comparison graph . uncertainty sampling, ...) [Negahban et al. NIPS 2012 ] 
 [Houlsby et al. NIPS 2012 ] 
 [Hajek et al. NIPS 2014 ] 
 [Chen et al. WSDM 2013 ] 
 [Vojnovic et al. ICML 2016 ] [Ailon et al. NIPS 2011 ] Bandit approaches [Szörényi et al. NIPS 2015 ] 
 [Heckel et al. arXiv 2016 ] Dueling bandits [Yue et al. COLT 2009 ] 15

  16. Empirical Validation max i | σ ( i ) − i | ∆ ( σ ) × 10 4 9 100 8 λ = 2 λ = 2 80 7 λ = 4 λ = 4 6 λ = 6 λ = 6 60 5 λ = 8 λ = 8 4 40 3 2 20 1 0 0 10 1 10 2 10 3 10 4 10 5 0 2000 4000 6000 8000 10000 Number of items n Number of items n 16

  17. Practical AL Strategy In practice : comparison budget is more than that of a single call to Quicksort. ∆ ( ˆ σ ) 900 800 Copeland 700 ML estimate 600 n = 200, λ = 4.0 500 400 300 200 100 0 0 20 40 60 80 100 Number of runs m 17

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend