Just Sort It! A Simple and E ff ective Approach to Active Preference - PowerPoint PPT Presentation

Just Sort It! A Simple and E ff ective Approach to Active Preference Learning Lucas Maystre, Matthias Grossglauser School of Computer and Communication Sciences, EPFL ICML — August 8 th , 2017

Goal Learning a ranking from noisy pairwise comparisons . some outcomes are   inconsistent with the ranking Recover the ranking accurately , but sample sparingly . choose pairs of items adaptively based on previous observations 2

Main Idea Use a sorting algorithm ! Ground-truth ranking Random ranking Outputs of Quicksort with noise Quicksort Quicksort Quicksort ML estimator ML estimate ranking 4

Why Sorting-based AL ? Prior work : greedy active learning strategies   [Houlsby et al. NIPS 2012 , Chen et al. WSDM 2013 , Wang et al. KDD 2014 , ...] r details. Time (in seconds) to select the n +1-st T [s] pair among n items n = 10 2 n = 10 3 n = 10 4 Strategy uncertainty 0 . 05 0 . 5 11 entropy 0 . 3 40 — ε < 10 -5   KL-divergence 0 . 9 71 — orders of magnitude faster Mergesort ε ε ε Quicksort ε ε ε random ε ε ε ... and simpler to implement 5

This Work Theory Practice Accuracy of output of single call to Empirical evaluation of sorting- Quicksort based active learning on real data vs. Quicksort Quicksort Quicksort ML estimator ranking 6

Noise Model Bradley-Terry model [Zermelo 1928 , Bradley & Terry 1952 ] 1 p ( i � j ) = 1 + e − ( θ i − θ j ) θ 1 θ 2 θ 3 R error is likely error is unlikely 7

Sorting Algorithm We analyze Quicksort . 2 5 5 4 7 1 3 6 2 4 1 1 3 5 7 6 6 1 2 4 3 3 5 6 7 1 2 3 4 5 6 7 8

Model Parameters 1 p ( i � j ) = 1 + e − ( θ i − θ j ) Di ff iculty of ranking the items We assume that parameters are depends on | θ 2 - θ 1 |, | θ 3 - θ 2 |, ... drawn from a Poisson point process . Our approach : postulate a i.i.d. θ i +1 − θ i ∼ Exp( λ ) distribution over the parameters s.t. n = 20 E [ | θ i +1 − θ i | ] = λ − 1 controls the average amount of noise 9

Main Result We measure the rank displacement of Theorem : if noise is Bradley-Terry and Quicksort's output σ . i.i.d. θ i +1 − θ i ∼ Exp( λ ) n X with high probability, ∆ ( σ ) = | σ ( i ) − i | i =1 ∆ ( σ ) = O ( λ 2 n ) Rank of item i in Quicksort's output with high probability, True rank of item i max | σ ( i ) − i | = O ( λ log n ) i 10

Sketch of Proof Lemma : Displacement of Quicksort's w.h.p., ∆ ( σ ) = O ( λ 2 n ) output can be bounded by a sum over “individual errors” . ( 1 if outcome is incorrect z ij = 0 otherwise X ∆ ( σ ) ≤ 2 | i − j | comparison pair ( i,j ) ∈ E X E [ ∆ ( σ )] ≤ 2 | i − j | E [ z ij ] pairs sampled by Quicksort i<j that resulted in an error ∞ X k E [ z i,i + k ] = O ( λ 2 n ) ≤ 2 n Does not assume anything about the k =1 noise generating process. decreases exponentially fast in k 11

Experimental Results Sushi dataset GIFGIF dataset × 10 6 1 . 6 2500 uncertainty Mergesort 1 . 4 Quicksort entropy 2000 KL-div random 1 . 2 Mergesort Displacement 1 . 0 Quicksort 1500 random 0 . 8 1000 0 . 6 0 . 4 500 0 . 2 0 0 . 0 0 2000 4000 6000 8000 10000 0 . 1 0 . 2 0 . 3 0 . 4 0 . 5 0 . 6 0 . 7 0 . 8 0 . 9 1 . 0 × 10 6 Number of comparisons c Number of comparisons c Matches the performance of alternatives at a fraction of computational cost . 12

Conclusions • Use sorting algorithms to learn a GIFGIF dataset ranking from noisy comparisons! • works well in practice • computationally inexpensive • Some theoretical results under assumptions on the noise 6170 animated GIF images   2.7+ million pairwise comparisons 13

How To Select Comparison Pairs? Batch setting Sequential setting Role of the spectral gap of the Greedy active learning strategies (EIG, comparison graph . uncertainty sampling, ...) [Negahban et al. NIPS 2012 ]   [Houlsby et al. NIPS 2012 ]   [Hajek et al. NIPS 2014 ]   [Chen et al. WSDM 2013 ]   [Vojnovic et al. ICML 2016 ] [Ailon et al. NIPS 2011 ] Bandit approaches [Szörényi et al. NIPS 2015 ]   [Heckel et al. arXiv 2016 ] Dueling bandits [Yue et al. COLT 2009 ] 15

Empirical Validation max i | σ ( i ) − i | ∆ ( σ ) × 10 4 9 100 8 λ = 2 λ = 2 80 7 λ = 4 λ = 4 6 λ = 6 λ = 6 60 5 λ = 8 λ = 8 4 40 3 2 20 1 0 0 10 1 10 2 10 3 10 4 10 5 0 2000 4000 6000 8000 10000 Number of items n Number of items n 16

Practical AL Strategy In practice : comparison budget is more than that of a single call to Quicksort. ∆ ( ˆ σ ) 900 800 Copeland 700 ML estimate 600 n = 200, λ = 4.0 500 400 300 200 100 0 0 20 40 60 80 100 Number of runs m 17

Just Sort It! A Simple and E ff ective Approach to Active Preference - PowerPoint PPT Presentation

Just Sort It! A Simple and E ff ective Approach to Active Preference Learning Lucas Maystre, Matthias Grossglauser School of Computer and Communication Sciences, EPFL ICML August 8 th , 2017 Goal Learning a ranking from noisy pairwise

Insertion-Sort M. Esponda Insertion-Sort M. Esponda Insertion-Sort M. Esponda Insertion-Sort

Selection Sort Section 10.2 Code for Selection Sort (cont.) Code for an Array Sort Code for an

R A D I X S O R T Radix Sort 147 dnc CS 16: Radix Sort Radix Sort Unlike other sorting

RADIX SORT Parosh Aziz Abdulla Uppsala University September 21, 2008 Parosh Aziz Abdulla

Sort Algorithms 15-110 - Friday 10/09 Learning Objectives Recognize the general algorithm and

Sorting a List: bubble sort selection sort insertion sort Sept. 22, 2017 1 Sorting BEFORE

Bucket-Sort and Radix-Sort 1, c 3, a 3, b 7, d 7, g 7, e B 0 1

SORTING Chapter 8 Sorting 2 Why sort? To make searching faster! How? Binary Search gives

Topological Sort Shivam Patel Viktor Zenkov Questions 1. Who first described topological sort?

Sorting Lower Bound Radix Sort Radix sort to the rescue sort of After today, you should

Sorting Chapter 7 1 Quick Sort One of the most popular fast sorting algorithms Quick sort

Sorting Lower Bound Radix Sort Radix sort to the rescue sort of After today, you should be

Sorting Lower Bound Radix Sort Radix sort to the rescue sort of

Sorting... more 2 Divide & conquer Which works better for multi-cores: insertion sort or

Sorting Lower Bound Radix Sort Radix sort to the rescue sort of After today, you should be

2-Way Sort Problem You have some large number (e.g., 3072) pages of data to sort You only have a

Praktikum Entwicklung von Mediensystemen (Android) Sommersemester 2014 Simon Eumes, Dr. Alexander

Electricity Affordability Metrics for the U.S. June 14, 2018 Housekeeping Join audio:

Why Photography? Photography for My Soul // Feeling Alive with Camera in Hand! Why Creativity?

CS371m - Mobile Computing 2D Graphics A Crash Course in Using (Android) 2D Graphics Libraries

CS 105 SUMMER WEDNESDAY 3 What to talk about today? From the muddiest points String

Statistical Animations Using R Yihui Xie useR! 2008 @ Dortmund Aug 11, 2008 8/20/2008

Can a Chatbot Support Software Engineers with Load Testing? Approach and Experiences

Coordination Needs: Congruence in Software Development James D. Herbsleb School of Computer

Just Sort It! A Simple and E ff ective Approach to Active Preference - PowerPoint PPT Presentation

Just Sort It! A Simple and E ff ective Approach to Active Preference Learning Lucas Maystre, Matthias Grossglauser School of Computer and Communication Sciences, EPFL ICML August 8 th , 2017 Goal Learning a ranking from noisy pairwise

Insertion-Sort M. Esponda Insertion-Sort M. Esponda Insertion-Sort M. Esponda Insertion-Sort

Selection Sort Section 10.2 Code for Selection Sort (cont.) Code for an Array Sort Code for an

R A D I X S O R T Radix Sort 147 dnc CS 16: Radix Sort Radix Sort Unlike other sorting

RADIX SORT Parosh Aziz Abdulla Uppsala University September 21, 2008 Parosh Aziz Abdulla

Sort Algorithms 15-110 - Friday 10/09 Learning Objectives Recognize the general algorithm and

Sorting a List: bubble sort selection sort insertion sort Sept. 22, 2017 1 Sorting BEFORE

Bucket-Sort and Radix-Sort 1, c 3, a 3, b 7, d 7, g 7, e B 0 1

SORTING Chapter 8 Sorting 2 Why sort? To make searching faster! How? Binary Search gives

Topological Sort Shivam Patel Viktor Zenkov Questions 1. Who first described topological sort?

Sorting Lower Bound Radix Sort Radix sort to the rescue sort of After today, you should

Sorting Chapter 7 1 Quick Sort One of the most popular fast sorting algorithms Quick sort

Sorting Lower Bound Radix Sort Radix sort to the rescue sort of After today, you should be

Sorting Lower Bound Radix Sort Radix sort to the rescue sort of

Sorting... more 2 Divide &amp; conquer Which works better for multi-cores: insertion sort or

Sorting Lower Bound Radix Sort Radix sort to the rescue sort of After today, you should be

2-Way Sort Problem You have some large number (e.g., 3072) pages of data to sort You only have a

Praktikum Entwicklung von Mediensystemen (Android) Sommersemester 2014 Simon Eumes, Dr. Alexander

Electricity Affordability Metrics for the U.S. June 14, 2018 Housekeeping Join audio:

Why Photography? Photography for My Soul // Feeling Alive with Camera in Hand! Why Creativity?

CS371m - Mobile Computing 2D Graphics A Crash Course in Using (Android) 2D Graphics Libraries

CS 105 SUMMER WEDNESDAY 3 What to talk about today? From the muddiest points String

Statistical Animations Using R Yihui Xie useR! 2008 @ Dortmund Aug 11, 2008 8/20/2008

Can a Chatbot Support Software Engineers with Load Testing? Approach and Experiences

Coordination Needs: Congruence in Software Development James D. Herbsleb School of Computer

Sorting... more 2 Divide & conquer Which works better for multi-cores: insertion sort or