PRanking with Ranking Koby Crammer Technion Israel Institute of - - PowerPoint PPT Presentation

pranking with ranking
SMART_READER_LITE
LIVE PREVIEW

PRanking with Ranking Koby Crammer Technion Israel Institute of - - PowerPoint PPT Presentation

PRanking with Ranking Koby Crammer Technion Israel Institute of Technology Based on joint work with Yoram Singer at the Hebrew University of Jerusalem Problem Machine Prediction Users Rating Ranking 3 3 0 3 1 Loss Ranking


slide-1
SLIDE 1

PRanking with Ranking

Based on joint work with Yoram Singer at the Hebrew University of Jerusalem

Koby Crammer

Technion – Israel Institute of Technology

slide-2
SLIDE 2

Problem

3 3 1

Machine Prediction User’s Rating Ranking Loss

3

slide-3
SLIDE 3

Ranking – Formal Description

  • Instances
  • Labels
  • Structure
  • Ranking rule
  • Ranking Loss
  • Algorithm works in rounds
  • On each round the ranking

algorithm :

– Gets an input instance – Outputs a rank as prediction – Receives the correct rank- value – Computes loss – Updates the rank-prediction rule

Online Framework Problem Setting

1 2  3  4  5

x1 is preferred over x2

slide-4
SLIDE 4

Goal

  • Algorithms Loss
  • Loss of a fixed function
  • Regret
  • No statistical assumptions over data
  • The algorithm should do well irrespectively of

specific sequence of inputs and target labels

Lt = yi − ˆ y

i i=1 t

Lt f

( ) =

yi − f xi

( )

i=1 t

Lt − inf

f ∈FLt f

( )

slide-5
SLIDE 5

Background

Binary Classification

slide-6
SLIDE 6

1 2

w

The Perceptron Algorithm

Rosenblatt, 1958

  • Hyperplane w
slide-7
SLIDE 7

1 2

w

(X,1)

  • Hyperplane w
  • Get new instance x
  • Classify x :

sign( )

The Perceptron Algorithm

Rosenblatt, 1958

slide-8
SLIDE 8
  • Hyperplane w
  • Get new instance x
  • Classify x :

sign( )

  • Update (in case of a mistake)

w

1 2 1 2

w

(X,1)

The Perceptron Algorithm

Rosenblatt, 1958

slide-9
SLIDE 9

A Function Class for Ranking

slide-10
SLIDE 10

Our Approach to Ranking

  • Project
slide-11
SLIDE 11

Our Approach to Ranking

  • Project
  • Apply Thresholds

< >

4 3 2 1 Rank

slide-12
SLIDE 12

Update of a Specific Algorithm

it if its not Broken Least change as possible One step at a time

slide-13
SLIDE 13

PRank

  • Direction w,

Thresholds

w 5 1 3 2 4

Rank Levels Thresholds

slide-14
SLIDE 14

PRank

  • Direction w,

Thresholds

  • Rank a new instance x

w 5 1 3 2 4

slide-15
SLIDE 15

PRank

  • Direction w,

Thresholds

  • Rank a new instance x
  • Get the correct rank y

w 5 1 3 2 4

Correct Rank Interval

slide-16
SLIDE 16

PRank

  • Direction w,

Thresholds

  • Rank a new instance x
  • Get the correct rank y
  • Compute Error-Set E

w 5 1 3 2 4

slide-17
SLIDE 17

PRank – Update

  • Direction w,

Thresholds

  • Rank a new instance x
  • Get the correct rank y
  • Compute Error-Set E
  • Update :

w

slide-18
SLIDE 18

PRank – Update

  • Direction w,

Thresholds

  • Rank a new instance x
  • Get the correct rank y
  • Compute Error-Set E
  • Update :

– –

w x x w

slide-19
SLIDE 19

PRank – Summary of Update

  • Direction w,

Thresholds

  • Rank a new instance x
  • Get the correct rank y
  • Compute Error-Set E
  • Update :

– –

w x x w x

slide-20
SLIDE 20

Predict : Get the true rank y Compute Error set : Get an instance x Maintain No

?

Yes Update

The PRank Algorithm

slide-21
SLIDE 21

Analysis

Two Lemmas

slide-22
SLIDE 22

Consistency

  • Can the following happen?

w b 4 b 2 b 2 b 3 b 1

slide-23
SLIDE 23
  • Can the following happen?
  • The order of the thresholds is preserved

after each round of PRank : .

Consistency

No

w

b

4

b

2

b

2

b

3

b

1

slide-24
SLIDE 24

Given :

  • Arbitrary input sequence

Easy Case:

  • Assume there exists a model that ranks all

the input instances correctly

– The total loss the algorithm suffers is bounded

Hard Case:

  • In general

– A “regret” is bounded

Regret Bound

Lt − inf

f ∈F

˜ L

t f

( )

slide-25
SLIDE 25

Margin(x,y) = min

Ranking Margin

w 1 2 4 5 3 ,

slide-26
SLIDE 26

Margin(x,y) = min

Ranking Margin

w 1 2 4 5 3 ,

slide-27
SLIDE 27

Margin(x,y) = min

Ranking Margin

w 1 2 4 5 3 ,

slide-28
SLIDE 28

Margin(x,y) = min

Ranking Margin

Margin = min Margin

w 1 2 4 5 3 ,

slide-29
SLIDE 29
  • Input sequence ,
  • Norm of instances is bounded
  • Ranked correctly by a normalized ranker

with Margin>0

Mistake Bound

Number of Mistakes PRank Makes

Given : Then :

slide-30
SLIDE 30

Exploit Structure

Loss Range Structure Classification

None

Regression

Metric

Ranking

Order

Under Constraint Over Constraint

slide-31
SLIDE 31

Other Approaches

  • Treat Ranking as Classification or Regression
  • Reduce a ranking problem into a classification

problem over pair of examples

– Not simple to combine preferences predictions over pairs into a singe consistent ordering – No simple adaptation for online settings

Basu, Hirsh, Cohen 1998 Freund, Lyer, Schapire, Singer 1998 Herbrich, Graepel, Obermayer 2000 E.g. E.g.

slide-32
SLIDE 32

Empirical Study

slide-33
SLIDE 33

An Illustration

PRank Ranking MC-Perceptron Classification Widrow-Hoff Regression

  • Five concentric ellipses
  • Training set of 50 points
  • Three approaches
  • Pranking
  • Classification
  • Regression
slide-34
SLIDE 34

Each-Movie database

  • 74424 registered Users
  • 1648 listed Movies
  • Users ranking of movies
  • 7451 Users saw >100 movies
  • 1801 Users saw >200 movies
slide-35
SLIDE 35

Ranking Loss, 100 Viewers

Rank Loss Round WH MC-Perceptron PRank Over constrained Under constrained Accurately constrained Regression Classification PRank

slide-36
SLIDE 36

Ranking Loss, 200 Viewers

WH MC-Perceptron PRank Round Rank Loss Regression Classification PRank

slide-37
SLIDE 37

Demonstration

slide-38
SLIDE 38

(1) User choose movies from this list (2) Movies chosen and ranked by user

slide-39
SLIDE 39

(3) Press the ‘learn’ key. The systems learns the user’s taste (4) The system re-ranks the training set (5) The system re-ranks a new fresh set of yet unseen movies

slide-40
SLIDE 40

(6) Press the ‘flip’ button to see what movies you should not view (7) The flipped list

slide-41
SLIDE 41
  • Many alternatives to formulate ranking
  • Choose one that models best your problem
  • Exploit and Incorporate structure
  • Specifically:

– Online algorithm for ranking problems via projections and conservative update of the projection’s direction and the threshold values – Experiments on a synthetic dataset and on Each- Movie data set indicate that the PRank algorithm performs better then algorithms for classification and regression