Object Ranking Toshihiro Kamishima http://www.kamishima.net/ - PowerPoint PPT Presentation

Invited Talk: Object Ranking Toshihiro Kamishima http://www.kamishima.net/ National Institute of Advanced Industrial Science and Technology (AIST), Japan Preference Learning Workshop (PL-09) @ ECML/PKDD 2009, Bled, Slovenia 1 START

Introduction Object Ranking: Task to learn a function for ranking objects from sample orders Discussion about methods for this task by connecting with the probabilistic distributions of rankings Several properties of object ranking methods Order / Ranking object sequence sorted according to a particular preference or property prefer not prefer Squid cucamber roll Fatty Tuna > > ex. an order sorted according to my preference in sushi “I prefer fatty tuna to squid” but “The degree of preference is not specified” 2

Outline What ʼ s object ranking Definition of an object ranking task Connection with regression and ordinal regression Measuring the degree of preference Probability distributions of rankings Thurstonian, paired comparison, distance-based, and multistage Six methods for object ranking Cohen ʼ s method, RankBoost, SVOR (a.k.a. RankingSVM), OrderSVM, ERR, and ListNet Properties of object ranking methods Absolute and relative ranking Conclusion 3

Object Raking Task unordered objects sample order set feature values are known O 1 = x 1 ≻ x 2 ≻ x 3 O 2 = x 1 ≻ x 5 ≻ x 2 X u x 5 O 3 = x 2 ≻ x 1 x 4 object x 3 x 1 ranking method x 2 x 1 ranking function x 3 x 5 x 4 feature space ˆ O u = x 1 ≻ x 5 ≻ x 4 ≻ x 3 objects are represented by feature vectors estimated order objects that don ʼ t appeared in training samples have to be ordered by referring feature vectors of objects 4

Object Ranking vs Regression Object Ranking: regression targeting orders generative model of object ranking input regression order permutation noise sample ranking function random permutation X 1 X 2 X 1 ≻ X 2 ≻ X 3 X 1 ≻ X 3 ≻ X 2 X 1 ≻ X 3 ≻ X 2 X 3 generative model of regression input regression curve additive noise sample Y ʼ 1 Y 3 X 1 Y ʼ 1 Y 1 Y 2 X 2 Y ʼ 2 Y ʼ 3 X 3 Y ʼ 3 Y ʼ 2 X 1 X 2 X 3 5

Ordinal Regression Ordinal Regression [McCullagh 80, Agresti 96] Regression whose target variable is ordered categorical Ordered Categorical Variable Variable can take one of a predefined set of values that are ordered ex. { good, fair, poor} Differences between “ordered categories” and “orders” Differences between “ordered categories” and “orders” Ordered Category Order The # of grades is finite The # of grades is infinite ex: For a domain {good, fair, poor}, the # of grades is limited to three , poor}, the # of grades is limited to three Absolute Information is contained It contains purely relative information ex: While “good” indicates absolutely preferred, “x ex: While “good” indicates absolutely preferred, “x 1 > x 2” indicates that x 1 is relatively preferred to x 2 Object ranking is more general problem than ordinal regression as a learning task 6

Measuring Preference ordinal regression (ordered categories) Scoring Method / Rating Method Using scales with scores (ex. 1,2,3,4,5) or ratings (ex. gold, silver, bronze) The user selects “5” in a five-point scale if he/she prefers the item A not prefer 1 2 3 4 5 prefer item A object ranking (orders) Ranking Method Objects are sorted according to the degree of preference The user prefers the item A most, and the item B least > > not prefer prefer item B item A item C 7

Demerit of Scoring / Rating Methods Difficulty in caliblation over subjects / items Mappings from the preference in users ʼ mind to rating scores differ among users Standardizing rating scores by subtracting user/item mean score is very important for good prediction [Herlocker+ 99, Bell+ 07] Replacing scores with rankings contributes to good prediction, even if scores are standardized [Kmaishima 03, Kamishima+ 06] presentation bias The wrong presentation of rating scales causes biases in scores When prohibiting neutral scores, users select positive scores more frequently [Cosley+ 03] Showing predicted scores affects users ʼ evaluation [Cosley+ 03] 8

Demerit of Ranking Methods Lack of absolute information Orders don ʼ t provide the absolute degree of preference Even if “x 1 > x 2 ” is specified, x 1 might be the second worst Difficulty in evaluating many objects Ranking method is not suitable for evaluating many objects at the same time Users cannot correctly sort hundreds of objects In such a case, users have to sort small groups of objects in many times 9

Distributions of Rankings generative model of object ranking + regression order permutation noise The permutation noise part is modeled by using probabilistic distributions of rankings 4 types of distributions for rankings [Crichlow+ 91, Marden 95] Thurstonian: Objects are sorted according to the objects ʼ scores Paired comparison: Objects are ordered in pairwise, and these ordered pairs are combined Distance-based: Distributions are defined based on the distance between a modal order and sample one Multistage: Objects are sequentially arranged top to end 10

Thurstonian Thurstonian model (a.k.a Order statistics model) Objects are sorted according to the objects ʼ scores objects A B C For each object, the corresponding scores are sampled from the associated distributions Sort objects according to the sampled scores > > A C B distribution of scores Normal Distribution: Thurstone ʼ s law of comparative judgment [Thurstone 27] 1 − exp( − exp(( x i − µ i ) / σ ) Gumbel Distribution: CDF is 11

Paired Comparison Paired comparison model Objects are ordered in pairwise, and these ordered pairs are combined Objects are ordered in pairwise A B B C C A A A A > B A > C A > B C > A cyclic acyclic B C B C B > C B > C generate the order: A > B > C Abandon and retry parameterization Babinton Smith model: saturated model with n C 2 paramaters [Babington Smith 50] v i Bradley-Terry model: Pr[ x i ≻ x j ] = [Bradley+ 52] v i + v j 12

Distance between Orders squared Euclidean distance between two rank vectors A B C D Spearman distance O 1 A > B > C > D 1 2 3 4 O 2 D > B > A > C 3 2 4 1 Spearman footrule rank vectors Manhattan distance between two rank vectors A > B A > C B > C O 1 A > B > C NO! OK OK Kendall distance O 2 B > A > C # of discordant pairs A > C B > C B > A between two orders decompose into ordered pairs 13

Distance-based Distance-based model Distributions are defined based on the distance between orders dispersion parameter modal order/ranking Pr[ O ] = C ( λ ) exp( − λ d ( O, O 0 )) normalization factor distance distance Spearman distance: Mallows ʼ θ model Kendall distance: Mallows ʼ φ model These are the special cases of Mallows ʼ model ( φ =1 or θ =1 ), which is a paired comparison model that defined as: θ i − j φ − 1 Pr[ x i ≻ x j ] = θ i − j φ − 1 + θ j − i φ [Mallows 57] 14

Multistage Multistage model Objects are sequentially arranged top to end Plackett-Luce model [Plackett 75] ex. objects {A,B,C,D} is sorted into A > C > D > B a param of the top object θ A Pr[A] = θ A + θ B + θ C + θ D total sum of params θ C Pr[A>C | A] = a param of the second object θ B + θ C + θ D θ D params for A is eliminated Pr[A>C>D | A>C] = θ B + θ D Pr[A>C>D>B | A>C>D] = θ B / θ B = 1 The probability of the order, A > C > D > B, is Pr[A>C>D>B] = Pr[A] Pr[A>C | A] Pr[A>C>D | A>C] 1 15

Object Ranking Methods Object Ranking Methods permutation noise model: orders are permutated accoding to the distributions of rankings regression order model: representation of the most probable rankings loss function: the definition of the “goodness of model” optimization method: tuning model parameters connection between distributions and permutation noise model Thurstonian: Expected Rank Regression (ERR) Paired comparison: Cohen ʼ s method Distance-based: RankBoost, Support Vector Ordinal Regression (SVOR, a.k.a RankingSVM), OrderSVM Multistage: ListNet 16

Regression Order Model linear ordering: Cohen ʼ s method 1. Given the features of any object pairs, x i and x j , f ( x i , x j ) represents the preference of the object i to the object j � x i ≻ x j f ( x i , x j ) 2. All objects are sorted so as to maximize: This is known as Linear Ordering Problem in an OR literature [Grötschel+ 84] , and is NP-hard => Greedy searching solution O(n 2 ) sorting by scores: ERR, RankBoost, SVOR, OrderSVM, ListNet 1. Given the features of an object, x i , f ( x i ) represents the preference of the object i 2. All objects are sorted according to the values of f ( x ) Computational complexity for sorting is O(n log(n)) 17

Cohen ʼ s Method [Cohen+ 99] permutation noise model = paired comparison regression order model = linear ordering sample orders are decomposed into ordered pairs A � B � C A � B, A � B, B � C D � E, D � B, D � C, · · · D � E � B � C A � D, A � C, D � C A � D � C ordered pairs training sample orders the preference function that one object precedes the other f ( x i , x j ) = Pr[ x i ≻ x j ; x i , x j ] Unordered objects can be sorted by solving linear ordering problem 18

Object Ranking Toshihiro Kamishima http://www.kamishima.net/ - PowerPoint PPT Presentation

Invited Talk: Object Ranking Toshihiro Kamishima http://www.kamishima.net/ National Institute of Advanced Industrial Science and Technology (AIST), Japan Preference Learning Workshop (PL-09) @ ECML/PKDD 2009, Bled, Slovenia 1 START

Object Oriented Object 3 Programming Object 1 Object 2 Object 4 For : COP 3330. Object

Easy and Hard Outline Constraint Ranking in OT The Constraint Ranking problem Making fast

Tutorial: TF-Ranking for sparse features Tutorial: TF-Ranking for sparse features This tutorial

Ranking candidate genes from Ranking candidate genes from perturbation experiments Niko

Online Submodular Set Cover, Ranking, and Repeated Active Learning Online Ranking: At each round,

TVM for Ads Ranking @ Facebook Hao Lu, Ansha Yu, Yinghai Lu, Andrew Tulloch Ads Ranking at

Object-Oriented Databases Object Oriented Databases ODMG Standard Object Model, Object

Object oriented Object oriented Object oriented Object oriented approach and UML approach and

Object Space Volume Rendering Object Space Volume Rendering Ronald Peikert SciVis 2010 - Object

A Ranking Method to Improve A Ranking Method to Improve Detection of Disease Using Selectively

+ Ranking Factor Latest Trends What factors matter in 2016-2017 for ranking your Google

Kernel Principal Component Ranking: Robust Ranking on Noisy Data Evgeni Tsivtsivadze Botond

KNN and re ranking models for English KNN and re-ranking models for English patent mining at

Tutorial Ranking Mechanisms in Games Vanessa Volz and Boris Naujoks CIG 2018, Maastricht

Web Mining and Recommender Systems Advanced Recommender Systems: Bayesian Personalized Ranking

PRanking with Ranking Koby Crammer Technion Israel Institute of Technology Based on joint

Rank correlation coefficients and their generalizations for interval data Karol Opara and Olgierd

Motivational Interviewing Jan 13, 2014 Paul Nagy, LPC, LCAS, CCS paul.nagy@duke.edu Assistant

Some Thoughts About Rock Mechanics Aspects of Mars Courtesy of NASA Opportunity Curiosity

Why analyze data? How variety in the objectives of analysis points to complementary roles for

Population-based Detection of Structural Variants in Normal and Aberrant Genomes. Jean Monlong

Pancreatic Ductal Adenocarcinoma 1 Pancreatic Ductal Adenocarcinoma AJCC 8 th Edition

-DECAY HALF LIVES OF NUCLEI APPROACHING -DECAY HALF LIVES OF NUCLEI APPROACHING THE

GlideinWMS Marco Mambelli Stakeholders Meeting January 9, 2019 Overview Upcoming releases