Lecture 3: Improving Ranking with Lecture 3: Improving Ranking with - PowerPoint PPT Presentation

Modeling User Behavior and Interactions Modeling User Behavior and Interactions Lecture 3: Improving Ranking with Lecture 3: Improving Ranking with Behavior Data Eugene Agichtein Emory University Eugene Agichtein, Emory University, RuSSIR 2009 (Petrozavodsk, Russia) 1

Lecture 3 Plan 1. Review: Learning to Rank 2. Exploiting User Behavior for Ranking: – Automatic relevance labels – Enriching feature space 3. Implementation and System Issues 3. Implementation and System Issues – Dealing with Scale Dealing with data sparseness – 4. New Directions Active learning – – Ranking for diversity Eugene Agichtein, Emory University, RuSSIR 2009 (Petrozavodsk, Russia) 2

Review: Learning to Rank • Goal: instead of fixed retrieval models learn them: – Usually: supervised learning on document/query U ll i d l i d t/ pairs embedded in high-dimensional feature space – Labeled by relevance of document to query L b l d b l f d t t – Features : provided by IR methods. • Given training instances: – (x q,d , y q,d ) for q = {1..N}, d = {1 .. N q } • Learn a ranking function – f(x q,1 , … x q,Nq ) f(x x ) Eugene Agichtein, Emory University, RuSSIR 2009 (Petrozavodsk, Russia) 3

Ordinal Regression Approaches • Learn multiple thresholds: Maintain T thresholds (b 1 , … b T ), b 1 < b 2 < … < b T => Learn parameters + (b 1 , …, b T ) Chu & Keerthi , New Approaches to Support Vector Ordinal Regression ICML 05 h h h l • Learn multiple classifiers: Use T different training sets, train classifiers C 1 ..C T => Sum U T diff i i i l ifi C C S T. Qin et al ., “Ranking with Multiple Hyperplanes.” SIGIR 2007 • Optimize pairwise preferences: • Optimize pairwise preferences: RankNet : Burges et al., Learning to Rank Using Gradient Descent, ICML 05 • Optimize Rank-based Measures: Optimize Rank based Measures: Directly optimize (n)DCG via local approximation of gradient LambdaRank: C. Burges, et al., “Learning to Rank with Non-Smooth Cost Functions.” NIPS 2006 Eugene Agichtein, Emory University, RuSSIR 2009 (Petrozavodsk, Russia) 4

Learning to Rank Summary • Many learning algorithms available to choose from • Require training data (feature vectors + labels) • Where does training data come from? Where does training data come from? – “Expert” human judges (TREC, editors, …) – Users: logs of user behavior – Users: logs of user behavior • Rest of this lecture: – Learning formulation and setup, to train and use L i f l ti d t t t i d learning to rank algorithms Eugene Agichtein, Emory University, RuSSIR 2009 (Petrozavodsk, Russia) 5

Approaches to Use Behavior Data • Use “clicks” as new training examples – Joachims, KDD 2002 Joachims KDD 2002 – Radlinski & Joachims, KDD 2005 • Incorporate behavior data as additional features – Richardson et al., WWW 2005 – Agichtein et al., SIGIR 2006 – Bilenko and White, WWW 2008 – Zhu and Mishne, KDD 2009 , Eugene Agichtein, Emory University, RuSSIR 2009 (Petrozavodsk, Russia) 6

Recap: Available Behavior Data Eugene Agichtein, Emory University, RuSSIR 2009 (Petrozavodsk, Russia) 7

Training Examples from Click Data [ Joachims 2002 ] Eugene Agichtein, Emory University, RuSSIR 2009 (Petrozavodsk, Russia) 8

Loss Function [ Joachims 2002 ] Eugene Agichtein, Emory University, RuSSIR 2009 (Petrozavodsk, Russia) 9

Learned Retrieval Function [ Joachims 2002 ] Eugene Agichtein, Emory University, RuSSIR 2009 (Petrozavodsk, Russia) 10

Features [ Joachims 2002 ] Eugene Agichtein, Emory University, RuSSIR 2009 (Petrozavodsk, Russia) 11

Results [ Joachims 2002 ] Summary: Learned outperforms all base methods in experiment � Learning from clickthrough data is possible data is possible � Relative preferences are useful training data. Eugene Agichtein, Emory University, RuSSIR 2009 (Petrozavodsk, Russia) 12

Extension: Query Chains [Radlinski & Joachims, KDD 2005] Eugene Agichtein, Emory University, RuSSIR 2009 (Petrozavodsk, Russia) 13

Query Chains (Cont’d) [Radlinski & Joachims, KDD 2005] Eugene Agichtein, Emory University, RuSSIR 2009 (Petrozavodsk, Russia) 14

Query Chains (Results) [Radlinski & Joachims, KDD 2005] [ , ] • Query Chains add slight improvement over clicks Eugene Agichtein, Emory University, RuSSIR 2009 (Petrozavodsk, Russia) 15

Lecture 3 Plan � Review: Learning to Rank � Exploiting User Behavior for Ranking: � Automatic relevance labels � � Enriching the ranking feature space Enriching the ranking feature space 1. Implementation and System Issues – Dealing with Scale D li ith S l Dealing with data sparseness – 2. New Directions i i – Active learning – Ranking for diversity – Fun and games Eugene Agichtein, Emory University, RuSSIR 2009 (Petrozavodsk, Russia) 16

Incorporating Behavior for Static Rank [Richardson et al., WWW2006] Build Answer Crawl Web Index Queries Efficient Informs Which index dynamic pages to order ranking crawl Static Rank Eugene Agichtein, Emory University, RuSSIR 2009 (Petrozavodsk, Russia) 17

fRank: Machine Learning for Static Ranking [Richardson et al., WWW2006] Words on page p g # Inlinks Machine fRank Web Learning g Contains ‘Viagra’ C t i ‘Vi ’ Model PageRank Eugene Agichtein, Emory University, RuSSIR 2009 (Petrozavodsk, Russia) 18

Features: Summary [Richardson et al., WWW2006] • Popularity • Anchor text and inlinks h d l k • Page • Domain • PageRank PageRank Eugene Agichtein, Emory University, RuSSIR 2009 (Petrozavodsk, Russia) 19

Features: Popularity [Richardson et al., WWW2006] • Data from MSN Toolbar • Smoothed Smoothed Function Example Exact URL cnn.com/2005/tech/wikipedia.html?v=mobile p No Params cnn.com/2005/tech/wikipedia.html Page wikipedia.html URL 1 URL-1 cnn.com/2005/tech /2005/t h URL-2 cnn.com/2005 … Domain cnn.com Domain+1 cnn.com/2005 … Eugene Agichtein, Emory University, RuSSIR 2009 (Petrozavodsk, 20 Russia)

Features: Anchor, Page, Domain [Richardson et al., WWW2006] • Anchor text and inlinks – Total amount of anchor text, unique anchor text words, number of inlinks, etc. • Page – 8 Features based on page alone: Words in body, frequency of most common term, etc. • Domain – Averages in domain: average #outlinks, etc. Eugene Agichtein, Emory University, RuSSIR 2009 (Petrozavodsk, Russia) 21

Data [Richardson et al., WWW2006] • Human judgments 1. Randomly choose query from MSN users 2. Chose top URLs by search engine 3. Rate quality of URL for that query • 500k (Query,URL,Rating) tuples • Judged URLs biased to good pages – Results apply to index ordering relevance Results apply to index ordering, relevance – Crawl ordering requires unbiased sample Eugene Agichtein, Emory University, RuSSIR 2009 (Petrozavodsk, Russia) 22

Becoming Query Independent [Richardson et al., WWW2006] • (Query,URL,Rating) → (URL,Rating) • Take maximum rating for each URL k f h – Good page if relevant for at least one query • Queries are common → likely correct index order and relevance order Eugene Agichtein, Emory University, RuSSIR 2009 (Petrozavodsk, Russia) 23

Measure [Richardson et al., WWW2006] • Goal: Find static ranking algorithm that most Goal: Find static ranking algorithm that most correctly reproduces judged order H ∩ H ∩ S S p p pairwise accuracy = H p • Fraction of pairs that, when the humans claim one is better than the other the static rank one is better than the other, the static rank algorithm orders them correctly Eugene Agichtein, Emory University, RuSSIR 2009 (Petrozavodsk, Russia) 24

RankNet, Burges et al., ICML 2005 [Richardson et al., WWW2006] Feature Vector Label NN output Error is function of label and output Error is function of label and output Eugene Agichtein, Emory University, RuSSIR 2009 (Petrozavodsk, Russia) 25

RankNet [Burges et al. 2005] [Richardson et al., WWW2006] • Training Phase: – Present pair of vectors with label1 > label2 Eugene Agichtein, Emory University, RuSSIR 2009 (Petrozavodsk, Russia) 26

RankNet [Burges et al. 2005] [Richardson et al., WWW2006] • Training Phase: – Present pair of vectors with label1 > label2 Feature Vector1 Label1 NN output 1 Eugene Agichtein, Emory University, RuSSIR 2009 (Petrozavodsk, Russia) 27

RankNet [Burges et al. 2005] [Richardson et al., WWW2006] • Training Phase: – Present pair of vectors with label1 > label2 Feature Vector2 Label2 NN output 1 NN output 2 Eugene Agichtein, Emory University, RuSSIR 2009 (Petrozavodsk, Russia) 28

RankNet [Burges et al. 2005] [Richardson et al., WWW2006] • Training Phase: – Present pair of vectors with label1 > label2 Present pair of vectors with label1 > label2 NN output 1 NN output 2 Error is function of both outputs (Desire output1 > output2) ( p p ) Eugene Agichtein, Emory University, RuSSIR 2009 (Petrozavodsk, Russia) 29

RankNet [Burges et al. 2005] [Richardson et al., WWW2006] • Test Phase: • Test Phase: – Present individual vector and get score Feature Vector1 NN output NN output Eugene Agichtein, Emory University, RuSSIR 2009 (Petrozavodsk, Russia) 30

Lecture 3: Improving Ranking with Lecture 3: Improving Ranking with - PowerPoint PPT Presentation

Modeling User Behavior and Interactions Modeling User Behavior and Interactions Lecture 3: Improving Ranking with Lecture 3: Improving Ranking with Behavior Data Eugene Agichtein Emory University Eugene Agichtein, Emory University, RuSSIR

Easy and Hard Outline Constraint Ranking in OT The Constraint Ranking problem Making fast

Tutorial: TF-Ranking for sparse features Tutorial: TF-Ranking for sparse features This tutorial

Ranking candidate genes from Ranking candidate genes from perturbation experiments Niko

Online Submodular Set Cover, Ranking, and Repeated Active Learning Online Ranking: At each round,

TVM for Ads Ranking @ Facebook Hao Lu, Ansha Yu, Yinghai Lu, Andrew Tulloch Ads Ranking at

A Ranking Method to Improve A Ranking Method to Improve Detection of Disease Using Selectively

+ Ranking Factor Latest Trends What factors matter in 2016-2017 for ranking your Google

Kernel Principal Component Ranking: Robust Ranking on Noisy Data Evgeni Tsivtsivadze Botond

KNN and re ranking models for English KNN and re-ranking models for English patent mining at

Tutorial Ranking Mechanisms in Games Vanessa Volz and Boris Naujoks CIG 2018, Maastricht

Web Mining and Recommender Systems Advanced Recommender Systems: Bayesian Personalized Ranking

PRanking with Ranking Koby Crammer Technion Israel Institute of Technology Based on joint

Online Ranking Combination Erzs ebet Frig o Institute for Computer Science and Control (MTA

Ranking Distributed Probabilistic Data Jeffrey Jestes Feifei Li Ke Yi 1-1 Introduction

1 Similarity ranking: example Weighted scoring with linear combination A simple weighted

Statistical Ranking Problem Tong Zhang Statistics Department, Rutgers University Ranking

Information Retrieval Venkatesh Vinayakarao Term: Aug Sep, 2019 Chennai Mathematical

Infrastructure Technologies for Large- Scale Service-Oriented Systems Kostas Magoutis

CIS 330: Applied Database Systems Lecture 11: HTTP Header Data Authentication Alan Demers

Web Dynamics Part 1 Introduction 1.1 Dimensions of dynamics in the Web 1.2 Application examples

Web Mining Web Mining to automatically discover and extract information from Web

Workplace Wellbeing - Tennessee Chapter Agenda - WellBeing OverOne Day in the Workplace

Post Covid Post Covid - 19 recovery relies 19 recovery relies on workplace wellbeing. on

Martin Jukes October 2020 Customer needs and expectations Office Cultural closures change