Online Learning to Rank with Features Authors: Shuai Li, Tor - PowerPoint PPT Presentation

Online Learning to Rank with Features Authors: Shuai Li, Tor Lattimore, Csaba Szepesvári The Chinese University of Hong Kong DeepMind University of Alberta

Learning to Rank Amazon, YouTube, Facebook, Netflix, Taobao 1

Online Learning to Rank • Show the user the list C tk K • Objective: Maximize the expected number of clicks 2 • There are L items and K ≤ L positions • At each time t = 1 , 2 , . . . , • Choose an ordered list A t = ( a t 1 , . . . , a t K ) • Receive click feedback C t 1 , . . . , C tK ∈ { 0 , 1 } , per position [ T ] ∑ ∑ E t = 1 k = 1

Click Models • Position-Based Model (PBM) click model • Make as few assumptions as possible about the • Generic model and position bias position k can be factored into item attractiveness • Click models describe how users interact with item • Assumes the user click probability on an item a of after click • Further assumes there is a satisfaction probability • Dependent Click Model (DCM) stops position K , clicks at the first satisfying item and • Assumes the user checks the list from position 1 to • Cascade Model (CM) lists 3 ✗ ✓ ✗ ✓ ✗

RecurRank the examination probability, which satisfies reasonable assumptions • Use first position for exploration • Use remaining positions for exploitation, rank best items first • Split items and positions when the phase ends • Recursively call the algorithm with increased phase 4 • Each item a is represented by a feature vector x a ∈ R d • The attractiveness of item a is α ( a ) = θ ⊤ x a • Click probability factors: P t ( C ti = 1 ) = α ( a t i ) χ ( A t , i ) where χ is • RecurRank (Recursive Ranking) • For each phase ℓ

Example 4 Instance 2 Instance 3 t 1 1 3 3 a 1 a 2 a 3 Instance 4 t 2 3 5 a 8 a 4 a 5 3 6 8 a 6 a 7 a 8 a 12 Instance 5 Instance 6 t 3 a 25 a 4 1 1 8 a 1 a 8 8 t Instance 1 a 50 1 a 3 4 2 3 5 a 2 a 1 2 A || ℓ = 1 �� · · · · · · ��

Example t 2 1 a 8 a 25 Instance 2 Instance 3 t 1 1 3 3 a 1 a 2 a 3 Instance 4 3 4 4 5 a 4 a 5 3 6 8 a 6 a 7 a 8 a 12 Instance 5 Instance 6 t 3 8 a 4 5 a 2 8 a 1 a 8 a 50 t Instance 1 1 3 a 1 1 a 3 A || ℓ = 2 �� A || ℓ = 1 �� · · A · || ℓ = 2 �� · · · · · · �� · · · ��

Example t 2 a 4 1 a 8 a 25 Instance 2 Instance 3 t 1 1 3 a 1 a 2 a 3 Instance 4 3 4 4 5 a 4 a 5 3 6 8 a 6 a 7 a 8 a 12 Instance 5 Instance 6 t 3 8 5 3 a 2 t Instance 1 1 1 a 8 a 1 a 50 a 1 a 3 8 A A || || ℓ = 2 ℓ = 3 �� A || ℓ = 1 �� · · A · || ℓ = 2 �� · · · · · · �� · · · ��

Example t 2 a 4 a 8 1 Instance 2 Instance 3 t 1 1 3 a 1 a 2 a 3 Instance 4 4 4 5 a 4 a 5 6 8 a 6 a 7 a 8 a 12 Instance 5 Instance 6 t 3 8 a 25 5 a 1 a 50 t Instance 1 1 1 3 a 1 a 8 a 3 8 a 2 A A || || ℓ = 2 ℓ = 3 �� A || ℓ = 1 �� A || ℓ = 3 �� · · A · || ℓ = 2 �� A · || · · ℓ = 3 �� · · · �� · · · · · · ��

Example Instance 4 a 4 a 8 a 25 1 Instance 2 Instance 3 t 1 1 3 a 1 a 2 a 3 t 2 4 4 5 a 4 a 5 6 8 a 6 a 7 a 8 a 12 Instance 5 Instance 6 t 3 8 5 a 8 a 50 t Instance 1 a 3 8 1 a 2 1 a 1 a 1 3 A A || || ℓ = 2 ℓ = 3 �� · · · A || ℓ = 1 �� A || ℓ = 3 �� · · · · · A · || ℓ = 2 �� A · || · · ℓ = 3 �� · · · �� · · · · · · · · · ��

(a) CM (b) PBM 10 2 700k 10 1 600k 10 0 500k Regret Regret 400k 10 −1 300k 10 −2 200k 10 −3 100k 10 −4 0 0 50k 100k 150k 200k 0 500k 1m 1.5m 2m Time t Time t —CascadeLinUCB —RecurRank —TopRank Results • Regret bound • Improves over existing bound O 6 √ R ( T ) = O ( K dT log( LT )) (√ ) K 3 LT log( T )

Results • Regret bound —TopRank —CascadeLinUCB —RecurRank 6 • Improves over existing bound O √ R ( T ) = O ( K dT log( LT )) (√ ) K 3 LT log( T ) (a) CM (b) PBM 10 2 700k 10 1 600k 10 0 500k Regret Regret 400k 10 −1 300k 10 −2 200k 10 −3 100k 10 −4 0 0 50k 100k 150k 200k 0 500k 1m 1.5m 2m Time t Time t

Thank you! 6

References i Sumeet Katariya, Branislav Kveton, Csaba Szepesvari, and Zheng Wen. Dcm bandits: Learning to rank with multiple clicks. In International Conference on Machine Learning , pages 1215–1224, 2016. Branislav Kveton, Csaba Szepesvari, Zheng Wen, and Azin Ashkan. Cascading bandits: Learning to rank in the cascade model. In International Conference on Machine Learning , pages 767–776, 2015. Paul Lagrée, Claire Vernade, and Olivier Cappe. Multiple-play bandits in the position-based model. In Advances in Neural Information Processing Systems , pages 1597–1605, 2016. 7

References ii Tor Lattimore, Branislav Kveton, Shuai Li, and Csaba Szepesvari. Toprank: A practical algorithm for online stochastic ranking. In The Conference on Neural Information Processing Systems , 2018. Shuai Li, Tor Lattimore, and Csaba Szepesvári. Online learning to rank with features. arXiv preprint arXiv:1810.02567 , 2018. Shuai Li, Baoxiang Wang, Shengyu Zhang, and Wei Chen. Contextual combinatorial cascading bandits. In International Conference on Machine Learning , pages 1245–1253, 2016. Shuai Li and Shengyu Zhang. Online clustering of contextual cascading bandits. In The AAAI Conference on Artificial Intelligence , 2018. 8

References iii Weiwen Liu, Shuai Li, and Shengyu Zhang. Contextual dependent click bandit algorithm for web recommendation. In International Computing and Combinatorics Conference , pages 39–50. Springer, 2018. Masrour Zoghi, Tomas Tunys, Mohammad Ghavamzadeh, Branislav Kveton, Csaba Szepesvari, and Zheng Wen. Online learning to rank in stochastic click models. In International Conference on Machine Learning , pages 4199–4208, 2017. 9

References iv Shi Zong, Hao Ni, Kenny Sung, Nan Rosemary Ke, Zheng Wen, and Branislav Kveton. Cascading bandits for large-scale recommendation problems. In Proceedings of the Thirty-Second Conference on Uncertainty in Artificial Intelligence , pages 835–844. AUAI Press, 2016. 10

Online Learning to Rank with Features Authors: Shuai Li, Tor - PowerPoint PPT Presentation

Online Learning to Rank with Features Authors: Shuai Li, Tor Lattimore, Csaba Szepesvri The Chinese University of Hong Kong DeepMind University of Alberta Learning to Rank Amazon, YouTube, Facebook, Netflix, Taobao 1 Online Learning to

2 3 4 5 8 9 MINNEAPOLIS MILWAUKEE MSA RANK #16 MSA RANK #39 CHICAGO MSA RANK #3

On the minimum rank of a graph Jisu Jeong June 21, 2013 Jisu Jeong On the minimum rank of a

COMPANY PROFILE WATER FEATURES 1 WATER FEATURES 2 WATER FEATURES 3 WATER FEATURES 4 WATER

10. Learning to Rank Outline 10.1. Why Learning to Rank (LeToR)? 10.2. Pointwise, Pairwise,

A new family of maximum rank distance codes or: Maximum rank distance codes and finite semifields

1 SVD applications: rank, column, row, and null spaces Rank : the rank of a matrix is equal to:

Online Learning Lorenzo Rosasco MIT, 9.520 L. Rosasco Online Learning About this class Goal

Learning to Rank Learning to Rank with Partially-Labeled Data with Partially-Labeled Data Kevin

Cross-Domain Learning-to-rank with SVM Erheng Zhong 1 1 Department of Computer Science and

Learning to Rank with Learning to Rank with Partially-Labeled Data Partially-Labeled Data Kevin

2018 - 2019 Teacher Salary Comparison Report 0-Year 5-Year 10-Year 15-Year 20-Year District

Introduction to rank-based cryptography Philippe Gaborit University of Limoges, France ASCRYPTO

Web Mining Mining content Simple rank is confused by rank sinks, e.g. two pages that

Parallel Numerical Algorithms Chapter 6 Matrix Models Section 6.2 Low Rank Approximation

Selection Problem Rank Given n unsorted elements, determine the Rank of an element is its

Selection Problem Rank Given n unsorted elements, determine the Rank of an element is its

Outline Contagion Contagion Basic Contagion Basic Contagion Models Models Complex Networks,

Scaling the Cascades Interconnect-aware FPGA implementation of Machine Learning problems Anand

Object Detection using Haar like Features CS 395T: Visual Recognition and Search Harshdeep

Willump: A Statistically-Aware End-to-end Optimizer for ML Inference Peter Kraft , Daniel Kang,

IN WHATSAPP GROUPS Josemar Alves Caetano, Gabriel Magno, Marcos Gonalves, Jussara Almeida,

Projections of random fractals and measures and Liouville quantum gravity Kenneth Falconer

Secure Joins with MapReduce Xavier Bultel 1 Radu Ciucanu 2 Matthieu Giraud 3 Pascal Lafourcade 3

RE-PACRR: A Context and Density-Aware Neural Information Retrieval Model Kai Hui 1 , Andrew Yates