Transfer to Rank for Heterogeneous One-Class Collaborative Filtering - PowerPoint PPT Presentation

Transfer to Rank for Heterogeneous One-Class Collaborative Filtering Weike Pan 1 , Qiang Yang 2 ∗ , Wanling Cai 1 , Yaofeng Chen 2 , Qing Zhang 1 , Xiaogang Peng 1 ∗ and Zhong Ming 1 ∗ panweike@szu.edu.cn, qyang@cse.ust.hk, wanling cai@qq.com, chenyaofeng@email.szu.edu.cn, qingzhang1992@qq.com, pengxg@szu.edu.cn, mingz@szu.edu.cn 1 College of Computer Science and Software Engineering and National Engineering Laboratory for Big Data System Computing Technology Shenzhen University, Shenzhen, China 2 Department of Computer Science and Engineering Hong Kong University of Science and Technology, Hong Kong, China Pan et al., (SZU & HKUST) HOCCF (RoToR) ACM TOIS 1 / 28

Introduction Problem Definition Heterogeneous One-Class Collaborative Filtering (HOCCF) Input: Browses B = { ( u , i ) } and Purchases P = { ( u , i ′ ) } Goal: Rank the not yet purchased items, i.e., I\P u , for each end user u ∈ U Pan et al., (SZU & HKUST) HOCCF (RoToR) ACM TOIS 2 / 28

Introduction Challenges Ambiguity of browses regarding the users’ preferences 1 Scarcity of purchases as compared with the browse data 2 Heterogeneity arising from different feedback 3 Pan et al., (SZU & HKUST) HOCCF (RoToR) ACM TOIS 3 / 28

Introduction Overall of Our Solution Role-based Transfer to Rank (RoToR) : In integrative RoToR: we leverage browses into the preference learning task of purchases, in which we take each user as a sophisticated customer (i.e., mixer ) that is able to take different types of feedback into consideration. In sequential RoToR, we aim to simplify the integrative one by decomposing it into two dependent phases according to a typical shopping process. Pan et al., (SZU & HKUST) HOCCF (RoToR) ACM TOIS 4 / 28

Introduction Advantages of Our Solution We design a novel transfer learning solution from the perspective 1 of users’ roles of mixer, browser and purchaser, which addresses the challenges of ambiguity, scarcity and heterogeneity well. Pan et al., (SZU & HKUST) HOCCF (RoToR) ACM TOIS 5 / 28

Introduction Notations (1/3) Table: Some notations and explanations (1/3). n = |U| user number m = |I| item number u ∈ U user ID i , i ′ ∈ I item ID R = { ( u , i ) } universe of all possible (user, item) pairs P = { ( u , i ) } (user, item) pairs denoting purchases P u set of items purchased by user u B = { ( u , i ′ ) } (user, item) pairs denoting browses B u set of items browsed by user u A sampled negative feedback from R\P r ui r ui = 1 if ( u , i ) ∈ P and r ui = − 1 if ( u , i ) ∈ A Pan et al., (SZU & HKUST) HOCCF (RoToR) ACM TOIS 6 / 28

Introduction Notations (2/3) Table: Some notations and explanations (2/3). d ∈ R number of latent dimensions b u ∈ R user bias b i ∈ R item bias U u · ∈ R 1 × d user-specific latent feature vector V i · ∈ R 1 × d item-specific latent feature vector W i ′ · ∈ R 1 × d item-specific latent feature vector Pan et al., (SZU & HKUST) HOCCF (RoToR) ACM TOIS 7 / 28

Introduction Notations (3/3) Table: Some notations and explanations (3/3). ˆ r ui predicted preference of user u to item i r ( F ) r uij , ˆ ˆ preference difference in pairwise preference learning uij , r ( F ) ˆ predicted preference of user u to item i in a factorization-based method ui r ( N ) ˆ predicted preference of user u to item i in a neighborhood-based method ui learned similarity between item i ′ and item i s ( ℓ ) i ′ i r ( N ′ ) ˆ predicted preference of user u to item i in item-oriented CF ui predefined similarity (Jaccard index) between item i ′ and item i s ( p ) i ′ i N i a nearest set of items of item i T iteration number in the algorithm Pan et al., (SZU & HKUST) HOCCF (RoToR) ACM TOIS 8 / 28

Method Integrative RoToR We follow the seminal work on integrating heterogeneous feedback of ratings and examinations [Koren, 2008], and have the estimated preference of user u to item i as follows, r ( F ) r ( N ) ˆ r ui = ˆ + ˆ (1) ui , ui r ( F ) = b u + b i + U u · V T where ˆ i · and ui r ( N ) i ′ ∈B u s ( ℓ ) 1 1 ˆ √ √ i ′ ∈B u W i ′ · V T = � i ′ i = � i · are the prediction rules ui |B u | |B u | of the classical factorization-based method and the neighborhood-based method, respectively. Pan et al., (SZU & HKUST) HOCCF (RoToR) ACM TOIS 9 / 28

Method Integrative RoToR with Pointwise Preference Learning With the positive feedback in P and the sampled negative feedback in A , we have the objective function for pointwise preference learning [Johnson, 2014], � min f ui (2) Θ ( u , i ) ∈P∪A where Θ = { U u · , b u , u = 1 , 2 , . . . , n ; W i · , V i · , b i , i = 1 , 2 , . . . , m } are model parameters to be learned, and f ui = log ( 1 + exp ( − r ui ˆ r ui )) + α u F + α v F + α w F + β u u + β v 2 || U u · || 2 2 || V i · || 2 i ′ ∈B u || W i ′ · || 2 2 b 2 2 b 2 � i is the 2 tentative objective function for the ( u , i ) pair. Notice that the prediction rule is r ( F ) r ( N ) 1 ˆ r ui = ˆ + ˆ = b u + b i + U u · V T √ i ′ ∈B u W i ′ · V T i · + � i · , and r ui = 1 if ui ui |B u | ( u , i ) ∈ P and r ui = − 1 if ( u , i ) ∈ A , denoting a positive and negative preference, respectively. Pan et al., (SZU & HKUST) HOCCF (RoToR) ACM TOIS 10 / 28

Method Integrative RoToR with Pairwise Preference Learning We adopt the classical pairwise objective function for purchases for preference learning [Rendle et al., 2009], � � � min f uij , (3) ˜ Θ u ∈U i ∈P u j ∈I\P u where ˜ Θ = { U u · , u = 1 , 2 , . . . , n ; W i · , V i · , b i , i = 1 , 2 , . . . , m } denotes the set of parameters to be learned, and 2 � U u · � 2 + α v 2 � V i · � 2 + α v 2 � V j · � 2 + r uj ) + α u f uij = − ln σ (ˆ r ui − ˆ 2 � b i � 2 + β v 2 � b j � 2 is the tentative objective α w i ′ ∈B u � W i ′ · � 2 F + β v � 2 function for every two (user, item) pairs, i.e., ( u , i ) and ( u , j ) . Notice that the prediction rule is r ( F ) r ( N ) 1 ˆ r ui = ˆ + ˆ = b i + U u · V T √ i ′ ∈B u W i ′ · V T i · + � i · because the user ui ui |B u | bias b u will be of no use in preference difference ˆ r ui − ˆ r uj . Pan et al., (SZU & HKUST) HOCCF (RoToR) ACM TOIS 11 / 28

Method Sequential RoToR (1/2) We propose to decompose the integrative variant, i.e., RoToR(int.), and combine the two units in a sequential and coarse-to-fine manner, i.e., first neighborhood-based method and then factorization-based method. This mechanism is designed to make the preference learning task from less aggressive to more aggressive. Mathematically, we represent the decomposition from an integrative manner to a sequential manner as follows, r ( N ′ ) r ( N ) r ( F ) r ( F ) ˆ + ˆ ≈ ˆ → ˆ ui , (4) ui ui ui where “ ≈ ” and “ → ” are the decomposition (or approximation) procedure and the sequential relationship, respectively. Pan et al., (SZU & HKUST) HOCCF (RoToR) ACM TOIS 12 / 28

Method Sequential RoToR (2/2) In the first phase of RoToR(seq.), we obtain a candidate list of items via a neighborhood-based method, i.e., item-oriented collaborative filtering (ICF). Specifically, the prediction rule is as follows, r ( N ′ ) s ( p ) � ˆ = (5) i ′ i , ui i ′ ∈N i ∩ ( P u ∪B u ) where s ( p ) i ′ i is a predefined similarity (Jaccard index) between item i ′ and item i based on P ∪ B , and N i contains the most similar neighbors of item i . Notice that we treat purchases and browses the same and union two sets of user behaviors when calculating the predefined similarity with the goal of identifying some likely to be examined items in this phase. Pan et al., (SZU & HKUST) HOCCF (RoToR) ACM TOIS 13 / 28

Method Sequential RoToR with Pointwise Preference Learning For pointwise preference learning in the second phase of sequential RoToR, i.e., RoToR(poi.,seq.), we have a similar objective function to that of Eq.(2) in RoToR(poi.,int.) as follows, f ( F ) � min ui , (6) Φ ( u , i ) ∈P∪A where Φ = { U u · , b u , u = 1 , 2 , . . . , n ; V i · , b i , i = 1 , 2 , . . . , m } are the model parameters, and f ( F ) r ( F ) F + β u u + β v = log ( 1 + exp ( − r ui ˆ ui )) + α u 2 || U u · || 2 F + α v 2 || V i · || 2 2 b 2 2 b 2 i is ui the tentative prediction rule defined on the ( u , i ) pair. r ( F ) Notice that the prediction rule is ˆ = b u + b i + U u · V T i · . And r ui = 1 if ui ( u , i ) ∈ P and r ui = − 1 if ( u , i ) ∈ A denote the positive and negative preference orientation, respectively. Pan et al., (SZU & HKUST) HOCCF (RoToR) ACM TOIS 14 / 28

Transfer to Rank for Heterogeneous One-Class Collaborative Filtering - PowerPoint PPT Presentation

Transfer to Rank for Heterogeneous One-Class Collaborative Filtering Weike Pan 1 , Qiang Yang 2 , Wanling Cai 1 , Yaofeng Chen 2 , Qing Zhang 1 , Xiaogang Peng 1 and Zhong Ming 1 panweike@szu.edu.cn, qyang@cse.ust.hk, wanling

2 3 4 5 8 9 MINNEAPOLIS MILWAUKEE MSA RANK #16 MSA RANK #39 CHICAGO MSA RANK #3

On the minimum rank of a graph Jisu Jeong June 21, 2013 Jisu Jeong On the minimum rank of a

A new family of maximum rank distance codes or: Maximum rank distance codes and finite semifields

1 SVD applications: rank, column, row, and null spaces Rank : the rank of a matrix is equal to:

Coverage in Heterogeneous Coverage in Heterogeneous Networks Xiaoli Chu King s College

Radiative Transfer Radiative Transfer Radiative transfer is a branch of atmospheric physics. We

Web Mining Mining content Simple rank is confused by rank sinks, e.g. two pages that

Rank one perturbations of unitary operators and Clarks model in general situation Sergei Treil

What Is Multicast? Key: Unicast transfer Broadcast transfer Unicast Multicast transfer

2018 - 2019 Teacher Salary Comparison Report 0-Year 5-Year 10-Year 15-Year 20-Year District

Introduction to rank-based cryptography Philippe Gaborit University of Limoges, France ASCRYPTO

Parallel Numerical Algorithms Chapter 6 Matrix Models Section 6.2 Low Rank Approximation

10. Learning to Rank Outline 10.1. Why Learning to Rank (LeToR)? 10.2. Pointwise, Pairwise,

Selection Problem Rank Given n unsorted elements, determine the Rank of an element is its

Selection Problem Rank Given n unsorted elements, determine the Rank of an element is its

Multiple-Rank Updates to Matrix Factorizations Zack 8/30/2013 Outline u Introduction u

DRUPAL PERFORMANCE A Surgical Approach 1 1 @mandclu MARTIN ANDERSON-CLUTZ 2

Making Decisions via Simulation [Law, Ch. 10], [Handbook of Sim. Opt.], [Haas, Sec. 6.3.6] Peter

What is a factor? Introduction to R for Finance Stocks or bonds? Investment = 2 stock = 1

Ranking Factors of Team Success Nataliia Pobiedina, Julia Neidhardt, Maria del Carmen Calatrava

Oberseminar Convergence Mechanisms for a Smart Space App Store Bibek Shrestha

Google matrix of the world trade network Leonardo Ermann CNEA (Buenos Aires, Argentina) Colab.

Introduction to the Million Songs Dataset Jamen Long Data Scientist DataCamp Building

Soumyajit Gupta, Mucahid Kutlu, Vivek Khetan, and Matthew Lease ECIR 2019, Cologne, Germany, So