CS249: ADVANCED DATA MINING Recommender Systems II Instructor: - PowerPoint PPT Presentation

CS249: ADVANCED DATA MINING Recommender Systems II Instructor: Yizhou Sun yzsun@cs.ucla.edu May 31, 2017

Recommender Systems • Recommendation via Information Network Analysis • Hybrid Collaborative Filtering with Information Networks • Graph Regularization for Recommendation • Summary 2

Traditional View of Recommendation Revolutionary Avatar Titanic Aliens Road 3

Recommendation Paradigm feedback user user-item feedback recommender system recommendation product features Content-Based Methods Collaborative Filtering Hybrid Methods E.g., K-Nearest Neighbor (Sarwar WWW’01) , Matrix E.g., (Balabanovic Comm. ACM’ 97, Zhang SIGIR’02) E.g., Content-Based CF (Antonopoulus , IS’06) , Factorization (Hu ICDM’08, Koren IEEE- CS’09) , External Knowledge CF (Ma WSDM’11) Probabilistic Model (Hofmann SIGIR’03) external knowledge 4

An Example of Traditional Method: Matrix Factorization 𝑆 : Rating Matrix 𝑆 : Estimated Rating Matrix 5

Challenges • How to address the data sparsity and cold start issues? • How to leverage different sources of information? 6

Solution: A Heterogeneous Information Network View of Recommendation Revolutionary Avatar Titanic Aliens Road James Romance Cameron Zoe Leonardo Kate Adventure Saldana Dicaprio Winslet 7

What Are Information Networks? • A network where each node represents an entity (e.g., user in a social network) and each link (e.g., friendship) a relationship between entities. • Nodes/links may have attributes, labels, and weights. • Links may carry rich semantic information. 8

We are living in a connected world! 9

Even in Biomedical Domain Side Symptom Disease Effect Gene carriedBy Patient Drug contain Microbe cause Disease Compound similarTo 10

Recommendation Paradigm feedback user user-item feedback recommender system recommendation product features Content-Based Methods Collaborative Filtering Hybrid Methods E.g., K-Nearest Neighbor (Sarwar WWW’01) , Matrix E.g., (Balabanovic Comm. ACM’ 97, Zhang SIGIR’02) E.g., Content-Based CF (Antonopoulus , IS’06) , Factorization (Hu ICDM’08, Koren IEEE- CS’09) , External Knowledge CF (Ma WSDM’11) Probabilistic Model (Hofmann SIGIR’03) external knowledge 12

Problem Definition feedback user implicit user feedback recommender system recommendation hybrid collaborative filtering with information networks information network 13

Recommend with Trust and Distrust Relationships [Ma et al., RecSys’09] • Users can be easily influenced by the friends they trust , and prefer their friends’ recommendations. Where to have dinner? Good Ask Very Good Ask Ask Cheap & Delicious 14

Trust and Distrust Graph 𝑻 𝑼 : Trust Graph 𝑻 𝑬 : Distrust Graph R: User Item Rating Matrix 15

Recommendation with Trust and Distrust Relationships 𝑻 𝑼 : Trust Graph 𝑻 𝑬 : Distrust Graph 16

Results • Dataset: Epinions • Metric: RMSE 17

Hybrid Collaborative Filtering with Networks • Utilizing network relationship information can enhance the recommendation quality • However, most of the previous studies only use single type of relationship between users or items (e.g., social network Ma,WSDM’ 11 , trust relationship Ester, KDD’ 10 , service membership Yuan, RecSys’ 11 ) 18

The Heterogeneous Information Network View of Recommender System Revolution Avatar Titanic Aliens -ary Road James Romance Cameron Zoe Leonardo Kate Adventure Saldana Dicaprio Winslet 19

Relationship Heterogeneity Alleviates Data Sparsity Collaborative filtering methods suffer from data sparsity issue # of ratings A small number Most users and items have of users and items a small number of ratings have a large number of ratings # of users or items • Heterogeneous relationships complement each other • Users and items with limited feedback can be connected to the network by different types of paths • Connect new users or items (cold start) in the information network 20

Relationship Heterogeneity Based Personalized Recommendation Models (Yu et al., WSDM’14) Different users may have different behaviors or preferences Two levels of personalization Data level James Cameron fan • Most recommendation methods use Aliens one model for all users and rely on personal feedback to achieve 80s Sci-fi fan personalization Model level Sigourney Weaver fan • With different entity relationships, we can learn personalized models for Different users may be interested in the same different users to further distinguish movie for different reasons their differences 21

Preference Propagation-Based Latent Features genre: drama King Kong Bob Naomi Watts Charlie tag: Oscar Nomination Ralph Fiennes Alice Titanic skyfall revolutionary Kate Winslet Sam Mendes road Calculate latent- Generate L different Propagate user features for users meta-path (pa path h typ ypes) es) implicit feedback and items for each connecting users along each meta- meta-path with NMF and items path related method 22

Recommendation Models Observation 1 : Different meta-paths may have different importance Global Recommendation Model features for user i and item j ranking score (1) the q-th meta-path Observation 2 : Different users may require different models Personalized Recommendation Model user-cluster similarity L (2) c total soft user clusters 23

Parameter Estimation • Bayesian personalized ranking (Rendle UAI’ 09) • Objective function sigmoid function min (3) Θ for each correctly ranked item pair i.e., 𝑣 𝑗 gave feedback to 𝑓 𝑏 but not 𝑓 𝑐 Generate For each user Soft cluster users personalized model cluster, learn one with NMF + k-means for each user on the model with Eq. (3) fly with Eq. (2) Learning Personalized Recommendation Model 24

Experiment Setup • Datasets • Comparison methods: • Popularity: recommend the most popular items to users • Co-click: conditional probabilities between items • NMF: non-negative matrix factorization on user feedback • Hybrid-SVM: use Rank-SVM with plain features (utilize both user feedback and information network) 25

Performance Comparison p HeteRec personalized recommendation (HeteRec-p) provides the best recommendation results 26

Performance under Different Scenarios p p user HeteRec – p consistently outperform other methods in different scenarios better recommendation results if users provide more feedback better recommendation for users who like less popular items 27

From Graph Regularization Point of View • Why additional links help? • They define new similarity metrics between users or items. • How to integrate this assumption into recommendation? • Use graph regularization to force two entities to be similar in latent space, if they are similar in graph • The original form of graph regularization 2 = 𝑔 ′ 𝑀𝑔 1 • 2 ∑𝑥 𝑗𝑘 𝑔 𝑗 − 𝑔 𝑘 • 𝑥 𝑗𝑘 ∶ 𝑡𝑗𝑛𝑗𝑚𝑏𝑠𝑗𝑢𝑧 𝑝𝑔 𝑜𝑝𝑒𝑓 𝑗 𝑏𝑜𝑒 𝑘 • 𝑔 𝑗 : some latent representation for node i • L : Laplacian matrix of W , i.e., 𝑀 = 𝐸 − 𝑋, • 𝑥ℎ𝑓𝑠𝑓 𝐸 𝑗𝑡 𝑏 𝑒𝑗𝑏𝑕𝑝𝑜𝑏𝑚 𝑛𝑏𝑢𝑠𝑗𝑦 𝑏𝑜𝑒 𝐸 𝑗𝑗 = ∑ 𝑘 𝑥 𝑗𝑘 29

Recommender Systems with Social Regularization [Ma et al., WSDM’11] • Input: Social Relation + Rating Matrix 30

Two Regularization Forms • Model 1: Average-based Regularization • We are similar to the average of our friends • Model2: Individual-based Regularization • We are similar to each of our friends Similarity can be propagated via friends: transitivity! 31

How to compute similarity between two users? • Cosine similarity (VSS) • Pearson correlation coefficient (PCC) 32

Results 33

Meta-Path-based Regularization [Yu et al., IJCAI- HINA’13] • What if it is more than one type of relation? Rating Data Heterogeneous Information Network • Solution: • Use meta-path to generate similarity relation between items, e.g., movie-director-movie • Learn the importance score for each meta-path 34

Notations • We have n users and m items. • • By computing similarity scores of all item pairs along certain meta-path, we can get a similarity matrix • • With L different meta-paths, we can calculate L similarity matrices as • 35

Objective Function Regularization on U V Approximate R with U V product Regularization on θ , Similar items measured from HIN which is the importance should have similar low-rank score for each meta-path representations 36

Equivalent Objective Function Using Graph Laplacian Similar items measured from HIN should have similar low-rank representations 37

Dataset • We combine IMDb + MovieLens100K We random sample training datasets of different sizes (0.4, 0.6, and 0.8) 38

CS249: ADVANCED DATA MINING Recommender Systems II Instructor: - PowerPoint PPT Presentation

CS249: ADVANCED DATA MINING Recommender Systems II Instructor: Yizhou Sun yzsun@cs.ucla.edu May 31, 2017 Recommender Systems Recommendation via Information Network Analysis Hybrid Collaborative Filtering with Information Networks

Web Mining Web Mining Web Mining Web Mining Web mining is the use of data mining techniques

CS249: SPECIAL TOPICS MINING INFORMATION/SOCIAL NETWORKS 1: Introduction Instructor: Yizhou Sun

CS249: SPECIAL TOPICS MINING INFORMATION/SOCIAL NETWORKS Overview of Networks Instructor: Yizhou

Introduction What is data mining? to Data Mining: On what kind of data? Data Mining

Web Mining Web Mining Web mining is the use of data mining techniques to automatically

Introduction What is data mining? to Data mining functionalities Data Mining Major

Data mining Machine Intelligence Thomas D. Nielsen September 2008 Data mining September 2008

DATA MINING LECTURE 2 What is data? The data mining pipeline What is Data Mining? Data

CS6220: DATA MINING TECHNIQUES Chapter 7: Advanced Pattern Mining Instructor: Yizhou Sun

Data Mining 2020 Frequent Pattern Mining (2) Ad Feelders Universiteit Utrecht October 2, 2020

Web MINING Web MINING Overview Overview Dr Ahmed Rafea Rafea Dr Ahmed 1 Web Mining Outline

LECTURE 1: INTRODUCTION TO DATA MINING Dr. Dhaval Patel CSE, IIT-Roorkee What is data mining?

Data Mining Based Detection Methods Data Mining in Intrusion detection Feng Pan Outline

DATA MINING LECTURE 1 Introduction What is data mining? After years of data mining there is

Cement, Aggregates, Mining Presentation Cement, Aggregates and Mining Cement, Aggregates and

Frequent Pattern Mining Frequent Sequence Mining Frequent Tree Mining Christian Borgelt

The Biodiscovery Pipeline is Discontinuous Sampling In situ Each step may take a significant

Very high dimensional causal structure and Markov boundary discovery: key algorithmic

Browsers on the move 2007 - 05 to 2008 - 06 Michael ( tm ) Smith mike@w3 . org Prologue : Biggest

Eco-evolutionary theory of gut microbiome dysbiosis marco.candela@unibo.it All macro-organisms

WELCOME Global Environmental Impact Dr. Teik C. Lim Provost and Vice President for Academic

Comparing the 2015 Pease Perfluorochemical (PFC) Blood Test Results to Other Populations Tested

Federal Lead-Based Paint Regulations: CDBG-DR Rehabilitation Programs 2020 CDBG-DR and CDBG-MIT

I have nothing to disclose The findings and conclusions in this report are those of the Bruce