A Performance Prediction Approach to Enhance Collaborative Filtering - PowerPoint PPT Presentation

A Performance Prediction Approach to Enhance Collaborative Filtering Performance Alejandro Bellogín and Pablo Castells { alejandro.bellogin , pablo.castells}@uam.es Universidad Autónoma de Madrid Escuela Politécnica Superior European Conference on Information Retrieval 2010 March 28-31, Milton Keynes, United Kingdom

Introduction: Recommender Systems    * argmaxutility , i u i u  i I (Adomavicius & Tuzhilin 2005) European Conference on Information Retrieval 2010 March 28-31, Milton Keynes, United Kingdom

Introduction: Recommender Systems    * argmaxutility , i u i u  i I  Collaborative filtering (Adomavicius & Tuzhilin 2005)        utility , sim , u i k u v r , v i    v N u European Conference on Information Retrieval 2010 March 28-31, Milton Keynes, United Kingdom

Is similarity enough?  No, we propose the following modification          γ sim , utility , u i k v u v r , v i    v N u  Related work: • Experts (Amatriain et al. 2009) • Power users (Lathia et al. 2008) • Trust (Kwon et al. 2009, O‘Donovan & Smyth 2005) • Dealing with users with little overlapping – Significance weighting: n/50 (Herlocker et al. 2002) – Confidence (Clements et al. 2007) European Conference on Information Retrieval 2010 March 28-31, Milton Keynes, United Kingdom

Our approach  Predict ―neighbor performance‖  (  )  Adaptation of query performance prediction techniques • User / item clarity  Check predictive power • Correlation against ―neighbor goodness‖  Enhance CF performance with dynamic weights on neighbors European Conference on Information Retrieval 2010 March 28-31, Milton Keynes, United Kingdom

Performance prediction in IR  Mostly addressed as query performance (Hauff et al. 2008)  Query clarity (Cronen-Townsend et al. 2002) • Distance (relative entropy) between query and collection language models   | P w q       clarity | log q P w q   2 P w  w V coll               | | | , | | P w q P w d P d q P q d P w d q   d R w q q              | | 1 P w d P w d P w ml coll  Query clarity captures the (lack of) ambiguity in a query with respect to the collection • Queries whose likely relevant documents are a mix of disparate topics receive a lower score than those with a topically-coherent result set. • Strong correlation between query clarity and the performance (average precision) of the result set European Conference on Information Retrieval 2010 March 28-31, Milton Keynes, United Kingdom

Predicting good neighbors  User ―clarity‖, item ―clarity‖…?  Many possible ways to map query clarity to elements in CF  For instance, for user clarity:   | p v u        γ( ) | log u clarity u p v u   2 p v  v U c         | | | p v u p v i p i u  : ( , ) i rat u i 1    p v c U European Conference on Information Retrieval 2010 March 28-31, Milton Keynes, United Kingdom

Evaluation  Correlation between predictor and performance metric • How do we define the ―performance‖ of a neighbor?  Final performance improvements when dynamic weights are introduced • Metric: RMSE  Dataset: • MovieLens (100K)  Two variables: • Neighborhood size • Sparsity (number of available ratings)  Baseline: • Standard user-based kNN CF with Pearson similarity European Conference on Information Retrieval 2010 March 28-31, Milton Keynes, United Kingdom

Assessing predictive power  A neighbor performance metric is needed  Proposed approximation to ― neighbor goodness ‖ How does a user affect the total MAE of the system NG(u) ~ ―total MAE reduction by u‖ ~ ―MAE without u‖ – ―MAE with u‖ 1 1           CE CE v v  U U u | | | | R R               v U u v U u U u U u          CE | , , | v r v i r v i X X  : ( , ) i rat v i  Observed results • Pearson correlation of 0.18 (50% sparsity, p-value < 0.05) European Conference on Information Retrieval 2010 March 28-31, Milton Keynes, United Kingdom

Dynamic neighbor weights in CF 1,32 b) Neighbourhood size: 500 a) Neighbourhood size: 100 1,30 1,28 1,26 1,24 Standard CF 1,22 1,20 Clarity-enhanced CF 1,18 1,16 RMSE 1,14 1,12 1,10 1,08 1,06 1,04 1,02 1,00 10 20 30 40 50 60 70 80 90 10 20 30 40 50 60 70 80 90 % of ratings for training Performance comparison for different rating density European Conference on Information Retrieval 2010 March 28-31, Milton Keynes, United Kingdom

Dynamic neighbor weights in CF 1,14 b) 80% training a) 60% training 1,13 1,12 1,11 1,10 Standard CF 1,09 1,08 Clarity-enhanced CF RMSE 1,07 1,06 1,05 1,04 1,03 1,02 1,01 1,00 100 150 200 250 300 350 400 450 500 100 150 200 250 300 350 400 450 500 Neighbourhood size Performance comparison for different neighbourhood sizes European Conference on Information Retrieval 2010 March 28-31, Milton Keynes, United Kingdom

Conclusions  Performance prediction for neighbor selection in CF  Positive though moderate correlations values • Revise NG: is it an adequate metric? • Improve predictor  Performance improvements using dynamic weights for neighbors • Higher difference for small neighborhoods European Conference on Information Retrieval 2010 March 28-31, Milton Keynes, United Kingdom

Future work  Alternative variants of clarity based predictor • Even  ( u, v, i, … )  Analysis of user performance metric  Further comparison with other predictors: variance, social- based, time-based  Predicting performance can be useful in many recommendation and personalization scenarios • Hybrid recommender systems, personalized IR, rank fusion European Conference on Information Retrieval 2010 March 28-31, Milton Keynes, United Kingdom

Thank you European Conference on Information Retrieval 2010 March 28-31, Milton Keynes, United Kingdom

Bibliography  Adomavicius, G., Tuzhilin, A.: Toward the next generation of recommender systems: A survey of the state-of-the-art and possible extensions. IEEE Transactions on Knowledge and Data Engineering 17(6), 734--749 (2005) Amatriain, X., Lathia, N., Pujol, J. M., Kwak, H., and Oliver, N: The wisdom of the few: a collaborative  filtering approach based on expert opinions from the web. In SIGIR 2009, pp. 532-539 (2009)  Clements, M., de Vries, A. P., Pouwelse, J. A., Wang, J., and Reinders, M. J. T. "Evaluation of neighbourhood selection methods in decentralized recommendation systems," in Workshop on Large Scale Distributed Systems for Information Retrieval (LSDS-IR) (2007) Cronen-Townsend, S., Zhou, Y. & Croft, B. W. (2002), Predicting query performance, in ‗SIGIR ‘ 02:  Proceedings of the 25th annual international ACM SIGIR conference on Research and development in information retrieval‘, ACM Press, New York, NY, USA, pp. 299 – 306. Hauff, C., Hiemstra, D., de Jong, F.: A survey of pre-retrieval query performance predictors. In: 17 th  ACM conference on Information and knowledge management (CIKM 2008), pp. 1419 — 1420. ACM Press, New York (2008) Herlocker, J., Konstan, J. A., and Riedl, J. An empirical analysis of design choices in neighborhood-  based collaborative filtering algorithms. Inf. Retr., 5(4), 287-310 (2002).  Kwon, K., Cho, J., and Park, Y. Multidimensional credibility model for neighbor selection in collaborative recommendation. Expert Syst. Appl., 36(3):7114-7122 (2009) Lathia, N., Hailes, S., and Capra, L: kNN CF: a temporal social network. In RecSys '08, pp. 227-234  (2008)  O'Donovan, J. and Smyth, B. Trust in recommender systems. In IUI '05, pp 167-174 (2005) European Conference on Information Retrieval 2010 March 28-31, Milton Keynes, United Kingdom

Predicting good neighbors   |  Many possible ways p w v        γ( ) | log v clarity v p w v   2 p w for the mapping  w U c         | | | p w v p w i p i v  : ( , ) 0 i rat v i              | | 1 p w i p w i p w ml c               User clarity: | | 1 p i v p i v p i ml c   , r w i    | p w i    ml , r u i  w U   , r v i    | p i v    ml , r v j  j I 1 1       , p v p i c c U I European Conference on Information Retrieval 2010 March 28-31, Milton Keynes, United Kingdom

Correlation  Pearson correlation: user clarity vs neighbor goodness % training 10% 20% 30% 40% 50% 60% 70% 80% 90% correlation -0.10 0.10 0.18 0.18 0.18 0.17 0.17 0.15 0.15  Direct correlation • When calculated with significant data  Not strong values European Conference on Information Retrieval 2010 March 28-31, Milton Keynes, United Kingdom

A Performance Prediction Approach to Enhance Collaborative Filtering - PowerPoint PPT Presentation

A Performance Prediction Approach to Enhance Collaborative Filtering Performance Alejandro Bellogn and Pablo Castells { alejandro.bellogin , pablo.castells}@uam.es Universidad Autnoma de Madrid Escuela Politcnica Superior European

Enhance OpenSSH for Fun and Security Enhance OpenSSH for Fun and Security Enhance OpenSSH for Fun

Structured Prediction Introduction What is structured prediction? CS 6355: Structured Prediction

Branch Prediction Branch Prediction vs vs Execution Time Execution Time Prediction

Using lasso and related estimators for prediction Di Liu StataCorp July 12, 2019 1 / 20

Prediction and Odds 18.05 Spring 2017 Probabilistic Prediction Also called probabilistic

Using Stata 16s lasso features for prediction and inference Di Liu StataCorp 1 / 50

CS 104 Computer Organization and Design Branch Prediction CS104:Branch Prediction 1 Branch

Exercise 7a: Additional Intra Prediction Modes Implement Additional Block Prediction Modes Add

Complex Prediction Problems A novel approach to multiple Structured Output Prediction Yasemin

DeepLoc Data set statistics & performance Protein prediction II Gregor Sturm, Johannes Rest,

Wiener chaos approach for optimal Introduction Wiener Chaos prediction Trigonometric

Prediction-Guided Performance-Energy Trade-off for Interactive Applications Daniel Lo Taejoon

(seasonal) prediction systems Arun Kumar Climate Prediction Center College Park, Maryland, USA

Summary of part I: prediction and RL Prediction is important for action selection The

Prediction and Odds 18.05 Spring 2014 January 1, 2017 1 / 20 Probabilistic Prediction Also

Prediction and Odds 18.05 Spring 2014 January 1, 2017 1 / 26 Probabilistic Prediction Also

Why learn how to build recommendation engines? Jamen Long Data Scientist DataCamp Building

Recommender Systems Jia-Bin Huang Virginia Tech Spring 2019 ECE-5424G / CS-5824 Administrative

Objective Taking recommendation technology to the masses Helping researchers and

Effjcient Similarity Computation for Collaborative Filtering in Dynamic Environments Olivier

Song Recommendation Engine Tianlai Karaoke App Gao Wa, Cui Xiaoting, Zheng Feng Advisor:

A Privacy-Protecting Architecture for Collaborative Filtering via Forgery and Suppression of

Please feel free to include these slides in your own material, or modify them as you see fit. If

Recommendation Systems Stony Brook University CSE545, Fall 2016 From Frequent to Recommended

Sambuz

Useful Links

Newsletter

Mail Us

A Performance Prediction Approach to Enhance Collaborative Filtering - PowerPoint PPT Presentation

A Performance Prediction Approach to Enhance Collaborative Filtering Performance Alejandro Bellogn and Pablo Castells { alejandro.bellogin , pablo.castells}@uam.es Universidad Autnoma de Madrid Escuela Politcnica Superior European

Enhance OpenSSH for Fun and Security Enhance OpenSSH for Fun and Security Enhance OpenSSH for Fun

Structured Prediction Introduction What is structured prediction? CS 6355: Structured Prediction

Branch Prediction Branch Prediction vs vs Execution Time Execution Time Prediction

Using lasso and related estimators for prediction Di Liu StataCorp July 12, 2019 1 / 20

Prediction and Odds 18.05 Spring 2017 Probabilistic Prediction Also called probabilistic

Using Stata 16s lasso features for prediction and inference Di Liu StataCorp 1 / 50

CS 104 Computer Organization and Design Branch Prediction CS104:Branch Prediction 1 Branch

Exercise 7a: Additional Intra Prediction Modes Implement Additional Block Prediction Modes Add

Complex Prediction Problems A novel approach to multiple Structured Output Prediction Yasemin

DeepLoc Data set statistics &amp; performance Protein prediction II Gregor Sturm, Johannes Rest,

Wiener chaos approach for optimal Introduction Wiener Chaos prediction Trigonometric

Prediction-Guided Performance-Energy Trade-off for Interactive Applications Daniel Lo Taejoon

(seasonal) prediction systems Arun Kumar Climate Prediction Center College Park, Maryland, USA

Summary of part I: prediction and RL Prediction is important for action selection The

Prediction and Odds 18.05 Spring 2014 January 1, 2017 1 / 20 Probabilistic Prediction Also

Prediction and Odds 18.05 Spring 2014 January 1, 2017 1 / 26 Probabilistic Prediction Also

Why learn how to build recommendation engines? Jamen Long Data Scientist DataCamp Building

Recommender Systems Jia-Bin Huang Virginia Tech Spring 2019 ECE-5424G / CS-5824 Administrative

Objective Taking recommendation technology to the masses Helping researchers and

Effjcient Similarity Computation for Collaborative Filtering in Dynamic Environments Olivier

Song Recommendation Engine Tianlai Karaoke App Gao Wa, Cui Xiaoting, Zheng Feng Advisor:

A Privacy-Protecting Architecture for Collaborative Filtering via Forgery and Suppression of

Please feel free to include these slides in your own material, or modify them as you see fit. If

Recommendation Systems Stony Brook University CSE545, Fall 2016 From Frequent to Recommended

Sambuz

Useful Links

Newsletter

Mail Us

DeepLoc Data set statistics & performance Protein prediction II Gregor Sturm, Johannes Rest,