A Performance Prediction Approach to Enhance Collaborative Filtering - - PowerPoint PPT Presentation

a performance prediction approach to enhance
SMART_READER_LITE
LIVE PREVIEW

A Performance Prediction Approach to Enhance Collaborative Filtering - - PowerPoint PPT Presentation

A Performance Prediction Approach to Enhance Collaborative Filtering Performance Alejandro Bellogn and Pablo Castells { alejandro.bellogin , pablo.castells}@uam.es Universidad Autnoma de Madrid Escuela Politcnica Superior European


slide-1
SLIDE 1

European Conference on Information Retrieval 2010 March 28-31, Milton Keynes, United Kingdom

A Performance Prediction Approach to Enhance Collaborative Filtering Performance

Alejandro Bellogín and Pablo Castells

{alejandro.bellogin, pablo.castells}@uam.es

Universidad Autónoma de Madrid Escuela Politécnica Superior

slide-2
SLIDE 2

European Conference on Information Retrieval 2010 March 28-31, Milton Keynes, United Kingdom

Introduction: Recommender Systems

(Adomavicius & Tuzhilin 2005)

 

*

argmaxutility ,

u i I

i u i

slide-3
SLIDE 3

European Conference on Information Retrieval 2010 March 28-31, Milton Keynes, United Kingdom

Introduction: Recommender Systems

  • Collaborative filtering (Adomavicius & Tuzhilin 2005)

   

 

,

utility , sim ,

v i v N u

u i k u v r

 

 

*

argmaxutility ,

u i I

i u i

slide-4
SLIDE 4

European Conference on Information Retrieval 2010 March 28-31, Milton Keynes, United Kingdom

Is similarity enough?

  • No, we propose the following modification
  • Related work:
  • Experts (Amatriain et al. 2009)
  • Power users (Lathia et al. 2008)
  • Trust (Kwon et al. 2009, O‘Donovan & Smyth 2005)
  • Dealing with users with little overlapping

– Significance weighting: n/50 (Herlocker et al. 2002) – Confidence (Clements et al. 2007)

     

 

,

utility , γ sim ,

v i v N u

u i k v u v r

 

slide-5
SLIDE 5

European Conference on Information Retrieval 2010 March 28-31, Milton Keynes, United Kingdom

Our approach

  • Predict ―neighbor performance‖ ( )
  • Adaptation of query performance prediction techniques
  • User / item clarity
  • Check predictive power
  • Correlation against ―neighbor goodness‖
  • Enhance CF performance with dynamic weights on neighbors
slide-6
SLIDE 6

European Conference on Information Retrieval 2010 March 28-31, Milton Keynes, United Kingdom

Performance prediction in IR

  • Mostly addressed as query performance (Hauff et al. 2008)
  • Query clarity (Cronen-Townsend et al. 2002)
  • Distance (relative entropy) between query and collection language models
  • Query clarity captures the (lack of) ambiguity in a query with

respect to the collection

  • Queries whose likely relevant documents are a mix of disparate topics receive a

lower score than those with a topically-coherent result set.

  • Strong correlation between query clarity and the performance (average

precision) of the result set

               

 

       

2

| clarity | log | | | , | | | | 1

q

w V coll q d R w q ml coll

P w q q P w q P w P w q P w d P d q P q d P w d P w d P w d P w  

  

     

  

slide-7
SLIDE 7

European Conference on Information Retrieval 2010 March 28-31, Milton Keynes, United Kingdom

Predicting good neighbors

  • User ―clarity‖, item ―clarity‖…?
  • Many possible ways to map query clarity to elements in CF
  • For instance, for user clarity:

               

2 : ( , )

| γ( ) | log | | | 1

v U c i rat u i c

p v u u clarity u p v u p v p v u p v i p i u p v U

 

   

 

slide-8
SLIDE 8

European Conference on Information Retrieval 2010 March 28-31, Milton Keynes, United Kingdom

Evaluation

  • Correlation between predictor and performance metric
  • How do we define the ―performance‖ of a neighbor?
  • Final performance improvements when dynamic weights are introduced
  • Metric: RMSE
  • Dataset:
  • MovieLens (100K)
  • Two variables:
  • Neighborhood size
  • Sparsity (number of available ratings)
  • Baseline:
  • Standard user-based kNN CF with Pearson similarity
slide-9
SLIDE 9

European Conference on Information Retrieval 2010 March 28-31, Milton Keynes, United Kingdom

Assessing predictive power

  • A neighbor performance metric is needed
  • Proposed approximation to ―neighbor goodness‖

How does a user affect the total MAE of the system NG(u) ~ ―total MAE reduction by u‖ ~ ―MAE without u‖ – ―MAE with u‖

  • Observed results
  • Pearson correlation of 0.18 (50% sparsity, p-value < 0.05)

         

 

 

     

: ( , )

1 1 CE CE | | | | CE | , , |

U U u v U u v U u U u U u X X i rat v i

v v R R v r v i r v i

       

   

  

slide-10
SLIDE 10

European Conference on Information Retrieval 2010 March 28-31, Milton Keynes, United Kingdom

Performance comparison for different rating density

Dynamic neighbor weights in CF

1,00 1,02 1,04 1,06 1,08 1,10 1,12 1,14 1,16 1,18 1,20 1,22 1,24 1,26 1,28 1,30 1,32 10 20 30 40 50 60 70 80 90 10 20 30 40 50 60 70 80 90 RMSE % of ratings for training

a) Neighbourhood size: 100

Standard CF Clarity-enhanced CF

b) Neighbourhood size: 500

slide-11
SLIDE 11

European Conference on Information Retrieval 2010 March 28-31, Milton Keynes, United Kingdom

Dynamic neighbor weights in CF

Performance comparison for different neighbourhood sizes

1,00 1,01 1,02 1,03 1,04 1,05 1,06 1,07 1,08 1,09 1,10 1,11 1,12 1,13 1,14 100 150 200 250 300 350 400 450 500 100 150 200 250 300 350 400 450 500 RMSE Neighbourhood size

a) 60% training

Standard CF Clarity-enhanced CF

b) 80% training

slide-12
SLIDE 12

European Conference on Information Retrieval 2010 March 28-31, Milton Keynes, United Kingdom

Conclusions

  • Performance prediction for neighbor selection in CF
  • Positive though moderate correlations values
  • Revise NG: is it an adequate metric?
  • Improve predictor
  • Performance improvements using dynamic weights for

neighbors

  • Higher difference for small neighborhoods
slide-13
SLIDE 13

European Conference on Information Retrieval 2010 March 28-31, Milton Keynes, United Kingdom

Future work

  • Alternative variants of clarity based predictor
  • Even (u, v, i, …)
  • Analysis of user performance metric
  • Further comparison with other predictors: variance, social-

based, time-based

  • Predicting performance can be useful in many

recommendation and personalization scenarios

  • Hybrid recommender systems, personalized IR, rank fusion
slide-14
SLIDE 14

European Conference on Information Retrieval 2010 March 28-31, Milton Keynes, United Kingdom

Thank you

slide-15
SLIDE 15

European Conference on Information Retrieval 2010 March 28-31, Milton Keynes, United Kingdom

Bibliography

  • Adomavicius, G., Tuzhilin, A.: Toward the next generation of recommender systems: A survey of the

state-of-the-art and possible extensions. IEEE Transactions on Knowledge and Data Engineering 17(6), 734--749 (2005)

  • Amatriain, X., Lathia, N., Pujol, J. M., Kwak, H., and Oliver, N: The wisdom of the few: a collaborative

filtering approach based on expert opinions from the web. In SIGIR 2009, pp. 532-539 (2009)

  • Clements, M., de Vries, A. P., Pouwelse, J. A., Wang, J., and Reinders, M. J. T. "Evaluation of

neighbourhood selection methods in decentralized recommendation systems," in Workshop on Large Scale Distributed Systems for Information Retrieval (LSDS-IR) (2007)

  • Cronen-Townsend, S., Zhou, Y. & Croft, B. W. (2002), Predicting query performance, in ‗SIGIR ‘02:

Proceedings of the 25th annual international ACM SIGIR conference on Research and development in information retrieval‘, ACM Press, New York, NY, USA, pp. 299–306.

  • Hauff, C., Hiemstra, D., de Jong, F.: A survey of pre-retrieval query performance predictors. In: 17th

ACM conference on Information and knowledge management (CIKM 2008), pp. 1419—1420. ACM Press, New York (2008)

  • Herlocker, J., Konstan, J. A., and Riedl, J. An empirical analysis of design choices in neighborhood-

based collaborative filtering algorithms. Inf. Retr., 5(4), 287-310 (2002).

  • Kwon, K., Cho, J., and Park, Y. Multidimensional credibility model for neighbor selection in

collaborative recommendation. Expert Syst. Appl., 36(3):7114-7122 (2009)

  • Lathia, N., Hailes, S., and Capra, L: kNN CF: a temporal social network. In RecSys '08, pp. 227-234

(2008)

  • O'Donovan, J. and Smyth, B. Trust in recommender systems. In IUI '05, pp 167-174 (2005)
slide-16
SLIDE 16

European Conference on Information Retrieval 2010 March 28-31, Milton Keynes, United Kingdom

Predicting good neighbors

  • Many possible ways

for the mapping

  • User clarity:

                                             

2 : ( , ) 0

| γ( ) | log | | | | | 1 | | 1 , | , , | , 1 1 ,

w U c i rat v i ml c ml c ml w U ml j I c c

p w v v clarity v p w v p w p w v p w i p i v p w i p w i p w p i v p i v p i r w i p w i r u i r v i p i v r v j p v p i U I    

   

            

   

slide-17
SLIDE 17

European Conference on Information Retrieval 2010 March 28-31, Milton Keynes, United Kingdom

Correlation

  • Pearson correlation: user clarity vs neighbor goodness

 Direct correlation

  • When calculated with significant data

Not strong values

% training 10% 20% 30% 40% 50% 60% 70% 80% 90% correlation

  • 0.10

0.10 0.18 0.18 0.18 0.17 0.17 0.15 0.15

slide-18
SLIDE 18

European Conference on Information Retrieval 2010 March 28-31, Milton Keynes, United Kingdom

Performance comparison for different rating density

Performance results I

0,80 0,82 0,84 0,86 0,88 0,90 0,92 0,94 0,96 0,98 10 20 30 40 50 60 70 80 90 10 20 30 40 50 60 70 80 90 MAE % of ratings for training

a) Neighbourhood size: 100

Standard CF Clarity-enhanced CF

b) Neighbourhood size: 500

slide-19
SLIDE 19

European Conference on Information Retrieval 2010 March 28-31, Milton Keynes, United Kingdom

Performance results II

Performance comparison for different neighbourhood sizes

0,80 0,81 0,82 0,83 0,84 0,85 0,86 0,87 0,88 100 150 200 250 300 350 400 450 500 100 150 200 250 300 350 400 450 500 MAE Neighbourhood size

a) 60% training

Standard CF Clarity-enhanced CF

b) 80% training