Recommender Systems Francesco Ricci Free University of - PowerPoint PPT Presentation

Recommender Systems Francesco Ricci Free University of Bozen-Bolzano Italy fricci@unibz.it

Content p The paradox of choice and information overload p Personalization p Recommender systems p Step 1: preference elicitation p Step 2: preference prediction - rating estimation techniques n Contextualization n Groups p Step 3: recommendations' presentation p Issues and problems 3

Explosion of Choice A trip to a local supermarket : p 85 different varieties and brands of crackers n 285 varieties of cookies. n 165 varieties of “ juice drinks ” n 75 iced teas n 275 varieties of cereal n 120 different pasta sauces n 80 different pain relievers n 40 options for toothpaste n 95 varieties of snacks (chips, pretzels, etc.) n 61 varieties of sun tan oil and sunblock n 360 types of shampoo, conditioner, gel, and mousse. n 90 different cold remedies and decongestants. n 230 soups, including 29 different chicken soups n 175 different salad dressings and if none of them suited, 15 extra-virgin n olive oils and 42 vinegars and make one ’ s own

New Domains for Choice p Telephone Services p Retirement Pensions p Medical Care p News p Choosing how to work p Choosing how to love p Choosing how to be

Choice and Well-Being p We have more choice , more freedom, autonomy, and self determination p Increased choice should improve well-being: n added options can only make us better off: those who care will benefit, and those who do not care can always ignore the added options p Various assessment of well-being have shown that increased affluence have accompanied by decreased well-being .

Neuroscience and Information Overload p Neuroscientists have discovered that unproductivity and loss of drive can result from decision overload p Our brains ( 120 bits per second ) are configured to make a certain number of decisions per day and once we reach that limit, we can’t make any more p After the limit is reached we can have trouble separating the trivial from the important. 7

Information Overload p Internet = information overload = having too much information to make a decision or remain informed about a topic p To make a decision or remain informed about a topic you must perform exploratory search (e.g., comparison, knowledge acquisition, product selection, etc.) n not aware of the range of available options n may not know what to search n if presented with some results may not be able to choose. 8

eCommerce Personalization p “If I have 3 million customers on the Web, I should have 3 million stores on the Web” n Jeff Bezos , CEO and founder, Amazon.com n Degree in Computer Science n $34.2 billion (net worth), ranked no. 15 in the Forbes list of the America's Wealthiest People 9

Amazon.it 10

Movie Recommendation – YouTube Recommendations account for about 60% of all video clicks from the home page. 11

Who is this company? p "Italians are emotional, the Swiss are punctual" p This shopping site is making billions by tailoring its services to European stereotypes Zalando : Europe’s largest dedicated online apparel retailer, with several thousand employees facilitating annual sales topping € 2.2 billion. 12 http://qz.com/482553

Consumer Attitudes 13

The Long Tail p Economic model in which the market for non-hits (typically large numbers of low-volume items) could be significant and sometimes even greater than the market for big hits (typically small numbers of high-volume items). 14

Goal p Recommend items that are good for you! n relevant n improve well being n rational choices n optimal 15

Step 1: Preference Elicitation 16

Last.fm – Preference Elicitation

Rating Recommendations 18

Alternative Methods 19

Remembering p D. Kahneman (nobel prize): what we remember about an experience is determined by ( peak-end rule ) n How the experience felt when it was at its peak (best or worst) n How it felt when it ended p We rely on this summary later to remind how the experience felt and decide whether to have that experience again p So how well do we know what we want? n It is doubtful that we prefer an experience to another very similar just because the first ended better. 20

Step 2: Model Building 21

Movie rating data Training data Test data user movie date user movie date score score 1 21 5/7/02 1 1 62 1/6/05 ? 1 213 8/2/04 5 1 96 9/13/04 ? 2 345 3/6/01 4 2 7 8/18/05 ? 2 123 5/1/05 4 2 3 11/22/05 ? 2 768 7/15/02 3 3 47 6/13/02 ? 3 76 1/22/01 5 3 15 8/12/01 ? 4 45 8/3/00 4 4 41 9/1/00 ? 5 568 9/10/05 1 4 28 8/27/05 ? 5 342 3/5/03 2 5 93 4/4/05 ? 5 234 12/28/00 2 5 74 7/16/03 ? 6 76 8/11/02 5 6 69 2/14/04 ? 6 56 6/15/03 4 6 83 10/3/03 ? 22

Matrix of ratings Items Users 23

Item-to-Item Collaborative Filtering target neigh. neigh. Suppose the prediction is made using two nearest-neighbors, and p that the items most similar to “Titanic” are “Forrest Gump” and “Wall-E” Similarity of items: w titanic, forrest = 0.85, w titanic, wall-e = 0.75 p r* eric, titanic = (0.85*5 + 0.75*4)/(0.85 + 0.75) = 4.53 p 24

User-Based Collaborative Filtering p A collection of n users U and a collection of m items I p A n ´ m matrix of ratings r ui , with r ui = ? if user u did not rate item i p Prediction for user u and item j is computed as * = r ∑ r u + K w uv ( r vj − r v ) uj A set of neighbours of v ∈ N j ( u ) u that have rated j p Where, r u is the average rating of user u , K is a normalization factor such that the absolute values of w uv sum to 1, and ∑ Pearson ( r uj − r u )( r vj − r v ) Correlation of j ∈ I uv w uv = ∑ u ) 2 ∑ v ) 2 users u and v ( r uj − r ( r vj − r 25 j ∈ I uv j ∈ I uv

Latent Factor Models serious Braveheart The Color Amadeus Purple Lethal Sense and Weapon Sensibility Ocean ’ s 11 Geared Geared towards towards males females The Lion King Dumb and Dumber The Princess Independence Diaries Day 26 escapist

“ Core ” Recommendation Techniques [Burke, 2002] 27

Content-Based Recommender with Centroid Not interesting Documents Interesting Documents Centroid sports Centroid Doc2 Doc1 User Model politics Doc1 is estimated more interesting than Doc2 28

Recommendations are often wrong p Recommenders tend to recommend items similar to those browsed or purchased in the past 30

Context-Aware Computing p Gartner Top 10 strategic technology trends for IT p "Context-aware computing is a style of computing in which situational and environmental information about people, places and things is used to anticipate immediate needs and proactively offer enriched, situation-aware and usable content, functions and experiences." http://www.gartner.com/it-glossary/context-aware-computing-2 31

Google Now 32 https://www.google.com/landing/now/

Types of Context - Mobile p Physical context n time, position, and activity of the user, [Fling, 2009] weather, light, and temperature ... p Social context n the presence and role of other people around the user p Interaction media context n the device used to access the system and the type of media that are browsed and personalized (text, music, images, movies, …) p Modal context n The state of mind of the user, the user’s goals, mood, experience, and cognitive capabilities. 33

Factors influencing Holiday Decision Internal to the tourist External to the tourist Personal Availability of Motivators Advice of travel products Personality agents Disposable Income Information obtained from tourism organization and Health media Family Word-of-mouth commitments Decision recommendations Past experience Political restrictions: visa, terrorism Works commitments Hobbies and interests Health problems Knowledge of Special promotion and potential holidays offers Lifestyle Attitudes, opinions Climate and perceptions [Swarbrooke & Horner , 2006]

Traditional contextual pre-filtering Only ratings acquired in exactly the same context are q used Ratings predicted Prediction filtering rating model "sunny" ratings in-context ratings target context Hypothesis: pre-filtering can be enhanced by q exploiting semantic similarities between contexts 35

Distributional semantics of context p Assumption : two contexts are similar if their composing conditions influence ratings similarly Condition User1 User2 User3 User4 User5 User6 User7 1 -0.7 0 0.9 0.1 -0.6 0 0.7 -0.8 0.5 0.8 0.4 -0.2 0 -0.5 0.7 0.2 -1 0.9 0.8 0.5 36

Semantic contextual pre-filtering q Key idea: reuse ratings acquired in similar contexts ≠ semantic ≈ similarities Ratings Prediction predicted filtering model rating "similar context" in-context ratings ratings target context 37

Semantic Pre-Filtering vs. state of the art % = MAE (mean absolute error) reduction with respect to a context-free Matrix Factorization model (the higher, the better) 20% 15% 10% 5% 0% Tourism Music Adom Comoda Movie Library UI-Splitting CAMF Semantic Pre-Filtering 38

Recommender Systems Francesco Ricci Free University of - PowerPoint PPT Presentation

Recommender Systems Francesco Ricci Free University of Bozen-Bolzano Italy fricci@unibz.it 1 2 3 Content p The paradox of choice and information overload p Personalization p Recommender systems p Step 1: preference elicitation p Step 2:

Web Mining and Recommender Systems Recommender Systems: Introduction Learning Goals

2. Recommender Systems Recommenders Everywhere Advanced Topics in Information Retrieval /

Affect- and Personality-based Recommender Systems Part II: Acquisition, Usage in Recommender

On the Economics of Recommender Systems Emilio Calvano Center for Studies in Econ and Finance U.

Privacy in Recommender Systems CompSci 590.03 Instructor: Ashwin Machanavajjhala Lecture 21:

CSE 255 Lecture 5 Data Mining and Predictive Analytics Recommender Systems Why

Content- -based Recommender Systems based Recommender Systems Content problems, challenges

CSE 158 Lecture 7 Web Mining and Recommender Systems Recommender Systems Announcements

Web Mining and Recommender Systems Advanced Recommender Systems: Bayesian Personalized Ranking

CSE 158 Lecture 7 Web Mining and Recommender Systems Recommender Systems Announcements

CSE 258 Web Mining and Recommender Systems Advanced Recommender Systems This week

Ruiqi Guo, Philip Sun, Erik Lindgren, Quan Geng, David Simcha, Felix Chern, Sanjiv Kumar Overview

CSE 258 Web Mining and Recommender Systems Advanced Recommender Systems This week

Web Mining and Recommender Systems Advanced Recommender Systems This week Methodological papers

CSE 258 Lecture 7 Web Mining and Recommender Systems Recommender Systems Announcements

Recommender Systems Research Challenges Francesco Ricci Free University of Bozen-Bolzano

CSCW Issues and Theory Ov erview computer systems ha v e group impact A l l not

DISK methods for yield fluids Karol Cascavita Directed by: Alexandre Ern Supervised by: Xavier

Accelerating MUS Extraction with Recursive Model Rotation Anton Belov and Joao Marques-Silva

Lecture 18: Multi-view reconstruction 1 Announcements If you have a question: just ask it

Analyst Meeting Four Seasons Hotel Bangkok Minor International PLC MINTs 1Q07 25 th May 2007

Knowledge Representation Part VII Protg / RDFS / OWL / ++ Jan Pettersen Nytun, UiA 1 S O

Title: Scalable Video Coding based DASH for efficient usage of network resources Presenter: Yago

Announcements CS 188: Artificial Intelligence Spring 2010 P2: Due tonight W3: Expectimax,