Going beyond the algorithms Yehuda Koren Haifa movie #15868 Rese - PowerPoint PPT Presentation

Lessons from the Netflix Prize: Going beyond the algorithms Yehuda Koren Haifa movie #15868 Rese search movie #7614 movie #16661

We Know What You Ought To Be Watching This Summer Rese search

Rese search

“We’re quite curious, really. To the tune of one million dollars.” – Netflix Prize rules • Goal to improve on Netflix’ existing movie recommendation technology, Cinematch • Criterion: reduction in root mean squared error (RMSE) • Oct’ 06: Contest began • Oct’ 07: $50K progress prize for 8.43% improvement • Oct’ 08: $50K progress prize for 9.44% improvement • Sept’ 09: $1 million grand prize for 10.06% improvement Rese search

Movie rating data Training data Test data • Training data user movie score user movie – 100 million ? 1 21 1 1 62 ratings 1 213 5 1 96 ? – 480,000 users 2 345 4 2 7 ? 2 123 4 – 17,770 movies 2 3 ? 2 768 3 3 47 ? – 6 years of data: 3 76 5 3 15 ? 2000-2005 4 45 4 4 41 ? • Test data 5 568 1 4 28 ? – Last few ratings 5 342 2 5 93 ? of each user (2.8 5 234 2 5 74 ? million) 6 76 5 6 69 ? • Dates of ratings are 6 56 4 6 83 ? given Rese search

Data >> Models • Very limited feature set – User, movie, date – Places focus on models/algorithms • Major steps forward associated with incorporating new data features – Temporal effects – Selection bias: • What movies a user rated • Daily rating counts Rese search

Multiple sources of temporal dynamics • Item-side effects: – Product perception and popularity are constantly changing – Seasonal patterns influence items’ popularity • User-side effects: – Customers ever redefine their taste – Transient, short-term bias; anchoring – Drifting rating scale – Change of rater within household Rese search

Something Happened in Early 2004 … 2004 Rese search

Are movies getting better with time? Rese search

Temporal dynamics - challenges • Multiple effects: Both items and users are changing over time  Scarce data per target • Inter-related targets: Signal needs to be shared among users – foundation of collaborative filtering  cannot isolate multiple problems  Common “concept drift” methodologies won’t hold. E.g., underweighting older instances is unappealing Rese search

Effect of daily rating counts • Number of ratings user gave on the same day is an important indicator • It affects different movies differently Credit to: Martin Piotte and Martin Chabbert Rese search

Memento vs Patch Adams Memento (127318 samples) 4.2 4.1 4 3.9 3.8 1 2 4 - 3 8 - 5 16 - 9 32 - 17 64 - 33 128 - 65 256 - 129 + 257 3.7 3.6 3.5 3.4 3.3 3.2 Patch Adams (121769 samples) 4 3.9 3.8 3.7 3.6 3.5 3.4 1 2 4 - 3 8 - 5 16 - 9 32 - 17 64 - 33 128 - 65 256 - 129 + 257 3.3 3.2 Credit to: Martin Piotte and Martin Chabbert Rese search

Why daily rating counts • Number of user ratings on a date is a proxy for how long ago the movie was seen – Some movies age better than others • Also, two rating tasks: – Seed Netflix recommendations – Rate movies as you see them • Related to selection bias? Rese search

Biases matter! Components of a rating predictor user bias movie bias user-movie interaction Baseline predictor User-movie interaction • Separates users and movies • Characterizes the matching • Often overlooked between users and movies • Benefits from insights into users’ • Attracts most research in the field behavior • Benefits from algorithmic and • Among the main practical mathematical innovations contributions of the competition Rese esear arch

A baseline predictor • We have expectations on the rating by user u to movie i, even without estimating u ’s attitude towards movies like i – Rating scale of user u – (Recent) popularity of movie i – Values of other ratings user – Selection bias; related to gave recently number of ratings user gave on (day-specific mood, the same day anchoring, multi-user accounts) Rese esear arch

Sources of Variance in Netflix data Biases 33% Unexplained 57% Personalization 10% 0.732 (unexplained) + 0.415 (biases) + 0.129 (personalization) 1.276 (total variance) Rese esear arch

What drives user preferences? • Do they like certain genre, actors, director, keywords, etc. ? • Well, some do, but this is far from a complete characterization! • E.g., a recent paper is titled: – “Recommending new movies: even a few ratings are more valuable than metadata” [Pilaszy and Tikk, 09] • User motives are latent, barely interpretable in human language • Can be captured when data is abundant Rese esear arch

Wishful perception serious Braveheart Amadeus The Color Purple Lethal Weapon Sense and Ocean’s 11 Sensibility Geared towards Geared towards males females The Lion King Dumb and Dumber The Princess Independence Diaries Day escapist Rese esear arch

Complex reality… Rese esear arch

Ratings are not given at random! Distribution of ratings Netflix ratings Yahoo! music ratings Yahoo! survey answers Marlin, Zemel, Roweis, Slaney, “Collaborative Filtering and the Missing at Random Assumption” UAI 2007 Rese search

Which movies users rate? • A powerful source of information: Characterize users by which movies they rated, rather than how they rated •  A dense binary representation of the data: users users 1 3 5 5 4 1 0 1 0 0 1 0 0 1 0 1 0 4 1 0 5 4 2 1 3 0 0 1 0 1 0 0 1 1 1 movies movies 2 4 1 2 3 4 3 5 1 1 0 1 1 0 1 0 1 1 1 0 2 4 5 4 2 0 1 1 0 1 0 0 1 0 0 1 0 4 3 4 2 2 5 0 0 1 1 1 1 0 0 0 0 1 1 1 3 3 2 4 1 0 1 0 1 0 0 1 0 0 1 0   ,   ,   R r B b ui ui u i u i Rese search

Ensembles are Valuable for Prediction • Our final solution was a linear blend of over 700 prediction sets – Some of the 700 were blends • Difficult, or impossible, to build a grand unified model • Blending techniques: linear regression, neural network, gradient boosted decision trees , and more… • Mega blends are not needed in practice – A handful of simple models achieves 90% of the improvement of the full blend Rese search

1 3 5 5 4 4 5 4 2 1 3 2 4 1 2 3 4 3 5 1 2 3 4 3 2 3 2 4 5 4 2 4 5 4 5 4 3 4 2 2 5 4 3 4 2 4 2 1 3 3 2 4 Yehuda Koren Yahoo! Research yehuda@yahoo - inc.com Rese search

Going beyond the algorithms Yehuda Koren Haifa movie #15868 Rese - PowerPoint PPT Presentation

Lessons from the Netflix Prize: Going beyond the algorithms Yehuda Koren Haifa movie #15868 Rese search movie #7614 movie #16661 We Know What You Ought To Be Watching This Summer Rese search Rese search Were quite curious,

Looking Beyond the Knob Looking Beyond the Knob Looking Beyond the Knob Looking Beyond the Knob

MEDIA DISRUPTION SEEING BEYOND SEEING BEYOND SEEING BEYOND SEEING BEYOND LED BY THE BLIND

Graph Algorithms Chapter 22 1 CPTR 430 Algorithms Graph Algorithms Why Study Graph Algorithms?

Greedy Algorithms Chapter 16 1 CPTR 430 Algorithms Greedy Algorithms Greedy Algorithms For

Algorithms Chapter 3 Chapter Summary Algorithms n Example Algorithms n Algorithmic Paradigms

Human Development Report 2019 Beyond income, beyond averages, beyond today: Inequalities in human

High Performing Darren Steeves Meet Aunt Hilda They are going to This is going to be love me

General remarks Algorithms Algorithms Oliver Oliver Week 8 Kullmann Kullmann Greedy Greedy

NTTS Conference Brussels Brussels March 10-13, 2015 Going beyond GDP: a challenge also for

Moving Beyond Market Moving Beyond Market Fundamentalism to a Fundamentalism to a More Balanced

Beyond your dreams Beyond your dreams Beyond your dreams Destnaton Destnaton Israel

Beyond your dreams Beyond your dreams Beyond your dreams Destnaton ROMANIA Romana

Beyond your Dreams Beyond Your Dreams Beyond your dreams Destnaton Destnaton KOSOVO

Beyond your dreams Beyond Your Dreams Beyond your dreams Destnaton Destnaton

Data Clustering: Data Clustering: 50 Years Beyond K means 50 Years Beyond K means 50 Years

Beyond Beyond Journey Journey Times Times Bluetooth journey time process Moving beyond basic

Evaluating Machine Learned User Experiences Asela Gunawardana Intelligent User Experiences

Control Department of Chemical Engineering I.I.T. Bombay, India Effect of disturbances in

Java SE 9 and the Application Server Kevin Sutter MicroProfile and Java EE Architect

THE (UN)DESERVING RICH American Beliefs about Income Inequality from 1980 to the Occupy Wall

LC Software in the EU Ties Behnke, DESY Ties Behnke: Simulation and Reconstruction: Introduction

Ionospheric Raytracing in a Time-dependent Mesoscale Ionospheric Model K.A. Zawdie 1 , D.P. Drob

Unstructured Data Typically refers to free text I Allows I G Keyword queries including

Information Retrieval Introducing Information Retrieval and Web Search