Recommender Systems Jia-Bin Huang Virginia Tech Spring 2019 - - PowerPoint PPT Presentation

โ–ถ
recommender systems
SMART_READER_LITE
LIVE PREVIEW

Recommender Systems Jia-Bin Huang Virginia Tech Spring 2019 - - PowerPoint PPT Presentation

Recommender Systems Jia-Bin Huang Virginia Tech Spring 2019 ECE-5424G / CS-5824 Administrative HW 4 due April 10 Unsupervised Learning Clustering, K-Mean Expectation maximization Dimensionality reduction Anomaly detection


slide-1
SLIDE 1

Recommender Systems

Jia-Bin Huang Virginia Tech

Spring 2019

ECE-5424G / CS-5824

slide-2
SLIDE 2

Administrative

  • HW 4 due April 10
slide-3
SLIDE 3

Unsupervised Learning

  • Clustering, K-Mean
  • Expectation maximization
  • Dimensionality reduction
  • Anomaly detection
  • Recommendation system
slide-4
SLIDE 4

Motivating example: Monitoring machines in a data center

๐‘ฆ1 (CPU load) ๐‘ฆ2 (Memory use) ๐‘ฆ1 (CPU load) ๐‘ฆ2 (Memory use)

slide-5
SLIDE 5

Multivariate Gaussian (normal) distribution

  • ๐‘ฆ โˆˆ ๐‘†๐‘œ. Donโ€™t model ๐‘ž ๐‘ฆ1 , ๐‘ž ๐‘ฆ2 , โ‹ฏ separately
  • Model ๐‘ž ๐‘ฆ all in one go.
  • Parameters: ๐œˆ โˆˆ ๐‘†๐‘œ, ฮฃ โˆˆ ๐‘†๐‘œร—๐‘œ (covariance matrix)
  • ๐‘ž ๐‘ฆ; ๐œˆ, ฮฃ =

1 2๐œŒ ๐‘œ/2 ฮฃ 1/2 exp โˆ’ ๐‘ฆ โˆ’ ๐œˆ โŠคฮฃโˆ’1(๐‘ฆ โˆ’ ๐œˆ)

slide-6
SLIDE 6

Multivariate Gaussian (normal) examples

ฮฃ = 1 1 ฮฃ = 0.6 0.6 ฮฃ = 2 2 ๐‘ฆ1 ๐‘ฆ2 ๐‘ฆ1 ๐‘ฆ2 ๐‘ฆ1 ๐‘ฆ2

slide-7
SLIDE 7

Multivariate Gaussian (normal) examples

ฮฃ = 1 1 ฮฃ = 0.6 1 ฮฃ = 2 1 ๐‘ฆ1 ๐‘ฆ2 ๐‘ฆ1 ๐‘ฆ2 ๐‘ฆ1 ๐‘ฆ2

slide-8
SLIDE 8

Multivariate Gaussian (normal) examples

ฮฃ = 1 1 ฮฃ = 1 0.5 0.5 1 ฮฃ = 1 0.8 0.8 1 ๐‘ฆ1 ๐‘ฆ2 ๐‘ฆ1 ๐‘ฆ2 ๐‘ฆ1 ๐‘ฆ2

slide-9
SLIDE 9

Anomaly detection using the multivariate Gaussian distribution

  • 1. Fit model ๐‘ž ๐‘ฆ by setting

๐œˆ = 1 ๐‘› เท

๐‘—=1 ๐‘›

๐‘ฆ(๐‘—) ฮฃ = 1 ๐‘› เท

๐‘—=1 ๐‘›

(๐‘ฆ(๐‘—)โˆ’๐œˆ)(๐‘ฆ(๐‘—) โˆ’ ๐œˆ)โŠค 2 Give a new example ๐‘ฆ, compute ๐‘ž ๐‘ฆ; ๐œˆ, ฮฃ = 1 2๐œŒ ๐‘œ/2 ฮฃ 1/2 exp โˆ’ ๐‘ฆ โˆ’ ๐œˆ โŠคฮฃโˆ’1(๐‘ฆ โˆ’ ๐œˆ) Flag an anomaly if ๐‘ž ๐‘ฆ < ๐œ—

slide-10
SLIDE 10

Original model ๐‘ž ๐‘ฆ1; ๐œˆ1, ๐œ1

2 ๐‘ž ๐‘ฆ2; ๐œˆ2, ๐œ2 2 โ‹ฏ ๐‘ž ๐‘ฆ๐‘œ; ๐œˆ๐‘œ, ๐œ๐‘œ 2

Manually create features to capture anomalies where ๐‘ฆ1, ๐‘ฆ2 take unusual combinations of values Computationally cheaper (alternatively, scales better) OK even if training set size is small

Original model

๐‘ž ๐‘ฆ; ๐œˆ, ฮฃ = 1 2๐œŒ ๐‘œ/2 ฮฃ 1/2 exp(โˆ’ ๐‘ฆ โˆ’ ๐œˆ โŠคฮฃโˆ’1(๐‘ฆ

slide-11
SLIDE 11

Recommender Systems

  • Motivation
  • Problem formulation
  • Content-based recommendations
  • Collaborative filtering
  • Mean normalization
slide-12
SLIDE 12

Recommender Systems

  • Motivation
  • Problem formulation
  • Content-based recommendations
  • Collaborative filtering
  • Mean normalization
slide-13
SLIDE 13

You may also like..?

slide-14
SLIDE 14
slide-15
SLIDE 15

Recommender Systems

  • Motivation
  • Problem formulation
  • Content-based recommendations
  • Collaborative filtering
  • Mean normalization
slide-16
SLIDE 16

Example: Predicting movie ratings

  • User rates movies using zero to five stars

Movie Alice (1) Bob (2) Carol (3) Dave (4) Love at last

5 5

Romance forever

5 ? ?

Cute puppies of love

? 4 ?

Nonstop car chases

5 4

Swords vs. karate

5 ?

  • ๐‘œ๐‘ฃ = no. users
  • ๐‘œ๐‘› = no. movies
  • ๐‘  ๐‘—, ๐‘˜ = 1 if user ๐‘˜ has

rated movie ๐‘—

  • ๐‘ง(๐‘—,๐‘˜) = rating given by

user ๐‘˜ to movie ๐‘—

slide-17
SLIDE 17

Recommender Systems

  • Motivation
  • Problem formulation
  • Content-based recommendations
  • Collaborative filtering
  • Mean normalization
slide-18
SLIDE 18

Content-based recommender systems

Movie Alice (1) Bob (2) Carol (3) Dave (4) ๐‘ฆ1 (romance) ๐‘ฆ2 (action) Love at last

5 5 0.9

Romance forever

5 ? ? 1.0 0.01

Cute puppies

  • f love

? 4 ? 0.99

Nonstop car chases

5 4 0.1 1.0

Swords vs. karate

5 ? 0.9

For each user ๐‘˜, learn a parameter ๐œ„(๐‘˜) โˆˆ ๐‘†3. Predict user ๐‘˜ as rating movie ๐‘— with (๐œ„ ๐‘˜ )โŠค๐‘ฆ(๐‘—) stars.

slide-19
SLIDE 19

Content-based recommender systems

Movie Alice (1) Bob (2) Carol (3) Dave (4) ๐‘ฆ1 (romance) ๐‘ฆ2 (action) Love at last

5 5 0.9

Romance forever

5 ? ? 1.0 0.01

Cute puppies

  • f love

? 4 ? 0.99

Nonstop car chases

5 4 0.1 1.0

Swords vs. karate

5 ? 0.9

For each user ๐‘˜, learn a parameter ๐œ„(๐‘˜) โˆˆ ๐‘†3. Predict user ๐‘˜ as rating movie ๐‘— with (๐œ„ ๐‘˜ )โŠค๐‘ฆ(๐‘—) stars.

๐‘ฆ(3) = 1 0.99 ๐œ„ 1 = 5 (๐œ„ 1 )โŠค๐‘ฆ(3) = 5 โˆ— 0.99 = 4.95

slide-20
SLIDE 20

Problem formulation

  • ๐‘  ๐‘—, ๐‘˜ = 1 if user ๐‘˜ has rated movie ๐‘—
  • ๐‘ง(๐‘—,๐‘˜) = rating given by user ๐‘˜ to movie ๐‘—
  • ๐œ„(๐‘˜) = parameter vector for user ๐‘˜
  • ๐‘ฆ(๐‘—) = feature vector for user ๐‘—
  • For each user ๐‘˜, predicted rating: (๐œ„ ๐‘˜ )โŠค๐‘ฆ(๐‘—)
  • ๐‘›(๐‘˜) = no. of movies rated by user j

Goal: learn ๐œ„(๐‘˜):

min

๐œ„(๐‘˜)

1 2๐‘›(๐‘˜) เท

๐‘—:๐‘  ๐‘—,๐‘˜ =1

(๐œ„ ๐‘˜ )โŠค ๐‘ฆ ๐‘— โˆ’ ๐‘ง ๐‘—,๐‘˜

2 +

๐œ‡ 2๐‘›(๐‘˜) เท

๐‘™=1 ๐‘œ

๐œ„๐‘™

๐‘˜ 2

slide-21
SLIDE 21

Optimization objective

  • Learn ๐œ„ ๐‘˜ (parameter for user ๐‘˜):

min

๐œ„(๐‘˜)

1 2 เท

๐‘—:๐‘  ๐‘—,๐‘˜ =1

(๐œ„ ๐‘˜ )โŠค ๐‘ฆ ๐‘— โˆ’ ๐‘ง ๐‘—,๐‘˜

2 + ๐œ‡

2 เท

๐‘™=1 ๐‘œ

๐œ„๐‘™

๐‘˜ 2

Learn ๐œ„ 1 , ๐œ„ 2 , โ‹ฏ , ๐œ„ ๐‘œ๐‘ฃ : min

๐œ„ 1 ,๐œ„ 2 ,โ‹ฏ,๐œ„ ๐‘œ๐‘ฃ

1 2 เท

๐‘˜=1 ๐‘œ๐‘ฃ

เท

๐‘—:๐‘  ๐‘—,๐‘˜ =1

(๐œ„ ๐‘˜ )โŠค ๐‘ฆ ๐‘— โˆ’ ๐‘ง ๐‘—,๐‘˜

2 + ๐œ‡

2 เท

๐‘˜=1 ๐‘œ๐‘ฃ

เท

๐‘™=1 ๐‘œ

๐œ„๐‘™

๐‘˜ 2

slide-22
SLIDE 22

Optimization algorithm

min

๐œ„(๐‘˜)

1 2 เท

๐‘˜=1 ๐‘œ๐‘ฃ

เท

๐‘—:๐‘  ๐‘—,๐‘˜ =1

(๐œ„ ๐‘˜ )โŠค ๐‘ฆ ๐‘— โˆ’ ๐‘ง ๐‘—,๐‘˜

2 + ๐œ‡

2 เท

๐‘˜=1 ๐‘œ๐‘ฃ

เท

๐‘™=1 ๐‘œ

๐œ„๐‘™

๐‘˜ 2

Gradient descent update:

๐œ„๐‘™

๐‘˜ โ‰” ๐œ„๐‘™ ๐‘˜ โˆ’ ๐›ฝ ฯƒ๐‘—:๐‘  ๐‘—,๐‘˜ =1

๐œ„ ๐‘˜

โŠค ๐‘ฆ ๐‘—

โˆ’ ๐‘ง ๐‘—,๐‘˜ ๐‘ฆ๐‘™

๐‘—

(for ๐‘™ = 0) ๐œ„๐‘™

๐‘˜ โ‰” ๐œ„๐‘™ ๐‘˜ โˆ’ ๐›ฝ ฯƒ๐‘—:๐‘  ๐‘—,๐‘˜ =1( ๐œ„ ๐‘˜ โŠค ๐‘ฆ ๐‘—

โˆ’ ๐‘ง ๐‘—,๐‘˜ ) ๐‘ฆ๐‘™

๐‘— + ๐œ‡ ๐œ„๐‘™ (๐‘˜)

(for ๐‘™ โ‰  0)

slide-23
SLIDE 23

Recommender Systems

  • Motivation
  • Problem formulation
  • Content-based recommendations
  • Collaborative filtering
  • Mean normalization
slide-24
SLIDE 24

Problem motivation

Movie Alice (1) Bob (2) Carol (3) Dave (4) ๐‘ฆ1 (romance) ๐‘ฆ2 (action) Love at last

5 5 0.9

Romance forever

5 ? ? 1.0 0.01

Cute puppies

  • f love

? 4 ? 0.99

Nonstop car chases

5 4 0.1 1.0

Swords vs. karate

5 ? 0.9

slide-25
SLIDE 25

Problem motivation

Movie Alice (1) Bob (2) Carol (3) Dave (4) ๐‘ฆ1 (romance) ๐‘ฆ2 (action) Love at last

5 5 ? ?

Romance forever

5 ? ? ? ?

Cute puppies

  • f love

? 4 ? ? ?

Nonstop car chases

5 4 ? ?

Swords vs. karate

5 ? ? ?

๐œ„ 1 = 5 ๐œ„ 2 = 5 ๐œ„ 3 = 5 ๐œ„ 4 = 5 ๐‘ฆ 1 = ? ? ?

slide-26
SLIDE 26

Optimization algorithm

  • Given ๐œ„ 1 , ๐œ„ 2 , โ‹ฏ , ๐œ„ ๐‘œ๐‘ฃ , to learn ๐‘ฆ(๐‘—):

min

๐‘ฆ(๐‘—)

1 2 เท

๐‘˜:๐‘  ๐‘—,๐‘˜ =1

(๐œ„ ๐‘˜ )โŠค ๐‘ฆ ๐‘— โˆ’ ๐‘ง ๐‘—,๐‘˜

2 + ๐œ‡

2 เท

๐‘™=1 ๐‘œ

๐‘ฆ๐‘™

(๐‘—) 2

  • Given ๐œ„ 1 , ๐œ„ 2 , โ‹ฏ , ๐œ„ ๐‘œ๐‘ฃ , to learn ๐‘ฆ(1), ๐‘ฆ(2), โ‹ฏ , ๐‘ฆ(๐‘œ๐‘›):

min

๐‘ฆ(1),๐‘ฆ(2),โ‹ฏ,๐‘ฆ(๐‘œ๐‘›)

1 2 เท

๐‘—=1 ๐‘œ๐‘›

เท

๐‘˜:๐‘  ๐‘—,๐‘˜ =1

(๐œ„ ๐‘˜ )โŠค ๐‘ฆ ๐‘— โˆ’ ๐‘ง ๐‘—,๐‘˜

2 + ๐œ‡

2 เท

๐‘—=1 ๐‘œ๐‘›

เท

๐‘™=1 ๐‘œ

๐‘ฆ๐‘™

(๐‘—) 2

slide-27
SLIDE 27

Collaborative filtering

  • Given ๐‘ฆ 1 , ๐‘ฆ 2 , โ‹ฏ , ๐‘ฆ ๐‘œ๐‘› (and movie ratings),

Can estimate ๐œ„ 1 , ๐œ„ 2 , โ‹ฏ , ๐œ„ ๐‘œ๐‘ฃ

  • Given ๐œ„ 1 , ๐œ„ 2 , โ‹ฏ , ๐œ„ ๐‘œ๐‘ฃ

Can estimate ๐‘ฆ 1 , ๐‘ฆ 2 , โ‹ฏ , ๐‘ฆ ๐‘œ๐‘›

slide-28
SLIDE 28

Collaborative filtering optimization

  • bjective
  • Given ๐‘ฆ 1 , ๐‘ฆ 2 , โ‹ฏ , ๐‘ฆ ๐‘œ๐‘› , estimate ๐œ„ 1 , ๐œ„ 2 , โ‹ฏ , ๐œ„ ๐‘œ๐‘ฃ

min

๐œ„ 1 ,๐œ„ 2 ,โ‹ฏ,๐œ„ ๐‘œ๐‘ฃ

1 2 เท

๐‘˜=1 ๐‘œ๐‘ฃ

เท

๐‘—:๐‘  ๐‘—,๐‘˜ =1

(๐œ„ ๐‘˜ )โŠค ๐‘ฆ ๐‘— โˆ’ ๐‘ง ๐‘—,๐‘˜

2 + ๐œ‡

2 เท

๐‘˜=1 ๐‘œ๐‘ฃ

เท

๐‘™=1 ๐‘œ

๐œ„๐‘™

๐‘˜ 2

  • Given ๐œ„ 1 , ๐œ„ 2 , โ‹ฏ , ๐œ„ ๐‘œ๐‘ฃ , estimate ๐‘ฆ 1 , ๐‘ฆ 2 , โ‹ฏ , ๐‘ฆ ๐‘œ๐‘›

min

๐‘ฆ(1),๐‘ฆ(2),โ‹ฏ,๐‘ฆ(๐‘œ๐‘›)

1 2 เท

๐‘—=1 ๐‘œ๐‘›

เท

๐‘˜:๐‘  ๐‘—,๐‘˜ =1

(๐œ„ ๐‘˜ )โŠค ๐‘ฆ ๐‘— โˆ’ ๐‘ง ๐‘—,๐‘˜

2 + ๐œ‡

2 เท

๐‘—=1 ๐‘œ๐‘›

เท

๐‘™=1 ๐‘œ

๐‘ฆ๐‘™

(๐‘—) 2

slide-29
SLIDE 29

Collaborative filtering optimization objective

  • Given ๐‘ฆ 1 , ๐‘ฆ 2 , โ‹ฏ , ๐‘ฆ ๐‘œ๐‘› , estimate ๐œ„ 1 , ๐œ„ 2 , โ‹ฏ , ๐œ„ ๐‘œ๐‘ฃ

min

๐œ„ 1 ,๐œ„ 2 ,โ‹ฏ,๐œ„ ๐‘œ๐‘ฃ

1 2 เท

๐‘˜=1 ๐‘œ๐‘ฃ

เท

๐‘—:๐‘  ๐‘—,๐‘˜ =1

(๐œ„ ๐‘˜ )โŠค ๐‘ฆ ๐‘— โˆ’ ๐‘ง ๐‘—,๐‘˜

2 + ๐œ‡

2 เท

๐‘˜=1 ๐‘œ๐‘ฃ

เท

๐‘™=1 ๐‘œ

๐œ„๐‘™

๐‘˜ 2

  • Given ๐œ„ 1 , ๐œ„ 2 , โ‹ฏ , ๐œ„ ๐‘œ๐‘ฃ , estimate ๐‘ฆ 1 , ๐‘ฆ 2 , โ‹ฏ , ๐‘ฆ ๐‘œ๐‘›

min

๐‘ฆ(1),๐‘ฆ(2),โ‹ฏ,๐‘ฆ(๐‘œ๐‘›)

1 2 เท

๐‘—=1 ๐‘œ๐‘›

เท

๐‘˜:๐‘  ๐‘—,๐‘˜ =1

(๐œ„ ๐‘˜ )โŠค ๐‘ฆ ๐‘— โˆ’ ๐‘ง ๐‘—,๐‘˜

2 + ๐œ‡

2 เท

๐‘—=1 ๐‘œ๐‘›

เท

๐‘™=1 ๐‘œ

๐‘ฆ๐‘™

(๐‘—) 2

  • Minimize ๐‘ฆ 1 , ๐‘ฆ 2 , โ‹ฏ , ๐‘ฆ ๐‘œ๐‘› and ๐œ„ 1 , ๐œ„ 2 , โ‹ฏ , ๐œ„ ๐‘œ๐‘ฃ simultaneously

๐พ = 1 2 เท

๐‘˜:๐‘  ๐‘—,๐‘˜ =1

(๐œ„ ๐‘˜ )โŠค ๐‘ฆ ๐‘— โˆ’ ๐‘ง ๐‘—,๐‘˜

2 + ๐œ‡

2 เท

๐‘˜=1 ๐‘œ๐‘ฃ

เท

๐‘™=1 ๐‘œ

๐œ„๐‘™

๐‘˜ 2

+ ๐œ‡ 2 เท

๐‘—=1 ๐‘œ๐‘›

เท

๐‘™=1 ๐‘œ

๐‘ฆ๐‘™

(๐‘—) 2

slide-30
SLIDE 30

Collaborative filtering optimization objective

๐พ(๐‘ฆ 1 , ๐‘ฆ 2 , โ‹ฏ , ๐‘ฆ ๐‘œ๐‘› , ๐œ„ 1 , ๐œ„ 2 , โ‹ฏ , ๐œ„ ๐‘œ๐‘ฃ ) = 1 2 เท

๐‘  ๐‘—,๐‘˜ =1

(๐œ„ ๐‘˜ )โŠค ๐‘ฆ ๐‘— โˆ’ ๐‘ง ๐‘—,๐‘˜

2 + ๐œ‡

2 เท

๐‘˜=1 ๐‘œ๐‘ฃ

เท

๐‘™=1 ๐‘œ

๐œ„๐‘™

๐‘˜ 2

+ ๐œ‡ 2 เท

๐‘—=1 ๐‘œ๐‘›

เท

๐‘™=1 ๐‘œ

๐‘ฆ๐‘™

(๐‘—) 2

slide-31
SLIDE 31

Collaborative filtering algorithm

  • Initialize ๐‘ฆ 1 , ๐‘ฆ 2 , โ‹ฏ , ๐‘ฆ ๐‘œ๐‘› , ๐œ„ 1 , ๐œ„ 2 , โ‹ฏ , ๐œ„ ๐‘œ๐‘ฃ to small random values
  • Minimize ๐พ(๐‘ฆ 1 , ๐‘ฆ 2 , โ‹ฏ , ๐‘ฆ ๐‘œ๐‘› , ๐œ„ 1 , ๐œ„ 2 , โ‹ฏ , ๐œ„ ๐‘œ๐‘ฃ ) using gradient

descent (or an advanced optimization algorithm). For every ๐‘˜ = 1 โ‹ฏ ๐‘œ๐‘ฃ, ๐‘— = 1, โ‹ฏ , ๐‘œ๐‘›: ๐‘ฆ๐‘™

๐‘˜ โ‰” ๐‘ฆ๐‘™ ๐‘˜ โˆ’ ๐›ฝ

เท

๐‘˜:๐‘  ๐‘—,๐‘˜ =1

( ๐œ„ ๐‘˜

โŠค ๐‘ฆ ๐‘—

โˆ’ ๐‘ง ๐‘—,๐‘˜ ) ๐œ„๐‘™

๐‘— + ๐œ‡ ๐‘ฆ๐‘™ (๐‘—)

๐œ„๐‘™

๐‘˜ โ‰” ๐œ„๐‘™ ๐‘˜ โˆ’ ๐›ฝ

เท

๐‘—:๐‘  ๐‘—,๐‘˜ =1

( ๐œ„ ๐‘˜

โŠค ๐‘ฆ ๐‘—

โˆ’ ๐‘ง ๐‘—,๐‘˜ ) ๐‘ฆ๐‘™

๐‘— + ๐œ‡ ๐œ„๐‘™ (๐‘˜)

  • For a user with parameter ๐œ„ and movie with (learned) feature ๐‘ฆ, predict

a star rating of ๐œ„โŠค๐‘ฆ

slide-32
SLIDE 32

Collaborative filtering

Movie Alice (1) Bob (2) Carol (3) Dave (4) Love at last

5 5

Romance forever

5 ? ?

Cute puppies of love

? 4 ?

Nonstop car chases

5 4

Swords vs. karate

5 ?

slide-33
SLIDE 33

Collaborative filtering

  • Predicted ratings:

๐‘Œ = โˆ’ ๐‘ฆ 1

โŠค โˆ’

โˆ’ ๐‘ฆ 2

โŠค โˆ’

โ‹ฎ โˆ’ ๐‘ฆ ๐‘œ๐‘›

โŠค โˆ’

ฮ˜ = โˆ’ ๐œ„ 1

โŠคโˆ’

โˆ’ ๐œ„ 2

โŠคโˆ’

โ‹ฎ โˆ’ ๐œ„ ๐‘œ๐‘ฃ

โŠค โˆ’

Y = Xฮ˜โŠค

Low-rank matrix factorization

slide-34
SLIDE 34

Finding related movies/products

  • For each product ๐‘—, we learn a feature vector ๐‘ฆ(๐‘—) โˆˆ ๐‘†๐‘œ

๐‘ฆ1: romance, ๐‘ฆ2: action, ๐‘ฆ3: comedy, โ€ฆ

  • How to find movie ๐‘˜ relate to movie ๐‘—?

Small ๐‘ฆ(๐‘—) โˆ’ ๐‘ฆ(๐‘˜) movie j and I are โ€œsimilarโ€

slide-35
SLIDE 35

Recommender Systems

  • Motivation
  • Problem formulation
  • Content-based recommendations
  • Collaborative filtering
  • Mean normalization
slide-36
SLIDE 36

Users who have not rated any movies

1 2 เท

๐‘  ๐‘—,๐‘˜ =1

(๐œ„ ๐‘˜ )โŠค ๐‘ฆ ๐‘— โˆ’ ๐‘ง ๐‘—,๐‘˜

2 + ๐œ‡

2 เท

๐‘˜=1 ๐‘œ๐‘ฃ

เท

๐‘™=1 ๐‘œ

๐œ„๐‘™

๐‘˜ 2

+ ๐œ‡ 2 เท

๐‘—=1 ๐‘œ๐‘›

เท

๐‘™=1 ๐‘œ

๐‘ฆ๐‘™

(๐‘—) 2

๐œ„(5) = 0

Movie Alice (1) Bob (2) Carol (3) Dave (4) Eve (5) Love at last

5 5 ?

Romance forever

5 ? ? ?

Cute puppies

  • f love

? 4 ? ?

Nonstop car chases

5 4 ?

Swords vs. karate

5 ? ?

slide-37
SLIDE 37

Users who have not rated any movies

1 2 เท

๐‘  ๐‘—,๐‘˜ =1

(๐œ„ ๐‘˜ )โŠค ๐‘ฆ ๐‘— โˆ’ ๐‘ง ๐‘—,๐‘˜

2 + ๐œ‡

2 เท

๐‘˜=1 ๐‘œ๐‘ฃ

เท

๐‘™=1 ๐‘œ

๐œ„๐‘™

๐‘˜ 2

+ ๐œ‡ 2 เท

๐‘—=1 ๐‘œ๐‘›

เท

๐‘™=1 ๐‘œ

๐‘ฆ๐‘™

(๐‘—) 2

๐œ„(5) = 0

Movie Alice (1) Bob (2) Carol (3) Dave (4) Eve (5) Love at last

5 5

Romance forever

5 ? ?

Cute puppies

  • f love

? 4 ?

Nonstop car chases

5 4

Swords vs. karate

5 ?

slide-38
SLIDE 38

Mean normalization

For user ๐‘˜, on movie ๐‘— predict: ๐œ„ ๐‘˜

โŠค ๐‘ฆ(๐‘—) + ๐œˆ๐‘—

User 5 (Eve): ๐œ„ 5 = 0 ๐œ„ 5

โŠค ๐‘ฆ(๐‘—) + ๐œˆ๐‘—

Learn ๐œ„(๐‘˜), ๐‘ฆ(๐‘—)

slide-39
SLIDE 39

Recommender Systems

  • Motivation
  • Problem formulation
  • Content-based recommendations
  • Collaborative filtering
  • Mean normalization