SLIDE 1
Content-based recommendation systems (based on chapter 9 of Mining of Massive Datasets, a book by Rajaraman, Leskovec, and Ullman’s book)
Fernando Lobo
Data mining
1 / 16
Content-based recommendation systems (based on chapter 9 of Mining - - PowerPoint PPT Presentation
Content-based recommendation systems (based on chapter 9 of Mining of Massive Datasets, a book by Rajaraman, Leskovec, and Ullmans book) Fernando Lobo Data mining 1 / 16 Content-based Recommendation Systems Focus on properties of
1 / 16
2 / 16
◮ set of actors ◮ director ◮ year the movie was made ◮ genre 3 / 16
◮ document collections ◮ images 4 / 16
5 / 16
6 / 16
7 / 16
8 / 16
◮ TFwk = 1/20. ◮ TF.IDF for word w in document k is 1/20 × 10 = 1/2. 9 / 16
◮ Jaccard distance between sets of words ◮ cosine distance between sets, treated as vectors 10 / 16
11 / 16
12 / 16
13 / 16
14 / 16
15 / 16
16 / 16