using ratings posters for anime manga recommendations
play

Using Ratings & Posters for Anime & Manga Recommendations - PowerPoint PPT Presentation

Intro Recommender Systems Our method Experiments Using Ratings & Posters for Anime & Manga Recommendations Jill-Jnn Vie 13 Florian Yger 2 Ryan Lahfa 3 Basile Clement 3 Hisashi Kashima1 4 Kvin Cocchi3 Thomas Chalumeau3 1 RIKEN


  1. Intro Recommender Systems Our method Experiments Using Ratings & Posters for Anime & Manga Recommendations Jill-Jênn Vie 13 Florian Yger 2 Ryan Lahfa 3 Basile Clement 3 Hisashi Kashima1 4 Kévin Cocchi3 Thomas Chalumeau3 1 RIKEN Center for Advanced Intelligence Project (Tokyo) 2 Université Paris-Dauphine (France) 3 Mangaki (Paris, France) 4 Kyoto University

  2. Intro Recommender Systems Our method Experiments Mangaki.fr User can rate anime or manga (works) And receive recommendations And reorder their watchlist Code is 100% on GitHub Awards from Microsoft and Japan Foundation Organized a data challenge with Kyoto University

  3. Intro Recommender Systems Our method Experiments RIKEN Center for Advanced Intelligence Project New AI lab near Tokyo Station (opened in 2016) 8 accepted papers at NIPS 2017

  4. Intro Recommender Systems Our method Experiments Authors Florian Yger Ryan Lahfa Jill-Jênn Vie Hisashi Kashima Florian Yger was visiting RIKEN AIP Kévin Cocchi & Thomas Chalumeau were interns at Mangaki

  5. Intro Recommender Systems Our method Experiments Outline 1. Usual algorithms for recommender systems Content-based Collaborative filtering 2. Our method Extracting tags from posters Blending models 3. Experiments Dataset: Mangaki Results

  6. Intro Recommender Systems Our method Experiments Recommender Systems Problem Every user rates few items (1 %) How to infer missing ratings? Example Sacha ? 5 2 ? Ondine 4 1 ? 5 Pierre 3 3 1 4 Joëlle 5 ? 2 ?

  7. Intro Recommender Systems Our method Experiments Recommender Systems Problem Every user rates few items (1 %) How to infer missing ratings? Example Sacha 3 5 2 2 Ondine 4 1 4 5 Pierre 3 3 1 4 Joëlle 5 2 2 5

  8. Intro Recommender Systems Our method Experiments Usual techniques Content-based (work features: directors, genre, etc.) Linear regression Sparse linear regression (LASSO) Collaborative filtering (solely based on ratings) K -nearest neighbors Matrix factorization: Singular value decomposition Alternating least squares Stochastic gradient descent Hybrid recommender systems (combine those two) The proposed method

  9. Pearl Harbor Donald ? Shinzō ? Angela Justin Angela Justin Emmanuel An Inconvenient Truth Pearl Harbor Paprika Ratings Emmanuel Shinzō An Inconvenient Truth Donald Intro Recommender Systems Our method Experiments Example: K -Nearest Neighbors 3 1 3 2 2 − 3 2 − 4 − 1 4 4 − 1 − 3

  10. Donald Justin Similarity Justin Donald Angela Emmanuel ? Angela Shinzo Angela Emmanuel Justin An Inconvenient Truth Pearl Harbor Paprika Ratings Donald Emmanuel Shinzō Shinzō Intro Recommender Systems Our method Experiments Example: K -Nearest Neighbors 3 1 3 2 2 − 3 2 − 4 3 , 5 − 1 4 4 − 1 − 3 1 0 , 649 − 0 , 809 0 , 612 0 , 090 0 , 649 1 − 0 , 263 0 , 514 − 0 , 555 − 0 , 809 − 0 , 263 1 − 0 , 811 − 0 , 073 0 , 612 0 , 514 − 0 , 811 1 − 0 , 523 0 , 090 − 0 , 555 − 0 , 073 − 0 , 523 1

  11. Intro Recommender Systems Our method Experiments Matrix factorization → reduce dimension to generalize   R 1 P R 2     R = = = . C  .  .     R n � C : 2k users × 20 profiles R : 2k users × 15k works ⇐ ⇒ P : 20 profiles × 15k works R Bob is a linear combination of profiles P 1 , P 2 , etc.. Interpreting Key Profiles If P P 1 : adventure P 2 : romance P 3 : plot twist And C u 0 , 2 − 0 , 5 0 , 6 ⇒ u likes a bit adventure, hates romance, loves plot twists.

  12. Intro Recommender Systems Our method Experiments Weighted Alternating Least Squares (Zhou, 2008) R ratings, U user features, V work features. R = UV T r ALS � U i · V j . ⇒ r ij ≃ ˆ ij Objective function to minimize i , j known ( r ij − U i · V j ) 2 + λ �� i N i || U i || 2 + � j M j || V j || 2 � U , V �→ � where: N i : number of ratings by user i M j : number of ratings for item j Algorithm Until convergence (~ 10 iterations): Fix U find V (just linear regression → least squares) Fix V find U

  13. Intro Recommender Systems Our method Experiments Visualizing first two components of anime V j Closer points mean similar taste

  14. Intro Recommender Systems Our method Experiments Find your taste by plotting first two columns of U i You will like anime that are in your direction

  15. Intro Recommender Systems Our method Experiments Drawback with collaborative filtering Issue: Item Cold-Start If no ratings are available for a work j ⇒ Its features V j cannot be trained :-( No way to distinguish between unrated works. But we have posters! On Mangaki, almost all works have a poster How to extract information?

  16. Intro Recommender Systems Our method Experiments Illustration2Vec (Saito and Matsui, 2015) CNN (VGG-16) pretrained on ImageNet, trained on Danbooru (1.5M illustrations with tags) 502 most frequent tags kept, outputs tag weights

  17. Intro Recommender Systems Our method Experiments LASSO for sparse linear regression T matrix of 15000 works × 502 tags ( t jk : tag k appears in item j ) Each user is described by its preferences P i → a sparse row of weights over tags. Estimate user preferences P i such that r LASSO � P i T T r ij ≃ ˆ j . ij Least Absolute Shrinkage and Selection Operator (LASSO) 1 2 �R i − P i T T � P i �→ 2 + α � P i � 1 . 2 N i where N i is the number of items rated by user i . Interpretation and explanation of user preferences You seem to like magical girls but not blonde hair ⇒ Look! All of them are brown hair! Buy now!

  18. Intro Recommender Systems Our method Experiments Combine models Which model should be choose between ALS and LASSO? Answer Both! Methods boosting, bagging, model stacking, blending. r ij � α j ˆ r ALS r LASSO Idea find α j s.t. ˆ + (1 − α j )ˆ . ij ij

  19. 1 Intro Recommender Systems Our method Experiments Examples of α j α j α j : R j �→ 1 Number of ratings R j Mimics ALS r ALS r LASSO r ij � 1ˆ ˆ + 0ˆ . ij ij

  20. 1 Intro Recommender Systems Our method Experiments Examples of α j α j α j : R j �→ 0 Number of ratings R j Mimics LASSO r ALS r LASSO r ij � 0ˆ ˆ + 1ˆ . ij ij We call this gate the Steins;Gate.

  21. 1 Intro Recommender Systems Our method Experiments Examples of α j α j α j : R j �→ 1 ⇔ R j ≥ γ γ Number of ratings R j � r ALS ˆ if item j was rated at least γ times r BALSE ij ˆ = ij r LASSO ˆ otherwise ij But we can’t: Not differentiable!

  22. 1/2 1 Intro Recommender Systems Our method Experiments Examples of α j α j α j : R j �→ 1 / (1 + exp( − β ( R j − γ ))) γ Number of ratings R j r BALSE r ALS r LASSO ˆ = σ ( β ( R j − γ ))ˆ + (1 − σ ( β ( R j − γ )))ˆ ij ij ij β and γ are learned by stochastic gradient descent. We call this gate the Steins;Gate.

  23. Illustration2Vec posters tags LASSO ALS ratings Intro Recommender Systems Our method Experiments Blended Alternate Least Squares with Explanation β , γ We call this model BALSE.

  24. Intro Recommender Systems Our method Experiments Dataset: Mangaki 2300 users 15000 works anime / manga / OST 340000 ratings fav / like / dislike / neutral / willsee / wontsee

  25. Intro Recommender Systems Our method Experiments Evaluation: Root Mean Squared Error (RMSE) If we predict ˆ r ij for each user-work pair ( i , j ) to test among n , while truth is r ij : � � 1 � � r ij − r ij ) 2 . RMSE (ˆ r , r ) = (ˆ � n i , j

  26. Intro Recommender Systems Our method Experiments Cross-validation 80% of the ratings are used for training 20% of the ratings are kept for testing Differents sets of items: Whole test set of works 1000 works least rated (1.5%) Cold-start: works not seen in the training set (only the posters)

  27. 1.316 1.299 1.247 1.150 BALSE 1.358 1.347 1.446 LASSO 1.493 1.157 ALS Cold-start items 1000 least rated (1.5%) Test set RMSE Intro Recommender Systems Our method Experiments Results

  28. Intro Recommender Systems Our method Experiments Summing up We presented BALSE, a model that: uses information in the ratings (collaborative filtering) uses information in the posters using CNNs (content-based) combine them in a nonlinear way to improve the recommendations, and explain them. Further work Use latent features (not only tags) of the posters (IJCAI 2016) End-to-end training (not separately)

  29. Intro Recommender Systems Our method Experiments Thank you! Try it: https://mangaki.fr Twitter: @MangakiFR Read the article Using Posters to Recommend Anime and Mangas in a Cold-Start Scenario github.com/mangaki/balse (PDF on arXiv, front page of HNews) Results of Mangaki Data Challenge: research.mangaki.fr 1. Ronnie Wang (Microsoft Suzhou, China) 2. Kento Nozawa (Tsukuba University, Japan) 3. Jo Takano (Kobe University, Japan)

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend