Shefali Garg Fangyan Sun Music dataset is too big while life is - PowerPoint PPT Presentation

Shefali Garg Fangyan Sun

Music dataset is too big while life is short!!!! You need someone to teach you how to manage and give you wise suggestions according to your taste! Music service providers need a more efficient system to attraction their clients!

Music User‘s listening Prediction of history songs that user Recommender &music will listen to information Syste tem Our system: off-line system

 Features:  Too big dataset:  Large-scale: 1 000 000 users  Difficult to implement the whole 15000 000 songs dataset, so need to create a  Open small dataset by ourselves  Implicit feedback  Content:  Format of the Dataset:  Triplets (user, song, count)  Hdf5 files  Meta-data, content-analysis  Need to be opened by a Python  No users’ demographic information, Wrapper timestamp Features & Content Difficulties linked to the Data

1.Popularity 2.Same artist Content based based model greatest hits Model Latent factor Nearest ... Model Neighborhood 3.Collaborative 4.Content- SVD filtering based Model

 Idea  Pros:  Idea is simple Sort songs by popularity in a 1.  Easy to implement decreasing order  Served as basel eline For each user, recommend the 2. songs in order of popularity,  Cons: except those already in the user’s profile  Not personalized (users and songs’ information is not taken into account)  Some songs will never be listend Idea & Steps Pros & Cons

 Pros:  Idea  Idea is simple Sort songs by popularity in a 1.  Easy to implement decreasing order  Minimum personalized For each user, the ranking of 2. songs is re-ordered to place  Cons: songs by artists recommend the songs in the  Only single-meta-data is used 3.  Maximally conservative: doesn’t new order, except those already explore the space byond songs with in the user’s profile which the user is likely already familiar Idea & Steps Pros & Cons

Idea: songs that are often listened by the same user Idea: users who listen to the same songs in the tend to be similar and are more likely to be listened past tend to have similar interests and will probably together in future by some other user. listen to the same songs in future. Item-based User-based

 Similarity != Based on item’s description 1. and user’s preference profile recommendation (no Not based on choices of notion of personalization) 2. other users with similar interests We make recommendations 3.  Majority of songs have too by looking for music whose few listeners, so difficult to features are very similar to the tastes of the user “collaborate” And Why? What’s content -based model?

 1. Create a space of songs according to songs features. We find out neighborhood of each song.  2. We look at each user’s profile and suggest songs which are neighbors to the songs that he listens to

 Idea: SVD  Personalized  Listening histories are influenced  Meta-data is fully used, all the by a set of factors specific to the information is well explored domain (e.g. Genre, artist).  It works well in many tested cases These factors are in general not  obvious and we need to infer those so called latent factors from the data. Users and songs are characterized  by latent factors. Idea

 Matrix M, a user-song play count matrix 1 0 1 1 0 0 ... 1 1 0 0 0 0 0 1 ...

 Off-line evaluation  Truncated mAP (mean Average Precision)

1 0 1 1 0 0 ... Haven’t listend to a song != 1. dislike it. The « 0 » gives a lot 1 1 0 0 0 0 confusion and little confidence. 0 1 2. We use weighted matrix ... factorization 3. Each entry is weighted by a confidence function so as to put more confidence on non- zero entries

 First latent factors capture properties of the most popular items, while the additional latent factors represent more refined features related to unpopular items.  Number of latent factors influences the quality of long-tail items differently than head items.

[1] McFee, B., BertinMahieux,T., Ellis, D. P., Lanckriet, G. R. (2012, April). The million  song dataset challenge . In Proceedings of the 21st international conference companion on World Wide Web (pp. 909916).ACM. [2] Aiolli, F. (2012). A preliminary y study y on a recommender system for the million  songs dataset challenge . PREFERENCE LEARNING: PROBLEMS AND APPLICATIONS IN AI [3] Koren, Yehuda. "Recommender system utilizing collaborative filtering combining  explicit and implicit feedback with both neighborhood and latent factor models." U.S. Patent No. 8,037,080. 11 Oct. 2011. [4] Cremonesi, Paolo, Yehuda Koren, and Roberto Turrin. "Performance of recommender  algorithms on top-n recommendation tasks." Proceedings of the fourth ACM conference on Recommender systems . ACM, 2010

Any questions or suggestions?

Shefali Garg Fangyan Sun Music dataset is too big while life is - PowerPoint PPT Presentation

Shefali Garg Fangyan Sun Music dataset is too big while life is short!!!! You need someone to teach you how to manage and give you wise suggestions according to your taste! Music service providers need a more efficient system to attraction

Final Presentation May 6, 2011 Brandon Borkholder Mark Dickerson Shefali Garg Aren Knutsen

Project Proposal February 17, 2011 Brandon Borkholder Mark Dickerson Shefali Garg Aren Knutsen

CORRELATING SPEECH PROCESSING IN DEEP LEARNING AND COMPUT ATIONAL NEUROSCIENCE Shefali Garg

Supervising Unsupervised Learning Vikas K. Garg & Adam Kalai Vikas K. Garg & Adam Kalai

Fault-Tolerant Services in Distributed Systems Usin Vijay K. Garg email: garg@ece.utexas.edu

Sun and Grid John Barr Grid Business Development 07808 328351 john.barr@sun.com Sun and Grid

1 Code from Sun Bev Crair 2 beverly.crair@eng.sun.com Sun Microsystems, Inc. Overview

MEDICAL MANAGEMENT OF GLAUCOMA DR. RAVI THOMAS, DR. RAJUL PARIKH, DR. SHEFALI PARIKH IJO MAY

Introducing Sun Global Presentation 2013 Strictly Confidential. Sun Global Investments is

Studying the Sun with satellites and eclipses THE SUN The Sun is a variable star. That

Earth-Sun Relationships Energy received from the Sun drives weather and climate, so it is

Fraser Health South Asian Health Institute To Be A Global Leader Arun K. Garg, PhD, MD, FRCPC

Adaptive Garbled RAM from Adaptive Garbled RAM from Laconic Oblivious Transfer Sanjam Garg

Two-Round Secure Multiparty Computation Minimizing Public Key Operations Sanjam Garg

Memory Consistency Exceptions 18742-Computer Architecture and Systems Nithin, Deepali Garg

AutoSynch An Automatic-Signal Monitor Based on Predicate Tagging Vijay K. Garg Wei-Lun Hung

Temporal Learning and Sequence Modeling for a Job Recommender System Kuan Liu, Xing Shi, Anoop

Privacy in Recommender Systems CompSci 590.03 Instructor: Ashwin Machanavajjhala Lecture 21:

G enerative A dversarial User Model for R einforcement L earning Based R ecommendation S ystem

Recommender Systems Collabora2ve Filtering and Matrix Factoriza2on

Intelligently Recommending Key Bindings on Physical Keyboards with Demonstrations in Emacs Shudan

and Improve Recommender System Performance WWW 2016, Montral Tobias Schnabel , Paul N.

Designing a Conversational Travel Recommender System Based on Data-Driven Destination

SLIM : Sparse Linear Methods for Top-N Recommender Systems Xia Ning and George Karypis Computer

Shefali Garg Fangyan Sun Music dataset is too big while life is - PowerPoint PPT Presentation

Shefali Garg Fangyan Sun Music dataset is too big while life is short!!!! You need someone to teach you how to manage and give you wise suggestions according to your taste! Music service providers need a more efficient system to attraction

Final Presentation May 6, 2011 Brandon Borkholder Mark Dickerson Shefali Garg Aren Knutsen

Project Proposal February 17, 2011 Brandon Borkholder Mark Dickerson Shefali Garg Aren Knutsen

CORRELATING SPEECH PROCESSING IN DEEP LEARNING AND COMPUT ATIONAL NEUROSCIENCE Shefali Garg

Supervising Unsupervised Learning Vikas K. Garg &amp; Adam Kalai Vikas K. Garg &amp; Adam Kalai

Fault-Tolerant Services in Distributed Systems Usin Vijay K. Garg email: garg@ece.utexas.edu

Sun and Grid John Barr Grid Business Development 07808 328351 john.barr@sun.com Sun and Grid

1 Code from Sun Bev Crair 2 beverly.crair@eng.sun.com Sun Microsystems, Inc. Overview

MEDICAL MANAGEMENT OF GLAUCOMA DR. RAVI THOMAS, DR. RAJUL PARIKH, DR. SHEFALI PARIKH IJO MAY

Introducing Sun Global Presentation 2013 Strictly Confidential. Sun Global Investments is

Studying the Sun with satellites and eclipses THE SUN The Sun is a variable star. That

Earth-Sun Relationships Energy received from the Sun drives weather and climate, so it is

Fraser Health South Asian Health Institute To Be A Global Leader Arun K. Garg, PhD, MD, FRCPC

Adaptive Garbled RAM from Adaptive Garbled RAM from Laconic Oblivious Transfer Sanjam Garg

Two-Round Secure Multiparty Computation Minimizing Public Key Operations Sanjam Garg

Memory Consistency Exceptions 18742-Computer Architecture and Systems Nithin, Deepali Garg

AutoSynch An Automatic-Signal Monitor Based on Predicate Tagging Vijay K. Garg Wei-Lun Hung

Temporal Learning and Sequence Modeling for a Job Recommender System Kuan Liu, Xing Shi, Anoop

Privacy in Recommender Systems CompSci 590.03 Instructor: Ashwin Machanavajjhala Lecture 21:

G enerative A dversarial User Model for R einforcement L earning Based R ecommendation S ystem

Recommender Systems Collabora2ve Filtering and Matrix Factoriza2on

Intelligently Recommending Key Bindings on Physical Keyboards with Demonstrations in Emacs Shudan

and Improve Recommender System Performance WWW 2016, Montral Tobias Schnabel , Paul N.

Designing a Conversational Travel Recommender System Based on Data-Driven Destination

SLIM : Sparse Linear Methods for Top-N Recommender Systems Xia Ning and George Karypis Computer

Supervising Unsupervised Learning Vikas K. Garg & Adam Kalai Vikas K. Garg & Adam Kalai