Machine Learning Techniques Alejandro Bellogn, Ivn Cantador, Pablo - PowerPoint PPT Presentation

Discovering Relevant Preferences in a Personalised Recommender System using Machine Learning Techniques Alejandro Bellogín, Iván Cantador, Pablo Castells, Álvaro Ortigosa Escuela Politécnica Superior Universidad Autónoma de Madrid Campus de Cantoblanco, 28049 Madrid, Spain {alejandro.bellogin, ivan.cantador, pablo.castells, alvaro.ortigosa}@uam.es Preference Learning Workshop At ECML-PKDD 2008 Antwerp, Belgium, September 2008

Motivation  Complexity of recommender systems  Several variables affect the effectiveness of recommendations • Apply personalisation? • Use current context? • Consider all the users equally? • Consider all the items similarly? • …  Effectiveness is achieved by adequately handling these variables, but it is also an issue of knowing how relevant each one is  Can we learn which preferences are relevant to achieve effective recommendations? Preference Learning Workshop At ECML-PKDD 2008 Antwerp, Belgium, September 2008

Our approach  Log analysis of a personalised news recommender system  For each user-rated recommendation, a pattern is created • The attributes of the pattern correspond to the characteristics we aim to analyse, and their values are obtained from log information databases. • The class of the pattern can be assigned two possible values, correct or incorrect, depending on whether the user evaluated the recommendation as relevant or irrelevant Recommendation Relevant Algorithms preferences for ML Logs Patterns effective recommendations Preference Learning Workshop At ECML-PKDD 2008 Antwerp, Belgium, September 2008

Architecture Preference Learning Workshop At ECML-PKDD 2008 Antwerp, Belgium, September 2008

User profile representation  User preferences are described as vectors where measures the intensity    ( , ,..., )   1,1 u u u u u ,1 ,2 , , m m m m K m k of the interest of user for concept (a class u  c  m k or an instance) in a domain ontology  Items are assumed to be annotated by vectors d  n of concept weights, in the same  ( , ,..., ) d d d d ,1 ,2 , n n n n K vector-space as user preferences Preference Learning Workshop At ECML-PKDD 2008 Antwerp, Belgium, September 2008

Recommendation models  Preference expansion mechanism • Through explicit semantic relations with other concepts in the ontology  Personalised and context-aware score ( d n , u m ) = w p · pref ( d n , u m ) + w c · pref ( d n , u m , context) • pref(·,·) measures the relevance of a document for a user, using a cosine-based vector similarity • context is represented as a set of weighted concepts Preference Learning Workshop At ECML-PKDD 2008 Antwerp, Belgium, September 2008

Log database  The system monitors all the actions the user performs, and records them in a log database Table Attributes Browsing actionID , actionType, timestamp, sessionID , itemID, itemRankingPosition , itemRankingProfile, itemRankingContext, itemRankingCollaborative, itemRankingHybridUP, itemRankingHybridNUP, itemRankingHybridUPq, itemRankingHybridNUPq, topicSection, interestSituation, userProfileWeight , contextWeight , collaborative, scoreSearch Context updates actionID, actionType, timestamp, sessionID, context, origin, changeOfFocus Queries actionID, actionType, timestamp, sessionID, keywords, topicSection, interestSituation Recommendations actionID, actionType, timestamp, sessionID, recommendationType, userProfileWeight, contextWeight, collaborative, topicSection, interestSituation User accesses actionID, actionType, timestamp, sessionID User evaluations actionID , actionType, timestamp, sessionID , itemID, rating, userFeedback , tags, comments, topicSection, interestSituation, duration User preferences actionID, actionType, timestamp, sessionID, concept, weight, interestSituation User profiles actionID, actionType, timestamp, sessionID, userProfile User sessions sessionID, userID, timestamp Preference Learning Workshop At ECML-PKDD 2008 Antwerp, Belgium, September 2008

The News@hand system • A hybrid news recommender system which makes use of Semantic Web technologies to provide several on-line news recommendation services Long-term (profile) user preferences Short-term (context) user preferences Semantic expansion Preference Learning Workshop At ECML-PKDD 2008 Antwerp, Belgium, September 2008

Machine learning algorithms  Decision Trees: • Interpretable • Most informative attributes (entropy) • Revision J4.8 (Weka) Preference Learning Workshop At ECML-PKDD 2008 Antwerp, Belgium, September 2008

Evaluation  16 users (12 undergraduate/graduate students, 4 lecturers)  Task: find and evaluate those news items that were relevant to a given goal Profile Section Query Task goal World Q 1,1 pakistan News about media: TV, radio, Internet 1 News about software piracy, illegal Telecom Entertainment Q 1,2 music downloads, file sharing Business Q 2,1 dollar News about oil prices 2 Banking Headlines Q 2,2 fraud News about money losses Science Q 3,1 food News about cloning 3 News about children, young people, child Social care Headlines Q 3,2 internet safety, child abuse  Different configurations based on activation/deactivation of: personalisation and context-aware recommendations, semantic expansion  Each user classifies an item as relevant in general, relevant to the current goal or relevant to the profile Preference Learning Workshop At ECML-PKDD 2008 Antwerp, Belgium, September 2008

Results  Relevant preferences for personalised recommendations: • The bigger the user profile size, the more relevant the retrieved news • The system retrieves relevant news in the first page (top 5) • If the expansion is activated it is a very important preference  Relevant preferences for context-aware recommendations: • Best performance with a medium context size • Context usually needs personalisation to obtain good results  Meta-evaluation conclusions: • The evaluation was unbalanced in terms of difficulty of obtaining relevant news items for each task • Profile specificity ( Business , Entertainment sections; manual profiles) Preference Learning Workshop At ECML-PKDD 2008 Antwerp, Belgium, September 2008

Conclusions and future work  Machine Learning techniques are useful: • to learn those user and system features of a recommender system that are (un)favourable for correct recommendations • to learn deficiencies and weaknesses of the experiments conducted to measure the system performance  Next steps: • Make the system adaptive to the current status of the analysed preferences and evaluate the (potential) improvement • Similar evaluations with the collaborative group-oriented and multi-facet recommendation strategies of News@Hand Preference Learning Workshop At ECML-PKDD 2008 Antwerp, Belgium, September 2008

Questions Thank you Preference Learning Workshop At ECML-PKDD 2008 Antwerp, Belgium, September 2008

Split showing unbalanced anomaly Preference Learning Workshop At ECML-PKDD 2008 Antwerp, Belgium, September 2008

Evaluation: configurations Personalised Context-aware recommendations recommendations Without With With expansion expansion expansion w p =1 w p =1 w p =0 w p =0.5 User w c =0 w c =0 w c =1 w c =1 1 *Q 1,1 Q 2,1 Q 3,1 A Q 1,2 2 Q 2,2 *Q 3,2 A Q 2,1 Q 1,2 3 Q 3,1 A Q 3,2 *Q 1,1 Q 2,1 4 A Q 1,1 Q 1,2 Q 2,2 *Q 3,2 5 Q 1,2 *Q 2,2 Q 3,2 A Q 2,1 6 Q 2,1 Q 3,1 * A Q 3,2 Q 1,1 7 Q 3,2 A Q 1,1 Q 1,2 *Q 2,2 8 * A Q 2,2 Q 1,1 Q 2,1 Q 3,1 9 Q 1,1 Q 2,1 *Q 3,1 A Q 3,2 10 Q 2,2 Q 3,2 A Q 1,1 *Q 1,2 11 *Q 3,1 A Q 2,2 Q 1,1 Q 2,1 12 A Q 3,1 *Q 1,2 Q 2,2 Q 3,2 13 Q 1,2 Q 2,2 Q 3,2 * A Q 1,1 14 *Q 2,1 Q 3,1 A Q 2,2 Q 1,1 15 Q 3,2 * A Q 3,1 Q 1,2 Q 2,2 16 A Q 1,2 Q 1,1 *Q 2,1 Q 3,1 Preference Learning Workshop At ECML-PKDD 2008 Antwerp, Belgium, September 2008

Semantic contextualisation of user preferences  Nodes represent ontology concepts and edges are associated to semantic relations between those concepts. Preference Learning Workshop At ECML-PKDD 2008 Antwerp, Belgium, September 2008

Knowledge Representation • User profiles and item descriptions are represented as vectors u m =( u m,1 , u m,2 , … , u m,K ) and d n =( d n ,1 , d n,2 , … , d n,K ), where u m,k , d n,k in [-1,1] are the weights that measure the relevance of concept c k for user u m and item d n • Recommendation models are based on the definition of matching algorithms which make use of similarity measures based on cos ( u m , d n ) Preference Learning Workshop At ECML-PKDD 2008 Antwerp, Belgium, September 2008

Machine Learning Techniques Alejandro Bellogn, Ivn Cantador, Pablo - PowerPoint PPT Presentation

Discovering Relevant Preferences in a Personalised Recommender System using Machine Learning Techniques Alejandro Bellogn, Ivn Cantador, Pablo Castells, lvaro Ortigosa Escuela Politcnica Superior Universidad Autnoma de Madrid Campus

Introduction to Machine Learning Introduction to Machine Learning Introduction to Machine

Quantum Machine Learning Adam Brown, HEP-AI Quantum Computing Machine Learning Quantum

MICROSOFT AZURE MACHINE LEARNING Oscar Naim Microsoft Microsoft Azure Machine Learning What is

MACHINE LEARNING Overview 1 1 APPLIED MACHINE LEARNING 2011-2012 APPLIED MACHINE LEARNING

MACHINE LEARNING kernels 1 MACHINE LEARNING 2012 MACHINE LEARNING Kernels: Intuition How

A Machine Learning Approach A Machine Learning Approach A Machine Learning Approach A Machine

APPLIED MACHINE LEARNING Methods for Clustering K-means, Soft K-means DBSCAN 1 MACHINE

Welcome to the Machine Learning Toolbox! Machine Learning Toolbox Supervised learning caret

Introduction to Machine Learning COMPSCI 371D Machine Learning COMPSCI 371D Machine

INTRODUCTION TO MACHINE LEARNING Joseph C. Osborn CS 51A Spring 2020 Machine Learning is

Human and Machine Learning Tom Mitchell Machine Learning Department Carnegie Mellon University

Machine Learning Algorithms for Classification Machine Learning Algorithms for Classification

Machine Learning - Intro Aarti Singh Machine Learning 10-701/15-781 Sept 8, 2010 You tell me

MACHINE LEARNING Kernel Canonical Correlation Analysis 1 ADVANCED MACHINE LEARNING ADVANCED

Machine learning for finance Nathan George Data Science Professor DataCamp Machine Learning

1/88 Presentation: Advanced Techniques 2/88 Presentation: Advanced Techniques 3/88

Clustering Rankings in the Fourier Domain Stphan Clmenon and Romaric Gaudel and Jrmie

Functional Bid Landscape Forecasting for Display Advertising Yuchen Wang 1 Kan Ren 1 Weinan Zhang

Fast Training of Support Vector Machines for Survival Analysis Sebastian Plsterl 1 , Nassir

Rule-Based Classification Johannes Frnkranz Knowledge Engineering Group TU Darmstadt

Experiments in Value Function Approximation with Sparse Support Vector Regression Tobias Jung and

Distribution-Free Uncertainty Quantification for Kernel Methods by Gradient Perturbations Bal

Drawing Parallels between Multi-label Classification and Multi-target Regression Grigorios

Stochastic approximation for speeding up LSTD/LSPI (and least squares regression/LinUCB) Prashanth