to weight similarity measurement in collaborative filtering - PowerPoint PPT Presentation

EBCR : Empirical Bayes Concordance Rate to weight similarity measurement in collaborative filtering recommendations Y. Du , LGI2P, IMT Mines Alès S. Ranwez , LGI2P, IMT Mines Alès V. Ranwez , AGAP, Montpellier SupAgro N. Sutton-Charani , LGI2P, IMT Mines Alès

2 Collaborative Filtering recommender systems Rose : like movie

3 Collaborative Filtering recommender systems Alice Rose Bob : like movie

4 Collaborative Filtering recommender systems Alice Rose Bob : like movie

Ƹ 5 Memory-based collaboratif filtering algorithm I i 1 , i 2 ,…, i, …, i n Input : an User-Item-Rating matrix R U u 1 (5, ?,…, 1, …, 2) Output : {ො 𝒔 𝑣𝑗 | 𝑣 ∈ 𝑉, 𝑗 ∈ 𝐽 and 𝑠 𝑣𝑗 = unknown} u 2 (?, 1,…, ?, …, ?) … … u (5, 2,…, ?, …, ?) R m x n ... … u m-1 (?, ?,…, 1, …, 2) u m (5, 2,…, 2, …, 4) Algorithm : weighted average of ratings of u ’s neighbors 𝑙 𝑠 𝑣𝑗 = σ 𝑤=1 𝑠 𝑤𝑗 ∗ 𝑡𝑗𝑛(𝑣, 𝑤) 𝑙 σ 𝑤=1 𝑡𝑗𝑛(𝑣, 𝑤)

6 Most employed similarity measurements in CF approches PCC : P earson C orrelation C oefficient: - Linear correlation between two rating vectors COS : Cos ine similarity: v u MSD : M ean S quare D istance - Normalized distance of two vectors in an euclidien space

7 What is the problem here? PCC, COS, MSD: consider only the rating distributions of u and v restricted to their co-rated items, i.e. I u,v . i1 i2 i3 i4 i5 Alice 1 ∅ ∅ ∅ 5 I Rose,Alice = {i1} Rose 1 2 ∅ 3 1 I Rose,Bob Bob = {i1,i2,i3,i5} ∅ 1 2 2 1

8 Here is the problem ! PCC, COS, MSD : consider only the rating distributions of u and v restricted to their co-rated items, which ignores the number of co-rated items . Why do we have to consider the number of co-rated items, i.e. | I u,v | ? Alice 1 ∅ ∅ ∅ 5 PCC(R, A) = 1 > PCC(R, B) = 0.905 Rose COS(R, A) = 1 > COS(R, B) = 0.9798 1 2 ∅ 3 1 MSD(R, A) = 1 > MSD(R, B) = 0.8 Bob 1 2 2 ∅ 1 NOT Reliable as | I u,v | is small !!! So, the values need to be adjusted

Proposed method : EBCR ( E mpirical B ayes C oncordance R ate) 9 Discretize user ratings by three categories of taste T T ( u, u,i ) )

Proposed method : EBCR ( E mpirical B ayes C oncordance R ate) 10 CR: concordance rate of a given user pair • Set C u,v of f ( u and v v )’s c oncordantly co-rated items: i ∈ C u,v if T ( u,i ) = T ( v,i ) v : CR u,v = | C u,v | • CR of u and v u: (1, 1, ?, 5, 4, 5, ?, ?, ..., 2) | I u,v | v: (?, 2, 5, ?, ?, 1, 1, ?, ..., 5) u: (1, 5, 2) u: (dislike, like, dislike) v: (2, 1, 5) v: (dislike, dislike, like) | I u,v | = 3 C u,v = 1 CR u,v = 1/3 • Interpretation of CR : Probability of two users having the same taste on an item BUT, what if I u,v is small ? 1 2000 ( 1 ) 1 != 1 ( 2000 )

11 Here comes Empirical Bayes 1. Take all the CR rates as a Beta prior distribution 2. Find 𝜷 0 and 𝜸 𝟏 that best fit the data, i.e. CR rate set 3. Use the prior to adjust each CR value : 𝑫 𝒗𝒘 𝑫 𝒗𝒘 + 𝜷 𝟏 EBCR u,v ,v : : Figure taken from Google Image 𝑱 𝒗𝒘 𝑱 𝒗𝒘 + 𝜷 𝟏 + 𝜸 𝟏 Espérance de la 4. Use EBCR to weight similarity measurement : lois Beta( 𝜷 0, 𝜸 𝟏 ) sim ’( u,v) = sim(u,v) * EBCR u,v

12 Evaluation and results • Dataset : Movielens-1M → 1 million movie ratings of 6 040 users on 3 900 items • Evaluation metric : MAE (Mean Absolute Error) • Evaluation protocol : 10-folds cross validation better

13 State of advance and perspectives 3 rd Year 1 st Year 2 nd Year Submission 1 st Envisage paper for IC 2019 submitting EBCR conference, to ECAI2019 in Collaborate Ontology, accepted in May. 1. Literature English version knowledge graph, on RS knowledge base and 2. Literature on Proposition Model-based RS for knowledge- Juin. 2019 recommendation diversity RS + semantic based RS and explanation Oct. 2018 Oct. 2021 Apr. 2019 Nov. 2019 2 nd Submission for the LFA 2019 conference

Merci pour votre attention

15 Formulars

to weight similarity measurement in collaborative filtering - PowerPoint PPT Presentation

EBCR : Empirical Bayes Concordance Rate to weight similarity measurement in collaborative filtering recommendations Y. Du , LGI2P, IMT Mines Als S. Ranwez , LGI2P, IMT Mines Als V. Ranwez , AGAP, Montpellier SupAgro N. Sutton-Charani , LGI2P,

cProbLog: Restricting the Possible Worlds of Probabilistic Logic Programs Dimitar Shterionov

Semantic Similarity MultiJEDI ERC 259234 Semantic Similarity Semantic Similarity Mostly

MEASUREMENT Weight ESSENTIAL QUESTION: How do we know which unit to choose to measure weight?

Gemstones a Unit of Weight Gemstones a Unit of Weight The historical unit of weight

INTRODUCING Connecting Weight Loss Patients Directly to your Weight Loss Center Physicians Weight

Formulation and development of foods for weight management Paola Vitaglione Weight control and

/k Content 2/15 1. Introduction 2. Hamming weight 3. Rank weight 4. Extended rank weight

Align, Disambiguate, and Walk A Unified Approach for Measuring Semantic Similarity Semantic

Time- -dependent Similarity Measure dependent Similarity Measure Time Time-dependent Similarity

Unification of CSC and SE ABET Effor ts Similarity of CSC and SE Programs Similarity of CSC and

LECTURE 4 Similarity and Distance Recommender Systems SIMILARITY AND DISTANCE Thanks to: Tan,

I/O-EFFICIENT SIMILARITY JOIN R. Pagh, N. Pham, F. Silvestri, M. Stckel Similarity Join R = Q

COMP9313: Big Data Management High Dimensional Similarity Search Similarity Search Problem

DATA MINING LECTURE 4 Similarity and Distance Recommender Systems SIMILARITY AND DISTANCE

DATA MINING LECTURE 5 Similarity and Distance Sketching, Locality Sensitive Hashing SIMILARITY

Presentation to Ontario Smart Grid Working Group Who is Measurement Canada? Measurement: A part

COHN LOCALIZATION, GENERALIZED FREE PRODUCTS AND BOUNDARY LINKS ANDREW RANICKI (Edinburgh)

The Small World Problem Christoph Trattner Know-Center Graz University of Technology,

Overview Agenda A selection of relevant concepts from Graph and Network Theory Markus

RECSM Summer School: Social Media and Big Data Research Pablo Barber a London School of

Announcements Wednesday, October 25 The midterm will be returned in recitation on Friday.

3.1 Introduction to Determinants McDonald Fall 2018, MATH 2210Q, 3.1&3.2 Slides 3.1 Homework

Calculating Determinants Recursive Formula: Cofactor Expansion Assume A is n n matrix. Let A ij

Announcements Monday, October 23 Webwork due next week, No quiz this week. Chapter 3

to weight similarity measurement in collaborative filtering - PowerPoint PPT Presentation

EBCR : Empirical Bayes Concordance Rate to weight similarity measurement in collaborative filtering recommendations Y. Du , LGI2P, IMT Mines Als S. Ranwez , LGI2P, IMT Mines Als V. Ranwez , AGAP, Montpellier SupAgro N. Sutton-Charani , LGI2P,

cProbLog: Restricting the Possible Worlds of Probabilistic Logic Programs Dimitar Shterionov

Semantic Similarity MultiJEDI ERC 259234 Semantic Similarity Semantic Similarity Mostly

MEASUREMENT Weight ESSENTIAL QUESTION: How do we know which unit to choose to measure weight?

Gemstones a Unit of Weight Gemstones a Unit of Weight The historical unit of weight

INTRODUCING Connecting Weight Loss Patients Directly to your Weight Loss Center Physicians Weight

Formulation and development of foods for weight management Paola Vitaglione Weight control and

/k Content 2/15 1. Introduction 2. Hamming weight 3. Rank weight 4. Extended rank weight

Align, Disambiguate, and Walk A Unified Approach for Measuring Semantic Similarity Semantic

Time- -dependent Similarity Measure dependent Similarity Measure Time Time-dependent Similarity

Unification of CSC and SE ABET Effor ts Similarity of CSC and SE Programs Similarity of CSC and

LECTURE 4 Similarity and Distance Recommender Systems SIMILARITY AND DISTANCE Thanks to: Tan,

I/O-EFFICIENT SIMILARITY JOIN R. Pagh, N. Pham, F. Silvestri, M. Stckel Similarity Join R = Q

COMP9313: Big Data Management High Dimensional Similarity Search Similarity Search Problem

DATA MINING LECTURE 4 Similarity and Distance Recommender Systems SIMILARITY AND DISTANCE

DATA MINING LECTURE 5 Similarity and Distance Sketching, Locality Sensitive Hashing SIMILARITY

Presentation to Ontario Smart Grid Working Group Who is Measurement Canada? Measurement: A part

COHN LOCALIZATION, GENERALIZED FREE PRODUCTS AND BOUNDARY LINKS ANDREW RANICKI (Edinburgh)

The Small World Problem Christoph Trattner Know-Center Graz University of Technology,

Overview Agenda A selection of relevant concepts from Graph and Network Theory Markus

RECSM Summer School: Social Media and Big Data Research Pablo Barber a London School of

Announcements Wednesday, October 25 The midterm will be returned in recitation on Friday.

3.1 Introduction to Determinants McDonald Fall 2018, MATH 2210Q, 3.1&amp;3.2 Slides 3.1 Homework

Calculating Determinants Recursive Formula: Cofactor Expansion Assume A is n n matrix. Let A ij

Announcements Monday, October 23 Webwork due next week, No quiz this week. Chapter 3

3.1 Introduction to Determinants McDonald Fall 2018, MATH 2210Q, 3.1&3.2 Slides 3.1 Homework