cross domain recommendation via clustering on multi layer
play

Cross-Domain Recommendation via Clustering on Multi-Layer Graphs Al - PowerPoint PPT Presentation

Cross-Domain Recommendation via Clustering on Multi-Layer Graphs Al Aleksandr Fa Farseev, Ivan Samborskii, Andrey Filchenkov, Tat-Seng Chua By AleksandrFarseev http://farseev .com Aug 8 th , 2017 Venue Category Recommendation Collaborative


  1. Cross-Domain Recommendation via Clustering on Multi-Layer Graphs Al Aleksandr Fa Farseev, Ivan Samborskii, Andrey Filchenkov, Tat-Seng Chua By AleksandrFarseev http://farseev .com Aug 8 th , 2017

  2. Venue Category Recommendation Collaborative Venue Category Recommendation – recommendation of venue categories (i.e. restaurant, cinema) to user using information about his/her profile (i.e. past visits) and/or information about users from the same domain. Venue categories: Clothing Store Hotel Venue categories: Ice Cream Shop Total 764 different categories

  3. Idea 1: Utilization of Individual And Group Knowledge for Better Recommendation

  4. User Community-Based Collaborative Recommendation We perform venue category recommendation based on both individual and group knowledge => naturally models the impact of society on an individual's behavior during the selection of a new place to go: βˆ‘ 𝑀𝑓𝑑 0 0∈2 3 𝑠𝑓𝑑 𝑣 = 𝑑𝑝𝑠𝑒 𝛿 * 𝑀𝑓𝑑 , + πœ„ 𝐷 , +

  5. What do we need user communities for? + Users from the same community (extracted from multi-source data) may have similar location preferences + Search within user community significantly reduces search space during the recommendation process

  6. Example of User Communities (1) Community 1: Gingers Community K: Darker Hair

  7. User Relation and Community Representations One way to find user communities is to model users' relationships in the form of a graph so that dense subgraphs are considered to be user communities.

  8. Community Detection based on a single data source One of the commonly formulations is MinCut problem. For a given number k of subsets, the MinCut involves choosing a partition 𝐷 ; ,…, 𝐷 > such that it minimizes the expression: > 𝑑𝑣𝑒 𝐷 ; ,… ,𝐷 > = ? 𝑋(𝐷 B ,𝐷̅ B ) BE; *W is the sum of weights of edges attached to vertices in 𝐷 B

  9. How to solve MinCut problem? Approximation of MinCut as standard tr st trace mi minimi mization problem: m: H∈I JΓ—L tr 𝑉 O 𝑀𝑉 ,s.t. 𝑉 O 𝑉 = 𝐽 min which can be solved by Sp Spectral Clu lusterin ing: Calculates Laplacian matrix 𝑀 ∈ 𝑆 UΓ—U 1. 2. Builds matrix of the first 𝑙 eigenvectors 𝑉 ∈ 𝑆 UΓ—> correspond to the smallest eigenvalues of 𝑀 3. Clusters data in a new space 𝑉 using i.e. 𝑙 -means algorithm

  10. Idea 2: Utilization of Multi-Source Data

  11. Most of user actively use β‰ˆ 3 social networks Accounts Ac ~6 registered social network ~6 accounts per person* 5 Ac Active Usage 4 6 People actively use ~3 ~3 social platforms simultaneously* 3 7 2 8 1 9 0 10 * GlobalWebIndex. 2016. GWI Social report. http://www.globalwebindex.net/blog/internet-users-have-average-of-5-social-media-accounts

  12. Multi-source data describe user from multiple views

  13. Cross-Domain Venue Category Recommendation Cr Cross Domain - Ve Venue ca category reco commendation – recommendation of venue categories (i.e. restaurant, cinema) using information about his/her profile (i.e. past visits) and/or information about users from other sources (i.e. images, texts, location types). Venue categories: Clothing Store Hotel Ice Cream Shop Multi-Source Data:

  14. Community Detection must performed in a Cross-Source Manner… Problems: β€’ Data source integration β€’ Community detection

  15. How to represent multi-source data? Mu Multi-la layer graph – graph 𝐻 , where 𝐻 = 𝐻 B , 𝐻 B = π‘Š,𝐹 B

  16. Extending definition of spectral clustering [ H∈I JΓ—L ? tr 𝑉 O 𝑀 B 𝑉 , s.t.𝑉 O 𝑉 = 𝐽 min BE; [ H∈I JΓ—L tr 𝑉 O 𝑀 \,] 𝑉 , where 𝑀 \,] = ? 𝑀 B min BE; Such approximation could suffer from poor poor ge gene neralization on abi bility.

  17. Regularized Clustering on Multi-layer Graph -1 Use Gr Grassman Ma Manifolds to keep final latent representation β€œclose” to all layers of multi-layer graph*. Where projected distance between two spaces 𝑍 ; and 𝑍 b : b = 1 b ,where 𝐡 k is the Frobenius norm O βˆ’ 𝑍 b O 𝑒 defg 𝑍 ; ,𝑍 2 𝑍 ; 𝑍 b 𝑍 ; b k [ = 𝑙𝑁 βˆ’ ?tr(𝑇𝑇 O βˆ’ 𝑇 B 𝑇 B b [ O ) 𝑒 defg 𝑇, 𝑇 B BE; BE; * X. Dong, P. Frossard, P. Vandergheynst, and N. Nefedov. Clustering on multi-layer graphs via subspace analysis on grassmann manifolds. IEEE Transactions on Signal Processing, 2014.

  18. Regularized Clustering on Multi-layer Graph -2 Extends the objective function to introduce the subspace analysis regularization [ [ O 𝑉 O 𝑀 B 𝑉 + 𝛽 𝑉𝑉 O 𝑉 B 𝑉 B ,s.t. 𝑉 O 𝑉 = 𝐽 Hβˆˆβ„ JΓ—L ? tr min 𝑙𝑁 βˆ’ ? tr BE; BE; Hβˆˆβ„ JΓ—L tr 𝑉 O 𝑀 ]ft 𝑉 min [ O ) 𝑀 ]ft = ?(𝑀 B βˆ’ 𝛽𝑉 B 𝑉 B BE;

  19. Idea 4: Making use of Inter-Layer (Inter-Source) Relations

  20. Incorporating inter-layer relationship (1) By using distance on Grassman Manifolds, we present the new objective function for the 𝑗 th layer: [ v B O 𝑀 B 𝑉 v B + 𝛾 B v B 𝑉 v B O 𝑉 O v w βˆˆβ„ JΓ—L tr 𝑉 min 𝑙𝑁 βˆ’ ? π‘₯ B,g tr 𝑉 g 𝑉 g H gE;,gzB O 𝑀 v B { B 𝑉 v B v w βˆˆβ„ JΓ—L tr 𝑉 min H [ { B = 𝑀 B βˆ’ 𝛾 B O 𝑀 ? π‘₯ B,g tr 𝑉 g 𝑉 g gE;,gzB

  21. But how can we determine w |,} when computing i-th layer ? O 𝑀 v B { B 𝑉 v B v w βˆˆβ„ JΓ—L tr 𝑉 min H [ { B = 𝑀 B βˆ’ 𝛾 B O 𝑀 ? π‘₯ B,g tr 𝑉 g 𝑉 g gE;,gzB In Inter-la layer rela latio ionship ip graph 𝑺(𝑾,𝑭) – weighted graph which represents the similarity between layers. 𝑁 B,> βˆ’ 𝑁 g,> β€ž βˆ‘ 1 βˆ’ >Eb 𝑂 𝑂 βˆ’ 1 βˆ€ 𝑗,π‘˜ ∈ 𝐹, π‘₯ B,g = 𝐿 βˆ’ 1 where 𝑁 B,> is clustering co-occurrence matrix of layer 𝑗 , 𝑛 ‑,Λ† = 1, if users 𝑏 and 𝑐 assigned to the same cluster , and 0 otherwise.

  22. Final objective function Let’s combine equations from previous slides to define the final objective function: [ [ { B 𝑉 + 𝛽 v B 𝑉 v B O 𝑉 O 𝑀 𝑉𝑉 O 𝑉 min βˆˆβ„ JΓ—L ?tr 𝑙𝑁 βˆ’ ? tr = H BE; BE; [ βˆˆβ„ JΓ—L tr 𝑉 O ?(𝑀 { B βˆ’ 𝛽𝑉 v B 𝑉 v B O ) = min 𝑉 H BE;

  23. Problems β€’ Community detection β€’ Data source integration

  24. Recall: Community-Based Cross-Domain Recommendation We perform venue category recommendation based on both individual and group knowledge, where group knowledge is obtained from multiple sources: βˆ‘ 𝑀𝑓𝑑 0 0∈2 3 𝑠𝑓𝑑 𝑣 = 𝑑𝑝𝑠𝑒 𝛿 * 𝑀𝑓𝑑 , + πœ„ 𝐷 , +

  25. Foursquare Instagram NUS-MSS Dataset Dataset* is presented as a set of features, extracted from user-generated data in three social networks: - text based fromTwitter (LDA, LIWC, text features) - image based from Instagram (concepts) - location based from Foursquare (LDA, categories, Mobility Features) Foursquare categories is splited into two parts: 3 months data (train) and 2 months (test). Twitter * A. Farseev, N. Liqiang, M. Akbari, and T.-S. Chua. Ha Harvesting multiple so sources s for use ser profile learning: a Big data st study. ACM International Conference on Multimedia Retrieval (ICMR). China. June 23-26, 2015.

  26. Data Sources Text Features: Linguistic features: LIWC; Latent Topics Heuristic features: Writing behavior LIWC LDA Location Features: Location Semantics: Venue Category Distribution Mobility Location Type Mobility Features: Areas of Interest (AOI) Preferences Image Features Image Google Net Concepts Image Concept Distribution (Image Net) Images

  27. Evaluation Baselines Re Recommender Systems Co Community Detection Approaches β€’ 𝐣 β€” C ’ R recommendation without inter-layer 𝐃 πŸ’ 𝐒 βˆ’ 𝐌 Po Popular (PO POP) P) β€”recommendation based on user’s past regularization experience β€’ 𝐣 - 𝐌 β€’ 𝐍𝐩𝐞 β€” C ’ R recommendation without inter-layer 𝐃 πŸ’ 𝐒 βˆ’ 𝐌 Popular Al All (POP Al All) ) β€”recommendation based on experience of regularization and sub-space regularization all users 𝐃 πŸ’ 𝐒 βˆ’ 𝑫𝒑𝒏𝒏 β€” C ’ R recommendation without user Mu Multi-So Source Re-Ra Ranking (MSRR) RR) β€” linearly combines community extraction recommendation results from all data modalities 𝐃 πŸ’ 𝐒 (DB ) β€” C ’ R recommendation, where user Nearest Ne Ne Neighbor Collaborative Filtering (CF) β€” DBScan) recommendation based on top k most similar Foursquare users communities are detected by Density-Based clustering (DBScan) Ea Early Fusion (EF EF) β€” fuses multi-source data into a single feature 𝐃 πŸ’ 𝐒 (x means) β€” C ’ R recommendation, where user vector (x-me communities are detected by x-means clustering SV SVD++ β€” makes use of the β€œimplicit feedback” information 𝐃 πŸ’ 𝐒 (H (Hierarchical) β€” C ’ R recommendation, where user FMβ€” brings together the advantages of different factorization- FM communities are detected by Hierarchical Clustering based models via regularization. 𝐃 πŸ’ 𝐒 β€” Our Ap Approach

  28. Evaluation against other recommender systems

  29. Evaluation against other community detection approaches + Incorporation of group knowledge is is important + Multi-modal clustering performs better than single-source clustering + Incorporation of Inter-Source relationshipis crucial.

  30. Evaluation against source combinations + In different geo regions, different data sources are of different importance + Location data is more powerful than other data modalities

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend