Xing Zhao, Qingquan Song, James Caverlee and Xia Hu
Department of Computer Science and Engineering Texas A&M University, USA
1
Xing Zhao, Qingquan Song, James Caverlee and Xia Hu Department of - - PowerPoint PPT Presentation
Xing Zhao, Qingquan Song, James Caverlee and Xia Hu Department of Computer Science and Engineering Texas A&M University, USA 1 Da Dataset Statistics cs # 10 6 2.5 1 Cumsum Taking Up of Positive Samples Items Quantity Proportion
Xing Zhao, Qingquan Song, James Caverlee and Xia Hu
Department of Computer Science and Engineering Texas A&M University, USA
1
Items Quantity Proportion
Playlists 1,000,000 Unique Tracks 2,262,292 100% Unique tracks (freq ≥ 5) 599,341 96.05% Unique tracks (freq ≥ 100) 70,229 80.67% Unique albums 734,684 Unique artists 295,860
4
Track Appeared Times in Training Data
1 5 10 100 1000 10000 40000
Number of Remaining Tracks
#106 0.5 1 1.5 2 2.5
Cumsum Taking Up of Positive Samples
0.2 0.4 0.6 0.8 1
Therefore, in some part of our methods, we
DNCF C-Tree CC- Title
5
Playlist Continuation: For Task 2 to 10 Cold Start: For Task 1
3 7 5 21 3 43 6 81 8 32 7 13 14 6 5
Tracks (2,262,292) Words (9,817)
Word list 1: Track list 1 Word list 2: Track list 2
Word list 3: Track list 3
… … …
Cluster Recommend
New title: e.g. Pop Punk 2018 Summer
Word list Tracks Word list Tracks Word list Tracks Word list Tracks Word list Tracks
Normalize Pre- process …
6
i j
Track i is existed in 6 playlists whose title contain word j
Items Quantity unique titles 92,944 unique normalized titles 17,381 unique non-stop normalized words 9,817 playlist without title after processing 22,921
7
Steps:
emoji, punctuation, etc.
9817 x 2,262,292
similarity
new title
8
Highlight:
computation with high efficiency.
very good.
Pros:
basic matrix factorization model and MLP. Cons: Computationally not efficient to be directly applied on the target problem due to the huge item scope and the matrix sparsity.
9
He et al. , “Neural Collaborative Filtering”. WWW, 2017.
Neural Collaborative Filtering
10
Two modifications to address efficiency issue:
Training Phase: Constrained Negative Sampling. Testing Phase: Constrained Recommendation with Reordering.
dataset during training to protect the feasible embedding and prediction of all the testing data. (Task 2-10)
11
space to the space of the tracks appearing equal to or more than 100 times in the training data.
Track Appeared Times in Training Data
1 5 10 100 1000 10000 40000
Number of Remaining Tracks
#106 0.5 1 1.5 2 2.5
Cumsum Taking Up of Positive Samples
0.2 0.4 0.6 0.8 1
Training Phase: Constrained Negative Sampling.
tracks with an ensemble trick leveraging two types of predictions provided by the Word2Vec embedding.
12
popular tracks (>=100 times) during testing phase towards a more targeted prediction.
Testing Phase: Constrained Recommendation with Reordering.
DNCF Word2Vec (1) Word2Vec (2)
L
1
L2 L3
φ1 φ2 φ3
φ1 \ L
1 ∪ L2 ∪ L3
13
Highlight:
sequential seeding tracks;
A Playlist is:
consists of different tracks ,and these tracks always belong to a specific album of an artist;
in a specific playlist always have latent similarity, such as genres, style, listening sense, etc.
14
Phylogenetic Tree.
(Source: https://www.creative-biostructure.com/custom- phylogenetic-tree-construction-service-399.htm)
15
A Real Example (PID: 11548):
albums by 5 artists (2 rock bands and 3 pop punk bands)
Pop punk band Rock band
How do we compare the internal relationship? How do we compare it with another tree (external)?
16
Training Data: Complete Tree Testing Data: Incomplete Tree
External comparison
Incomplete Tree: A playlist
tracks (seed), which is waiting for recommending.
Steps:
complete trees;
distance between the incomplete tree T-test and complete tree T-train;
(leaves) from each T-train to the incomplete tree T-test, based on the score of each leaf.
17
Playlist 1 Playlist 2 Playlist 3 Playlist 4 Playlist n …
18
Highlight:
sequential seeding tracks;
CC-Title Final Recommendation ADNCF BDNCF AC-Tree BC-Tree
Num_handou t
Method 1 Method 2
19
Experiment Setting:
parameter tuning
RecSys 2018
20
21
22