Reddicommend
Content recommendations for Reddit
Reddicommend Content recommendations for Reddit Motivation Reddit - - PowerPoint PPT Presentation
Reddicommend Content recommendations for Reddit Motivation Reddit is the fourth-most visited site in the US (8th globally) The ability of users to find content relevant to their interests is important to its utility as a
Content recommendations for Reddit
globally)
interests is important to its utility as a content-aggregating site
recommendations
Implementation
Collaborative filtering using similarity measure based on the Pearson correlation:
subreddit 1 subreddit 2 subreddit 3 u s e r 1 u s e r 2 u s e r 3 u s e r 4 u s e r 5 subreddit 1 subreddit 2 subreddit 3 user 1 user 2 user 3 user 4 user 5 Subreddit correlation matrix
Scalable matrix multiplication in Spark
○ Dense-matrix multiplication is O(n3) (for
square matrices) ■ > 1 million users per batch ■ > 100,000 subreddits
Scalable matrix multiplication in Spark
0. 1. 0. 1. 0. 1. 3. 2. 0. 1. 0. 0. 0. 5. 0. 1.
= x
M N P M: [MatrixEntry(0, 0, 3), MatrixEntry(0, 1, 2), MatrixEntry(1, 0, 1)] N: [MatrixEntry(0, 1, 1), MatrixEntry(1, 1, 1), MatrixEntry(2, 1, 1)]
0. 1. 0. 1. 0. 1. 1. 2. 0. 0. 0. 0. 0. 3. 0. 0.
= x
M N P M: [MatrixEntry(0, 0, 1), MatrixEntry(0, 1, 2)] N: [MatrixEntry(0, 1, 1), MatrixEntry(1, 1, 1), MatrixEntry(2, 1, 1)] [(0, (0, 1)), (1, (0, 2)] [(0, (1, 1)), (1, (1, 1)), (2, (1, 1))]
Scalable matrix multiplication in Spark
0. 1. 0. 1. 0. 1. 1. 2. 0. 0. 0. 0. 0. 3. 0. 0.
= x
M N P [(0, (0, 1)), (1, (0, 2)] [(0, (1, 1)), (1, (1, 1)), (2, (1, 1))] [(0, ((0, 1), (1, 1))), (1, ((0, 2), (1, 1)))]
Scalable matrix multiplication in Spark
0. 1. 0. 1. 0. 1. 1. 2. 0. 0. 0. 0. 0. 3. 0. 0.
= x
M N P [(0, ((0, 1), (1, 1))), (1, ((0, 2), (1, 1)))] [((0, 1), 1), ((0, 1), 2)]
Scalable matrix multiplication in Spark
0. 1. 0. 1. 0. 1. 1. 2. 0. 0. 0. 0. 0. 3. 0. 0.
= x
M N P [(0, ((0, 1), (1, 1))), (1, ((0, 2), (1, 1)))] [((0, 1), 1), ((0, 1), 2)] [((0, 1), 3)]
Scalable matrix multiplication in Spark
0. 1. 0. 1. 0. 1. 1. 2. 0. 0. 0. 0. 0. 3. 0. 0.
= x
M N P [(0, ((0, 1), (1, 1))), (1, ((0, 2), (1, 1)))] [((0, 1), 1), ((0, 1), 2)] [((0, 1), 3)] [MatrixEntry(0, 1, 3)]
Scalable matrix multiplication in Spark
reddicommend.ddns.net
Oliver Hoidn
College
Washington Past and current interests:
programming