Reduce and Aggregate: Similarity Ranking in Multi-Categorical - PowerPoint PPT Presentation

Reduce and Aggregate: Similarity Ranking in Multi-Categorical Bipartite Graphs Alessandro Epasto J. Feldman*, S. Lattanzi*, S. Leonardi°, V. Mirrokni*. *Google Research °Sapienza U. Rome

Motivation ● Recommendation Systems: ● Bipartite graphs with Users and Items. ● Identify similar users and suggest relevant items. ● Concrete example: The AdWords case. ● Two key observations: ● Items belong to different categories. ● Graphs are often lopsided.

Modeling the Data as a Bipartite Graph 2$ Retailers Hundreds of Labels Nike Store 3$ New York 4$ Apparel 1$ Soccer Shoes 5$ Sport 2$ Equipment Soccer Ball Millions of Advertisers Billions of Queries

Personalized PageRank For a node v (the seed) and a probability alpha v u The stationary distribution assigns a similarity score to each node in the graph w.r.t. node v.

The Problem 2$ Retailers Hundreds of Labels Nike Store 3$ New York 4$ Apparel 1$ Soccer Shoes 5$ Sport 2$ Equipment Soccer Ball Millions of Advertisers Billions of Queries

Other Applications ● General approach applicable to several contexts: ● User , Movies , Genres : find similar users and suggest movies. ● Authors , Papers , Conferences : find related authors and suggest papers to read.

Semi-Formal Problem Definition Advertisers Queries

Semi-Formal Problem Definition Advertisers A Queries

Semi-Formal Problem Definition Advertisers A Queries Labels:

Semi-Formal Problem Definition Advertisers A Queries Goal: Find the nodes most Labels: “similar” to A.

How to Define Similarity? ● We address the computation of several node similarity measures: ● Neighborhood based: Common neighbors, Jaccard Coefficient, Adamic-Adar. ● Paths based: Katz. ● Random Walk based: Personalized PageRank. ● Experimental question: which measure is useful? ● Algorithmic questions: ● Can it scale to huge graphs? ● Can we compute it in real-time ?

Our Contribution ● Reduce and Aggregate: general approach to induce real-time similarity rankings in multi- categorical bipartite graphs, that we apply to several similarity measures. ● Theoretical guarantees for the precision of the algorithms. ● Experimental evaluation with real world data.

Personalized PageRank For a node v (the seed) and a probability alpha v u The stationary distribution assigns a similarity score to each node in the graph w.r.t. node v.

Challenges ● Our graphs are too big ( billions of nodes) even for very large-scale MapReduce systems. ● MapReduce is not real-time. ● We cannot pre-compute the rankings for each subset of labels.

Reduce and Aggregate 1) a b a a 2) b a c c 3) b c b c Reduce: Given the bipartite and a category construct a graph with only A nodes that preserves the ranking on the entire graph. Aggregate: Given a node v in A and the reduced graphs of the subset of categories interested determine the ranking for v.

Reduce (Precomputation) Advertisers Queries

Reduce (Precomputation) Advertisers Queries Precomputed Rankings

Reduce (Precomputation) Advertisers Queries Precomputed Precomputed Rankings Rankings

Reduce (Precomputation) Advertisers Queries Precomputed Precomputed Precomputed Rankings Rankings Rankings

Aggregate (Run Time) A Precomputed Precomputed Ranking of Rankings Rankings Red + Yellow

Reduce for Personalized PageRank Side A X Y X Y Side A Side B ● Markov Chain state aggregation theory (Simon and Ado, ’61; Meyer ’89, etc.). ● 750x reduction in the number of node while preserving correctly the PPR distribution on the entire graph .

Run-time Aggregation

Koury et al. Aggregation-Disaggregation Algorithm B A Step 1: Partition the Markov chain into DISJOINT subsets

Koury et al. Aggregation-Disaggregation Algorithm B A π B π A Step 2: Approximate the stationary distribution on each subset independently.

Koury et al. Aggregation-Disaggregation Algorithm P AA B A P AB π B π A P BA P BB Step 3: Consider the transition between subsets.

Koury et al. Aggregation-Disaggregation Algorithm P AA B A P AB π 0 π 0 B A P BA P BB Step 4: Aggregate the distributions. Repeat until convergence.

Aggregation in PPR A X Y π A Precompute the stationary distributions individually

Aggregation in PPR B X Y π B Precompute the stationary distributions individually

Aggregation in PPR A B The two subsets are not disjoint!

Our Approach π A π B Y X Y X ● The algorithm is based only on the reduced graphs with Advertiser-Side nodes. ● The aggregation algorithm is scalable and converges to the correct distribution.

Experimental Evaluation ● We experimented with publicly available and proprietary datasets: ● Query-Ads graph from Google AdWords > 1.5 billions nodes, > 5 billions edges. ● DBLP Author-Papers and Patent Inventor- Inventions graphs. ● Ground-Truth clusters of competitors in Google AdWords.

Patent Graph Precision vs Recall 1 Inter Jaccard Adamic-Adar 0.9 Katz PPR 0.8 0.7 0.6 Precision Precision 0.5 0.4 0.3 0.2 0.1 0 0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1 Recall Recall

Google AdWords Precision Recall

Conclusions and Future Work ● It is possible to compute several similarity scores on very large bipartite graphs in real-time with g ood accuracy. ● Future work could focus on the case where categories are not disjoint is relevant.

Thank you for your attention

Reduction to the Query Side X Y π B π A

Reduction to the Query Side X Y π B π A This is the larger side of the graph.

Convergence after One Iteration Kendall-T au Correlation DBLP Patent 1 Query-Ads (cost) 0.8 au Kendall-T 0.6 0.4 0.2 0 10 20 30 40 50 All Position (k)

Convergence Approximation Error vs # Iterations DBLP (1 - Cosine) Patent (1 - Cosine) 1-Cosine Similarity 0.001 0.0001 1-Cosine 1e-05 1e-06 0 2 4 6 8 10 12 14 16 18 20 Iterations Iterations

Reduce and Aggregate: Similarity Ranking in Multi-Categorical - PowerPoint PPT Presentation

Reduce and Aggregate: Similarity Ranking in Multi-Categorical Bipartite Graphs Alessandro Epasto J. Feldman, S. Lattanzi, S. Leonardi, V. Mirrokni. Google Research Sapienza U. Rome Motivation Recommendation Systems: Bipartite

Aggregate Sampling Aggregate Stockpiles CIVL 3137 2 Stockpile Segregation CIVL 3137 3

1 So, similarity is not a Boolean notion It is Similarity Are they similar? relatively

Semantic Similarity MultiJEDI ERC 259234 Semantic Similarity Semantic Similarity Mostly

Asphalt Aggregate Specifications Aggregate Specifications In order to make good asphalt

Aggregate Blending Aggregate Blending To meet the gradation specifications for a concrete or

Easy and Hard Outline Constraint Ranking in OT The Constraint Ranking problem Making fast

Tutorial: TF-Ranking for sparse features Tutorial: TF-Ranking for sparse features This tutorial

Self-similar traffic 1 Self-similarity 2 Aggregate traffic - exact self-similarity Intuition:

Align, Disambiguate, and Walk A Unified Approach for Measuring Semantic Similarity Semantic

Time- -dependent Similarity Measure dependent Similarity Measure Time Time-dependent Similarity

Short-Run Aggregate Supply (SRAS) Video explanation in 2 minutes or 12 minutes AND Long-Run

Declarative MapReduce 10/29/2018 1 MapReduce Examples Filter Map Aggregate Map Reduce

Distributed Multi-modal Similarity Retrieval David Novak Seminar of DISA Lab, October 14, 2014

Online Submodular Set Cover, Ranking, and Repeated Active Learning Online Ranking: At each round,

Ranking candidate genes from Ranking candidate genes from perturbation experiments Niko

TVM for Ads Ranking @ Facebook Hao Lu, Ansha Yu, Yinghai Lu, Andrew Tulloch Ads Ranking at

Distance Measure for Querying Arrangements of Temporal Intervals Orestis Kostakis, Panagiotis

Phylogenetics Eliran Avni, Reuven Cohen, Sagi Snir Presentation by Ashu Gupta Motivation

On the Limitations of Unsupervised Bilingual Dictionary Induction Anders Sgaard Sebastian

Multilevel refinement based on neighborhood similarity Alan Valejo, Jorge Valverde-Rebaza, Brett

Paraphrase Recognition Using Machine Learning to Combine Similarity Measures Prodromos

Investor Similarity Affects Investment Decisions This Paper: investors who trade an asset care

Citation networks in economics Carlo D Ippoliti Carlo D Ippoliti Citation Networks in

t t tts

Reduce and Aggregate: Similarity Ranking in Multi-Categorical - PowerPoint PPT Presentation

Reduce and Aggregate: Similarity Ranking in Multi-Categorical Bipartite Graphs Alessandro Epasto J. Feldman*, S. Lattanzi*, S. Leonardi, V. Mirrokni*. *Google Research Sapienza U. Rome Motivation Recommendation Systems: Bipartite

Aggregate Sampling Aggregate Stockpiles CIVL 3137 2 Stockpile Segregation CIVL 3137 3

1 So, similarity is not a Boolean notion It is Similarity Are they similar? relatively

Semantic Similarity MultiJEDI ERC 259234 Semantic Similarity Semantic Similarity Mostly

Asphalt Aggregate Specifications Aggregate Specifications In order to make good asphalt

Aggregate Blending Aggregate Blending To meet the gradation specifications for a concrete or

Easy and Hard Outline Constraint Ranking in OT The Constraint Ranking problem Making fast

Tutorial: TF-Ranking for sparse features Tutorial: TF-Ranking for sparse features This tutorial

Self-similar traffic 1 Self-similarity 2 Aggregate traffic - exact self-similarity Intuition:

Align, Disambiguate, and Walk A Unified Approach for Measuring Semantic Similarity Semantic

Time- -dependent Similarity Measure dependent Similarity Measure Time Time-dependent Similarity

Short-Run Aggregate Supply (SRAS) Video explanation in 2 minutes or 12 minutes AND Long-Run

Declarative MapReduce 10/29/2018 1 MapReduce Examples Filter Map Aggregate Map Reduce

Distributed Multi-modal Similarity Retrieval David Novak Seminar of DISA Lab, October 14, 2014

Online Submodular Set Cover, Ranking, and Repeated Active Learning Online Ranking: At each round,

Ranking candidate genes from Ranking candidate genes from perturbation experiments Niko

TVM for Ads Ranking @ Facebook Hao Lu, Ansha Yu, Yinghai Lu, Andrew Tulloch Ads Ranking at

Distance Measure for Querying Arrangements of Temporal Intervals Orestis Kostakis, Panagiotis

Phylogenetics Eliran Avni, Reuven Cohen, Sagi Snir Presentation by Ashu Gupta Motivation

On the Limitations of Unsupervised Bilingual Dictionary Induction Anders Sgaard Sebastian

Multilevel refinement based on neighborhood similarity Alan Valejo, Jorge Valverde-Rebaza, Brett

Paraphrase Recognition Using Machine Learning to Combine Similarity Measures Prodromos

Investor Similarity Affects Investment Decisions This Paper: investors who trade an asset care

Citation networks in economics Carlo D Ippoliti Carlo D Ippoliti Citation Networks in

t t tts

Reduce and Aggregate: Similarity Ranking in Multi-Categorical Bipartite Graphs Alessandro Epasto J. Feldman, S. Lattanzi, S. Leonardi, V. Mirrokni. Google Research Sapienza U. Rome Motivation Recommendation Systems: Bipartite