Efficient Algorithms for Public-Private Social Networks Flavio - PowerPoint PPT Presentation

Efficient Algorithms for Public-Private Social Networks Flavio Chierichetti Vahab Mirrokni Alessandro Epasto Ravi Kumar Silvio Lattanzi Sapienza University Google Brown University Google Google KDD2015 — Sydney, Australia — August 11, 2015

Private-Public networks Idealized vision

Private-Public networks Reality My friends are private

Private-Public networks Reality My friends are private B A C

Private-Public networks Reality My friends are private Only my friends can see my friends

Private-Public networks Reality My friends are private C Only my friends can see my A friends D B

Private-Public networks We are Reality a private group My friends are private Only my friends can see my friends

Private-Public networks ~ 5 2 % o f N Y C We are Reality Facebook users hide a private their friends group My friends are private Only my friends can see my friends

Private-Public networks ~ 5 2 % o f N Y C We are Reality Facebook users hide a private their friends group My friends are private Only my friends can see my friends There is no such thing as the Social Network !

Social network of User A User A Each user has his/her own personal Social Network!

Social network of User B User B User A Each user has his/her own personal Social Network!

Computational implication The algorithms need to respect the privacy of the users. We can only use the data that the user can access. Naively, we need to run the algorithms once for each user on a different ( and huge ) graph!

Application: Friend suggestion Network signals are very useful   Number of common neighbors Personalized PageRank, etc. My friends are private C A B D

Application: Friend suggestion Common Neighbors - Ideal World 1) Run the algorithm (in parallel) on the graph G 2) For each user suggest top k users by common neighbors. My friends are … but there is no such graph G. private C A B D

Application: Friend suggestion Common Neighbors - Real World Multiple graphs = Multiple answers ! How many common neighbors do B and C have? My friends are private Answer for C A One common neighbor: me! A B D

Application: Friend suggestion Common Neighbors - Real World Multiple graphs = Multiple answers ! How many common neighbors do B and C have? My friends are Answer for private B C Zero common A neighbors ! D B We cannot suggest C to B as friends based on common neighbors!

Naive approaches 1) Running the algorithms N times is infeasible 2) Ignoring all private data is very ineffective ! My friends are From user A’s private prospective there are C E interesting signals A B D E and D are good suggestions!

Naive approaches 1) Running the algorithms N times is infeasible 2) Ignoring all private data is very ineffective ! My friends are From public private data C prospective E there are no signals! A B D No suggestions for the user!

Public-Private Graph Model

Private-Public model There is a public graph G

Private-Public model There is a public graph in addition every node has G u access to a private graph G u u u G u We assume the private graph to be at <= 2 hops from . u

Private-Public model For each we would like to execute computation on u G ∪ G u u

Private-Public model For each we would like to execute computation on u G ∪ G u u This respects the privacy of each user. We want the computation to be efficient.

Two-Steps Approach Precompute data structure for so that we can G solve problems in efficiently. G ∪ G u Preprocessing G Synopsis of G Public Graph + u Query for user u fast computation Output for User u Private Graph G u

Private-Public problem Ideally. Preprocessing time: ˜ O ( | E G | ) ˜ Preprocessing space: O ( | V G | ) ˜ Query time: O ( | E G u | )

Warm-up: # connected components

Warm-up: # connected components B B B B B A A A A C C A A A C C Precompute component IDs in G

Warm-up: # connected components B B B B B A A A A C C A A A C C Add private edges and merge conn. components

Warm-up: # connected components B A A Add private edges and merge conn. components.

Results Algorithms   Reachability Approximate All-pairs shortest paths Correlation clustering Social affinity Heuristics Personalized PageRank Centrality measures

Reachability How many nodes can I reach from u? u

Reachability How many nodes can I reach from u? u We have to handle overlaps.

Reachability Key idea: use size-estimation sketch [Cohen JCSS97] 0.5 0.23 0.33 0.9 0.2 0.3 0.1 Every node samples a random number between [0,1]

Reachability Key idea: use size-estimation sketch [Cohen JCSS97] [0.1, 0.2] 0.5 0.23 0.33 0.9 0.2 0.3 0.1 Every node samples a random number between [0,1]. Look at the k-th smallest value , use it to estimate the size of the set.

Reachability Key idea: use size-estimation sketch [Cohen JCSS97] [0.1, 0.2] [0.15, 0.2] 0.5 0.5 0.23 0.15 0.33 0.33 0.9 0.2 0.9 0.3 0.7 0.1 Every node samples a random number between [0,1]. Look at the k-th smallest value , use it to estimate the size of the set. Composable sketch of size k.

Reachability Key idea: use size-estimation sketch [Cohen JCSS97] [0.1, 0.15] [0.1, 0.2] [0.15, 0.2] 0.5 0.5 0.23 0.15 0.33 0.33 0.9 0.2 0.9 0.3 0.7 0.1 Every node samples a random number between [0,1]. Look at the k-th smallest value , use it to estimate the size of the set. Composable sketch of size k.

Reachability How many nodes can I reach from u? [0.7, 1.0] [0.8, 1.0] [0.1, 1.0] u [0.2, 0.3] Precompute sketches for each node in public graph.

Reachability How many nodes can I reach from u? [0.7, 1.0] [0.8, 1.0] [0.1, 1.0] u [0.2, 0.3] [0.1, 0.2] Compose sketches of nodes reachable in private graph.

Experiments Personalized PageRank Approximating the PPR stationary distribution. Up to 4 orders of magnitudes faster naive approach.

Conclusions New model for practical problems; Some algorithms designed using sampling and   sketching techniques; Large speed-up in practice.

Future works New algorithms for other problems; Not only graph problems; Study limit of the model (lower bounds).

Thanks!

Personalized PageRank is the probability of visiting in the following PPR ( v, z ) z lazy random walk: - with probability jumps to α v - with probability jumps to a random neighbor 1 − α v

Personalized PageRank Nice property [Jeh and Widom WWW03] PPR G ∪ G u ( v, z ) = (1 − α ) d G ∪ G u ( y ) − 1 X PPR G ∪ G u ( v, y ) + α 1 v y ∈ N ( z ) v

Personalized PageRank Nice property [Jeh and Widom WWW03] PPR G ∪ G u ( v, z ) = (1 − α ) d G ∪ G u ( y ) − 1 X PPR G ∪ G u ( v, y ) + α 1 v y ∈ N ( z ) We don’t have it v

Personalized PageRank Nice property [Jeh and Widom WWW03] PPR G ∪ G u ( v, z ) = (1 − α ) d G ∪ G u ( y ) − 1 X PPR G ∪ G u ( v, y ) + α 1 v y ∈ N ( z ) Simple heuristic: PPR G ∪ G u ( v, z ) ≈ (1 − α ) d G ∪ G u ( y ) − 1 X u ( v, y ) + α 1 v PPR G ∪ y ∈ N ( z ) Using public graph distribution v

Social affinity Which connection is stronger?

Social affinity Which connection is stronger? It is important to consider the number of paths and their lengths

Efficient Algorithms for Public-Private Social Networks Flavio - PowerPoint PPT Presentation

Efficient Algorithms for Public-Private Social Networks Flavio Chierichetti Vahab Mirrokni Alessandro Epasto Ravi Kumar Silvio Lattanzi Sapienza University Google Brown University Google Google KDD2015 Sydney, Australia August

Grid.java public public class class Grid { private private final final int int width;

Introduction Social and Economic Networks MohammadAmin Fazli Social and Economic Networks 1

Submodular Maximization applied to Marketing Over Social Networks Vahab Mirrokni Google

SOCIAL NETWORKS OF ELDERLY PEOPLE Hayden Manseau 1 1. THE PROBLEM 2 THE IMPACT OF SOCIAL

Virtual Private Networks -Prekshu Ajmera Virtual Private Network Internet runs on public

Virtual Private Networks Distributed Systems Paul Krzyzanowski Private networks Problem You

Graph Algorithms Chapter 22 1 CPTR 430 Algorithms Graph Algorithms Why Study Graph Algorithms?

Greedy Algorithms Chapter 16 1 CPTR 430 Algorithms Greedy Algorithms Greedy Algorithms For

Algorithms Chapter 3 Chapter Summary Algorithms n Example Algorithms n Algorithmic Paradigms

Russian Private Equity landscape W hats today? Money flow cycle Money flow cycle Private

POZIERES RELIC Private WOOD HC Private POTTER TJA DIV FIELD ARTILLERY LCPL PRIEST TH Private

Types of networks (social networks, computer networks, entity- relationship networks, )

Querying Geo-social Data by Bridging Spatial Networks and Social Networks Yerach Ben Yaron

Social Networks What are they, really? What we will learn today What is a social network?

Graphs and social networks Social networks Active area of research motivated in part by

Efficient Network Coding in Planar Multicast Networks Tang Xiahou Department of Computer Science

(Aster)-picking through the pieces of short URL services An investigation into the maliciousness

Mobile Phone and Internet Access Among Low Income and Homeless Populations Jordan Rivera JWCH

Chapter 10 : Informatics Practices Class XI ( As per Python CBSE Board) Modules New

GEOMETRICAL STABILITY OF CFRP LAMINATE CONSIDERING PLY ANGLE MISALIGNMENT Y. Arao 1* , J. Koyanagi

Wetland Mitigation: An Evaluation of Regulatory Success Tammy Hill NCDWQ Eric Kulz NCDWQ

Functions Function Calls Python supports expressions with math-like functions A function

Digital Ledgers and Cybersecurity David Beam Partner 1 202 263 3375 dbeam@mayerbrown.com

Quantum Mechanics; a Blessing and a Curse By Elias Marcopoulos Quantum Computers Quantum