Detecting Network Effects Randomizing Over Randomized Experiments - PowerPoint PPT Presentation

Detecting Network Effects Randomizing Over Randomized Experiments Martin Saveski (@msaveski) MIT

Detecting Network Effects Randomizing Over Randomized Experiments Guillaume Saint ‑ Jacques Martin Saveski Jean Pouget-Abadie MIT MIT Harvard Weitao Duan Souvik Ghosh Ya Xu Edo Airoldi LinkedIn LinkedIn LinkedIn Harvard

Treatment Z i = 1 New Feed Ranking Algorithm

Treatment Control Z i = 1 Z j = 0 New Feed Old Feed Ranking Algorithm Ranking Algorithm

Treatment Control Z i = 1 Z j = 0 New Feed Old Feed Ranking Algorithm Ranking Algorithm Y i Engagement

Treatment Control Z i = 1 Z j = 0 New Feed Old Feed Ranking Algorithm Ranking Algorithm Y j Y i Engagement Engagement

Completely-randomized Experiment

Treatment (B) Completely-randomized Experiment

Control (A) Treatment (B) Completely-randomized Experiment

Control (A) Treatment (B) Σ Y Σ Y ( ) ( ) - μ = | | | | completely-randomized Completely-randomized Experiment

Control (A) Treatment (B) Σ Y Σ Y ( ) ( ) - μ = | | | | completely-randomized SUTVA : Stable Unit Treatment Value Assumption Every user’s behavior is affected only by their treatment and NOT by the treatment of any other user Completely-randomized Experiment

Cluster-based Randomized Experiment

Control (A) Treatment (B) Cluster-based Randomized Experiment

Completely-randomized Experiment Cluster-based Randomized Experiment OR

Completely-randomized Experiment Cluster-based Randomized Experiment OR More Spillovers Less Spillovers Lower Variance Higher Variance

Design for Detecting Network Effects

Completely Randomized Experiment

Completely Randomized Cluster-based Randomized Experiment Experiment

Completely Randomized Cluster-based Randomized Experiment Experiment ? μ completely-randomized μ cluster-based =

Hypothesis Test

Hypothesis Test H 0 : SUTVA Holds

Hypothesis Test H 0 : SUTVA Holds E W , Z [ˆ µ cbr − ˆ µ cr ] = 0

Hypothesis Test H 0 : SUTVA Holds E W , Z [ˆ µ cbr − ˆ µ cr ] = 0 σ 2 ] var W , Z [ˆ µ cr − ˆ µ cbr ] ≤ E W , Z [ˆ

Hypothesis Test H 0 : SUTVA Holds E W , Z [ˆ µ cbr − ˆ µ cr ] = 0 σ 2 ] var W , Z [ˆ µ cr − ˆ µ cbr ] ≤ E W , Z [ˆ Reject the null when:

Hypothesis Test H 0 : SUTVA Holds E W , Z [ˆ µ cbr − ˆ µ cr ] = 0 σ 2 ] var W , Z [ˆ µ cr − ˆ µ cbr ] ≤ E W , Z [ˆ Reject the null when: | ˆ µ cbr | µ cr − ˆ 1 √ √ α ≥ σ 2 ˆ

Hypothesis Test H 0 : SUTVA Holds E W , Z [ˆ µ cbr − ˆ µ cr ] = 0 σ 2 ] var W , Z [ˆ µ cr − ˆ µ cbr ] ≤ E W , Z [ˆ Reject the null when: | ˆ µ cbr | µ cr − ˆ 1 √ √ α ≥ σ 2 ˆ Type I error is no greater than α

Nuts and Bolts of Running Cluster-based Randomized Experiments

Why Balanced Clustering?

Why Balanced Clustering? • Theoretical Motivation – Constants VS random variables

Why Balanced Clustering? • Theoretical Motivation – Constants VS random variables • Practical Motivations

Why Balanced Clustering? • Theoretical Motivation – Constants VS random variables • Practical Motivations – Variance reduction

Why Balanced Clustering? • Theoretical Motivation – Constants VS random variables • Practical Motivations – Variance reduction – Balance on pre-treatment covariates (homophily => large homogenous clusters)

Algorithms for Balanced Clustering

Algorithms for Balanced Clustering Most clustering methods find skewed distributions of cluster sizes (Leskovec, 2009; Fortunato, 2010)

Algorithms for Balanced Clustering Most clustering methods find skewed distributions of cluster sizes (Leskovec, 2009; Fortunato, 2010) => Algorithms that enforce equal cluster sizes

Algorithms for Balanced Clustering Most clustering methods find skewed distributions of cluster sizes (Leskovec, 2009; Fortunato, 2010) => Algorithms that enforce equal cluster sizes Restreaming Linear Deterministic Greedy (Nishimura & Ugander, 2013)

Algorithms for Balanced Clustering Most clustering methods find skewed distributions of cluster sizes (Leskovec, 2009; Fortunato, 2010) => Algorithms that enforce equal cluster sizes Restreaming Linear Deterministic Greedy (Nishimura & Ugander, 2013) – Streaming – Parallelizable – Stable

Clustering the LinkedIn Graph – Graph: >100M nodes, >10B edges – 350 Hadoop nodes – 1% leniency

Clustering the LinkedIn Graph – Graph: >100M nodes, >10B edges – 350 Hadoop nodes – 1% leniency 40% 35.6% % edges within clusters 28.5% 30% 26.2% 22.8% 21.1% 20% 10% 0% k = 1000 k = 3000 k = 5000 k = 7000 k = 10000

Clustering the LinkedIn Graph – Graph: >100M nodes, >10B edges – 350 Hadoop nodes – 1% leniency 40% 35.6% % edges within clusters 30% 20% 10% 0% k = 1000 k = 3000 k = 5000 k = 7000 k = 10000

Clustering the LinkedIn Graph – Graph: >100M nodes, >10B edges – 350 Hadoop nodes – 1% leniency 40% 35.6% % edges within clusters 28.5% 30% 26.2% 22.8% 21.1% 20% 10% 0% k = 1000 k = 3000 k = 5000 k = 7000 k = 10000

Choosing the Number of Clusters

Choosing the Number of Clusters small k large k

Choosing the Number of Clusters small k large k small clusters large clusters

Choosing the Number of Clusters small k large k small clusters large clusters small network effect large network effect small variance large variance

Choosing the Number of Clusters Understanding the Type II error

Choosing the Number of Clusters Understanding the Type II error Assuming an interference model

Choosing the Number of Clusters Understanding the Type II error Assuming an interference model Y i = � 0 + � 1 Z i + � 2 ⇢ i + ✏ i ρ i : fraction of treated friends

Choosing the Number of Clusters Understanding the Type II error Assuming an interference model Y i = � 0 + � 1 Z i + � 2 ⇢ i + ✏ i ρ i : fraction of treated friends E [ˆ µ cbr − ˆ µ cr ] ≈ ρ · β 2 : average fraction of a unit's neighbors contained in the cluster ρ i

Choosing the Number of Clusters Understanding the Type II error Assuming an interference model Y i = � 0 + � 1 Z i + � 2 ⇢ i + ✏ i ρ i : fraction of treated friends E [ˆ µ cbr − ˆ µ cr ] ≈ ρ · β 2 : average fraction of a unit's neighbors contained in the cluster ρ i Choose number of clusters M and clustering C such that ρ max p σ 2 M,C ˆ C

Experiments on LinkedIn

Bernoulli Completely Randomized Cluster-based Randomized Randomized Experiment Experiment Experiment ? ( μ bernoulli ) μ completely-randomized μ cluster-based =

Experiment 1

Experiment 1 – Population: 20% of all LinkedIn users [Bernoulli: 10%, Cluster-based: 10%]

Experiment 1 – Population: 20% of all LinkedIn users [Bernoulli: 10%, Cluster-based: 10%] Time period: 2 weeks –

Experiment 1 – Population: 20% of all LinkedIn users [Bernoulli: 10%, Cluster-based: 10%] Time period: 2 weeks – Number of clusters: k = 3,000 –

Experiment 1 – Population: 20% of all LinkedIn users [Bernoulli: 10%, Cluster-based: 10%] Time period: 2 weeks – Number of clusters: k = 3,000 – Outcome: feed engagement –

Experiment 1 – Population: 20% of all LinkedIn users [Bernoulli: 10%, Cluster-based: 10%] Time period: 2 weeks – Number of clusters: k = 3,000 – Outcome: feed engagement – Treatment effect Standard Deviation Bernoulli Randomization (BR) 0.0559 0.0050 Cluster-based Randomization (CBR) 0.0771 0.0260 Delta (CBR – BR) -0.0211 0.0265 p-value: 0.4246

Detecting Network Effects Randomizing Over Randomized Experiments - PowerPoint PPT Presentation

Detecting Network Effects Randomizing Over Randomized Experiments Martin Saveski (@msaveski) MIT Detecting Network Effects Randomizing Over Randomized Experiments Guillaume Saint Jacques Martin Saveski Jean Pouget-Abadie MIT MIT

Detecting Spammers and Content Detecting Spammers and Content Detecting Spammers and Content

12/6/2013 Detecting Fakes Image Forensics: Detecting Forged Photos 1.Detecting photorealistic

Introduction Detecting Errors in Effects of Annotation Errors Detecting Errors in Corpus

NetFlow Analysis: Detecting covert channels on the network Detecting malicious traffic by using

Effects and State Liam OConnor CSE, UNSW (and Data61) Term 2 2019 1 Effects State IO

Detecting Chang Detecting Changes in W s in Water ter Qua Q ualit lity i lit lit i in L

Detecting Self-Interruptions during Reading Jan Pilzer and Sam Liu 2017-11-27 Detecting

Effective features for detecting Effective features for detecting IRC botnets IRC botnets

Detecting Insolvency Detecting Insolvency David Emanuel 1 4 August 2 0 0 9 Outline

Detecting Cracks under Bushings Detecting Cracks under Bushings in Aircraft Structures in

Detecting abnormal events Detecting abnormal events Jaechul Kim Purpose Purpose Introduce

Detecting and Detecting and Characterizing Heterogeneity Characterizing Heterogeneity

Detecting Topics and their Transitions Victor Mireles , Artem Revenko Hybrid Statistical Semantic

Detecting Errors in Semantic Annotation Argument identification variation Heuristics for

Detecting Outliers under Detecting Outliers . . . What We Plan To Do Interval Uncertainty:

Detecting changes in Detecting changes in the rate the rate of a of a Poisson process

Shuffler: Fast and Deployable Continuous Code Re-Randomization David Williams-King, Graham

stakeholder Think Tank Meeting Trevor Lentz, PhD, PT, MPH Lesley Curtis, PhD Frank Rockhold, PhD

RANSAC 16-385 Computer Vision (Kris Kitani) Carnegie Mellon University Up to now, weve

ECON 626: Applied Microeconomics Lecture 8: Permutations and Bootstraps Professors: Pamela

Improving the Randomization Step in Feasibility Pump using WalkSAT Santanu S. Dey Joint work

Randomization DS-GA 1013 / MATH-GA 2824 Mathematical Tools for Data Science

MARDU: Efficient and Scalable Code Re-Randomization SYSTOR '20: Proceedings of the 13th ACM

Graphical Representation of Causal Effects November 10, 2016 Lords Paradox: Observed Data