Estimating treatment effects in online experiments
Media in Context and the 2015 General Election: How Traditional and Social Media Shape Elections and Governing
(ES/M010775/1)
University of Exeter
1 / 26
Estimating treatment effects in online experiments Media in Context - - PowerPoint PPT Presentation
Estimating treatment effects in online experiments Media in Context and the 2015 General Election: How Traditional and Social Media Shape Elections and Governing (ES/M010775/1) University of Exeter 1 / 26 A brief intro to the potential
1 / 26
◮ (relatively) easy to generalize to more complex treatment regimes (see
2 / 26
◮ (relatively) easy to generalize to more complex treatment regimes (see
◮ fundamental problem of causal inference
2 / 26
◮ Again, random assignment to treatment is important here: on
3 / 26
◮ e.g., media consumption habits, partisan affiliation, interest in politics,
4 / 26
◮ CATE: conditional average treatment effects ◮ i.e., average treatment effects among subgroups defined by baseline
5 / 26
ATE ^ and CATE ^ −5 −2.5 Non−UKIP ID ATE UKIP ID 6 / 26
7 / 26
◮ Difficult to interpret & understand beyond 2-way interactions ⋆ many interactions also lower statistical power and lead to imprecise
◮ So we typically use a few relevant mediators that need to be selected a
⋆ bypassing alternative explanations ◮ Model mis-specification and sensitivity to functional forms (especially
◮ Assumes a deterministic relationship between mediator and treatment
1
2
7 / 26
◮ Within each class, treatment effects are simply given by βj ◮ Variations in βj across classes capture differences in responsiveness to
8 / 26
◮ Treatment: EU referendum was just a campaign promise to attract
⋆ Control: government will not renege on its promise ◮ Outcome: Approve or disapprove of government action ◮ Possible moderators: Identification with UKIP, political interest and
9 / 26
◮ does heterogeneity exist? (i.e., do we distinguish classes of
◮ how many classes? ◮ what is driving heterogeneity?
◮ no asymptotic approximations: suitable for typical experimental
◮ flexibility to explore posterior distribution of parameters
10 / 26
1
2
3
4
⋆ check using standard Bayesian convergence diagnostics (e.g.,
11 / 26
Classes CATE ^ 1 2 −1.5 −1 −0.5 0.5 1 1.5
Estimates Intercept Prior Exposure Political Knowledge Media Use Media Trust Interest Politics Partisan: Conservative Partisan: Labour Partisan:Libdem Partisan:UKIP Independents University Education
12 / 26
◮ and categorical outcomes ◮ not so easy to accomplish using some of the other approaches we will
◮ treatment: media report on the “decisiveness” of the majority ◮ control: business news piece ◮ outcomes: several attitudes about governments’ ability to exert power
⋆ The government will be able to fulfill its campaign promises ⋆ It it important to command a majority in parliament to govern ⋆ The government has little effect on economic performance ⋆ The government’s ability to improve life in Britain depends on the
⋆ Accountability depends that the majority party governs by itself 13 / 26
1 Subjects are classified into “classes” based on Wi and the
2 Within each class j, for each outcomes k = 1, . . . , 5, the
3 Heterogeneity in responsiveness to treatment can be gauged
15 / 26
◮ useful for high-dimensional data ◮ less sensitive to specification of functional forms than parametric
◮ more robust to the choice of tuning parameters than other statistical
◮ existing off the shelf software (in R) minimizes the need for
16 / 26
◮ Yes: Node 1; No: Node 2 ◮ Repeat this process for each variable until each unit of analysis is
17 / 26
◮ e.g., compare the difference in ˆ
18 / 26
19 / 26
CATE estimate for AGE
CATE 21 31 41 51 61 71 81 −4 −2 2 4
21 / 26
◮ sounds Chinese, I know! But don’t worry too much ◮ The bottom line is: the method simultaneously selects a subset of the
◮ Imai and Ratkovic’s method “does it for you” in a single step 22 / 26
CATE ^
ATE ^
−2.5 −1.5 1 2 3 4 5 6
CATE ^
ATE ^
−4 −2.5 −1.5 1 2 3 4 5
CATE ^
ATE ^
−2 −1.5 1
‘The Telegraph'
CATE ^
ATE ^
−1 −0.5 1
23 / 26
◮ we have typically 4 treatment conditions in our experiments ◮ we could apply these tools for pairwise comparisons ◮ however, no easy way out in the case of - say - ordered treatment
24 / 26
25 / 26
◮ “flexmix: Flexible Mixture Modeling.” https:
◮ “FindIt: Finding Heterogeneous Treatment Effects.” https:
◮ “BayesTree: Bayesian Additive Regression Trees.” https:
26 / 26