Experiments and Causal Inference Erik Gahner Larsen Advanced - - PowerPoint PPT Presentation

experiments and causal inference
SMART_READER_LITE
LIVE PREVIEW

Experiments and Causal Inference Erik Gahner Larsen Advanced - - PowerPoint PPT Presentation

Experiments and Causal Inference Erik Gahner Larsen Advanced applied statistics, 2015 1 / 67 Articles published in APSR 10 15 0 5 Experiments in political science research 190609 191014 191519 192024 Political Science


slide-1
SLIDE 1

Experiments and Causal Inference

Erik Gahner Larsen Advanced applied statistics, 2015

1 / 67

slide-2
SLIDE 2

Experiments in political science research

Cambridge Handbook of Experimental Political Science (Druckman et al. 2011, 5)

5 10 15 1906−09 1910−14 1915−19 1920−24 1925−29 1930−34 1935−39 1940−44 1945−49 1950−54 1955−59 1960−64 1964−69 1970−74 1975−79 1980−84 1985−89 1990−94 1995−99 2000−04 2005−09

Articles published in APSR

2 / 67

slide-3
SLIDE 3

Quotes #1

“Like it or not, social scientists rely on the logic of experimentation even when analyzing nonexperimental data.” (Green and Gerber 2003, 110) “In some sense every empirical researcher is reporting the results of an

  • experiment. Every researcher who behaves as if an exogenous variable

varies independently of an error term effectively views their data as coming from an experiment.” (Harrison and List 2004, 1009)

3 / 67

slide-4
SLIDE 4

Quotes #2

“[T]here is no reason to suppose that case study research follows a divergent logic of inquiry relative to experimental research.” (Gerring and McDermott 2007, 689) “If you can’t devise an experiment that answers your question in a world where anything goes, then the odds of generating useful results with a modest budget and nonexperimental survey data seem pretty slim. The description of an ideal experiment also helps you formulate causal questions precisely.” (Angrist and Pischke 2009, 5)

4 / 67

slide-5
SLIDE 5

Agenda

▸ Causal effects ▸ Experiments and assumptions ▸ Issues in experimental research ▸ Types of experiments

5 / 67

slide-6
SLIDE 6

Example: The effect of facebook use on life satisfaction

▸ How would we test this in an observational setting? ▸ What is the problem?

6 / 67

slide-7
SLIDE 7

What is it all about?

▸ We need strong designs in order to make causal inferences

▸ Remember: Science is all about causality

▸ The issue it not the data we have . . . but the data we do not have. ▸ “Easy” to measure the factual world

▸ What about the counterfactual world?

▸ We need theoretical and statistical tools to make valid counterfactuals

7 / 67

slide-8
SLIDE 8

What is it all about?

▸ From estimation strategies (statistics) to identification strategies

(design):

▸ “Without an experiment, a natural experiment, a discontinuity, or

some other strong design, no amount of econometric or statistical modeling can make the move from correlation to causation persuasive.” (Sekhon 2009, 503)

8 / 67

slide-9
SLIDE 9

Neyman-Rubin causal model

▸ We are interested in potential outcomes to define causal effects ▸ For individual i, we have a potential outcome: Yi ▸ Treatment: Wi ▸ Potential outcome given treatment treatment status: Yi(Wi) ▸ Two potential outcomes: Yi(1), Yi(0) ▸ Unit causal effect: The difference between a unit’s potential outcome

under treatment and the unit’s potential outcome under control. τi = Yi(1) − Yi(0)

9 / 67

slide-10
SLIDE 10

Example: facebook and life satisfaction

User i Yi(0) (no facebook) 1 60 2 20 3 80 4 30 5 40 6 75 7 40 8 20 9 60 10 75 Average 50

10 / 67

slide-11
SLIDE 11

Example: facebook and life satisfaction

User i Yi(0) (no facebook) Yi(1) (facebook) 1 60 70 2 20 50 3 80 80 4 30 45 5 40 50 6 75 60 7 40 45 8 20 30 9 60 85 10 75 85 Average 50 60

11 / 67

slide-12
SLIDE 12

Example: facebook and life satisfaction

User i Yi(0) (no facebook) Yi(1) (facebook) τi 1 60 70 10 2 20 50 30 3 80 80 4 30 45 15 5 40 50 10 6 75 60

  • 15

7 40 45 5 8 20 30 10 9 60 85 25 10 75 85 10 Average 50 60 10

12 / 67

slide-13
SLIDE 13

FPCI

13 / 67

slide-14
SLIDE 14

The Fundamental Problem of Causal Inference (FPCI)

▸ “It is impossible to observe the value of Yi(1) and Yi(0) on the same

unit and, therefore, it is impossible to observe the effect of Wi on i.” (Holland 1986, 947)

▸ We observe one outcome: the realised outcome

Ri = WiYi(1) + (1 − Wi)Yi(0)

14 / 67

slide-15
SLIDE 15

Example: facebook and life satisfaction

User i Yi(0) Yi(1) Wi 1 60 ? 2 ? 50 1 3 80 ? 4 ? 45 1 5 40 ? 6 ? 60 1 7 ? 45 1 8 20 ? 9 ? 85 1 10 75 ?

15 / 67

slide-16
SLIDE 16

Example: facebook and life satisfaction

User i Yi(0) Yi(1) Wi Ri (observed outcome) 1 60 ? 60 2 ? 50 1 50 3 80 ? 80 4 ? 45 1 45 5 40 ? 40 6 ? 60 1 60 7 ? 45 1 45 8 20 ? 20 9 ? 85 1 85 10 75 ? 75

16 / 67

slide-17
SLIDE 17

So, you cannot prove causality with statistics?

17 / 67

slide-18
SLIDE 18

Well, you can only prove causality with statistics.

▸ Rosenbaum (2010, 35) ▸ The FPCI is a missing data problem. What is the solution?

18 / 67

slide-19
SLIDE 19

Random assignment

Figur 1: Random selection

19 / 67

slide-20
SLIDE 20

Random assignment

▸ Create two groups of observations that are, in expectation, identical

prior to application of the treatment (Green and Gerber 2012, 31)

▸ Create a counterfactual group. ▸ Guarantees that the treatment is prior to the outcome, avoiding

posttreatment and simultaneity biases.

▸ P(W ) = 0.5 (coin flip)

20 / 67

slide-21
SLIDE 21

Assumption I: Ignorability of Treatment Assignment

▸ Pretreatment covariates, X ▸ Unconfoundedness (Y (1), Y (0), X) ⊥ W ▸ What about (Y (1), Y (0)) ⊥ W ∣X?

▸ We will address this issue next week 21 / 67

slide-22
SLIDE 22

Average treatment effect

▸ What most scholars are interested in ▸ Average treatment effect:

τATE = E[Y (1) − Y (0)] = E[Y (1)] − E[Y (0)]

22 / 67

slide-23
SLIDE 23

Assumption II: Stable Unit Treatment Value Assumption (SUTVA)

▸ A collection of implied assumptions about the effect of treatments on

individuals

▸ “The potential outcomes for any unit do not vary with the treatments

assigned to other units, and, for each unit, there are no different forms or versions of each treatment level, which lead to different potential outcomes.” (Imbens and Rubin 2015, 10)

23 / 67

slide-24
SLIDE 24

Assumption II: Stable Unit Treatment Value Assumption (SUTVA)

  • 1. Noninterference: Potential outcomes for unit i depend only on the

treatment assignment of unit i (no interference or spillover effect): (Y (1), Y (0)) ⊥ Wj, ∀i ≠ j

  • 2. Exclusion restriction: Only one version of each treatment possible for

each unit

24 / 67

slide-25
SLIDE 25

Assumption II: Stable Unit Treatment Value Assumption (SUTVA)

Two implications (from Heckman 2005, 11):

▸ Rules out social interactions and general equilibrium effects. ▸ Rules out any effect of the assignment mechanism on potential

  • utcomes.

25 / 67

slide-26
SLIDE 26

Assumption III: Compliance

▸ Wi is assignment to treatment ▸ Subjects can - in many cases - decide not to comply ▸ Di: treatment status (1 if treated, 0 if not)

26 / 67

slide-27
SLIDE 27

Assumption III: Compliance, always-takers

▸ Always-takers will always be treated ▸ Wi = 1, Di = 1 ▸ Wi = 0, Di = 1 ▸ Facebook example: Will use facebook independent of treatment

assignment

27 / 67

slide-28
SLIDE 28

Assumption III: Compliance, never-takers

▸ Never-takers will never be treated ▸ Wi = 1, Di = 0 ▸ Wi = 0, Di = 0 ▸ Facebook example: Will not use facebook independent of treatment

assignment

28 / 67

slide-29
SLIDE 29

Assumption III: Compliance, cooperators

▸ Cooperators will. . . cooperate ▸ Wi = 1, Di = 1 ▸ Wi = 0, Di = 0 ▸ Facebook example: Will only use facebook if assigned to treatment

29 / 67

slide-30
SLIDE 30

Assumption III: Compliance, defiers

▸ Defiers will. . . do the opposite ▸ Wi = 1, Di = 0 ▸ Wi = 0, Di = 1 ▸ Facebook example: Will use facebook if not assigned to treatment

and not use facebook if assigned to treatment

30 / 67

slide-31
SLIDE 31

So, which cases inform causal inference?

▸ The cases whose treatment status can be changed (hint: cooperators)

31 / 67

slide-32
SLIDE 32

Assumption III: Compliance

How do we know that Wi = 1, Di = 1 is a cooperator and not an always-taker? How do we know that Wi = 0, Di = 0 is a cooperator and not a never-taker?

32 / 67

slide-33
SLIDE 33

Assumption III: Compliance

▸ We only have realised outcomes (we need a counterfactual) ▸ Hard to say whether we are dealing with compliance or noncompliance ▸ Remember: Try to measure compliance!

33 / 67

slide-34
SLIDE 34

Intention-to-treat

▸ Our effects are often intention-to-treat (ITT) estimates. ▸ Mean difference on Y between subjects assigned to treatment and

subjects not assigned to treatment.

34 / 67

slide-35
SLIDE 35

Example: Noncompliance with Encouragement Wi to Exercise Di

▸ From Table 5.5 in Rosenbaum (2002, 182). ▸ Y = forced expiratory volume (higher numbers signifying better lung

function)

▸ Will subject exercice with encouragement? (di(1)) ▸ Will subject exercice without encouragement? (di(0))

35 / 67

slide-36
SLIDE 36

Example: Noncompliance with Encouragement Wi to Exercise Di

User i di(1) di(0) 1 1 1 2 1 1 3 1 4 1 5 1 6 1 7 1 8 1 9 10

36 / 67

slide-37
SLIDE 37

What are the potential outcomes?

37 / 67

slide-38
SLIDE 38

Example: Noncompliance with Encouragement Wi to Exercise Di

User i di(1) di(0) Yi(1) Yi(0) 1 1 1 71 71 2 1 1 68 68 3 1 64 59 4 1 62 57 5 1 59 54 6 1 58 53 7 1 56 51 8 1 56 51 9 42 42 10 39 39

38 / 67

slide-39
SLIDE 39

Let’s assign some treatments and see the realised

  • utcomes.

39 / 67

slide-40
SLIDE 40

Example: Noncompliance with Encouragement Wi to Exercise Di

User i di(1) di(0) Yi(1) Yi(0) Wi Di Ri 1 1 1 71 71 1 1 71 2 1 1 68 68 1 68 3 1 64 59 1 1 64 4 1 62 57 57 5 1 59 54 54 6 1 57 52 1 1 57 7 1 56 51 1 1 56 8 1 56 51 51 9 42 42 42 10 39 39 1 39

40 / 67

slide-41
SLIDE 41

Let’s create a data frame

di1 <- c(1, 1, 1, 1, 1, 1, 1, 1, 0, 0) di0 <- c(1, 1, 0, 0, 0, 0 ,0, 0, 0, 0) Yi1 <- c(71, 68, 64, 62, 59, 57, 56, 56, 42, 39) Yi0 <- c(71, 68, 59, 57, 54, 52, 51, 51, 42, 39) Wi <- c(1, 0, 1, 0, 0, 1, 1, 0, 0, 1) Di <- c(1, 1, 1, 0, 0, 1, 1, 0, 0, 0) Ri <- c(71, 68, 64, 57, 54, 57, 56, 51, 42, 39) lung <- data.frame(di1, di0, Yi1, Yi0, Wi, Di, Ri)

41 / 67

slide-42
SLIDE 42

What is the (average) causal effect?

mean( lung[lung$Wi == 1 & lung$di1 - lung$di0 == 1,]$Ri - lung[lung$Wi == 0 & lung$di1 - lung$di0 == 1,]$Ri ) ## [1] 5

42 / 67

slide-43
SLIDE 43

What is the naive average treatment effect?

mean(lung[lung$Di == 1,]$Ri - lung[lung$Di == 0,]$Ri) ## [1] 14.6

43 / 67

slide-44
SLIDE 44

summary(lm(Ri~Di, data=lung)) ## ## Call: ## lm(formula = Ri ~ Di, data = lung) ## ## Residuals: ## Min 1Q Median 3Q Max ##

  • 9.60
  • 6.50

1.60 5.25 8.40 ## ## Coefficients: ## Estimate Std. Error t value Pr(>|t|) ## (Intercept) 48.600 3.225 15.070 3.72e-07 *** ## Di 14.600 4.561 3.201 0.0126 * ## --- ## Signif. codes: 0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1 ## ## Residual standard error: 7.211 on 8 degrees of freedom ## Multiple R-squared: 0.5616, Adjusted R-squared: 0.5068 ## F-statistic: 10.25 on 1 and 8 DF, p-value: 0.01259

44 / 67

slide-45
SLIDE 45

Problem

ATE is confounded by endogenous selection into treatment

45 / 67

slide-46
SLIDE 46

What is the ITT?

mean(lung[lung$Wi == 1,]$Ri - lung[lung$Wi == 0,]$Ri) ## [1] 3

46 / 67

slide-47
SLIDE 47

summary(lm(Ri~Wi, data=lung)) ## ## Call: ## lm(formula = Ri ~ Wi, data = lung) ## ## Residuals: ## Min 1Q Median 3Q Max ##

  • 18.4
  • 2.9
  • 0.4

5.6 13.6 ## ## Coefficients: ## Estimate Std. Error t value Pr(>|t|) ## (Intercept) 54.400 4.812 11.304 3.38e-06 *** ## Wi 3.000 6.806 0.441 0.671 ## --- ## Signif. codes: 0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1 ## ## Residual standard error: 10.76 on 8 degrees of freedom ## Multiple R-squared: 0.02371, Adjusted R-squared:

  • 0.09832

## F-statistic: 0.1943 on 1 and 8 DF, p-value: 0.671

47 / 67

slide-48
SLIDE 48

Problem

You might say: “Seriously, I can think of a billion cases where all this might be problematic.”

48 / 67

slide-49
SLIDE 49

Welcome to the social sciences.

49 / 67

slide-50
SLIDE 50

Types of experiments

▸ Lab experiments ▸ Survey experiments ▸ Field experiments ▸ Natural experiments ▸ Quasi-experiments

50 / 67

slide-51
SLIDE 51

Types of designs

Experiment Comparison of experiment and control group Exogenous or as-if exogenous intervention Groupings are randomized or as-if randomized Researcher manipulates intervention Lab Yes Yes Yes Yes Field Yes Yes Yes Yes Survey Yes Yes Yes Yes Natural Yes Yes Yes No Quasi Yes Yes No No Observational Yes No No No

51 / 67

slide-52
SLIDE 52

What type of experiment is the facebook and life satisfaction study?

52 / 67

slide-53
SLIDE 53

Facebook and life satisfaction

▸ Lab: Get subjects into the lab, randomize, treatment group use

facebook.

▸ Survey: Get subjects to answer a survey, randomize, treatment group

see material from facebook.

▸ Field: Get subjects to sign up, randomize, treatment group use

facebook.

▸ Natural/quasi: Utilize (as-if) random variation in the access to

facebook.

▸ Observational: Ask people about facebook use and life satisfaction :-(

53 / 67

slide-54
SLIDE 54

Example: Study design, facebook and life satisfaction

Figur 2: Study design, facebook

54 / 67

slide-55
SLIDE 55

Example: Study effects, facebook and life satisfaction

Figur 3: Effects, facebook

55 / 67

slide-56
SLIDE 56

Example: Study effects, facebook and life satisfaction (0-9)

Control Treatment 2 4 6 Before Now Before Now

56 / 67

slide-57
SLIDE 57

What if the randomization failed?

Figur 4: Random numbers

57 / 67

slide-58
SLIDE 58

What about covariates?

▸ Always report the unadjusted treatment effect: “If an estimated

treatment effect is insignificant in the absence of controls, this should clearly shape our interpretation of the effect being estimated” (Dunning 2012, 268)

▸ We use experiments so we don’t have to care about covariates: “Yet,

the whole point of a natural experiment is that such concerns about confounding should be limited by the research design.” (Dunning 2012, 118)

▸ Covariates reduce noise, increases the chance that we reach statistical

significance (Mutz 2011, 123f)

▸ Positive view: Variables measured before the variables of interest was

determined are generally good controls (Angrist and Pischke 2009, 64ff)

58 / 67

slide-59
SLIDE 59

How to conduct an experiment

▸ Remember theory. ▸ Consider – from a practical perspective – whether an experiment is

feasible

▸ Specify hypothesis/hypotheses prior to the data collection

▸ Prespecification (if you plan to publish in academic journals:

preregister)

▸ What is your dependent variable? 59 / 67

slide-60
SLIDE 60

How to report an experiment

▸ We have specific guidelines for reporting experimental research in

political science

▸ See Gerber et al. (2014): Reporting Guidelines for Experimental

Research: A Report from the Experimental Research Section Standards Committee. Journal of Experimental Political Science 1(1): 81-98.

60 / 67

slide-61
SLIDE 61

What about external validity?

▸ External validity is all about your theory ▸ And remember: “It makes no sense to say that some empirical

research is low on internal validity but high on external validity.” (Morton and Williams 2010, 275)

61 / 67

slide-62
SLIDE 62

Replication, reproduction and transparency

▸ One of the biggest issues with experiments (and all research) today:

lack of replications

▸ Novelty bias (especially in political science!) ▸ We need more replications of existing experiments ▸ Sadly, only few examples of direct replications in political science ▸ “Indeed, few experimental literatures have generated repicable

interactions between two variables.” (Green and Gerber 2012, 310)

62 / 67

slide-63
SLIDE 63

Replication, reproduction and transparency

▸ Make sure that your research is reproducible (STATA do-file and/or R

scripts)

▸ Share your data ▸ Reproduce and replicate existing studies (great way to “learn science”) ▸ Be transparent (what did you do, how did you do it etc.)

63 / 67

slide-64
SLIDE 64

Observational research

▸ Cochran’s Basic Advice: “The planner of an observational study

should always ask himself the question, ‘How would the study be conducted if it were possible to do it by controlled experimentation?’ ” (from Rosenbaum 2010, 16)

64 / 67

slide-65
SLIDE 65

Conclusion: Words to live by

▸ Think of experiments as observational studies ▸ Think of observational studies as experiments

65 / 67

slide-66
SLIDE 66

Install R and RStudio

R: cran.rstudio.com RStudio: rstudio.com/products/rstudio/download/

66 / 67

slide-67
SLIDE 67

Schedule

▸ Today: Experiments ▸ Next: Matching ▸ Lecture 13 and 14: Natural experiments

▸ Regression-Discontinuity Designs ▸ Instrumental Variable Regression

▸ Lab session 6 and 7: R and matching

67 / 67