Session I Survey Experiments in Context Thomas J. Leeper - - PowerPoint PPT Presentation

session i survey experiments in context
SMART_READER_LITE
LIVE PREVIEW

Session I Survey Experiments in Context Thomas J. Leeper - - PowerPoint PPT Presentation

Introductions Course Outline History/Logic Session I Survey Experiments in Context Thomas J. Leeper Government Department London School of Economics and Political Science Introductions Course Outline History/Logic 1 Introductions 2 Course


slide-1
SLIDE 1

Introductions Course Outline History/Logic

Session I Survey Experiments in Context

Thomas J. Leeper

Government Department London School of Economics and Political Science

slide-2
SLIDE 2

Introductions Course Outline History/Logic

1 Introductions 2 Course Outline 3 History and Logic

slide-3
SLIDE 3

Introductions Course Outline History/Logic

Activity!

slide-4
SLIDE 4

Introductions Course Outline History/Logic

Activity!

1 Ask you to guess a number

slide-5
SLIDE 5

Introductions Course Outline History/Logic

Activity!

1 Ask you to guess a number 2 Number off 1 and 2 across the room

slide-6
SLIDE 6

Introductions Course Outline History/Logic

Activity!

1 Ask you to guess a number 2 Number off 1 and 2 across the room 3 Group 2, close your eyes

slide-7
SLIDE 7

Introductions Course Outline History/Logic

Activity! Group 1 Think about whether the population

  • f Chicago is more or less than

500,000 people. What do you think the population of Chicago is?

slide-8
SLIDE 8

Introductions Course Outline History/Logic

Activity!

1 Ask you to guess a number 2 Number off 1 and 2 across the room 3 Group 2, close your eyes 4 Group 1, close your eyes

slide-9
SLIDE 9

Introductions Course Outline History/Logic

Activity! Group 2 Think about whether the population

  • f Chicago is more or less than

10,000,000 people. What do you think the population of Chicago is?

slide-10
SLIDE 10

Introductions Course Outline History/Logic

slide-11
SLIDE 11

Introductions Course Outline History/Logic

Enter your data

Go here: http://bit.ly/297vEdd Enter your guess and your group number

slide-12
SLIDE 12

Introductions Course Outline History/Logic

Results

True population: 2.79 million

slide-13
SLIDE 13

Introductions Course Outline History/Logic

Results

True population: 2.79 million What did you guess? (See Responses)

slide-14
SLIDE 14

Introductions Course Outline History/Logic

Results

True population: 2.79 million What did you guess? (See Responses) What’s going on here?

An experiment! Demonstrates “anchoring” heuristic

slide-15
SLIDE 15

Introductions Course Outline History/Logic

Results

True population: 2.79 million What did you guess? (See Responses) What’s going on here?

An experiment! Demonstrates “anchoring” heuristic

Experiments are easy to analyze, but only if designed and implemented well

slide-16
SLIDE 16

Introductions Course Outline History/Logic

1 Introductions 2 Course Outline 3 History and Logic

slide-17
SLIDE 17

Introductions Course Outline History/Logic

Who am I?

Thomas Leeper Associate Professor in Political Behaviour at London School of Economics 2013–15: Aarhus University (Denmark) 2008–12: PhD from Northwestern University (Chicago, USA) Birth–2008: Minnesota, USA Interested in public opinion and political psychology Email: t.leeper@lse.ac.uk

slide-18
SLIDE 18

Introductions Course Outline History/Logic

Who are you?

Introduce yourself to a neighbour Where are you from? What do you hope to learn from the course?

slide-19
SLIDE 19

Introductions Course Outline History/Logic

Quick Survey

slide-20
SLIDE 20

Introductions Course Outline History/Logic

Quick Survey

1 How many of you have worked with survey

data before?

slide-21
SLIDE 21

Introductions Course Outline History/Logic

Quick Survey

1 How many of you have worked with survey

data before?

2 Of those, how many of you have performed a

survey before?

slide-22
SLIDE 22

Introductions Course Outline History/Logic

Quick Survey

1 How many of you have worked with survey

data before?

2 Of those, how many of you have performed a

survey before?

3 How many of you have worked with

experimental data before?

slide-23
SLIDE 23

Introductions Course Outline History/Logic

Quick Survey

1 How many of you have worked with survey

data before?

2 Of those, how many of you have performed a

survey before?

3 How many of you have worked with

experimental data before?

4 Of those, how many of you have performed an

experiment before?

slide-24
SLIDE 24

Introductions Course Outline History/Logic

1 Introductions 2 Course Outline 3 History and Logic

slide-25
SLIDE 25

Introductions Course Outline History/Logic

Course Materials

All material for the course is available at:

http://www.thomasleeper.com/ surveyexpcourse/

slide-26
SLIDE 26

Introductions Course Outline History/Logic

Learning Outcomes

By the end of the week, you should be able to. . .

slide-27
SLIDE 27

Introductions Course Outline History/Logic

Learning Outcomes

By the end of the week, you should be able to. . .

1 Explain how to analyze experiments quantitatively.

slide-28
SLIDE 28

Introductions Course Outline History/Logic

Learning Outcomes

By the end of the week, you should be able to. . .

1 Explain how to analyze experiments quantitatively. 2 Explain how to design experiments that speak to

relevant research questions and theories.

slide-29
SLIDE 29

Introductions Course Outline History/Logic

Learning Outcomes

By the end of the week, you should be able to. . .

1 Explain how to analyze experiments quantitatively. 2 Explain how to design experiments that speak to

relevant research questions and theories.

3 Evaluate the uses and limitations of several common

survey experimental paradigms.

slide-30
SLIDE 30

Introductions Course Outline History/Logic

Learning Outcomes

By the end of the week, you should be able to. . .

1 Explain how to analyze experiments quantitatively. 2 Explain how to design experiments that speak to

relevant research questions and theories.

3 Evaluate the uses and limitations of several common

survey experimental paradigms.

4 Identify practical issues that arise in the implementation

  • f experiments and evaluate how to anticipate and

respond to them.

slide-31
SLIDE 31

Introductions Course Outline History/Logic

Schedule of Four Sessions

1 Survey Experiments in Context 2 Examples and Paradigms 3 Hands-on Session 4 Practical Issues

slide-32
SLIDE 32

Introductions Course Outline History/Logic

Questions?

slide-33
SLIDE 33

Introductions Course Outline History/Logic

1 Introductions 2 Course Outline 3 History and Logic

slide-34
SLIDE 34

Introductions Course Outline History/Logic

Experiments: History I

Oxford English Dictionary defines “experiment” as:

1 A scientific procedure undertaken to make a

discovery, test a hypothesis, or demonstrate a known fact

2 A course of action tentatively adopted without

being sure of the outcome

slide-35
SLIDE 35

Introductions Course Outline History/Logic

Experiments: History II

“Experiments” have a very long history Major advances in design and analysis of experiments based on agricultural and later biostatistical research in the 19th century (Fisher, Neyman, Pearson, etc.)

slide-36
SLIDE 36

Introductions Course Outline History/Logic

Experiments: History II

“Experiments” have a very long history Major advances in design and analysis of experiments based on agricultural and later biostatistical research in the 19th century (Fisher, Neyman, Pearson, etc.) Multiple origins in the social sciences

slide-37
SLIDE 37

Introductions Course Outline History/Logic

Experiments: History II

“Experiments” have a very long history Major advances in design and analysis of experiments based on agricultural and later biostatistical research in the 19th century (Fisher, Neyman, Pearson, etc.) Multiple origins in the social sciences

First randomized experiment by Peirce and Jastrow (1884) Gosnell (1924) LaLonde (1986) Gerber and Green (2000)

slide-38
SLIDE 38

Introductions Course Outline History/Logic

Experiments: History III

“Question testing” split ballots (e.g., Cantril) Rise of surveys in the behavioral revolution Split ballots (e.g., Schuman & Presser; Bishop)

slide-39
SLIDE 39

Introductions Course Outline History/Logic

Experiments: History III

“Question testing” split ballots (e.g., Cantril) Rise of surveys in the behavioral revolution Split ballots (e.g., Schuman & Presser; Bishop) 1983: Merrill Shanks and the Berkeley Survey Research Center develop CATI

slide-40
SLIDE 40

Introductions Course Outline History/Logic

Experiments: History III

“Question testing” split ballots (e.g., Cantril) Rise of surveys in the behavioral revolution Split ballots (e.g., Schuman & Presser; Bishop) 1983: Merrill Shanks and the Berkeley Survey Research Center develop CATI Mid-1980s: Paul Sniderman & Tom Piazza performed the first modern survey experiment1

Then: the “first multi-investigator” Later: Skip Lupia and Diana Mutz created TESS

1Sniderman, Paul M., and Thomas Piazza. 1993. The Scar of Race. Cambridge, MA: Harvard University Press.

slide-41
SLIDE 41

Introductions Course Outline History/Logic

TESS

Time-Sharing Experiments for the Social Sciences Multi-disciplinary initiative that provides infrastructure for survey experiments on nationally representative samples of the United States population Great resource for survey experimental materials, designs, and data Funded by the U.S. National Science Foundation Anyone anywhere in the world can apply See also: LISS, Bergen’s Citizen Panel, Gothenburg’s Citizen Panel

slide-42
SLIDE 42

Introductions Course Outline History/Logic

The First Survey Experiment?

Hadley Cantril (1940) asks 3000 Americans either:

slide-43
SLIDE 43

Introductions Course Outline History/Logic

The First Survey Experiment?

Hadley Cantril (1940) asks 3000 Americans either: Do you think the U.S. should do more than it is now doing to help England and France? Yes No

slide-44
SLIDE 44

Introductions Course Outline History/Logic

The First Survey Experiment?

Hadley Cantril (1940) asks 3000 Americans either: Do you think the U.S. should do more than it is now doing to help England and France? Yes No Do you think the U.S. should do more than it is now doing to help England and France in their fight against Hitler? Yes No

slide-45
SLIDE 45

Introductions Course Outline History/Logic

The First Survey Experiment?

Hadley Cantril (1940) asks 3000 Americans either: Do you think the U.S. should do more than it is now doing to help England and France? Yes: 13% No Do you think the U.S. should do more than it is now doing to help England and France in their fight against Hitler? Yes No

slide-46
SLIDE 46

Introductions Course Outline History/Logic

The First Survey Experiment?

Hadley Cantril (1940) asks 3000 Americans either: Do you think the U.S. should do more than it is now doing to help England and France? Yes: 13% No Do you think the U.S. should do more than it is now doing to help England and France in their fight against Hitler? Yes: 22% No

slide-47
SLIDE 47

Introductions Course Outline History/Logic

The First Survey Experiment?

Hadley Cantril (1940) asks 3000 Americans either: Do you think the U.S. should do more than it is now doing to help England and France? Yes: 13% No Do you think the U.S. should do more than it is now doing to help England and France in their fight against Hitler? Yes: 22% No The “Hitler effect” was 22% - 13% = 9%

slide-48
SLIDE 48

Introductions Course Outline History/Logic

Definitions I

A randomized experiment is:

The observation of units after, and possibly before, a randomly assigned intervention in a controlled set- ting, which tests one or more precise causal expec- tations

slide-49
SLIDE 49

Introductions Course Outline History/Logic

Definitions I

A randomized experiment is:

The observation of units after, and possibly before, a randomly assigned intervention in a controlled set- ting, which tests one or more precise causal expec- tations

If we manipulate the thing we want to know the effect of (X), and control (i.e., hold constant) everything we do not want to know the effect of (Z), the only thing that can affect the outcome (Y ) is X.

slide-50
SLIDE 50

Introductions Course Outline History/Logic

Definitions II

slide-51
SLIDE 51

Introductions Course Outline History/Logic

Definitions II

A survey experiment is just an experiment that occurs in a survey context As opposed to in the field or in a laboratory

slide-52
SLIDE 52

Introductions Course Outline History/Logic

Definitions II

A survey experiment is just an experiment that occurs in a survey context As opposed to in the field or in a laboratory Can be in any mode (face-to-face, CATI, IVR, CASI, etc.)

slide-53
SLIDE 53

Introductions Course Outline History/Logic

Definitions II

A survey experiment is just an experiment that occurs in a survey context As opposed to in the field or in a laboratory Can be in any mode (face-to-face, CATI, IVR, CASI, etc.) May or may not involve a representative population Mutz (2011): “population-based survey experiments”

slide-54
SLIDE 54

Introductions Course Outline History/Logic

Definitions II

slide-55
SLIDE 55

Introductions Course Outline History/Logic

Definitions II

Unit: A physical object at a particular point in time

slide-56
SLIDE 56

Introductions Course Outline History/Logic

Definitions II

Treatment: An intervention, whose effect(s) we wish to assess relative to some other (non-)intervention Synonyms: manipulation, intervention, factor, condition, cell

slide-57
SLIDE 57

Introductions Course Outline History/Logic

Definitions II

Outcome: The variable we are trying to explain

slide-58
SLIDE 58

Introductions Course Outline History/Logic

Definitions II

Potential outcomes: The outcome value for each unit that we would observe if that unit received each treatment Multiple potential outcomes for each unit, but we

  • nly observe one of them
slide-59
SLIDE 59

Introductions Course Outline History/Logic

Definitions II

Causal effect: The comparisons between the unit-level potential outcomes under each intervention This is what we want to know!

slide-60
SLIDE 60

Introductions Course Outline History/Logic

Definitions II

Average causal effect: Difference in mean

  • utcomes between treatment groups

This is almost what we want to know!

slide-61
SLIDE 61

Introductions Course Outline History/Logic

Example

slide-62
SLIDE 62

Introductions Course Outline History/Logic

Example

Unit: Americans in 1940

slide-63
SLIDE 63

Introductions Course Outline History/Logic

Example

Unit: Americans in 1940 Outcome: Support for military intervention

slide-64
SLIDE 64

Introductions Course Outline History/Logic

Example

Unit: Americans in 1940 Outcome: Support for military intervention Treatment: Mentioning Hitler versus not

slide-65
SLIDE 65

Introductions Course Outline History/Logic

Example

Unit: Americans in 1940 Outcome: Support for military intervention Treatment: Mentioning Hitler versus not Potential outcomes:

1 Support in “Hitler” condition 2 Support in control condition

slide-66
SLIDE 66

Introductions Course Outline History/Logic

Example

Unit: Americans in 1940 Outcome: Support for military intervention Treatment: Mentioning Hitler versus not Potential outcomes:

1 Support in “Hitler” condition 2 Support in control condition

Causal effect: Difference in support between the two question wordings for each respondent

slide-67
SLIDE 67

Introductions Course Outline History/Logic

Example

Unit: Americans in 1940 Outcome: Support for military intervention Treatment: Mentioning Hitler versus not Potential outcomes:

1 Support in “Hitler” condition 2 Support in control condition

Causal effect: Difference in support between the two question wordings for each respondent Individual treatment effect not observable!

slide-68
SLIDE 68

Introductions Course Outline History/Logic

Example

Unit: Americans in 1940 Outcome: Support for military intervention Treatment: Mentioning Hitler versus not Potential outcomes:

1 Support in “Hitler” condition 2 Support in control condition

Causal effect: Difference in support between the two question wordings for each respondent Individual treatment effect not observable! Average effect (ATE) is the mean-difference

slide-69
SLIDE 69

Introductions Course Outline History/Logic

Questions?

slide-70
SLIDE 70

Introductions Course Outline History/Logic

Why are experiments useful?

slide-71
SLIDE 71

Introductions Course Outline History/Logic

Why are experiments useful? Causal inference!

slide-72
SLIDE 72

Introductions Course Outline History/Logic

Addressing Confounding

In observational research. . .

slide-73
SLIDE 73

Introductions Course Outline History/Logic

Addressing Confounding

In observational research. . .

1 Correlate a “putative” cause (X) and an

  • utcome (Y ), where X temporally precedes Y
slide-74
SLIDE 74

Introductions Course Outline History/Logic

Addressing Confounding

In observational research. . .

1 Correlate a “putative” cause (X) and an

  • utcome (Y ), where X temporally precedes Y

2 Identify all possible confounds (Z)

slide-75
SLIDE 75

Introductions Course Outline History/Logic

Addressing Confounding

In observational research. . .

1 Correlate a “putative” cause (X) and an

  • utcome (Y ), where X temporally precedes Y

2 Identify all possible confounds (Z) 3 “Condition” on all confounds

Calculate correlation between X and Y at each combination of levels of Z

slide-76
SLIDE 76

Introductions Course Outline History/Logic

Addressing Confounding

In observational research. . .

1 Correlate a “putative” cause (X) and an

  • utcome (Y ), where X temporally precedes Y

2 Identify all possible confounds (Z) 3 “Condition” on all confounds

Calculate correlation between X and Y at each combination of levels of Z

4 Basically: Y = β0 + β1X + β2−kZ + ǫ

slide-77
SLIDE 77

Introductions Course Outline History/Logic

Salience of Hitler Support for Military Intervention Media Coverage Demographics Ideology Political Sophistication

slide-78
SLIDE 78

Introductions Course Outline History/Logic

Salience of Hitler Support for Military Intervention Media Coverage Demographics Ideology Political Sophistication

slide-79
SLIDE 79

Introductions Course Outline History/Logic

Salience of Hitler Support for Military Intervention Media Coverage Demographics Ideology Political Sophistication

slide-80
SLIDE 80

Introductions Course Outline History/Logic

Experiments are different

slide-81
SLIDE 81

Introductions Course Outline History/Logic

Experiments are different

1 Causal inferences from design not analysis

slide-82
SLIDE 82

Introductions Course Outline History/Logic

Experiments are different

1 Causal inferences from design not analysis 2 Solves both temporal ordering and confounding

Treatment (X) applied by researcher before

  • utcome (Y )

Randomization eliminates confounding (Z) We don’t need to “control” for anything

slide-83
SLIDE 83

Introductions Course Outline History/Logic

Experiments are different

1 Causal inferences from design not analysis 2 Solves both temporal ordering and confounding

Treatment (X) applied by researcher before

  • utcome (Y )

Randomization eliminates confounding (Z) We don’t need to “control” for anything

3 Basically: Y = β0 + β1X + ǫ

slide-84
SLIDE 84

Introductions Course Outline History/Logic

Experiments are different

1 Causal inferences from design not analysis 2 Solves both temporal ordering and confounding

Treatment (X) applied by researcher before

  • utcome (Y )

Randomization eliminates confounding (Z) We don’t need to “control” for anything

3 Basically: Y = β0 + β1X + ǫ 4 Thus experiments are a “gold standard”

slide-85
SLIDE 85

Introductions Course Outline History/Logic

Mill’s Method of Difference

If an instance in which the phenomenon under investigation

  • ccurs, and an instance in which it does not occur, have every

circumstance save one in common, that one occurring only in the former; the circumstance in which alone the two instances differ, is the effect, or cause, or an necessary part of the cause,

  • f the phenomenon.
slide-86
SLIDE 86

Introductions Course Outline History/Logic

Mill’s Method of Difference

If an instance in which the phenomenon under investigation

  • ccurs, and an instance in which it does not occur, have

every circumstance save one in common, that one

  • ccurring only in the former; the circumstance in which

alone the two instances differ, is the effect, or cause, or an necessary part of the cause, of the phenomenon.

slide-87
SLIDE 87

Introductions Course Outline History/Logic

Questions?

slide-88
SLIDE 88

Introductions Course Outline History/Logic

Neyman-Rubin Potential Outcomes Framework

If we are interested in some outcome Y , then for every unit i, there are numerous “potential

  • utcomes” Y ∗ only one of which is visible in a given
  • reality. Comparisons of (partially unobservable)

potential outcomes indicate causality.

slide-89
SLIDE 89

Introductions Course Outline History/Logic

Neyman-Rubin Potential Outcomes Framework

Concisely, we typically discuss two potential

  • utcomes:

Y0i, the potential outcome realized if Xi = 0 (b/c Di = 0, assigned to control) Y1i, the potential outcome realized if Xi = 1 (b/c Di = 1, assigned to treatment)

slide-90
SLIDE 90

Introductions Course Outline History/Logic

Experimental Inference I

Each unit has multiple potential outcomes, but we only

  • bserve one of them, randomly
slide-91
SLIDE 91

Introductions Course Outline History/Logic

Experimental Inference I

Each unit has multiple potential outcomes, but we only

  • bserve one of them, randomly

In this sense, we are sampling potential outcomes from each unit’s population of potential outcomes unit low high 1 ? ? 2 ? ? 3 ? ? 4 ? ?

slide-92
SLIDE 92

Introductions Course Outline History/Logic

Experimental Inference I

Each unit has multiple potential outcomes, but we only

  • bserve one of them, randomly

In this sense, we are sampling potential outcomes from each unit’s population of potential outcomes unit low high control 1 ? ? ? 2 ? ? ? 3 ? ? ? 4 ? ? ?

slide-93
SLIDE 93

Introductions Course Outline History/Logic

Experimental Inference I

Each unit has multiple potential outcomes, but we only

  • bserve one of them, randomly

In this sense, we are sampling potential outcomes from each unit’s population of potential outcomes unit low high control etc. 1 ? ? ? . . . 2 ? ? ? . . . 3 ? ? ? . . . 4 ? ? ? . . .

slide-94
SLIDE 94

Introductions Course Outline History/Logic

Experimental Inference II

We cannot see individual-level causal effects

slide-95
SLIDE 95

Introductions Course Outline History/Logic

Experimental Inference II

We cannot see individual-level causal effects We can see average causal effects Ex.: Average difference in military support among those thinking of Hitler versus not

slide-96
SLIDE 96

Introductions Course Outline History/Logic

Experimental Inference II

We cannot see individual-level causal effects We can see average causal effects Ex.: Average difference in military support among those thinking of Hitler versus not We want to know: TEi = Y1i − Y0i

slide-97
SLIDE 97

Introductions Course Outline History/Logic

Experimental Inference III

We want to know: TEi = Y1i − Y0i for every i in the population

slide-98
SLIDE 98

Introductions Course Outline History/Logic

Experimental Inference III

We want to know: TEi = Y1i − Y0i for every i in the population We can average: E[TE] = E[Y1 − Y0] = E[Y1] − E[Y0]

slide-99
SLIDE 99

Introductions Course Outline History/Logic

Experimental Inference III

We want to know: TEi = Y1i − Y0i for every i in the population We can average: E[TE] = E[Y1 − Y0] = E[Y1] − E[Y0] But we still only see one potential outcome for each unit: ATEnaive = E[Y1|X = 1] − E[Y0|X = 0]

slide-100
SLIDE 100

Introductions Course Outline History/Logic

Experimental Inference III

We want to know: TEi = Y1i − Y0i for every i in the population We can average: E[TE] = E[Y1 − Y0] = E[Y1] − E[Y0] But we still only see one potential outcome for each unit: ATEnaive = E[Y1|X = 1] − E[Y0|X = 0] Is this what we want to know?

slide-101
SLIDE 101

Introductions Course Outline History/Logic

Experimental Inference IV

What we want and what we have: ATE = E[Y1] − E[Y0] (1) ATEnaive = E[Y1|X = 1] − E[Y0|X = 0] (2)

slide-102
SLIDE 102

Introductions Course Outline History/Logic

Experimental Inference IV

What we want and what we have: ATE = E[Y1] − E[Y0] (1) ATEnaive = E[Y1|X = 1] − E[Y0|X = 0] (2) Are the following statements true? E[Y1] = E[Y1|X = 1] E[Y0] = E[Y0|X = 0]

slide-103
SLIDE 103

Introductions Course Outline History/Logic

Experimental Inference IV

What we want and what we have: ATE = E[Y1] − E[Y0] (1) ATEnaive = E[Y1|X = 1] − E[Y0|X = 0] (2) Are the following statements true? E[Y1] = E[Y1|X = 1] E[Y0] = E[Y0|X = 0] Not in general!

slide-104
SLIDE 104

Introductions Course Outline History/Logic

Experimental Inference V

Only true when both of the following hold: E[Y1] = E[Y1|X = 1] = E[Y1|X = 0] (3) E[Y0] = E[Y0|X = 1] = E[Y0|X = 0] (4) In that case, potential outcomes are independent of treatment assignment If true (e.g., due to randomization of X), then: ATEnaive = E[Y1|X = 1] − E[Y0|X = 0] (5) = E[Y1] − E[Y0] = ATE

slide-105
SLIDE 105

Introductions Course Outline History/Logic

Experimental Inference VI

This holds in experiments because of a physical process of randomization2

2Random means “known probability of treatment” not “haphazard”.

slide-106
SLIDE 106

Introductions Course Outline History/Logic

Experimental Inference VI

This holds in experiments because of a physical process of randomization2 Units differ only in side of coin that was up

Xi = 1 only because Di = 1

2Random means “known probability of treatment” not “haphazard”.

slide-107
SLIDE 107

Introductions Course Outline History/Logic

Experimental Inference VI

This holds in experiments because of a physical process of randomization2 Units differ only in side of coin that was up

Xi = 1 only because Di = 1

Implications:

Covariate balance Potential outcomes balanced and independent of treatment assignment No confounding (selection bias)

2Random means “known probability of treatment” not “haphazard”.

slide-108
SLIDE 108

Introductions Course Outline History/Logic

Salience of Hitler Support for Military Intervention Media Coverage Demographics Ideology Political Sophistication

slide-109
SLIDE 109

Introductions Course Outline History/Logic

Salience of Hitler Support for Military Intervention Media Coverage Demographics Ideology Political Sophistication Randomly Assigned Prime

slide-110
SLIDE 110

Introductions Course Outline History/Logic

Questions?

slide-111
SLIDE 111

Introductions Course Outline History/Logic

Experimental Analysis I

The statistic of interest in an experiment is the sample average treatment effect (SATE) If our sample is representative, then this provides an estimate of the population average treatment (PATE) Design-based random sampling Model-based re-weighting

slide-112
SLIDE 112

Introductions Course Outline History/Logic

Experimental Analysis I

The statistic of interest in an experiment is the sample average treatment effect (SATE) If our sample is representative, then this provides an estimate of the population average treatment (PATE) Design-based random sampling Model-based re-weighting This boils down to being a mean-difference between two groups: SATE = 1 n1

  • Y1i − 1

n0

  • Y0i

(5)

slide-113
SLIDE 113

Introductions Course Outline History/Logic

Tidy Experimental Data

An experimental data structure looks like:

unit treatment

  • utcome

1 13 2 6 3 4 4 5 5 1 3 6 1 1 7 1 10 8 1 9

slide-114
SLIDE 114

Introductions Course Outline History/Logic

Tidy Experimental Data

Sometimes it looks like this instead, which is bad:

unit treatment

  • utcome0
  • utcome1

1 13 NA 2 6 NA 3 4 NA 4 5 NA 5 1 NA 3 6 1 NA 1 7 1 NA 10 8 1 NA 9

slide-115
SLIDE 115

Introductions Course Outline History/Logic

Tidy Experimental Data

An experimental data structure looks like:

unit treatment

  • utcome

1 13 2 6 3 4 4 5 5 1 3 6 1 1 7 1 10 8 1 9

slide-116
SLIDE 116

Introductions Course Outline History/Logic

Computation of Effects I

In practice we often estimate SATE using t-tests, ANOVA, or OLS regression These are all basically equivalent

slide-117
SLIDE 117

Introductions Course Outline History/Logic

Computation of Effects I

In practice we often estimate SATE using t-tests, ANOVA, or OLS regression These are all basically equivalent Reasons to choose one procedure over another:

Disciplinary norms

slide-118
SLIDE 118

Introductions Course Outline History/Logic

Computation of Effects I

In practice we often estimate SATE using t-tests, ANOVA, or OLS regression These are all basically equivalent Reasons to choose one procedure over another:

Disciplinary norms Ease of interpretation

slide-119
SLIDE 119

Introductions Course Outline History/Logic

Computation of Effects I

In practice we often estimate SATE using t-tests, ANOVA, or OLS regression These are all basically equivalent Reasons to choose one procedure over another:

Disciplinary norms Ease of interpretation Flexibility for >2 treatment conditions

slide-120
SLIDE 120

Introductions Course Outline History/Logic

Computation of Effects II

R:

t.test(outcome ~ treatment, data = data) lm(outcome ~ factor(treatment), data = data) Stata: ttest outcome, by(treatment) reg outcome i.treatment

slide-121
SLIDE 121

Introductions Course Outline History/Logic

Questions?

slide-122
SLIDE 122

Introductions Course Outline History/Logic

Experimental Analysis II

We don’t just care about the size of the SATE. We also want to know whether it is significantly different from zero (i.e., different from no effect/difference) Thus we need to estimate the variance of the SATE The variance is influenced by: Total sample size Element variance of the outcome, Y Relative size of each treatment group (Some other factors)

slide-123
SLIDE 123

Introductions Course Outline History/Logic

Experimental Analysis III

Formula for the variance of the SATE is:

  • Var(SATE) =

Var

¯

Y0

  • +

Var

¯

Y1

  • Var( ¯

Y0) is control group variance

  • Var( ¯

Y1) is treatment group variance We often express this as the standard error of the estimate:

  • SE SATE =
  • Var

¯

Y0

  • +

Var

¯

Y1

slide-124
SLIDE 124

Introductions Course Outline History/Logic

Intuition about Variance

Bigger sample → smaller SEs Smaller variance → smaller SEs Efficient use of sample size:

When treatment group variances equal, equal sample sizes are most efficient When variances differ, sample units are better allocated to the group with higher variance in Y

slide-125
SLIDE 125

Introductions Course Outline History/Logic

Statistical Power

Power analysis is used to determine sample size before conducting an experiment Type I and Type II Errors H0 False H0 True (|ATE| > 0) (ATE = 0) Reject H0 True positive Type I Error Accept H0 Type II Error True zero

True positive rate (1 − κ) is power False positive rate is the significance threshold (α)

slide-126
SLIDE 126

Introductions Course Outline History/Logic

Doing a Power Analysis

µ, Treatment group mean outcomes N, Sample size σ, Outcome variance α Statistical significance threshold φ, a sampling distribution Power = φ

  • |µ1−µ0|

√ N 2σ

− φ−1 1 − α

2

slide-127
SLIDE 127

Introductions Course Outline History/Logic

Intuition about Power

Minimum detectable effect is the smallest effect we could detect given sample size, “true” ATE, variance of outcome measure, power (1 − κ), and α.

slide-128
SLIDE 128

Introductions Course Outline History/Logic

Intuition about Power

Minimum detectable effect is the smallest effect we could detect given sample size, “true” ATE, variance of outcome measure, power (1 − κ), and α. In essence: some non-zero effect sizes are not detectable by a study of a given sample size.

slide-129
SLIDE 129

Introductions Course Outline History/Logic

Intuition about Power

Minimum detectable effect is the smallest effect we could detect given sample size, “true” ATE, variance of outcome measure, power (1 − κ), and α. In essence: some non-zero effect sizes are not detectable by a study of a given sample size. In underpowered study, we will be unlikely to detect true small effects. And most effects are small! 3

3Gelman, A. and Weakliem, D. 2009. “Of Beauty, Sex and Power.” American Scientist 97(4): 310–16

slide-130
SLIDE 130

Introductions Course Outline History/Logic

Intuition about Power

It can help to think in terms of “standardized effect sizes” Intuition: How large is the effect in standard deviations of the outcome?

Know if effects are large or small Compare effects across studies

slide-131
SLIDE 131

Introductions Course Outline History/Logic

Intuition about Power

It can help to think in terms of “standardized effect sizes” Intuition: How large is the effect in standard deviations of the outcome?

Know if effects are large or small Compare effects across studies

Cohen’s d: d = ¯

x1−¯ x0 s

, where s =

  • (n1−1)s2

1+(n0−1)s2

n1+n0−2

slide-132
SLIDE 132

Introductions Course Outline History/Logic

Intuition about Power

It can help to think in terms of “standardized effect sizes” Intuition: How large is the effect in standard deviations of the outcome?

Know if effects are large or small Compare effects across studies

Cohen’s d: d = ¯

x1−¯ x0 s

, where s =

  • (n1−1)s2

1+(n0−1)s2

n1+n0−2

Small: 0.2; Medium: 0.5; Large: 0.8

slide-133
SLIDE 133

Introductions Course Outline History/Logic

Intuition about Power

slide-134
SLIDE 134

Introductions Course Outline History/Logic

Power analysis in R

power.t.test( # sample size (leave blank!) n = , # minimum detectable effect size delta = 0.4, sd = 1, # alpha and power (1-kappa) sig.level = 0.05, power = 0.8, # two-tailed vs. one-tailed test alternative = "two.sided" )

slide-135
SLIDE 135

Introductions Course Outline History/Logic

Power analysis in Stata

power twomeans 0, diff(0.2) // for multiple values of forvalues i = 0.1 (0.1) 1.0 { power twomeans 0, diff(‘i’) } // using raw effect sizes and standard deviations power twomeans 0 0.5, sd1(.5) sd2(.7) // adjusting alpha or power power twomeans 0, diff(0.2) alpha(0.10) power(0.7)

slide-136
SLIDE 136

Introductions Course Outline History/Logic

Increasing/Decreasing Power

Increases Power

Bigger sample Precise measures Covariates?

Decreases Power

Attrition Noncompliance Clustering

slide-137
SLIDE 137

Introductions Course Outline History/Logic

slide-138
SLIDE 138

Introductions Course Outline History/Logic

Factorial Designs

The two-condition experiment is a stylized ideal An experiment can have any number of conditions

Up to the limits of sample size More than 8–10 conditions is typically unwieldy

Three “flavors”:

Multiple conditions in a single factor Multiple fully crossed factors Partially crossed (“fractional factorial”) designs

Regression methods provide a generalizable tool for causal inference in such designs

slide-139
SLIDE 139

Introductions Course Outline History/Logic

Policy Beneficiaries Policy Opinion Ideology Etc. Identity Salience

slide-140
SLIDE 140

Introductions Course Outline History/Logic

Policy Beneficiaries Policy Opinion Ideology Etc. Identity Salience Treatment 1 Treatment 2

slide-141
SLIDE 141

Introductions Course Outline History/Logic

Example4 How close do you feel to your ethnic or racial group? Some people have said that taxes need to be raised to take care of pressing national needs. How willing would you be to have your taxes raised to improve education in public schools?

  • 4Transue. 2007. “Identity Salience, Identity Acceptance, and Racial Policy Attitudes: American National

Identity as a Uniting Force.” American Journal of Political Science 51(1): 78–91.

slide-142
SLIDE 142

Introductions Course Outline History/Logic

Example4 How close do you feel to other Americans? Some people have said that taxes need to be raised to take care of pressing national needs. How willing would you be to have your taxes raised to improve education in public schools?

  • 4Transue. 2007. “Identity Salience, Identity Acceptance, and Racial Policy Attitudes: American National

Identity as a Uniting Force.” American Journal of Political Science 51(1): 78–91.

slide-143
SLIDE 143

Introductions Course Outline History/Logic

Example4 How close do you feel to your ethnic or racial group? Some people have said that taxes need to be raised to take care of pressing national needs. How willing would you be to have your taxes raised to improve educational opportunities for minorities?

  • 4Transue. 2007. “Identity Salience, Identity Acceptance, and Racial Policy Attitudes: American National

Identity as a Uniting Force.” American Journal of Political Science 51(1): 78–91.

slide-144
SLIDE 144

Introductions Course Outline History/Logic

Example4 How close do you feel to other Americans? Some people have said that taxes need to be raised to take care of pressing national needs. How willing would you be to have your taxes raised to improve educational opportunities for minorities?

  • 4Transue. 2007. “Identity Salience, Identity Acceptance, and Racial Policy Attitudes: American National

Identity as a Uniting Force.” American Journal of Political Science 51(1): 78–91.

slide-145
SLIDE 145

Introductions Course Outline History/Logic

2x2 Factorial Design

Condition

  • Educ. for Minorities

Y1 Schools Y0

slide-146
SLIDE 146

Introductions Course Outline History/Logic

2x2 Factorial Design

Condition Americans Own Race

  • Educ. for Minorities

Y1,0 Y1,1 Schools Y0,0 Y0,1

slide-147
SLIDE 147

Introductions Course Outline History/Logic

Two ways to parameterize this

Dummy variable regression (i.e., treatment–control CATEs): Y = β0 + β1X0,1 + β2X1,0 + β3X1,1 + ǫ Interaction effects (i.e., treatment–treatment CATEs): Y = β0 + β1X11 + β2X21 + β3X11 ∗ X21 + ǫ Use margins to extract marginal effects

slide-148
SLIDE 148

Introductions Course Outline History/Logic

Considerations

Factorial designs can quickly become unwieldy and expensive

slide-149
SLIDE 149

Introductions Course Outline History/Logic

Probably obvious, but. . .

Factors Conditions per factor Total Conditions n 1 2 2 400 1 3 3 600 1 4 4 800 2 2 4 800 2 3 6 1200 2 4 8 1600 3 3 9 1800 3 4 12 2400 4 4 16 3200 Assumes power to detect a relatively small effect, but no consideration of multiple comparisons.

slide-150
SLIDE 150

Introductions Course Outline History/Logic

Considerations

Factorial designs can quickly become unwieldy and expensive

slide-151
SLIDE 151

Introductions Course Outline History/Logic

Considerations

Factorial designs can quickly become unwieldy and expensive Need to consider what CATEs are of theoretical interest

Treatment–control, pairwise Treatment–treatment, pairwise Marginal effects, averaging across other factors Comparison of merged conditions

slide-152
SLIDE 152

Introductions Course Outline History/Logic

Questions?