Sample Size and Power Calculations IPA/JPAL/CMF Training Limuru, - - PowerPoint PPT Presentation

sample size and power calculations
SMART_READER_LITE
LIVE PREVIEW

Sample Size and Power Calculations IPA/JPAL/CMF Training Limuru, - - PowerPoint PPT Presentation

Motivation Probability basics Power calculation Exercises Sample Size and Power Calculations IPA/JPAL/CMF Training Limuru, Kenya 28 July 2010 Owen Ozier Department of Economics University of California at Berkeley Slides revised 14


slide-1
SLIDE 1

Motivation Probability basics Power calculation Exercises

Sample Size and Power Calculations

IPA/JPAL/CMF Training

Limuru, Kenya 28 July 2010

Owen Ozier Department of Economics University of California at Berkeley

Slides revised 14 September 2010

Owen Ozier Sample Size and Power Calculations

slide-2
SLIDE 2

Motivation Probability basics Power calculation Exercises

Thanks and Introduction

Thanks to everyone from JPAL/IPA who made this happen!

Owen Ozier Sample Size and Power Calculations

slide-3
SLIDE 3

Motivation Probability basics Power calculation Exercises

Thanks and Introduction

Thanks to everyone from JPAL/IPA who made this happen! My background: randomized evaluations in Busia, Kenya.

Owen Ozier Sample Size and Power Calculations

slide-4
SLIDE 4

Motivation Probability basics Power calculation Exercises

Motivation

Owen Ozier Sample Size and Power Calculations

slide-5
SLIDE 5

Motivation Probability basics Power calculation Exercises

Motivation

Program evaluation: bringing the scientific method to social science

Owen Ozier Sample Size and Power Calculations

slide-6
SLIDE 6

Motivation Probability basics Power calculation Exercises

Motivation

Program evaluation: bringing the scientific method to social science First steps: Propose a hypothesis

Owen Ozier Sample Size and Power Calculations

slide-7
SLIDE 7

Motivation Probability basics Power calculation Exercises

Motivation

Program evaluation: bringing the scientific method to social science First steps: Propose a hypothesis Design an experiment to test the hypothesis

Owen Ozier Sample Size and Power Calculations

slide-8
SLIDE 8

Motivation Probability basics Power calculation Exercises

Motivation

Program evaluation: bringing the scientific method to social science First steps: Propose a hypothesis Design an experiment to test the hypothesis This involves gathering data...

Owen Ozier Sample Size and Power Calculations

slide-9
SLIDE 9

Motivation Probability basics Power calculation Exercises

Motivation

Program evaluation: bringing the scientific method to social science First steps: Propose a hypothesis Design an experiment to test the hypothesis This involves gathering data... ...but how much data will we need?

Owen Ozier Sample Size and Power Calculations

slide-10
SLIDE 10

Motivation Probability basics Power calculation Exercises

Usually a lot

Owen Ozier Sample Size and Power Calculations

slide-11
SLIDE 11

Motivation Probability basics Power calculation Exercises

How this will work

“Numerical data should be kept for eternity; it’s great stuff.”

  • Glenn Stevens, Boston University

Owen Ozier Sample Size and Power Calculations

slide-12
SLIDE 12

Motivation Probability basics Power calculation Exercises

Outline

1

Motivation

Owen Ozier Sample Size and Power Calculations

slide-13
SLIDE 13

Motivation Probability basics Power calculation Exercises

Outline

1

Motivation

2

Probability basics Coin tossing

Owen Ozier Sample Size and Power Calculations

slide-14
SLIDE 14

Motivation Probability basics Power calculation Exercises

Outline

1

Motivation

2

Probability basics Coin tossing

3

Power calculation Terminology/Concepts The Basic Calculation Clusters Covariates Details

Owen Ozier Sample Size and Power Calculations

slide-15
SLIDE 15

Motivation Probability basics Power calculation Exercises

Outline

1

Motivation

2

Probability basics Coin tossing

3

Power calculation Terminology/Concepts The Basic Calculation Clusters Covariates Details

4

Exercises

Owen Ozier Sample Size and Power Calculations

slide-16
SLIDE 16

Motivation Probability basics Power calculation Exercises Coin tossing

A hypothesis and a kind of test

Owen Ozier Sample Size and Power Calculations

slide-17
SLIDE 17

Motivation Probability basics Power calculation Exercises Coin tossing

A hypothesis and a kind of test

“Null” Hypothesis: the coin is fair 50% chance of heads, 50% chance of tails.

Owen Ozier Sample Size and Power Calculations

slide-18
SLIDE 18

Motivation Probability basics Power calculation Exercises Coin tossing

A hypothesis and a kind of test

“Null” Hypothesis: the coin is fair 50% chance of heads, 50% chance of tails. Structure of the data: Toss the coin a number of times, count heads.

Owen Ozier Sample Size and Power Calculations

slide-19
SLIDE 19

Motivation Probability basics Power calculation Exercises Coin tossing

A hypothesis and a kind of test

“Null” Hypothesis: the coin is fair 50% chance of heads, 50% chance of tails. Structure of the data: Toss the coin a number of times, count heads. The test: “Accept” hypothesis if within some distance of the mean under the null; “Reject” otherwise.

Owen Ozier Sample Size and Power Calculations

slide-20
SLIDE 20

Motivation Probability basics Power calculation Exercises Coin tossing

A hypothesis and a kind of test

“Null” Hypothesis: the coin is fair 50% chance of heads, 50% chance of tails. Structure of the data: Toss the coin a number of times, count heads. The test: “Accept” hypothesis if within some distance of the mean under the null; “Reject” otherwise. If we only had 4 tosses of the coin, what distance cutoffs could we use?

Owen Ozier Sample Size and Power Calculations

slide-21
SLIDE 21

Motivation Probability basics Power calculation Exercises Coin tossing

A hypothesis and a kind of test

“Null” Hypothesis: the coin is fair 50% chance of heads, 50% chance of tails. Structure of the data: Toss the coin a number of times, count heads. The test: “Accept” hypothesis if within some distance of the mean under the null; “Reject” otherwise. If we only had 4 tosses of the coin, what distance cutoffs could we use? Could accept (A) never, (B) when exactly the mean (2 heads), (C) when within 1 (1, 2, or 3 heads),

  • r (D) always.

Owen Ozier Sample Size and Power Calculations

slide-22
SLIDE 22

Motivation Probability basics Power calculation Exercises Coin tossing

A hypothesis and a kind of test

“Null” Hypothesis: the coin is fair 50% chance of heads, 50% chance of tails. Structure of the data: Toss the coin a number of times, count heads. The test: “Accept” hypothesis if within some distance of the mean under the null; “Reject” otherwise. If we only had 4 tosses of the coin, what distance cutoffs could we use? Could accept (A) never, (B) when exactly the mean (2 heads), (C) when within 1 (1, 2, or 3 heads),

  • r (D) always.

We don’t want to reject the null when it is true, though; How much accidental rejection would each possible cutoff give us?

Owen Ozier Sample Size and Power Calculations

slide-23
SLIDE 23

Motivation Probability basics Power calculation Exercises Coin tossing

Distribution of possible results

0.06 0.25 0.38 0.25 0.06

.1 .2 .3 .4 Probability 1 2 3 4

P(2)=.38; P(1...3)=.88; P(0...4)=1

Distribution of numbers of heads in 4 tosses of a fair coin

Owen Ozier Sample Size and Power Calculations

slide-24
SLIDE 24

Motivation Probability basics Power calculation Exercises Coin tossing

Not enough data.

Owen Ozier Sample Size and Power Calculations

slide-25
SLIDE 25

Motivation Probability basics Power calculation Exercises Coin tossing

Not enough data.

There is no way* to create such a test with four coin tosses so that the chance of accidental rejection under the “null” hypothesis (sometimes written H0) is less than 5%, a standard in social science.

Owen Ozier Sample Size and Power Calculations

slide-26
SLIDE 26

Motivation Probability basics Power calculation Exercises Coin tossing

Not enough data.

There is no way* to create such a test with four coin tosses so that the chance of accidental rejection under the “null” hypothesis (sometimes written H0) is less than 5%, a standard in social science.

* (Except the “never reject, no matter what” rule. Not very useful.)

Owen Ozier Sample Size and Power Calculations

slide-27
SLIDE 27

Motivation Probability basics Power calculation Exercises Coin tossing

Not enough data.

There is no way* to create such a test with four coin tosses so that the chance of accidental rejection under the “null” hypothesis (sometimes written H0) is less than 5%, a standard in social science.

* (Except the “never reject, no matter what” rule. Not very useful.)

What about 20 coin tosses?

Owen Ozier Sample Size and Power Calculations

slide-28
SLIDE 28

Motivation Probability basics Power calculation Exercises Coin tossing

Distribution of possible results

0.000.000.000.000.00 0.01 0.04 0.07 0.12 0.16 0.18 0.16 0.12 0.07 0.04 0.01 0.000.000.000.000.00

.05 .1 .15 .2 Probability 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20

P(10)=.18; P(9...11)=.5; P(8...12)=.74; P(7...13)=.88; P(6...14)=.96

Distribution of numbers of heads in 20 tosses of a fair coin

Owen Ozier Sample Size and Power Calculations

slide-29
SLIDE 29

Motivation Probability basics Power calculation Exercises Coin tossing

The normal distribution

.1 .2 .3 .4 Probability density −4 −2 2 4 Deviations from mean

Distribution (limiting) of any well−behaved residual error

Owen Ozier Sample Size and Power Calculations

slide-30
SLIDE 30

Motivation Probability basics Power calculation Exercises Coin tossing

As sample size increases more:

.02 .04 .06 .08 Probability

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 95 96 97 98 99 100

Distribution of numbers of heads in 100 tosses of a fair coin

Owen Ozier Sample Size and Power Calculations

slide-31
SLIDE 31

Motivation Probability basics Power calculation Exercises Terminology/Concepts The Basic Calculation Clusters Covariates Details

Types of error

Test result “Reject Null,” “Accept Null,” Find an effect! Conclude no effect. Truth: There is an effect Great! “Type II Error” (low power) Truth: There is NO effect “Type I Error” Great! (test size)

Owen Ozier Sample Size and Power Calculations

slide-32
SLIDE 32

Motivation Probability basics Power calculation Exercises Terminology/Concepts The Basic Calculation Clusters Covariates Details

Types of error

Test result “Reject Null,” “Accept Null,” Find an effect! Conclude no effect. Truth: There is an effect Great! “Type II Error” (low power) Truth: There is NO effect “Type I Error” Great! (test size) The probability of Type I error is what we just discussed: the “size” of the test. By convention, we are usually interested in tests of “size” 0.05.

Owen Ozier Sample Size and Power Calculations

slide-33
SLIDE 33

Motivation Probability basics Power calculation Exercises Terminology/Concepts The Basic Calculation Clusters Covariates Details

Types of error

Test result “Reject Null,” “Accept Null,” Find an effect! Conclude no effect. Truth: There is an effect Great! “Type II Error” (low power) Truth: There is NO effect “Type I Error” Great! (test size) The probability of Type I error is what we just discussed: the “size” of the test. By convention, we are usually interested in tests of “size” 0.05. The probability of Type II error is also very important; If P(failure to detect an effect) = 1 − κ, then the power of the test is κ.

Owen Ozier Sample Size and Power Calculations

slide-34
SLIDE 34

Motivation Probability basics Power calculation Exercises Terminology/Concepts The Basic Calculation Clusters Covariates Details

Types of error

Test result “Reject Null,” “Accept Null,” Find an effect! Conclude no effect. Truth: There is an effect Great! “Type II Error” (low power) Truth: There is NO effect “Type I Error” Great! (test size) The probability of Type I error is what we just discussed: the “size” of the test. By convention, we are usually interested in tests of “size” 0.05. The probability of Type II error is also very important; If P(failure to detect an effect) = 1 − κ, then the power of the test is κ. Power depends on anticipated effect size; typical desired power is 80% or higher.

Owen Ozier Sample Size and Power Calculations

slide-35
SLIDE 35

Motivation Probability basics Power calculation Exercises Terminology/Concepts The Basic Calculation Clusters Covariates Details

Rejecting H0 in critical region

Area: α/2 Area: α/2 Probability density

Null distn. False rejection probability α

Significance level (test size) α

Note: two-sided test.

Owen Ozier Sample Size and Power Calculations

slide-36
SLIDE 36

Motivation Probability basics Power calculation Exercises Terminology/Concepts The Basic Calculation Clusters Covariates Details

Under an alternative:

Probability density

Null Under 1 SE effect

Suppose true effect were 1 SE (Standard Error):

Note: two-sided test.

Owen Ozier Sample Size and Power Calculations

slide-37
SLIDE 37

Motivation Probability basics Power calculation Exercises Terminology/Concepts The Basic Calculation Clusters Covariates Details

Under an alternative:

Probability density

Null Under 1 SE effect

Power would only be approximately 0.17

Note: two-sided test.

Owen Ozier Sample Size and Power Calculations

slide-38
SLIDE 38

Motivation Probability basics Power calculation Exercises Terminology/Concepts The Basic Calculation Clusters Covariates Details

Under an alternative:

Probability density 1 2 3

Null Under 3 SE effect

Suppose true effect were 3 SE’s (Standard Errors):

Note: this example follows slides by Marc Shotland (JPAL).

Owen Ozier Sample Size and Power Calculations

slide-39
SLIDE 39

Motivation Probability basics Power calculation Exercises Terminology/Concepts The Basic Calculation Clusters Covariates Details

Under an alternative:

Probability density 1 2 3

Null Under 3 SE effect

Power would be approximately 0.85

Note: this example follows slides by Marc Shotland (JPAL).

Owen Ozier Sample Size and Power Calculations

slide-40
SLIDE 40

Motivation Probability basics Power calculation Exercises Terminology/Concepts The Basic Calculation Clusters Covariates Details

Power calculation, visually

Effect Probability density

Null distn. Effect distn. t_α/2 size t_1−κ power

How the power calculation formula works

Note: see the related figure in the Toolkit paper.

Owen Ozier Sample Size and Power Calculations

slide-41
SLIDE 41

Motivation Probability basics Power calculation Exercises Terminology/Concepts The Basic Calculation Clusters Covariates Details

The formula: for power κ and size α,

Effect > (t1−κ + tα/2)SE(ˆ β) Notation: t1−p = pth percentile of the t dist’n.

Owen Ozier Sample Size and Power Calculations

slide-42
SLIDE 42

Motivation Probability basics Power calculation Exercises Terminology/Concepts The Basic Calculation Clusters Covariates Details

The formula: for power κ and size α,

Effect > (t1−κ + tα/2)SE(ˆ β) Notation: t1−p = pth percentile of the t dist’n. MDE = (t1−κ + tα/2)

  • 1

P(1 − P)

  • σ2

N

Owen Ozier Sample Size and Power Calculations

slide-43
SLIDE 43

Motivation Probability basics Power calculation Exercises Terminology/Concepts The Basic Calculation Clusters Covariates Details

The formula: for power κ and size α,

Effect > (t1−κ + tα/2)SE(ˆ β) Notation: t1−p = pth percentile of the t dist’n. MDE = (t1−κ + tα/2)

  • 1

P(1 − P)

  • σ2

N ≈ (z1−κ + zα/2)

  • 1

P(1 − P)

  • σ2

N In practice (Stata): sampsi Note: Stata uses the normal rather than t distribution (avoiding the D.O.F. issue).

Owen Ozier Sample Size and Power Calculations

slide-44
SLIDE 44

Motivation Probability basics Power calculation Exercises Terminology/Concepts The Basic Calculation Clusters Covariates Details

The formula: for power κ and size α,

Effect > (t1−κ + tα/2)SE(ˆ β) Notation: t1−p = pth percentile of the t dist’n. MDE = (t1−κ + tα/2)

  • 1

P(1 − P)

  • σ2

N ≈ (z1−κ + zα/2)

  • 1

P(1 − P)

  • σ2

N In practice (Stata): sampsi Note: Stata uses the normal rather than t distribution (avoiding the D.O.F. issue). Where do these numbers come from, σ2 and the effect size? Two basic options:

Owen Ozier Sample Size and Power Calculations

slide-45
SLIDE 45

Motivation Probability basics Power calculation Exercises Terminology/Concepts The Basic Calculation Clusters Covariates Details

The formula: for power κ and size α,

Effect > (t1−κ + tα/2)SE(ˆ β) Notation: t1−p = pth percentile of the t dist’n. MDE = (t1−κ + tα/2)

  • 1

P(1 − P)

  • σ2

N ≈ (z1−κ + zα/2)

  • 1

P(1 − P)

  • σ2

N In practice (Stata): sampsi Note: Stata uses the normal rather than t distribution (avoiding the D.O.F. issue). Where do these numbers come from, σ2 and the effect size? Two basic options: Consider standardized effect sizes in terms of standard deviations

Owen Ozier Sample Size and Power Calculations

slide-46
SLIDE 46

Motivation Probability basics Power calculation Exercises Terminology/Concepts The Basic Calculation Clusters Covariates Details

The formula: for power κ and size α,

Effect > (t1−κ + tα/2)SE(ˆ β) Notation: t1−p = pth percentile of the t dist’n. MDE = (t1−κ + tα/2)

  • 1

P(1 − P)

  • σ2

N ≈ (z1−κ + zα/2)

  • 1

P(1 − P)

  • σ2

N In practice (Stata): sampsi Note: Stata uses the normal rather than t distribution (avoiding the D.O.F. issue). Where do these numbers come from, σ2 and the effect size? Two basic options: Consider standardized effect sizes in terms of standard deviations Draw on existing data: What is available that could inform your project?

Owen Ozier Sample Size and Power Calculations

slide-47
SLIDE 47

Motivation Probability basics Power calculation Exercises Terminology/Concepts The Basic Calculation Clusters Covariates Details

What if treatment is assigned by groups?

We have been thinking here of randomizing at the individual level. But in practice, we often randomize larger units.

Owen Ozier Sample Size and Power Calculations

slide-48
SLIDE 48

Motivation Probability basics Power calculation Exercises Terminology/Concepts The Basic Calculation Clusters Covariates Details

What if treatment is assigned by groups?

We have been thinking here of randomizing at the individual level. But in practice, we often randomize larger units. Examples: Entire schools are assigned to treatment or comparison groups, and we observe outcomes at the level of the individual pupil Classes within a school are assigned to treatment or comparison groups, and we observe outcomes at the level of the individual pupil Households are assigned to treatment or comparison groups, and we observe outcomes at the level of the individual family member Sub-district locations are assigned to treatment or comparison groups, and we observe outcomes at the level of the individual road Microfinance branch offices are assigned to treatment or comparison groups, and we observe outcomes at the level of the individual borrower

Owen Ozier Sample Size and Power Calculations

slide-49
SLIDE 49

Motivation Probability basics Power calculation Exercises Terminology/Concepts The Basic Calculation Clusters Covariates Details

What if treatment is assigned by groups?

We have been thinking here of randomizing at the individual level. But in practice, we often randomize larger units. Examples: Entire schools are assigned to treatment or comparison groups, and we observe outcomes at the level of the individual pupil Classes within a school are assigned to treatment or comparison groups, and we observe outcomes at the level of the individual pupil Households are assigned to treatment or comparison groups, and we observe outcomes at the level of the individual family member Sub-district locations are assigned to treatment or comparison groups, and we observe outcomes at the level of the individual road Microfinance branch offices are assigned to treatment or comparison groups, and we observe outcomes at the level of the individual borrower What does this do? It depends on how much variation is explained by the group each individual is in.

Owen Ozier Sample Size and Power Calculations

slide-50
SLIDE 50

Motivation Probability basics Power calculation Exercises Terminology/Concepts The Basic Calculation Clusters Covariates Details

Intuitive example

You want to know how close the upcoming Kenyan referendum will be.

Owen Ozier Sample Size and Power Calculations

slide-51
SLIDE 51

Motivation Probability basics Power calculation Exercises Terminology/Concepts The Basic Calculation Clusters Covariates Details

Intuitive example

You want to know how close the upcoming Kenyan referendum will be. Method 1: Randomly select 50 people from entire Kenyan population

Owen Ozier Sample Size and Power Calculations

slide-52
SLIDE 52

Motivation Probability basics Power calculation Exercises Terminology/Concepts The Basic Calculation Clusters Covariates Details

Intuitive example

You want to know how close the upcoming Kenyan referendum will be. Method 1: Randomly select 50 people from entire Kenyan population Method 2: Randomly select 5 families, and ask ten members of each extended family their opinion

Owen Ozier Sample Size and Power Calculations

slide-53
SLIDE 53

Motivation Probability basics Power calculation Exercises Terminology/Concepts The Basic Calculation Clusters Covariates Details

The formula

Scale the effective standard error by: DesignEffect =

  • 1 + (ngroupsize − 1)ρ

ρ (“rho”) is the intra-class correlation.

Owen Ozier Sample Size and Power Calculations

slide-54
SLIDE 54

Motivation Probability basics Power calculation Exercises Terminology/Concepts The Basic Calculation Clusters Covariates Details

The formula

Scale the effective standard error by: DesignEffect =

  • 1 + (ngroupsize − 1)ρ

ρ (“rho”) is the intra-class correlation. In practice (Stata): loneway and sampclus

Owen Ozier Sample Size and Power Calculations

slide-55
SLIDE 55

Motivation Probability basics Power calculation Exercises Terminology/Concepts The Basic Calculation Clusters Covariates Details

The formula

Scale the effective standard error by: DesignEffect =

  • 1 + (ngroupsize − 1)ρ

ρ (“rho”) is the intra-class correlation. In practice (Stata): loneway and sampclus Recall earlier formula: MDE = (t1−κ + tα/2)

  • 1

P(1 − P)

  • σ2

N

Owen Ozier Sample Size and Power Calculations

slide-56
SLIDE 56

Motivation Probability basics Power calculation Exercises Terminology/Concepts The Basic Calculation Clusters Covariates Details

The formula

Scale the effective standard error by: DesignEffect =

  • 1 + (ngroupsize − 1)ρ

ρ (“rho”) is the intra-class correlation. In practice (Stata): loneway and sampclus Recall earlier formula: MDE = (t1−κ + tα/2)

  • 1

P(1 − P)

  • σ2

N But where does this ρ number come from? Two basic options:

Owen Ozier Sample Size and Power Calculations

slide-57
SLIDE 57

Motivation Probability basics Power calculation Exercises Terminology/Concepts The Basic Calculation Clusters Covariates Details

The formula

Scale the effective standard error by: DesignEffect =

  • 1 + (ngroupsize − 1)ρ

ρ (“rho”) is the intra-class correlation. In practice (Stata): loneway and sampclus Recall earlier formula: MDE = (t1−κ + tα/2)

  • 1

P(1 − P)

  • σ2

N But where does this ρ number come from? Two basic options: Consider what might be reasonable assumptions

Owen Ozier Sample Size and Power Calculations

slide-58
SLIDE 58

Motivation Probability basics Power calculation Exercises Terminology/Concepts The Basic Calculation Clusters Covariates Details

The formula

Scale the effective standard error by: DesignEffect =

  • 1 + (ngroupsize − 1)ρ

ρ (“rho”) is the intra-class correlation. In practice (Stata): loneway and sampclus Recall earlier formula: MDE = (t1−κ + tα/2)

  • 1

P(1 − P)

  • σ2

N But where does this ρ number come from? Two basic options: Consider what might be reasonable assumptions Draw on existing data (again): What is available that could inform your project?

Owen Ozier Sample Size and Power Calculations

slide-59
SLIDE 59

Motivation Probability basics Power calculation Exercises Terminology/Concepts The Basic Calculation Clusters Covariates Details

Intra-class correlations we have known

Data source ICC (ρ) Madagascar Math + Language 0.5 Busia, Kenya Math + Language 0.22 Udaipur, India Math + Language 0.23 Mumbai, India Math + Language 0.29 Vadodara, India Math + Language 0.28 Busia, Kenya Math 0.62 Source: Marc Shotland (JPAL) slides

Owen Ozier Sample Size and Power Calculations

slide-60
SLIDE 60

Motivation Probability basics Power calculation Exercises Terminology/Concepts The Basic Calculation Clusters Covariates Details

Baseline and other covariate data

Controlling for covariates reduces the SE, so increases power.

Owen Ozier Sample Size and Power Calculations

slide-61
SLIDE 61

Motivation Probability basics Power calculation Exercises Terminology/Concepts The Basic Calculation Clusters Covariates Details

Baseline and other covariate data

Controlling for covariates reduces the SE, so increases power. Baseline survey data on the outcome of interest is especially useful when the within-individual correlation of outcomes (in the absence of treatment) is

  • high. Consider academic test scores, for example.

Owen Ozier Sample Size and Power Calculations

slide-62
SLIDE 62

Motivation Probability basics Power calculation Exercises Terminology/Concepts The Basic Calculation Clusters Covariates Details

Baseline and other covariate data

Controlling for covariates reduces the SE, so increases power. Baseline survey data on the outcome of interest is especially useful when the within-individual correlation of outcomes (in the absence of treatment) is

  • high. Consider academic test scores, for example.

However, it may risk contaminating an otherwise clean randomized design. (randomizing was supposed to avoid omitted variable biases in OLS!)

Owen Ozier Sample Size and Power Calculations

slide-63
SLIDE 63

Motivation Probability basics Power calculation Exercises Terminology/Concepts The Basic Calculation Clusters Covariates Details

Baseline and other covariate data

Controlling for covariates reduces the SE, so increases power. Baseline survey data on the outcome of interest is especially useful when the within-individual correlation of outcomes (in the absence of treatment) is

  • high. Consider academic test scores, for example.

However, it may risk contaminating an otherwise clean randomized design. (randomizing was supposed to avoid omitted variable biases in OLS!) Stratifying is a good option:

Owen Ozier Sample Size and Power Calculations

slide-64
SLIDE 64

Motivation Probability basics Power calculation Exercises Terminology/Concepts The Basic Calculation Clusters Covariates Details

Baseline and other covariate data

Controlling for covariates reduces the SE, so increases power. Baseline survey data on the outcome of interest is especially useful when the within-individual correlation of outcomes (in the absence of treatment) is

  • high. Consider academic test scores, for example.

However, it may risk contaminating an otherwise clean randomized design. (randomizing was supposed to avoid omitted variable biases in OLS!) Stratifying is a good option:

On baseline outcome values

Owen Ozier Sample Size and Power Calculations

slide-65
SLIDE 65

Motivation Probability basics Power calculation Exercises Terminology/Concepts The Basic Calculation Clusters Covariates Details

Baseline and other covariate data

Controlling for covariates reduces the SE, so increases power. Baseline survey data on the outcome of interest is especially useful when the within-individual correlation of outcomes (in the absence of treatment) is

  • high. Consider academic test scores, for example.

However, it may risk contaminating an otherwise clean randomized design. (randomizing was supposed to avoid omitted variable biases in OLS!) Stratifying is a good option:

On baseline outcome values On fixed observables that have high predictive power (parents’ education, age, ...)

Owen Ozier Sample Size and Power Calculations

slide-66
SLIDE 66

Motivation Probability basics Power calculation Exercises Terminology/Concepts The Basic Calculation Clusters Covariates Details

Baseline and other covariate data

Controlling for covariates reduces the SE, so increases power. Baseline survey data on the outcome of interest is especially useful when the within-individual correlation of outcomes (in the absence of treatment) is

  • high. Consider academic test scores, for example.

However, it may risk contaminating an otherwise clean randomized design. (randomizing was supposed to avoid omitted variable biases in OLS!) Stratifying is a good option:

On baseline outcome values On fixed observables that have high predictive power (parents’ education, age, ...) On observables that delineate subpopulations you may want to test within (you might want to do another power calculation on this subsample)

Owen Ozier Sample Size and Power Calculations

slide-67
SLIDE 67

Motivation Probability basics Power calculation Exercises Terminology/Concepts The Basic Calculation Clusters Covariates Details

Power calculation, visually

.01 .02 .03 Kernel density estimate 100 120 140 160 180 Height (cm)

5.1 cm average per additional year in age; overall SD=12.1cm;

Kernel density estimate

Owen Ozier Sample Size and Power Calculations

slide-68
SLIDE 68

Motivation Probability basics Power calculation Exercises Terminology/Concepts The Basic Calculation Clusters Covariates Details

Power calculation, visually

.02 .04 .06 .08 Kernel density estimate 100 120 140 160 180 Height (cm) Age 8 Age 9 Age 10 Age 11 Age 12 Age 13 Age 14

5.1 cm average per additional year in age; overall SD=12.1cm; within SD=7.21cm Owen Ozier Sample Size and Power Calculations

slide-69
SLIDE 69

Motivation Probability basics Power calculation Exercises Terminology/Concepts The Basic Calculation Clusters Covariates Details

Some things to watch out for

Imperfect compliance with treatment:

Owen Ozier Sample Size and Power Calculations

slide-70
SLIDE 70

Motivation Probability basics Power calculation Exercises Terminology/Concepts The Basic Calculation Clusters Covariates Details

Some things to watch out for

Imperfect compliance with treatment:

The most straightforward approach is to decrease the effect size.

Owen Ozier Sample Size and Power Calculations

slide-71
SLIDE 71

Motivation Probability basics Power calculation Exercises Terminology/Concepts The Basic Calculation Clusters Covariates Details

Some things to watch out for

Imperfect compliance with treatment:

The most straightforward approach is to decrease the effect size.

Multiple treatments:

Owen Ozier Sample Size and Power Calculations

slide-72
SLIDE 72

Motivation Probability basics Power calculation Exercises Terminology/Concepts The Basic Calculation Clusters Covariates Details

Some things to watch out for

Imperfect compliance with treatment:

The most straightforward approach is to decrease the effect size.

Multiple treatments:

Optimal treatment structure depends on several things; May be better to have control group larger than any of the treatment groups.

Owen Ozier Sample Size and Power Calculations

slide-73
SLIDE 73

Motivation Probability basics Power calculation Exercises Terminology/Concepts The Basic Calculation Clusters Covariates Details

Some things to watch out for

Imperfect compliance with treatment:

The most straightforward approach is to decrease the effect size.

Multiple treatments:

Optimal treatment structure depends on several things; May be better to have control group larger than any of the treatment groups. To get started, calculate power for the smallest effect you hope to test.

Owen Ozier Sample Size and Power Calculations

slide-74
SLIDE 74

Motivation Probability basics Power calculation Exercises Terminology/Concepts The Basic Calculation Clusters Covariates Details

Some things to watch out for

Imperfect compliance with treatment:

The most straightforward approach is to decrease the effect size.

Multiple treatments:

Optimal treatment structure depends on several things; May be better to have control group larger than any of the treatment groups. To get started, calculate power for the smallest effect you hope to test. Multiple test correction may be needed later. (recently: Michael Anderson)

Owen Ozier Sample Size and Power Calculations

slide-75
SLIDE 75

Motivation Probability basics Power calculation Exercises Terminology/Concepts The Basic Calculation Clusters Covariates Details

Some things to watch out for

Imperfect compliance with treatment:

The most straightforward approach is to decrease the effect size.

Multiple treatments:

Optimal treatment structure depends on several things; May be better to have control group larger than any of the treatment groups. To get started, calculate power for the smallest effect you hope to test. Multiple test correction may be needed later. (recently: Michael Anderson)

How randomization is done:

Owen Ozier Sample Size and Power Calculations

slide-76
SLIDE 76

Motivation Probability basics Power calculation Exercises Terminology/Concepts The Basic Calculation Clusters Covariates Details

Some things to watch out for

Imperfect compliance with treatment:

The most straightforward approach is to decrease the effect size.

Multiple treatments:

Optimal treatment structure depends on several things; May be better to have control group larger than any of the treatment groups. To get started, calculate power for the smallest effect you hope to test. Multiple test correction may be needed later. (recently: Michael Anderson)

How randomization is done:

Alternative test based on statistical work of Fisher in early 20th century

Owen Ozier Sample Size and Power Calculations

slide-77
SLIDE 77

Motivation Probability basics Power calculation Exercises Terminology/Concepts The Basic Calculation Clusters Covariates Details

Some things to watch out for

Imperfect compliance with treatment:

The most straightforward approach is to decrease the effect size.

Multiple treatments:

Optimal treatment structure depends on several things; May be better to have control group larger than any of the treatment groups. To get started, calculate power for the smallest effect you hope to test. Multiple test correction may be needed later. (recently: Michael Anderson)

How randomization is done:

Alternative test based on statistical work of Fisher in early 20th century Issues with 1,2,3,1,2,3... if alphabetical, not a problem if order is random

Owen Ozier Sample Size and Power Calculations

slide-78
SLIDE 78

Motivation Probability basics Power calculation Exercises Terminology/Concepts The Basic Calculation Clusters Covariates Details

Some things to watch out for

Imperfect compliance with treatment:

The most straightforward approach is to decrease the effect size.

Multiple treatments:

Optimal treatment structure depends on several things; May be better to have control group larger than any of the treatment groups. To get started, calculate power for the smallest effect you hope to test. Multiple test correction may be needed later. (recently: Michael Anderson)

How randomization is done:

Alternative test based on statistical work of Fisher in early 20th century Issues with 1,2,3,1,2,3... if alphabetical, not a problem if order is random (recently: McKenzie and Bruhn)

Owen Ozier Sample Size and Power Calculations

slide-79
SLIDE 79

Motivation Probability basics Power calculation Exercises Terminology/Concepts The Basic Calculation Clusters Covariates Details

Some things to watch out for

Imperfect compliance with treatment:

The most straightforward approach is to decrease the effect size.

Multiple treatments:

Optimal treatment structure depends on several things; May be better to have control group larger than any of the treatment groups. To get started, calculate power for the smallest effect you hope to test. Multiple test correction may be needed later. (recently: Michael Anderson)

How randomization is done:

Alternative test based on statistical work of Fisher in early 20th century Issues with 1,2,3,1,2,3... if alphabetical, not a problem if order is random (recently: McKenzie and Bruhn)

Much more: check out Duflo, Glennerster, and Kremer: Using randomization in development economics research: a toolkit

Owen Ozier Sample Size and Power Calculations

slide-80
SLIDE 80

Motivation Probability basics Power calculation Exercises Terminology/Concepts The Basic Calculation Clusters Covariates Details

Some things to watch out for

Imperfect compliance with treatment:

The most straightforward approach is to decrease the effect size.

Multiple treatments:

Optimal treatment structure depends on several things; May be better to have control group larger than any of the treatment groups. To get started, calculate power for the smallest effect you hope to test. Multiple test correction may be needed later. (recently: Michael Anderson)

How randomization is done:

Alternative test based on statistical work of Fisher in early 20th century Issues with 1,2,3,1,2,3... if alphabetical, not a problem if order is random (recently: McKenzie and Bruhn)

Much more: check out Duflo, Glennerster, and Kremer: Using randomization in development economics research: a toolkit “A first comment is that, despite all the precision of these formulas, power calculations involve substantial guess work in practice.”

Owen Ozier Sample Size and Power Calculations

slide-81
SLIDE 81

Motivation Probability basics Power calculation Exercises

Exercises

First make sure you have the files: Make sure sampclus is installed; either by using findit sampclus

  • r by copying the two files into C:/ado/plus/s/ (typically).

Various .do and .dta files type-I-error-reject-null-when-true.do type-II-error-fail-to-reject-when-alt-is-true.do sampsi-syntax.do icc-example.do SampsiExerciseB.do - here we need to adjust the directory in the .do file before running.

Owen Ozier Sample Size and Power Calculations

slide-82
SLIDE 82

Motivation Probability basics Power calculation Exercises

Exercises

First make sure you have the files: Make sure sampclus is installed; either by using findit sampclus

  • r by copying the two files into C:/ado/plus/s/ (typically).

Various .do and .dta files type-I-error-reject-null-when-true.do type-II-error-fail-to-reject-when-alt-is-true.do sampsi-syntax.do icc-example.do SampsiExerciseB.do - here we need to adjust the directory in the .do file before running.

Owen Ozier Sample Size and Power Calculations

slide-83
SLIDE 83

Motivation Probability basics Power calculation Exercises

Exercises

First make sure you have the files: Make sure sampclus is installed; either by using findit sampclus

  • r by copying the two files into C:/ado/plus/s/ (typically).

Various .do and .dta files type-I-error-reject-null-when-true.do type-II-error-fail-to-reject-when-alt-is-true.do sampsi-syntax.do icc-example.do SampsiExerciseB.do - here we need to adjust the directory in the .do file before running.

Owen Ozier Sample Size and Power Calculations

slide-84
SLIDE 84

Motivation Probability basics Power calculation Exercises

Exercises

First make sure you have the files: Make sure sampclus is installed; either by using findit sampclus

  • r by copying the two files into C:/ado/plus/s/ (typically).

Various .do and .dta files type-I-error-reject-null-when-true.do type-II-error-fail-to-reject-when-alt-is-true.do sampsi-syntax.do icc-example.do SampsiExerciseB.do - here we need to adjust the directory in the .do file before running.

Owen Ozier Sample Size and Power Calculations

slide-85
SLIDE 85

Motivation Probability basics Power calculation Exercises

Exercises

First make sure you have the files: Make sure sampclus is installed; either by using findit sampclus

  • r by copying the two files into C:/ado/plus/s/ (typically).

Various .do and .dta files type-I-error-reject-null-when-true.do type-II-error-fail-to-reject-when-alt-is-true.do sampsi-syntax.do icc-example.do SampsiExerciseB.do - here we need to adjust the directory in the .do file before running.

Owen Ozier Sample Size and Power Calculations

slide-86
SLIDE 86

Motivation Probability basics Power calculation Exercises

Exercises

First make sure you have the files: Make sure sampclus is installed; either by using findit sampclus

  • r by copying the two files into C:/ado/plus/s/ (typically).

Various .do and .dta files type-I-error-reject-null-when-true.do type-II-error-fail-to-reject-when-alt-is-true.do sampsi-syntax.do icc-example.do SampsiExerciseB.do - here we need to adjust the directory in the .do file before running.

Owen Ozier Sample Size and Power Calculations