Section 3: Permutation Inference
Yotam Shem-Tov Fall 2015
Yotam Shem-Tov STAT 239/ PS 236A September 26, 2015 1 / 47
Section 3: Permutation Inference Yotam Shem-Tov Fall 2015 Yotam - - PowerPoint PPT Presentation
Section 3: Permutation Inference Yotam Shem-Tov Fall 2015 Yotam Shem-Tov STAT 239/ PS 236A September 26, 2015 1 / 47 Introduction Throughout this slides we will focus only on randomized experiments, i.e the treatment is assigned at random.
Yotam Shem-Tov STAT 239/ PS 236A September 26, 2015 1 / 47
Yotam Shem-Tov STAT 239/ PS 236A September 26, 2015 2 / 47
1
2
Yotam Shem-Tov STAT 239/ PS 236A September 26, 2015 3 / 47
1 Data 2 Null hypothesis 3 Test statistic 4 The distribution of the test statistic under the null hypothesis Yotam Shem-Tov STAT 239/ PS 236A September 26, 2015 4 / 47
s=1 ns
i=1 Zsi, and 0 ≤ ms ≤ ns
Yotam Shem-Tov STAT 239/ PS 236A September 26, 2015 5 / 47
i=1 Zsi
Yotam Shem-Tov STAT 239/ PS 236A September 26, 2015 6 / 47
Yotam Shem-Tov STAT 239/ PS 236A September 26, 2015 7 / 47
m
2
1
Yotam Shem-Tov STAT 239/ PS 236A September 26, 2015 8 / 47
Yotam Shem-Tov STAT 239/ PS 236A September 26, 2015 9 / 47
Yotam Shem-Tov STAT 239/ PS 236A September 26, 2015 10 / 47
N
i=1 Ziri
i=1(1 − Zi)ri
Yotam Shem-Tov STAT 239/ PS 236A September 26, 2015 11 / 47
Yotam Shem-Tov STAT 239/ PS 236A September 26, 2015 12 / 47
Yotam Shem-Tov STAT 239/ PS 236A September 26, 2015 13 / 47
1 |Ω| and,
Yotam Shem-Tov STAT 239/ PS 236A September 26, 2015 14 / 47
1 Use a Monte-Carlo approximation 2 Use an asymptotic approximation for the distribution of the test
Yotam Shem-Tov STAT 239/ PS 236A September 26, 2015 15 / 47
1 Draw a SRS (simple random sample) of size m from the data and call
2 Compute the test statistic, t(Z, r), as you would if X and Y would
3 Repeat this procedure B times (many times), saving the results, so
4 The distribution of tb(Z, r) approximates the true distribution of
Yotam Shem-Tov STAT 239/ PS 236A September 26, 2015 16 / 47
B
Yotam Shem-Tov STAT 239/ PS 236A September 26, 2015 17 / 47
Yotam Shem-Tov STAT 239/ PS 236A September 26, 2015 18 / 47
Yotam Shem-Tov STAT 239/ PS 236A September 26, 2015 19 / 47
Value of test statistic Frequency −0.4 −0.2 0.0 0.2 0.4 500 1000 1500
Yotam Shem-Tov STAT 239/ PS 236A September 26, 2015 20 / 47
Permutation distribution
Value of test statistic Frequency −0.4 −0.2 0.0 0.2 0.4 500 1000 1500
Observed value Hypothetical
Yotam Shem-Tov STAT 239/ PS 236A September 26, 2015 21 / 47
Yotam Shem-Tov STAT 239/ PS 236A September 26, 2015 22 / 47
Z T 1 − (1−Z)T r (1−Z)T 1
Permutation distribution
Value of test statistic Frequency 0.0 0.1 0.2 0.3 0.4 0.5 200 400 600 800 1000
Observed value
Yotam Shem-Tov STAT 239/ PS 236A September 26, 2015 23 / 47
N
Yotam Shem-Tov STAT 239/ PS 236A September 26, 2015 24 / 47
Yotam Shem-Tov STAT 239/ PS 236A September 26, 2015 25 / 47
Value of test statistic Frequency 940000 960000 980000 1000000 1020000 1040000 50 100 150
Yotam Shem-Tov STAT 239/ PS 236A September 26, 2015 26 / 47
Yotam Shem-Tov STAT 239/ PS 236A September 26, 2015 27 / 47
Yotam Shem-Tov STAT 239/ PS 236A September 26, 2015 28 / 47
Yotam Shem-Tov STAT 239/ PS 236A September 26, 2015 29 / 47
Y −2 2 4 6 8
Yotam Shem-Tov STAT 239/ PS 236A September 26, 2015 30 / 47
Yotam Shem-Tov STAT 239/ PS 236A September 26, 2015 31 / 47
2 4 6 8 0.0 0.2 0.4 0.6 0.8 1.0
w F(w)
D = 0.25 Y CDF X CDF
Yotam Shem-Tov STAT 239/ PS 236A September 26, 2015 32 / 47
ks.permutation1 Frequency 0.0 0.1 0.2 0.3 0.4 50 100 150 One sided P−value: 0.02
Yotam Shem-Tov STAT 239/ PS 236A September 26, 2015 33 / 47
ks.permutation1 Frequency 0.0 0.1 0.2 0.3 0.4 50 100 150 One sided P−value: 0.02
Yotam Shem-Tov STAT 239/ PS 236A September 26, 2015 34 / 47
Yotam Shem-Tov STAT 239/ PS 236A September 26, 2015 35 / 47
−1.0 −0.5 0.0 0.5 1.0 0.0 0.2 0.4 0.6 0.8 1.0
t F(t)
Y CDF X CDF
Yotam Shem-Tov STAT 239/ PS 236A September 26, 2015 36 / 47
ks.permutation2 Frequency 0.0 0.1 0.2 0.3 0.4 20 40 60 80 100 One sided P−value: 0.402
Yotam Shem-Tov STAT 239/ PS 236A September 26, 2015 37 / 47
Yotam Shem-Tov STAT 239/ PS 236A September 26, 2015 38 / 47
Yotam Shem-Tov STAT 239/ PS 236A September 26, 2015 39 / 47
1 Assign rank 1 to the smallest observation, rank 2 to the highest
2 The ST test statistic is the sum of the ranks in treated group,
N
Yotam Shem-Tov STAT 239/ PS 236A September 26, 2015 40 / 47
Yotam Shem-Tov STAT 239/ PS 236A September 26, 2015 41 / 47
Yotam Shem-Tov STAT 239/ PS 236A September 26, 2015 42 / 47
Yotam Shem-Tov STAT 239/ PS 236A September 26, 2015 43 / 47
Yotam Shem-Tov STAT 239/ PS 236A September 26, 2015 44 / 47
Siegla−Tukey P−vlaue under the null Frequency
0.0 0.2 0.4 0.6 0.8 1.0 5 10 15 20 25 30
P−value: 0.0963
Yotam Shem-Tov STAT 239/ PS 236A September 26, 2015 45 / 47
Levene's P−vlaue under the null Frequency
0.0 0.2 0.4 0.6 0.8 1.0 5 10 15 20 25 30
P−value: 0.0033
Yotam Shem-Tov STAT 239/ PS 236A September 26, 2015 46 / 47
Yotam Shem-Tov STAT 239/ PS 236A September 26, 2015 47 / 47