1/39
Section 3 : Permutation Inference
Yotam Shem-Tov Fall 2014
Yotam Shem-Tov STAT 239/ PS 236A
Section 3 : Permutation Inference Yotam Shem-Tov Fall 2014 1/39 - - PowerPoint PPT Presentation
Section 3 : Permutation Inference Yotam Shem-Tov Fall 2014 1/39 Yotam Shem-Tov STAT 239/ PS 236A Introduction Throughout this slides we will focus only on randomized experiments, i.e the treatment is assigned at random We will follow the
1/39
Yotam Shem-Tov STAT 239/ PS 236A
2/39
Yotam Shem-Tov STAT 239/ PS 236A
3/39
1
2
Yotam Shem-Tov STAT 239/ PS 236A
4/39
1 Data 2 Null hypothesis 3 Test statistic 4 The distribution of the test statistic under the null hypothesis Yotam Shem-Tov STAT 239/ PS 236A
5/39
s=1 ns
i=1 Zsi, and 0 ≤ ms ≤ ns
Yotam Shem-Tov STAT 239/ PS 236A
6/39
i=1 Zsi
Yotam Shem-Tov STAT 239/ PS 236A
7/39
Yotam Shem-Tov STAT 239/ PS 236A
8/39
m
Yotam Shem-Tov STAT 239/ PS 236A
9/39
Yotam Shem-Tov STAT 239/ PS 236A
10/39
Yotam Shem-Tov STAT 239/ PS 236A
11/39
N
i=1 Ziri
i=1(1 − Zi)ri
Yotam Shem-Tov STAT 239/ PS 236A
12/39
Yotam Shem-Tov STAT 239/ PS 236A
13/39
Yotam Shem-Tov STAT 239/ PS 236A
14/39
1 |Ω| and,
Yotam Shem-Tov STAT 239/ PS 236A
15/39
1 Use a Monte-Carlo approximation 2 Use an asymptotic approximation for the distribution of the
Yotam Shem-Tov STAT 239/ PS 236A
16/39
1 Draw a SRS (simple random sample) of size m from the data
2 Compute the test statistic, t(Z, r), as you would if X and Y
3 Repeat this procedure B times (many times), saving the
4 The distribution of tb(Z, r) approximates the true distribution
Yotam Shem-Tov STAT 239/ PS 236A
17/39
B
Yotam Shem-Tov STAT 239/ PS 236A
18/39
Yotam Shem-Tov STAT 239/ PS 236A
19/39
Yotam Shem-Tov STAT 239/ PS 236A
20/39
Value of test statistic Frequency −0.4 −0.2 0.0 0.2 0.4 500 1000 1500
Yotam Shem-Tov STAT 239/ PS 236A
21/39
Permutation distribution
Value of test statistic Frequency −0.4 −0.2 0.0 0.2 0.4 500 1000 1500
Observed value Hypothetical
Yotam Shem-Tov STAT 239/ PS 236A
22/39
Yotam Shem-Tov STAT 239/ PS 236A
23/39
Z T 1 − (1−Z)T r (1−Z)T 1
Permutation distribution
Value of test statistic Frequency 0.0 0.1 0.2 0.3 0.4 0.5 200 400 600 800 1000
Observed value
Yotam Shem-Tov STAT 239/ PS 236A
24/39
N
Yotam Shem-Tov STAT 239/ PS 236A
25/39
Yotam Shem-Tov STAT 239/ PS 236A
26/39
Value of test statistic Frequency 940000 960000 980000 1000000 1020000 1040000 50 100 150
Yotam Shem-Tov STAT 239/ PS 236A
27/39
Yotam Shem-Tov STAT 239/ PS 236A
28/39
Yotam Shem-Tov STAT 239/ PS 236A
29/39
Yotam Shem-Tov STAT 239/ PS 236A
30/39
Y −2 2 4 6 8
Yotam Shem-Tov STAT 239/ PS 236A
31/39
Yotam Shem-Tov STAT 239/ PS 236A
32/39
2 4 6 8 0.0 0.2 0.4 0.6 0.8 1.0
w F(w)
D = 0.25 Y CDF X CDF
Yotam Shem-Tov STAT 239/ PS 236A
33/39
ks.permutation1 Frequency 0.0 0.1 0.2 0.3 0.4 50 100 150 One sided P−value: 0.02
Yotam Shem-Tov STAT 239/ PS 236A
34/39
ks.permutation1 Frequency 0.0 0.1 0.2 0.3 0.4 50 100 150 One sided P−value: 0.02
Yotam Shem-Tov STAT 239/ PS 236A
35/39
Yotam Shem-Tov STAT 239/ PS 236A
36/39
−1.0 −0.5 0.0 0.5 1.0 0.0 0.2 0.4 0.6 0.8 1.0
t F(t)
Y CDF X CDF
Yotam Shem-Tov STAT 239/ PS 236A
37/39
ks.permutation2 Frequency 0.0 0.1 0.2 0.3 0.4 20 40 60 80 100 One sided P−value: 0.402
Yotam Shem-Tov STAT 239/ PS 236A
38/39
Yotam Shem-Tov STAT 239/ PS 236A
39/39
Yotam Shem-Tov STAT 239/ PS 236A