Two-Sample Instrumental Variable Analysis: Challenges and Some Progress
Qingyuan Zhao Department of Statistics, The Wharton School, University
- f Pennsylvania
Two-Sample Instrumental Variable Analysis: Challenges and Some - - PowerPoint PPT Presentation
Two-Sample Instrumental Variable Analysis: Challenges and Some Progress Qingyuan Zhao Department of Statistics, The Wharton School, University of Pennsylvania November 28, 2017 Outline Two-Sample Some interesting history IV Qingyuan
Two-Sample IV Qingyuan Zhao Introduction Part 1 Part 2 References 1/42
Two-Sample IV Qingyuan Zhao Introduction Part 1 Part 2 References 2/42
1 Condition on all common causes of X and Y . 2 Study all causal mechanisms by which X influences Y . 3 Use instrumental variables (IV) or natural experiments.
Two-Sample IV Qingyuan Zhao Introduction Part 1 Part 2 References 3/42
1 IV causes the exposure (X). 2 IV is independent of the unmeasured confounder (C). 3 IV cannot have any direct effect on the outcome (Y ).
Two-Sample IV Qingyuan Zhao Introduction Part 1 Part 2 References 4/42
Two-Sample IV Qingyuan Zhao Introduction Part 1 Part 2 References 5/42
1 Using good instrument(s).
2 Statistical inference.
3 Robustness.
Two-Sample IV Qingyuan Zhao Introduction Part 1 Part 2 References 6/42
1 Need to find SNPs that are associated with the exposure. 2 Independence of unmeasured confounder is self-evident.
3 Direct effect on the outcome is possible (pleiotropy).
Two-Sample IV Qingyuan Zhao Introduction Part 1 Part 2 References 7/42
1 Two-sample IV: don’t need the full data (Z, X, Y ) for all
2 MR with GWAS summary statistics: don’t need individual
Two-Sample IV Qingyuan Zhao Introduction Part 1 Part 2 References 8/42
Two-Sample IV Qingyuan Zhao Introduction Part 1 Part 2 References 9/42
Two-Sample IV Qingyuan Zhao Introduction Part 1 Part 2 References 10/42
Two-Sample IV Qingyuan Zhao Introduction Part 1 Part 2 References 11/42
i = gs(xs i , us i )
i = f s(zs i , vs i )
i ⊥
i , vs i )
i d
i
z (z) + f s v (v)
LATE βab LATE
LATE is the local average treatment
LATE = βb LATE × Pb(complier)/Pa(complier).
Two-Sample IV Qingyuan Zhao Introduction Part 1 Part 2 References 12/42
Two-Sample IV Qingyuan Zhao Introduction Part 1 Part 2 References 13/42
Two-Sample IV Qingyuan Zhao Introduction Part 1 Part 2 References 14/42
5 10 15 20 −2 2 4
y sample
a b 0.0 0.1 0.2 0.3 0.4 −2 2 4
x density sample
a b
Two-Sample IV Qingyuan Zhao Introduction Part 1 Part 2 References 15/42
−2.5 0.0 2.5 5.0 7.5 −4 −2 2 4
y sample
a b 0.0 0.2 0.4 0.6 −4 −2 2 4
x density sample
a b
Two-Sample IV Qingyuan Zhao Introduction Part 1 Part 2 References 16/42
1 In the one-sample setting [Wooldridge, 2010], and 2 In the homogeneous two-sample setting [Inoue and Solon,
Two-Sample IV Qingyuan Zhao Introduction Part 1 Part 2 References 17/42
Two-Sample IV Qingyuan Zhao Introduction Part 1 Part 2 References 18/42
Two-Sample IV Qingyuan Zhao Introduction Part 1 Part 2 References 19/42
j1), available from GWAS.
j2), available from GWAS.
Two-Sample IV Qingyuan Zhao Introduction Part 1 Part 2 References 20/42
1 Measurement error: ˆ
2 Linkage disequilibrium: ˆ
3 How many SNPs should we use?
4 Pleiotropy: the equation Γj = βγj might not always be
5 ...
Two-Sample IV Qingyuan Zhao Introduction Part 1 Part 2 References 21/42
Two-Sample IV Qingyuan Zhao Introduction Part 1 Part 2 References 22/42
j1, σ2 j2 ≤ C/n for all j.
Two-Sample IV Qingyuan Zhao Introduction Part 1 Part 2 References 23/42
p
j + 1)1+δ → 0.
p
j σ2 j2 + Γ2 j σ2 j1 + σ2 j1σ2 j2
j2 + σ2 j1β2)2
p
j σ2 j2 + Γ2 j σ2 j1
j2 + σ2 j1β2)2 = O(n).
Two-Sample IV Qingyuan Zhao Introduction Part 1 Part 2 References 24/42
p
j σ2 j2 + Γ2 j σ2 j1+σ2 j1σ2 j2
j2 + σ2 j1β2)2
p
j σ2 j2 + Γ2 j σ2 j1
j2 + σ2 j1β2)2 .
Two-Sample IV Qingyuan Zhao Introduction Part 1 Part 2 References 25/42
Two-Sample IV Qingyuan Zhao Introduction Part 1 Part 2 References 26/42
p
j1
j2 + σ2 j1β2)2 −
j2+σ2 j1β2
Two-Sample IV Qingyuan Zhao Introduction Part 1 Part 2 References 27/42
Two-Sample IV Qingyuan Zhao Introduction Part 1 Part 2 References 28/42
1 No pleiotropy: PL estimator (compare to IVW). 2 Systematic pleiotropy: modified PL score equation
3 Systematic and idiosyncratic pleiotropy: robustified score
1
j2 + σ2 j1 ˆ
2
Two-Sample IV Qingyuan Zhao Introduction Part 1 Part 2 References 29/42
Two-Sample IV Qingyuan Zhao Introduction Part 1 Part 2 References 30/42
Two-Sample IV Qingyuan Zhao Introduction Part 1 Part 2 References 31/42
Two-Sample IV Qingyuan Zhao Introduction Part 1 Part 2 References 32/42
1
2
Two-Sample IV Qingyuan Zhao Introduction Part 1 Part 2 References 33/42
Two-Sample IV Qingyuan Zhao Introduction Part 1 Part 2 References 34/42
Two-Sample IV Qingyuan Zhao Introduction Part 1 Part 2 References 35/42
Two-Sample IV Qingyuan Zhao Introduction Part 1 Part 2 References 36/42
Two-Sample IV Qingyuan Zhao Introduction Part 1 Part 2 References 37/42
Two-Sample IV Qingyuan Zhao Introduction Part 1 Part 2 References 38/42
Two-Sample IV Qingyuan Zhao Introduction Part 1 Part 2 References 39/42
Two-Sample IV Qingyuan Zhao Introduction Part 1 Part 2 References 40/42
1 Sample splitting is very important to obtain unbiased
2 Pleiotropy (systematic and idiosyncratic) can be handled
3 Theoretical guarantees: statistical consistency and
Two-Sample IV Qingyuan Zhao Introduction Part 1 Part 2 References 41/42
Two-Sample IV Qingyuan Zhao Introduction Part 1 Part 2 References 42/42