SLIDE 1
Removing Hidden Confounding by Experimental Grounding Nathan Kallus - - PowerPoint PPT Presentation
Removing Hidden Confounding by Experimental Grounding Nathan Kallus - - PowerPoint PPT Presentation
NeurIPS 2018 Spotlight Presentation Removing Hidden Confounding by Experimental Grounding Nathan Kallus Aahlad Manas Puli Uri Shalit Cornell NYU Technion ( me) Poster: Today 10:45AM12:45PM @ Room 210 & 230 AB #2 Individual-level
SLIDE 2
SLIDE 3
Large-scale observational data can help
Age Weight BMI A1C LDL T Y 49 106 31 Insulin 9 54 89 26 Metformin 7 43 130 38 Metformin 10 . . . . . . . . . . . . . . . . . . . . .
SLIDE 4
Large-scale observational data can help
Age Weight BMI A1C LDL T Y 49 106 31 Insulin 9 54 89 26 Metformin 7 43 130 38 Metformin 10 . . . . . . . . . . . . . . . . . . . . .
Fit ω(X) = E[Y | X, T = 1] − E[Y | X, T = 0] to the data
Outcome under metformin Outcome under insulin
E.g.: Wager & Athey ’17 (CF), Shalit et al. ’17 (TARNet), ... usually assume ω = τ (no hidden confounding)
SLIDE 5
The problem with hidden confounding
◮ Hidden confounding = hidden correlations between treatments and outcome idiosyncrasies
◮ E.g.: healthier patients tend to get metformin ◮ Confounding = ⇒ ω = τ ◮ To some extent always unavoidable in observational data
SLIDE 6
The problem with hidden confounding
◮ Hidden confounding = hidden correlations between treatments and outcome idiosyncrasies
◮ E.g.: healthier patients tend to get metformin ◮ Confounding = ⇒ ω = τ ◮ To some extent always unavoidable in observational data
Experimental data Observational data Unconfounded by design Confounded by default Limited generalizability Covers population of interest Small samples Large samples
SLIDE 7
Experimental grounding
◮ Our Q: How to use a small & limited experimental dataset to remove confounding errors in individual-level treatment effect estimates from a large observational dataset?
SLIDE 8
Experimental grounding
◮ Our Q: How to use a small & limited experimental dataset to remove confounding errors in individual-level treatment effect estimates from a large observational dataset? ◮ Outline of our method:
◮ Fit ˆ ω(X) using blackbox on observational data (e.g., causal forest, TARNet, ...) ◮ A new way to fit η(X) = ω(X) − τ(X) across the
- bservational and experimental datasets
◮ Return the grounded estimate ˆ τ(X) = ˆ ω(X) − ˆ η(X)
SLIDE 9
Experimental grounding
◮ Our Q: How to use a small & limited experimental dataset to remove confounding errors in individual-level treatment effect estimates from a large observational dataset? ◮ Outline of our method:
◮ Fit ˆ ω(X) using blackbox on observational data (e.g., causal forest, TARNet, ...) ◮ A new way to fit η(X) = ω(X) − τ(X) across the
- bservational and experimental datasets
◮ Return the grounded estimate ˆ τ(X) = ˆ ω(X) − ˆ η(X)
◮ Our theoretical guarantee: if η is parametric and ˆ ω is consistent then ˆ τ is consistent!
◮ Strictly weaker than assuming no confounding (η = 0)
SLIDE 10
Empirical results
◮ Estimate the effect of large vs small classrooms on first graders’ test scores
◮ Data from STAR experiment (Word et al. 1990)
0.1 0.2 0.3 0.4 0.5 size of UNCONF as fraction of rural 73 76 79 82 85 88 RMSE on EVAL set 2 step RF 2 step ridge ridge YGT (UNC) ridge DIFF(UNC) RF YGT(UNC) RF DIFF(UNC) ridge DIFF(CONF) RF DIFF(CONF)
SLIDE 11