Removing Hidden Confounding by Experimental Grounding Nathan Kallus - - PowerPoint PPT Presentation

removing hidden confounding by experimental grounding
SMART_READER_LITE
LIVE PREVIEW

Removing Hidden Confounding by Experimental Grounding Nathan Kallus - - PowerPoint PPT Presentation

NeurIPS 2018 Spotlight Presentation Removing Hidden Confounding by Experimental Grounding Nathan Kallus Aahlad Manas Puli Uri Shalit Cornell NYU Technion ( me) Poster: Today 10:45AM12:45PM @ Room 210 & 230 AB #2 Individual-level


slide-1
SLIDE 1

Removing Hidden Confounding by Experimental Grounding

Nathan Kallus

Cornell

(↑ me) Aahlad Manas Puli

NYU

Uri Shalit

Technion

NeurIPS 2018 Spotlight Presentation Poster: Today 10:45AM–12:45PM @ Room 210 & 230 AB #2

slide-2
SLIDE 2

Individual-level causal effects

◮ Patient Anna is diagnosed with type-II diabetes

◮ Blood sugar: 8.7% A1C ◮ Age: 51 ◮ Weight: 102kg ◮ BMI: 35.3 ◮ ...

◮ Q: What first-line glucose control treatment to give? Insulin (t = 1) or Metformin (t = 0)?

◮ Want to know the individual-level causal effect of treatment, i.e., the conditional average treatment effect (CATE) Baseline covariates X τ(X) = E[Y (1) − Y (0) | X]

◮ Y (t) = Anna’s potential outcome under treatment t

◮ Same Q in targeted advertising, public policy, ...

slide-3
SLIDE 3

Large-scale observational data can help

Age Weight BMI A1C LDL T Y 49 106 31 Insulin 9 54 89 26 Metformin 7 43 130 38 Metformin 10 . . . . . . . . . . . . . . . . . . . . .

slide-4
SLIDE 4

Large-scale observational data can help

Age Weight BMI A1C LDL T Y 49 106 31 Insulin 9 54 89 26 Metformin 7 43 130 38 Metformin 10 . . . . . . . . . . . . . . . . . . . . .

Fit ω(X) = E[Y | X, T = 1] − E[Y | X, T = 0] to the data

Outcome under metformin Outcome under insulin

E.g.: Wager & Athey ’17 (CF), Shalit et al. ’17 (TARNet), ... usually assume ω = τ (no hidden confounding)

slide-5
SLIDE 5

The problem with hidden confounding

◮ Hidden confounding = hidden correlations between treatments and outcome idiosyncrasies

◮ E.g.: healthier patients tend to get metformin ◮ Confounding = ⇒ ω = τ ◮ To some extent always unavoidable in observational data

slide-6
SLIDE 6

The problem with hidden confounding

◮ Hidden confounding = hidden correlations between treatments and outcome idiosyncrasies

◮ E.g.: healthier patients tend to get metformin ◮ Confounding = ⇒ ω = τ ◮ To some extent always unavoidable in observational data

Experimental data Observational data Unconfounded by design Confounded by default Limited generalizability Covers population of interest Small samples Large samples

slide-7
SLIDE 7

Experimental grounding

◮ Our Q: How to use a small & limited experimental dataset to remove confounding errors in individual-level treatment effect estimates from a large observational dataset?

slide-8
SLIDE 8

Experimental grounding

◮ Our Q: How to use a small & limited experimental dataset to remove confounding errors in individual-level treatment effect estimates from a large observational dataset? ◮ Outline of our method:

◮ Fit ˆ ω(X) using blackbox on observational data (e.g., causal forest, TARNet, ...) ◮ A new way to fit η(X) = ω(X) − τ(X) across the

  • bservational and experimental datasets

◮ Return the grounded estimate ˆ τ(X) = ˆ ω(X) − ˆ η(X)

slide-9
SLIDE 9

Experimental grounding

◮ Our Q: How to use a small & limited experimental dataset to remove confounding errors in individual-level treatment effect estimates from a large observational dataset? ◮ Outline of our method:

◮ Fit ˆ ω(X) using blackbox on observational data (e.g., causal forest, TARNet, ...) ◮ A new way to fit η(X) = ω(X) − τ(X) across the

  • bservational and experimental datasets

◮ Return the grounded estimate ˆ τ(X) = ˆ ω(X) − ˆ η(X)

◮ Our theoretical guarantee: if η is parametric and ˆ ω is consistent then ˆ τ is consistent!

◮ Strictly weaker than assuming no confounding (η = 0)

slide-10
SLIDE 10

Empirical results

◮ Estimate the effect of large vs small classrooms on first graders’ test scores

◮ Data from STAR experiment (Word et al. 1990)

0.1 0.2 0.3 0.4 0.5 size of UNCONF as fraction of rural 73 76 79 82 85 88 RMSE on EVAL set 2 step RF 2 step ridge ridge YGT (UNC) ridge DIFF(UNC) RF YGT(UNC) RF DIFF(UNC) ridge DIFF(CONF) RF DIFF(CONF)

slide-11
SLIDE 11

Thank you!

Poster: Today 10:45AM–12:45PM @ Room 210 & 230 AB #2