For Objec*ve Causal Inference, Design Trumps Analysis - PowerPoint PPT Presentation

For ¡Objec*ve ¡Causal ¡Inference, ¡ Design ¡Trumps ¡Analysis ¡ Donald ¡B. ¡Rubin ¡ Department ¡of ¡Sta*s*cs ¡ Harvard ¡University ¡ 16 ¡March ¡2012 ¡ 1 ¡

Prologue ¡to ¡Objec*ve ¡Causal ¡Inference ¡ in ¡Observa*onal ¡Studies ¡ • My ¡Introduc*on ¡ – Physics ¡– ¡Wheeler ¡1961 ¡ – Experimental ¡Design ¡– ¡Cochran ¡1968 ¡ • Clear ¡Separa*on ¡Between ¡ – Science ¡= ¡object ¡of ¡inference: ¡DEFINE ¡QUESTION ¡FIRST ¡ – What ¡is ¡done ¡to ¡learn ¡about ¡the ¡science ¡ • Intervene ¡to ¡measure ¡aspects ¡at ¡a ¡point ¡in ¡*me ¡ • Same ¡nota*on/representa*on ¡of ¡science ¡no ¡maXer ¡ how ¡we ¡try ¡to ¡learn ¡about ¡or ¡measure ¡ • Missing ¡data ¡always ¡exist ¡ – Cannot ¡go ¡back ¡in ¡*me ¡ 2 ¡

Poten*al ¡Outcomes ¡Approach ¡to ¡Causal ¡ Inference ¡– ¡Simplest ¡SeZng ¡ Y(1) ¡ Y(0) ¡ 1 ¡ . ¡ . ¡ Units ¡ . ¡ N ¡ 3 ¡

Poten*al ¡Outcomes ¡Approach ¡to ¡Causal ¡ Inference ¡– ¡Simplest ¡SeZng ¡ • Fundamental ¡problem ¡of ¡causal ¡inference ¡ • For ¡each ¡ i , ¡only ¡Y i (1) ¡or ¡Y i (0) ¡can ¡be ¡observed ¡ Y(1) ¡ Y(0) ¡ T ¡ 1 ¡ ✓ ¡ ? ¡ 1 ¡ . ¡ ✓ ¡ ? ¡ 1 ¡ . ¡ ✓ ¡ ? ¡ 1 ¡ Units ¡ . ¡ ? ¡ ✓ ¡ 0 ¡ . ¡ ? ¡ ✓ ¡ 0 ¡ N ¡ ? ¡ ✓ ¡ 0 ¡ • Random ¡assignment ¡of ¡ac*ve ¡versus ¡control ¡ ¡ representa*ve ¡sample ¡of ¡Y i (1) ¡will ¡be ¡compared ¡ to ¡representa*ve ¡sample ¡of ¡Y i (0) ¡ ¡ 4 ¡

Poten*al ¡Outcomes ¡Approach ¡to ¡Causal ¡ Inference ¡– ¡Simplest ¡SeZng ¡with ¡Covariates ¡ X ¡ Y(1) ¡ Y(0) ¡ 1 ¡ . ¡ . ¡ Units ¡ . ¡ N ¡ • Same ¡as ¡before, ¡except ¡includes ¡pretreatment ¡ covariates, ¡e.g., ¡age, ¡sex, ¡background ¡educa*on ¡ • Randomiza*on ¡s*ll ¡works ¡for ¡females ¡ 5 ¡

Randomized ¡Trials ¡That ¡Are ¡Designed ¡ Oden ¡Using ¡Covariates ¡ • Randomized ¡blocks ¡(e.g., ¡males, ¡females) ¡ • Forces ¡balance ¡on ¡blocking ¡variables ¡ • Probability ¡of ¡treatment ¡versus ¡control ¡can ¡depend ¡on ¡ covariates’ ¡values, ¡some*mes ¡in ¡complicated ¡ways ¡ • This ¡is ¡the ¡template ¡for ¡the ¡design ¡and ¡analysis ¡of ¡ nonrandomized ¡(e.g., ¡observa*onal) ¡data ¡ • Assignment-‑based ¡approaches ¡use ¡assignment ¡ mechanism ¡for ¡inference ¡– ¡Fisherian ¡& ¡Neymanian ¡ • Predic*ve ¡approach ¡predicts ¡from ¡observed ¡values ¡– ¡ Bayesian ¡posterior ¡predic*ve ¡(Rubin, ¡1978) ¡ ¡ ¡ 6 ¡

Design ¡Observa-onal ¡Studies ¡to ¡Approximate ¡ Randomized ¡Trials ¡ 1. Hide ¡outcome ¡data ¡un*l ¡the ¡design ¡phase ¡is ¡complete ¡ 2. Think ¡very ¡carefully ¡about ¡decision ¡makers ¡and ¡the ¡key ¡covariates ¡ that ¡were ¡used ¡to ¡make ¡treatment ¡decisions ¡ 3. If ¡key ¡covariates ¡are ¡not ¡observed ¡or ¡very ¡noisy, ¡usually ¡best ¡to ¡give ¡ up ¡and ¡seek ¡beXer ¡data ¡source ¡ 4. Find ¡subgroups ¡(subclasses ¡or ¡matched ¡pairs) ¡in ¡which ¡the ¡treatment ¡ and ¡control ¡groups ¡have ¡balance ¡– ¡essen*ally ¡the ¡same ¡distribu*on ¡ of ¡observed ¡covariates ¡ ¡ Not ¡always ¡possible ¡to ¡achieve ¡balance ¡ • Inferences ¡are ¡limited ¡to ¡subgroups ¡where ¡balance ¡is ¡achieved ¡ • 5. Protocol ¡specified ¡analysis ¡ #1 ¡-‑ ¡#5 ¡combine ¡to ¡create ¡an ¡objec*ve ¡design ¡that ¡approximates ¡a ¡ • randomized ¡trial ¡in ¡each ¡subclass ¡that ¡is ¡balanced ¡with ¡respect ¡to ¡ observed ¡covariates ¡ 7 ¡

Illustra*ve ¡Example ¡with ¡One ¡Key ¡Covariate ¡ (Cochran, ¡1968) ¡ • Popula*on: ¡ ¡Male ¡smokers ¡in ¡U.S. ¡ • Treatment ¡= ¡cigar/pipe ¡smoking ¡ • Control ¡= ¡cigareXe ¡smoking ¡ • Outcome ¡= ¡death ¡rate/1000 ¡person ¡years ¡ • Decision ¡maker ¡is ¡the ¡individual ¡male ¡smoker ¡ • Reason ¡for ¡a ¡smoking ¡male ¡to ¡choose ¡cigareXes ¡ versus ¡cigar/pipe? ¡ • Age ¡is ¡a ¡key ¡covariate ¡for ¡selec*on ¡of ¡smoking ¡ type ¡for ¡males ¡ ¡ 8 ¡

Subclassifica*on ¡to ¡Balance ¡Age ¡ • To ¡achieve ¡balance ¡on ¡age, ¡compare: ¡ – “young” ¡cigar/pipe ¡smokers ¡with ¡“young” ¡cigareXe ¡ smokers ¡ – “old” ¡cigar/pipe ¡smokers ¡with ¡“old” ¡cigareXe ¡smokers ¡ • Or ¡beXer, ¡compare: ¡ – Young, ¡middle ¡aged, ¡old ¡ – Even ¡more ¡age ¡subclasses ¡ • Design ¡phase, ¡no ¡outcome ¡data, ¡objec*ve: ¡ – Approximates ¡a ¡randomized ¡trial ¡within ¡subclasses ¡ • Now ¡look ¡at ¡outcome ¡data ¡ Reference: ¡Rubin ¡DB. ¡The ¡Design ¡Versus ¡the ¡Analysis ¡of ¡Observa*onal ¡Studies ¡for ¡Causal ¡ ¡ Effects: ¡ ¡Parallels ¡With ¡The ¡Design ¡of ¡Randomized ¡Trials. ¡Sta*s*cs ¡in ¡Medicine ¡2007 ¡ 9 ¡

Comparison ¡of ¡Mortality ¡Rates ¡for ¡Two ¡ Smoking ¡Groups ¡in ¡U.S. ¡ CigareXe ¡ Cigar/Pipe ¡ Smokers ¡ Smokers ¡ Variable ¡ Mortality ¡Rates ¡per ¡1000 ¡ 13.5 ¡ 17.4 ¡ person-‑years, ¡% ¡ Adjusted ¡Mortality ¡Rates ¡ using ¡subclasses, ¡% ¡ ¡ ¡ ¡2 ¡age ¡subclasses ¡ 16.4 ¡ 14.9 ¡ ¡ ¡ ¡3 ¡age ¡subclasses ¡ 17.7 ¡ 14.2 ¡ ¡ ¡ ¡9-‑11 ¡age ¡subclasses ¡ 21.2 ¡ 13.7 ¡ Source: ¡ ¡Cochran ¡WG. ¡ ¡The ¡effec*veness ¡of ¡adjustment ¡of ¡subclassifica*on ¡in ¡ ¡ removing ¡bias ¡in ¡observa*onal ¡studies. ¡ ¡Biometrics ¡1968; ¡24:295-‑313. ¡ Note: ¡ ¡20 ¡four-‑level ¡covariates ¡ ⇒ ¡over ¡million ¡million ¡subclasses ¡ 10 ¡

Propensity ¡Score ¡Methods ¡ • Rosenbaum ¡and ¡Rubin. ¡“The ¡Central ¡Role ¡of ¡the ¡Propensity ¡Score ¡in ¡ Observa*onal ¡Studies.” ¡Biometrika ¡1983. ¡ • Observa*onal ¡study ¡analogue ¡of ¡randomiza*on ¡ • The ¡propensity ¡score ¡is ¡the ¡probability ¡of ¡treatment ¡versus ¡control ¡ as ¡a ¡func*on ¡of ¡observed ¡covariates ¡ – Model ¡the ¡reasons ¡for ¡treatment ¡versus ¡control ¡at ¡the ¡level ¡of ¡the ¡ decision ¡makers ¡ – For ¡example, ¡logis*c ¡regression ¡model ¡to ¡predict ¡cigareXe ¡versus ¡ cigar/pipe ¡smoking ¡with ¡age, ¡educa*on, ¡income, ¡etc. ¡as ¡predictors ¡ • Then ¡subclassify ¡(or ¡match) ¡on ¡the ¡propensity ¡score ¡as ¡if ¡it ¡were ¡the ¡ only ¡covariate, ¡e.g., ¡5-‑10 ¡subclasses ¡ • If ¡correctly ¡done, ¡this ¡creates ¡balance ¡within ¡each ¡subclass ¡on ¡ ALL ¡ covariates ¡used ¡in ¡es*ma*ng ¡the ¡propensity ¡score ¡ • Using ¡diagnos*cs ¡to ¡assess ¡and ¡to ¡document ¡balance ¡is ¡cri*cal ¡ 11 ¡

Example: ¡ ¡GAO ¡Study ¡of ¡Breast ¡ Conserva*on ¡versus ¡Mastectomy ¡ • Six ¡large ¡and ¡expensive ¡randomized ¡clinical ¡trials ¡had ¡been ¡ completed ¡showing ¡liXle ¡difference ¡for ¡the ¡type ¡of ¡women ¡ randomized ¡in ¡the ¡trials ¡and ¡par*cipa*ng ¡clinics ¡ • Ques*on: ¡ ¡Same ¡results ¡in ¡general ¡prac*ce? ¡ • Observa*onal ¡data ¡available ¡ – SEER ¡Database: ¡covariates, ¡treatments, ¡post-‑surgery ¡outcomes ¡ ¡ • Design ¡phase ¡ – Hide ¡outcomes ¡ – Balance ¡covariates ¡between ¡treatment ¡and ¡control ¡ • Reasons ¡for ¡mastectomy ¡versus ¡breast ¡conserva*on ¡ – Age, ¡marital ¡status, ¡region ¡of ¡country, ¡urbaniza*on, ¡race, ¡size ¡of ¡tumor, ¡ etc. ¡ Reference: ¡ ¡Rubin ¡DB. ¡Es*mated ¡Causal ¡Effects ¡from ¡Large ¡Datasets ¡Using ¡Propensity ¡ ¡ Scores. ¡Annals ¡of ¡Internal ¡Medicine ¡1997; ¡127, ¡8(II):757-‑763. ¡ 12 ¡

For Objec*ve Causal Inference, Design Trumps Analysis - PowerPoint PPT Presentation

For Objecve Causal Inference, Design Trumps Analysis Donald B. Rubin Department of Stascs Harvard University 16 March 2012 1 Prologue to Objecve

Political Science 209 - Fall 2018 Causal Inference Florian Hollenbach 7th September 2018 Causal

Causal Effect Evaluation and Causal Network Learning Zhi Geng Peking University, China June

Causal Inference By: Miguel A. Hern an and James M. Robins Part I: Causal inference without

A Brief Introduction to Causal Inference Brady Neal causalcourse.com What is causal inference?

Introduction to Causal Inference Lan Liu University of Minnesota at Twin Cities liux3771@umn.edu

Foundations of Causal Discovery Frederick Eberhardt KDD Causality Workshop 2016 Causal Discovery

Modes of Statistical Inference for Causal Efgects Plus an overview of the testing based approach

Geographic Data Science - Lecture IX Causal Inference Dani Arribas-Bel Today Correlation Vs

Causal inference Gary Goertz Kroc Institute for International Peace Studies University of Notre

Causal Inference An introduction based on S. Wagers course on Causal Inference (OIT 661) Imke

Geographic Data Science - Lecture IX Causal Inference Dani Arribas-Bel Today Correlation Vs

Causal Inference Theory and Applications Dr. Matthias Uflacker, Johannes Huegle, Christopher

Geographic Data Science - Lecture IX Causal Inference Dani Arribas-Bel Today Correlation Vs

Causal Inference and Response Surface Modeling Inference and

Causal Programming Causal Programming Joshua Brul Joshua Brul

Few-shot Domain Adaptation 1/12 by Causal Mechanism Transfer Domain adaptation Causal mechanism

ERICA A cloud orchestration meta-framework for secure health data analytics Tim Churches SW

Chapter 6 Inference for categorical data Huamei Dong 03/15/2016 1. Quick Summary 2. Sample

Section 8: Statics - Basics Section 8: Statics - Basics 8-1 Fundamental Concepts Fundamental

Dugald Seely, ND, MSc, FABNO World Cancer Congress, August 28, 2012 Laura Weeks An

Breast MRI: Friend or Foe? Comments: 0 ASSOCIATED PRESS 8/19/2008 Applegate's cancer was

3/7/2017 Assistant to Chair 17 th Multidisciplinary Management of Cancers: A Case based

Breast Cancer Epidemiology Issues pertaining to Screening Approximately 12.3 % of women

Building Applications with Protg: An Overview Protg Conference July 23, 2006 Outline

Sambuz

Useful Links

Newsletter

Mail Us

For Objec*ve Causal Inference, Design Trumps Analysis - PowerPoint PPT Presentation

For Objec*ve Causal Inference, Design Trumps Analysis Donald B. Rubin Department of Sta*s*cs Harvard University 16 March 2012 1 Prologue to Objec*ve

Political Science 209 - Fall 2018 Causal Inference Florian Hollenbach 7th September 2018 Causal

Causal Effect Evaluation and Causal Network Learning Zhi Geng Peking University, China June

Causal Inference By: Miguel A. Hern an and James M. Robins Part I: Causal inference without

A Brief Introduction to Causal Inference Brady Neal causalcourse.com What is causal inference?

Introduction to Causal Inference Lan Liu University of Minnesota at Twin Cities liux3771@umn.edu

Foundations of Causal Discovery Frederick Eberhardt KDD Causality Workshop 2016 Causal Discovery

Modes of Statistical Inference for Causal Efgects Plus an overview of the testing based approach

Geographic Data Science - Lecture IX Causal Inference Dani Arribas-Bel Today Correlation Vs

Causal inference Gary Goertz Kroc Institute for International Peace Studies University of Notre

Causal Inference An introduction based on S. Wagers course on Causal Inference (OIT 661) Imke

Geographic Data Science - Lecture IX Causal Inference Dani Arribas-Bel Today Correlation Vs

Causal Inference Theory and Applications Dr. Matthias Uflacker, Johannes Huegle, Christopher

Geographic Data Science - Lecture IX Causal Inference Dani Arribas-Bel Today Correlation Vs

Causal Inference and Response Surface Modeling Inference and

Causal Programming Causal Programming Joshua Brul Joshua Brul

Few-shot Domain Adaptation 1/12 by Causal Mechanism Transfer Domain adaptation Causal mechanism

ERICA A cloud orchestration meta-framework for secure health data analytics Tim Churches SW

Chapter 6 Inference for categorical data Huamei Dong 03/15/2016 1. Quick Summary 2. Sample

Section 8: Statics - Basics Section 8: Statics - Basics 8-1 Fundamental Concepts Fundamental

Dugald Seely, ND, MSc, FABNO World Cancer Congress, August 28, 2012 Laura Weeks An

Breast MRI: Friend or Foe? Comments: 0 ASSOCIATED PRESS 8/19/2008 Applegate's cancer was

3/7/2017 Assistant to Chair 17 th Multidisciplinary Management of Cancers: A Case based

Breast Cancer Epidemiology Issues pertaining to Screening Approximately 12.3 % of women

Building Applications with Protg: An Overview Protg Conference July 23, 2006 Outline

Sambuz

Useful Links

Newsletter

Mail Us

For Objecve Causal Inference, Design Trumps Analysis Donald B. Rubin Department of Stascs Harvard University 16 March 2012 1 Prologue to Objecve