Bayesian Causal Inference in High Dimensional Data Settings Jacob - PowerPoint PPT Presentation

Bayesian Causal Inference in High Dimensional Data Settings Jacob Spertus and Sharon-Lise Normand Harvard Medical School & School of Public Health Funded by R01-GM111339 and U01-FDA004493 Thanks to Organizer: Mariel Finucane , Mathematica Policy Research June 2017 sharon@hcp.med.harvard.edu (HMS) AcademyHealth 2017 June 2017 1 / 14

Introduction OUTLINE Thanks Motivating problem What has been done What we add to literature Revisit motivating problem Concluding remarks sharon@hcp.med.harvard.edu (HMS) AcademyHealth 2017 June 2017 2 / 14

Introduction Motivating problem DRUG ELUTING (DES)VERSUS BARE METAL (BMS) CORONARY STENTS MASSACHUSETTS, 2011 DES (approved 2003) and BMS Stent Type (approved in 1990s) frequently Characteristic BMS DES implanted to keep treated arteries Outcomes, % clear & supported 1 Year Mortality 10.2 3.3 DES improves target-vessel 1 Year TVR 9.0 6.5 revascularization (TVR) more than Confounders BMS Age, yrs 66.4 63.7 DES associated with late stent STEMI, % 35.7 18.2 thrombosis (death) Cardiomyopathy or LVSD, % 11.1 8.4 Have 9000 patients and 500 Emergent, % 38.3 20.3 confounders Shock, % 3.8 0.8 Do DES cause fewer revascularizations STEMI = ST-elevated myocardial infarction; LVSD = left ventricular systolic dysfunction compared to BMS ? Spertus and Normand, 2017 (under review) sharon@hcp.med.harvard.edu (HMS) AcademyHealth 2017 June 2017 3 / 14

Introduction Motivating problem CONSIDERATIONS Observational setting - no randomization Combination of clinical and claims data Clinical data obtained by trained data managers in each hospital Approximately 131 confounders from clinical data (demographics, pre-existing conditions, presentation severity, procedural characteristics) Approximately 500 confounders in claims data (Present on Admission codes) Sparsity : considered claims diagnoses having � 10 coded as present Goals: avoid strong parametric specifications, adhere to ignorable treatment assignment assumption, and adopt a design-based approach Final dataset: 8718 patients and 495 potential confounders sharon@hcp.med.harvard.edu (HMS) AcademyHealth 2017 June 2017 4 / 14

What has been done PROPENSITY SCORES With high-dimensional data Schneeweiss et al. (2009) proposed algorithm in high-dimensional settting But no accounting for uncertainty in variable selection & outcome is used to inform propensity score model Bayesian ? McCandless et al. 2009: uses propensity score as a latent variable and jointly models the latent score and the outcome model . Kaplan and Chen, 2012: 2-step approach (propensity score model then a parametric outcome model) Saarela et al., 2015: marginal model specification coupled with inverse probability of treatment weighting 2-step procedure † Zigler and Dominici, 2014: include the propensity score as a linear predictor in the outcome model & use Bayesian model averaging † Only Bayesian method proposed in the high-dimensional setting sharon@hcp.med.harvard.edu (HMS) AcademyHealth 2017 June 2017 5 / 14

What we add APPROACH WANT: ∆ = E( Y DES) - E( Y BMS) Two-step approach 1 Step 1: estimate propensity score model via regularization 2 Step 2: assume a binomial likelihood for binary outcome, weighted to generate a pseudo-population Not Bayesian per se Incorporates uncertainty from propensity score estimation Addresses the large k problem Simple diagnostic tools to assess balancing properties Maintains separation between treatment and outcome Outcome model does not assume a parametric function of treatment sharon@hcp.med.harvard.edu (HMS) AcademyHealth 2017 June 2017 6 / 14

What we add Models MODELS ( T Binary Treatment, Y Binary Outcome, X Confounders, k Large) Step 1: Treatment Model T i ∼ Bern ( π ( X i )) � k � � logit − 1 π ( X i ) = β 0 + β j X ij j = 1 π ( X i ) = propensity score Priors required for β j Typically centered at 0 Horseshoe prior (Carvahlo et al., 2010) N ( 0, λ 2 j τ 2 ) β j ∼ Cauchy + ( 0, 1 ) λ j , τ ∼ Mimics Bayesian Model Averaging (with heavy-tailed discrete mixtures) sharon@hcp.med.harvard.edu (HMS) AcademyHealth 2017 June 2017 7 / 14

What we add Models MODELS ( T Binary Treatment, Y Binary Outcome, X Confounders, k Large) Step 1: Treatment Model Step 2: Outcome Model T i ∼ Bern ( π ( X i )) � k � � Y T | n T , p T , π ( X ) Binomial ( n T , p T ) ∼ logit − 1 π ( X i ) = β 0 + β j X ij p T | α T 0 , α T 1 ∼ Beta ( α T 0 , α T 1 ) j = 1 π ( X i ) = propensity score n T = number of subjects receiving treatment T Priors required for β j p T = probability of outcome under Typically centered at 0 treatment T Horseshoe prior (Carvahlo et al., 2010) Independent samples in treatment groups N ( 0, λ 2 j τ 2 ) β j ∼ A-posteriori Cauchy + ( 0, 1 ) λ j , τ ∼ p T | Y , π ( X ) ∼ Beta ( a T , b T ) Mimics Bayesian Model Averaging (with heavy-tailed discrete mixtures) sharon@hcp.med.harvard.edu (HMS) AcademyHealth 2017 June 2017 8 / 14

What we add Models MODEL ( T Binary Treatment, Y Binary Outcome, X Confounders, k Large) Posterior Distribution: p T | Y , π ( X ) ∼ Beta ( a T , b T ) � n � n � � T i Y i T i ( 1 − Y i ) � � a 1 = α 11 + γ 1 b 1 = α 10 + γ 1 π i π i i = 1 i = 1 � �� weight weight � n � n � � ( 1 − T i ) Y i ( 1 − T i )( 1 − Y i ) � � a 0 = α 00 + γ 0 b 0 = α 01 + γ 0 1 − π i 1 − π i i = 1 i = 1 � �� weight weight Renormalization terms: � n � n i = 1 ( 1 − T i ) i = 1 T i ; γ 0 = γ 1 = � n � n i = 1 T i /π i i = 1 ( 1 − T i ) / ( 1 − π i ) sharon@hcp.med.harvard.edu (HMS) AcademyHealth 2017 June 2017 9 / 14

What we add Operating characteristics DOES UNCERTAINTY IN STEP 1 MATTER? 500 simulations, n = 1000, k= 100 (18 β j � = 0), P ( Y i = 1 ) ≈ 0.10 BART = Bayesian Additive Regression Trees 95% CI Coverage ˆ Bias Width 95% CI ∆ Integrated Propensity Score Student- t 3 ( 0, 3 2 ) -.011 .220 95.2% Horseshoe Priors .016 .110 93.0% BART .011 .123 96.8% Mean Propensity Score Student- t 3 ( 0, 3 2 ) -.001 .095 79.2% Horseshoe Priors .018 .092 86.0% BART .015 .093 87.2% Other Methods Naive Estimate .030 .092 73.0% IPW -.001 .151 92.8% TMLE .006 .075 81.4% Bottom Line: useful to integrate over the propensity score distribution for large k sharon@hcp.med.harvard.edu (HMS) AcademyHealth 2017 June 2017 10 / 14

What we add Causal estimates for BMS vs DES WEIGHTED STANDARDIZED MEAN DIFFERENCES IN CONFOUNDERS sharon@hcp.med.harvard.edu (HMS) AcademyHealth 2017 June 2017 11 / 14

Frequentist Bayesian What we add Causal estimates for BMS vs DES 1-YEAR TARGET VESSEL REVASCULARIZATION Favours DES Favours BMS BART HShoe t(0,9) TMLE IPW Naive −8 −7 −6 −5 −4 −3 −2 −1 0 1 2 1−Year Revascularization,% sharon@hcp.med.harvard.edu (HMS) AcademyHealth 2017 June 2017 12 / 14

Conclusions SUMMARY OF OUR APPROACH Advantages Not fully Bayesian Propensity score is simple, widely used, and familiar Easy to assess balancing properties Fixing propensity score at posterior mean: Good coverage for small k Underestimates variance for large k Some bias Maintains separation between treatment and outcome models Honestly reflects uncertainty sharon@hcp.med.harvard.edu (HMS) AcademyHealth 2017 June 2017 13 / 14

Conclusions SUMMARY OF OUR APPROACH Disadvantages Advantages Not fully Bayesian Not fully Bayesian Robustness? Propensity score is simple, widely used, and familiar Binomial likelihood Easy to assess balancing properties Balancing property may be lost when incorporating uncertainty Fixing propensity score at posterior mean: Thank you! Good coverage for small k Underestimates variance for large k Some bias Maintains separation between treatment and outcome models Honestly reflects uncertainty sharon@hcp.med.harvard.edu (HMS) AcademyHealth 2017 June 2017 14 / 14

Bayesian Causal Inference in High Dimensional Data Settings Jacob - PowerPoint PPT Presentation

Bayesian Causal Inference in High Dimensional Data Settings Jacob Spertus and Sharon-Lise Normand Harvard Medical School & School of Public Health Funded by R01-GM111339 and U01-FDA004493 Thanks to Organizer: Mariel Finucane , Mathematica

Political Science 209 - Fall 2018 Causal Inference Florian Hollenbach 7th September 2018 Causal

Causal Effect Evaluation and Causal Network Learning Zhi Geng Peking University, China June

Causal Inference By: Miguel A. Hern an and James M. Robins Part I: Causal inference without

A Brief Introduction to Causal Inference Brady Neal causalcourse.com What is causal inference?

Introduction to Causal Inference Lan Liu University of Minnesota at Twin Cities liux3771@umn.edu

CS440/ECE448 Lecture 15: Bayesian Inference and Bayesian Learning Slides by Svetlana Lazebnik,

LR-GLM: High-Dimensional Bayesian Inference Using Low-Rank Data Approximations Brian Trippe ,

Foundations of Causal Discovery Frederick Eberhardt KDD Causality Workshop 2016 Causal Discovery

Geographic Data Science - Lecture IX Causal Inference Dani Arribas-Bel Today Correlation Vs

Geographic Data Science - Lecture IX Causal Inference Dani Arribas-Bel Today Correlation Vs

Geographic Data Science - Lecture IX Causal Inference Dani Arribas-Bel Today Correlation Vs

High-dimensional causal inference, graphical modeling and structural equation models Peter B

Modes of Statistical Inference for Causal Efgects Plus an overview of the testing based approach

Causal inference Gary Goertz Kroc Institute for International Peace Studies University of Notre

Causal Inference An introduction based on S. Wagers course on Causal Inference (OIT 661) Imke

Causal Inference Theory and Applications Dr. Matthias Uflacker, Johannes Huegle, Christopher

First of all, what is the definition of the term microfossils? Dr. A. J. Rundle applies

In the Government Inquiry into Operation Burnham SYNOPSIS OF SUBMISSIONS OF COUNSEL FOR JON

A validation/uncertainty quantification analysis for a 1.5 MW oxy-coal fired L1500 furnace using a

Statistical inference of network structure Part 2 Tiago P. Peixoto University of Bath Berlin,

Structuring Gainsharing Arrangements and Bundled Payments: Latest Developments Complying With

SURTAVI Nicolas Dumonteil Toulouse Dclaration de Relations Professionnelles Disclosure

CONTRACEPTIVE IMPLANT WHAT YOU NEED TO KNOW ABOUT A NEW CLASS OF DRUG COMING SOON REGINA RENNER,

and Medical Education This presentations is.. To share my experiences To ask you some