Causal Effect Moderation (Modification) When Treatment or Exposure - PowerPoint PPT Presentation

1 Causal Effect Moderation (Modification) When Treatment or Exposure is Time-Varying Daniel Almirall Health Services Research in Primary Care, Durham VA MC Dept of Biostatistics & Bioinformatics, Duke University MC Collaborators: Beth Ann Griffin, Rajeev Ramchand, Andrew R. Morral, Daniel F. McCaffrey, Thomas R. Ten Have, Susan A. Murphy, September 14-15, 2009 Federal Interagency Subgroups Analysis Meeting Washington, DC

Contents 2 Contents 1 Warm-up: Suppose we want A → Y . 4 2 Effect Moderation in One Time Point 7 3 Mean Model in One Time Point 11 4 The Time-Varying Setting 12 5 Robins’ Marginal Structural Model 16 6 Robins’ Structural Nested Mean Model 17

Contents 3 7 Estimation (in the Ole Days) 21 8 Conclusions 30 9 References 31

1 Warm-up: Suppose we want A → Y . 4 1 Warm-up: Suppose we want A → Y . A Y ? S Examples S = pre- A covt A = txt/expsr Y = outcome Suicidal? Medication? Depression Gender,SES SAT Coaching? SAT Math Score Social Support Inpatient vs. Outpatient Substance Abuse Why condition on (“adjust for”) pre-exposure covariables S ?

1 Warm-up: Suppose we want A → Y . 5 Suppose we want the effect of A on Y . Why condition on (adjust for) pre-treatment (or pre-exposure) variables S ? 1. Confounding : S is correlated with both A and Y . In this case, S is known as a “confounder” of the effect of A on Y . 2. Precision : S may be a pre-treatment measure of Y, or any other variable highly correlated with Y . 3. Missing Data : The outcome Y is missing for some units, S and A predict missingness, and S is associated with Y . 4. Effect Heterogeneity : S may moderate, temper, or specify the effect of A on Y . In this case, S is known as a “moderator” of the effect of A on Y .

1 Warm-up: Suppose we want A → Y . 6 Suppose we want the effect of A on Y . Why condition on (adjust for) pre-treatment (or pre-exposure) variables S ? A S Y 4. Effect Heterogeneity : S may moderate, temper, or specify the effect of A on Y . In this case, S is known as a “moderator” of the effect of A on Y . Formalized in next slide.

2 Effect Moderation in One Time Point 7 2 Effect Moderation in One Time Point µ ( s, a ) ≡ E ( Y ( a ) − Y (0) | S = s ) µ ( s ) = E( Y(inpat) − Y(outpat) | S=s ) Y(a) = Substance Use: Low is better a = 1 = residential a = 0 = outpatient µ = 0 = No Effect S = Social Support: High is better S = Social Support: High is better Outpatient substance abuse treatment is better than residential treatment for individuals with higher levels of social support.

2 Effect Moderation in One Time Point 8 Causal Effect Moderation in Context: Relevance? Theoretical Implication: Understanding the heterogeneity of the effects of treatments or exposures enhances our understanding of various (competing) scientific theories; and it may suggest new scientific hypotheses to be tested. Elaboration of Yu Xie’s Social Grouping Principle: We really want Y i ( a ) − Y i (0) ∀ i . We settle for “groupings” of effects (here, groupings by S ); µ ( s, a ) “comes closer” than E ( Y ( a ) − Y (0)) . Practical Implication: Identifying types, or subgroups, of individuals for which treatment or exposure is not effective may suggest altering the treatment to suit the needs of those types of individuals.

2 Effect Moderation in One Time Point 9 ** On Tailoring: Personalized Social, Behavioral, and Medical Treatments Programs ** The causal effect of interest (for most of us in this room) is µ ( s, a ) ≡ E ( Y ( a ) − Y (0) | S = s ) This is the Causal Effect Moderation Function . Developing tailored treatments for personalized medicine or tailored social programs is intimately tied to understanding µ ( s, a ) . This is, in fact, the driving practical motivation for what we have been working on here over the last 2 days.

2 Effect Moderation in One Time Point 10 ** On Language: Homogeneity? ** The causal effect of interest (for most of us in this room) is µ ( s, a ) ≡ E ( Y ( a ) − Y (0) | S = s ) This is the Causal Effect Moderation Function . The word homogenous is misleading even if we find that S is not a moderator. It is unlikely that the effect of treatment is homogenous (constant across the population) even if we find that the average treatment effect does not differ by S ; that is, even if we find that µ ( s, a ) is constant in S . Let’s use the phrase homogenous with respect to S .

3 Mean Model in One Time Point 11 3 Mean Model in One Time Point Decomposition of the conditional mean E ( Y ( a ) | S ) ; and the prototypical linear model: E ( Y ( a ) | S = s ) = E ( Y (0) | S = 0) � � + E ( Y (0) | S = s ) − E ( Y (0) | S = 0) + E ( Y ( a ) − Y (0) | S = s ) = η 0 + φ ( s ) + µ ( s, a ) e.g. = η 0 + η 1 s + β 1 a + β 2 as. This is precisely what I would do, too.

4 The Time-Varying Setting 12 4 The Time-Varying Setting The data structure in the time-varying setting is: S 1 a 1 a 2 Y ( a 1 , a 2 ) S 2 ( a 1 ) PROSPECT (Prevention of Suicide in Primary Care Elderly: CT) ( a 1 , a 2 ) Time-varying treatment pattern; a t is binary (0,1) Y ( a 1 , a 2 ) Depression at the end of the study; continuous Suicidal Ideation at baseline visit; continuous S 1 S 2 ( a 1 ) Suicidal Ideation at second visit; continuous We were interested in assessing the causal effect of time-varying treatment for depression, as a function of other variables that may lessen or increase this effect (ie, effect moderation).

4 The Time-Varying Setting 13 The Scientific Question Dictates Model Choice This is especially important in the time-varying setting. There are two types of scientific questions involving causal effect moderation in the time-varying setting: Type A : What is the effect of switching off treatment for depression early versus later, as a function of only baseline suicidal ideation (or age, race, etc.)? Type B : What is the effect of switching off treatment for depression early versus later, as a function of baseline and time-varying suicidal ideation?

4 The Time-Varying Setting 14 What is the effect of switching off treatment for depression early versus later, as a function of baseline suicidal ideation (or age, race, etc.)? Answering this type of question involves conditioning on baseline variables (putative moderators) thought to moderate the impact of different sequences of treatment (e.g., treatment duration) on outcomes. Importantly, because they are collected at baseline, the putative moderators are not outcomes of prior treatment. Marginal Structural Models are suitable for answering these types of questions.

4 The Time-Varying Setting 15 What is the effect of switching off treatment for depression early versus later, as a function of baseline and time-varying suicidal ideation? Answering this type of question involves conditioning on both baseline and time-varying variables thought to moderate the impact of different sequences of treatment on outcomes. The issue here is that the intermediate time-varying moderators are themselves likely impacted by prior treatment. This has conceptual as well as statistical implications (more on this later). Structural Nested Mean Models are suitable for answering these types of questions.

5 Robins’ Marginal Structural Model 16 5 Robins’ Marginal Structural Model The MSM for the conditional mean of Y ( a 1 , a 2 ) given S 1 is: � � E Y ( a 1 , a 2 ) | S 1 = E ( Y (0 , 0) | S 1 ) � � + E Y ( a 1 , 0 ) − Y ( 0 , 0 ) | S 1 � � + E Y ( a 1 , a 2 ) − Y ( a 1 , 0 ) | S 1 = µ 0 ( s 1 ) + µ 1 ( s 1 , a 1 ) + µ 2 ( s 1 , a 1 , a 2 ) e.g. = β 01 + β 02 s 1 + β 10 a 1 + β 11 a 1 s 1 + β 20 a 2 + β 21 a 2 s 1

6 Robins’ Structural Nested Mean Model 17 6 Robins’ Structural Nested Mean Model The SNMM for the conditional mean of Y ( a 1 , a 2 ) given ¯ S 2 ( a 1 ) is: � � Y ( a 1 , a 2 ) | S 1 , S 2 ( a 1 ) E � � = E ( Y (0 , 0)) + E ( Y (0 , 0) | S 1 ) − E ( Y (0 , 0)) � �� + Y ( a 1 , 0 ) − Y ( 0 , 0 ) | S 1 E � � E ( Y ( a 1 , 0) | ¯ + S 2 ( a 1 )) − E ( Y ( a 1 , 0) | S 1 ) � �� Y ( a 1 , a 2 ) − Y ( a 1 , 0 ) | ¯ � + S 2 ( a 1 ) E = µ 0 + ǫ 1 ( s 1 ) + µ 1 ( s 1 , a 1 ) + ǫ 2 (¯ s 2 , a 1 ) + µ 2 ( ¯ a 2 ) s 2 , ¯ e.g. = µ 0 + ǫ 1 ( s 1 ) + β 10 a 1 + β 11 a 1 s 1 + ǫ 2 (¯ s 2 , a 1 ) + β 20 a 2 + β 21 a 2 s 1 + β 22 a 2 s 2

6 Robins’ Structural Nested Mean Model 18 Constraints on the Causal and Nuisance Portions Y ( a 1 , a 2 ) | ¯ � � E S 2 ( a 1 ) = ¯ s 2 = µ 0 + ǫ 1 ( s 1 ) + µ 1 ( s 1 , a 1 ) + ǫ 2 (¯ s 2 , a 1 ) + µ 2 ( ¯ a 2 ) , where s 2 , ¯ · µ 2 (¯ s 2 , a 2 , 0) = 0 and µ 1 ( s 1 , 0) = 0 , s 2 , a 1 ) = E ( Y ( a 1 , 0) | ¯ · ǫ 2 (¯ S 2 ( a 1 ) = ¯ s 2 ) − E ( Y ( a 1 , 0) | S 1 = s 1 ) , · ǫ 1 ( s 1 ) = E ( Y (0 , 0) | S 1 = s 1 ) − E ( Y (0 , 0)) , · E S 2 | S 1 ( ǫ 2 (¯ s 2 , a 1 ) | S 1 = s 1 ) = 0 , and E S 1 ( ǫ 1 ( s 1 )) = 0 . The ǫ t ’s make the SNMM a non-standard regression model.

6 Robins’ Structural Nested Mean Model 19 Time-Varying Causal Effects of the SNMM Conditional Intermediate Causal Effect at t = 2 : µ 2 ( ¯ a 2 ) = E [ Y ( a 1 , a 2 ) − Y ( a 1 , 0) | S 1 = s 1 , S 2 ( a 1 ) = s 2 ] s 2 , ¯ a 2 a 1 Y ( a 1 , a 2 ) S 1 S 2 ( a 1 ) Conditional Intermediate Causal Effect at t = 1 : µ 1 ( s 1 , a 1 ) = E [ Y ( a 1 , 0) − Y (0 , 0) | S 1 = s 1 ] Set a 1 a 2 = 0 Y ( a 1 , 0) S 1

Causal Effect Moderation (Modification) When Treatment or Exposure - PowerPoint PPT Presentation

1 Causal Effect Moderation (Modification) When Treatment or Exposure is Time-Varying Daniel Almirall Health Services Research in Primary Care, Durham VA MC Dept of Biostatistics & Bioinformatics, Duke University MC Collaborators: Beth Ann

Causal Effect Evaluation and Causal Network Learning Zhi Geng Peking University, China June

Estimating time-varying causal effect Introduction moderation in mobile health with binary

Political Science 209 - Fall 2018 Causal Inference Florian Hollenbach 7th September 2018 Causal

Foundations of Causal Discovery Frederick Eberhardt KDD Causality Workshop 2016 Causal Discovery

Causal Inference By: Miguel A. Hern an and James M. Robins Part I: Causal inference without

On the Sources of the Great Moderation Jordi Gal and Luca Gambetti May 2008 Jordi Gal and

Content Moderation In Core Content Moderation In Core A Presentation By A Presentation By Eric

Structural Nested Mean Models for Assessing Time-Varying Causal Effect Moderation Daniel Almirall

Randomized Experiments The goal of randomized experiments is to identify The causal

Causal Programming Causal Programming Joshua Brul Joshua Brul

Few-shot Domain Adaptation 1/12 by Causal Mechanism Transfer Domain adaptation Causal mechanism

Causal Discovery from Observational Data Brady Neal causalcourse.com What if we dont have

A Brief Introduction to Causal Inference Brady Neal causalcourse.com What is causal inference?

Classifying Treatment Responders Under Causal Effect Monotonicity Nathan Kallus ORIE and Cornell

Data-efficient causal effect estimation Adith Swaminathan adswamin@microsoft.com Joint work with

Can one extract causal information from high-dimensional observational data? Applied Multivariate

MERIT (A4001026) Trial Maraviroc versus Efavirenz, both with Zidovudine-Lamivudine MERIT

Guiding stars for physics beyond SM: Higgs boson and dark matter , Shou-hua Zhu ITP,

EnterpriseandDesktopSearch Lecture2:SearchingtheEnterprise Web

a On the Management of Vehicular Traffic HYP2012 Massimiliano D. Rosini mrosini@icm.edu.pl

Development of Mogamulizumab, a defucosylated anti-CCR4 humanized monoclonal antibody Michinori

Teleconference on Zika for Title V Leaders May 26, 2016 Agenda: I. Introduction and Overview

What is happening to the world? Professor Sir John Beddington Oxford Martin School, Oxford

Semantic Question Answering on Big Data Tatiana Erekhinskaya July, 2016 The Goal Challenge:

Sambuz

Useful Links

Newsletter

Mail Us