What works in Boston may not work in Los Angeles: Understanding site - - PowerPoint PPT Presentation

what works in boston may not work in los angeles
SMART_READER_LITE
LIVE PREVIEW

What works in Boston may not work in Los Angeles: Understanding site - - PowerPoint PPT Presentation

What works in Boston may not work in Los Angeles: Understanding site di ff erences and generalizing e ff ects from one site to another. Kara Rudolph with Mark van der Laan RWJF Health and Society Scholar UC Berkeley / UC San Francisco Kara


slide-1
SLIDE 1

What works in Boston may not work in Los Angeles: Understanding site differences and generalizing effects from one site to another.

Kara Rudolph with Mark van der Laan

RWJF Health and Society Scholar UC Berkeley / UC San Francisco

Kara Rudolph (UCB/UCSF) Generalizing effects across sites 1 / 39

slide-2
SLIDE 2

Outline

1

Motivation Motivating example

2

Methodologic Challenges

3

Approach

4

Results

5

Future directions

Kara Rudolph (UCB/UCSF) Generalizing effects across sites 2 / 39

slide-3
SLIDE 3

Motivation

Should we expect that a policy/program/intervention implemented in

  • ne place will have the same effect when implemented in another

place?

Kara Rudolph (UCB/UCSF) Generalizing effects across sites 3 / 39

slide-4
SLIDE 4

Motivation

Should we expect that a policy/program/intervention implemented in

  • ne place will have the same effect when implemented in another

place? Not always.

Kara Rudolph (UCB/UCSF) Generalizing effects across sites 3 / 39

slide-5
SLIDE 5

Motivation

Should we expect that a policy/program/intervention implemented in

  • ne place will have the same effect when implemented in another

place? Not always.

1

Differences in site-level variables (e.g., implementation, economy) that modify intervention effectiveness, AND/OR

Kara Rudolph (UCB/UCSF) Generalizing effects across sites 3 / 39

slide-6
SLIDE 6

Motivation

Should we expect that a policy/program/intervention implemented in

  • ne place will have the same effect when implemented in another

place? Not always.

1

Differences in site-level variables (e.g., implementation, economy) that modify intervention effectiveness, AND/OR

2

Differences in person-level variables (i.e, population composition) that modify intervention effectiveness.

Kara Rudolph (UCB/UCSF) Generalizing effects across sites 3 / 39

slide-7
SLIDE 7

Motivation

Budgets are limited.

Kara Rudolph (UCB/UCSF) Generalizing effects across sites 4 / 39

slide-8
SLIDE 8

Motivation

Budgets are limited. Need to target the policy/intervention to those places that stand to benefit most.

Kara Rudolph (UCB/UCSF) Generalizing effects across sites 4 / 39

slide-9
SLIDE 9

Motivation

Budgets are limited. Need to target the policy/intervention to those places that stand to benefit most.

E.g., planned expansion of intervention. Where should it be expanded to have the largest effect? How is success defined?

Kara Rudolph (UCB/UCSF) Generalizing effects across sites 4 / 39

slide-10
SLIDE 10

Motivation

Budgets are limited. Need to target the policy/intervention to those places that stand to benefit most.

E.g., planned expansion of intervention. Where should it be expanded to have the largest effect? How is success defined? Can you think of any practical examples of this?

Kara Rudolph (UCB/UCSF) Generalizing effects across sites 4 / 39

slide-11
SLIDE 11

Motivation

Budgets are limited. Need to target the policy/intervention to those places that stand to benefit most.

E.g., planned expansion of intervention. Where should it be expanded to have the largest effect? How is success defined? Can you think of any practical examples of this?

Research question: What do we expect the effect of an intervention to be in a new place, accounting for population composition?

Kara Rudolph (UCB/UCSF) Generalizing effects across sites 4 / 39

slide-12
SLIDE 12

Outline

1

Motivation Motivating example

2

Methodologic Challenges

3

Approach

4

Results

5

Future directions

Kara Rudolph (UCB/UCSF) Generalizing effects across sites 5 / 39

slide-13
SLIDE 13

Motivating Example

Moving To Opportunity (MTO)1

https://upload.wikimedia.org/ http://www.chicagomag.com

1Kling, J. R. et al. Experimental analysis of neighborhood effects.

Econometrica 75, 83–119 (2007). Kara Rudolph (UCB/UCSF) Generalizing effects across sites 6 / 39

slide-14
SLIDE 14

Motivating Example

In discussing differences in effects across sites, MTO researchers concluded: Of course, if it had been possible to attribute differences in impacts across sites to differences in site characteristics, that would have been very valuable information. Unfortunately, that was not possible. With only five sites, which differ in innumerable potentially relevant ways, it was simply not possible to disentangle the underlying factors that cause impacts to vary across sites. (This is true for both the quantitative analysis and for any qualitative analysis of the impacts that might be undertaken.)2 Why are the researchers saying this? Do you agree?

2Orr, L. et al. Moving to opportunity: Interim impacts evaluation.

(2003), p.B11. Kara Rudolph (UCB/UCSF) Generalizing effects across sites 7 / 39

slide-15
SLIDE 15

Motivating Example

Research Question (MTO-specific): Are differences in intervention effects across cities due to differences in implementation? City-level differences (e.g, the economy)? Or differences in population composition?

Kara Rudolph (UCB/UCSF) Generalizing effects across sites 8 / 39

slide-16
SLIDE 16

Outline

1

Motivation Motivating example

2

Methodologic Challenges

3

Approach

4

Results

5

Future directions

Kara Rudolph (UCB/UCSF) Generalizing effects across sites 9 / 39

slide-17
SLIDE 17

Methodologic Challenges

Typically, multi-site data are analyzed using fixed effects.

Kara Rudolph (UCB/UCSF) Generalizing effects across sites 10 / 39

slide-18
SLIDE 18

Methodologic Challenges

Typically, multi-site data are analyzed using fixed effects.

Usually assumes that we answered “Yes” to whether we expect the intervention effect in one site is the same as the other site.

Kara Rudolph (UCB/UCSF) Generalizing effects across sites 10 / 39

slide-19
SLIDE 19

Methodologic Challenges

Typically, multi-site data are analyzed using fixed effects.

Usually assumes that we answered “Yes” to whether we expect the intervention effect in one site is the same as the other site. Why is that the case?

Kara Rudolph (UCB/UCSF) Generalizing effects across sites 10 / 39

slide-20
SLIDE 20

Methodologic Challenges

Typically, multi-site data are analyzed using fixed effects.

Usually assumes that we answered “Yes” to whether we expect the intervention effect in one site is the same as the other site. Why is that the case? Dummy variables for site changes the intercept but not the treatment effect coefficient. Assume that the conditional effect (regression coefficient) of the intervention in one site is the same as in another site.

Kara Rudolph (UCB/UCSF) Generalizing effects across sites 10 / 39

slide-21
SLIDE 21

Methodologic Challenges

Typically, multi-site data are analyzed using fixed effects.

Usually assumes that we answered “Yes” to whether we expect the intervention effect in one site is the same as the other site. Why is that the case? Dummy variables for site changes the intercept but not the treatment effect coefficient. Assume that the conditional effect (regression coefficient) of the intervention in one site is the same as in another site.

Need to apply the results from one city/site to a target city/site based on the observed population composition.

Kara Rudolph (UCB/UCSF) Generalizing effects across sites 10 / 39

slide-22
SLIDE 22

Methodologic Challenges

Typically, multi-site data are analyzed using fixed effects.

Usually assumes that we answered “Yes” to whether we expect the intervention effect in one site is the same as the other site. Why is that the case? Dummy variables for site changes the intercept but not the treatment effect coefficient. Assume that the conditional effect (regression coefficient) of the intervention in one site is the same as in another site.

Need to apply the results from one city/site to a target city/site based on the observed population composition.

Transportability/ generalizability/ external validity.

Kara Rudolph (UCB/UCSF) Generalizing effects across sites 10 / 39

slide-23
SLIDE 23

What’s been done

Most common: Use fixed effects for site.

3Miettinen, O. S. Standardization of risk ratios.

American Journal of Epidemiology 96, 383–388 (1972). Kara Rudolph (UCB/UCSF) Generalizing effects across sites 11 / 39

slide-24
SLIDE 24

What’s been done

Most common: Use fixed effects for site.

  • Conditional effect is not as policy relevant as marginal effect

3Miettinen, O. S. Standardization of risk ratios.

American Journal of Epidemiology 96, 383–388 (1972). Kara Rudolph (UCB/UCSF) Generalizing effects across sites 11 / 39

slide-25
SLIDE 25

What’s been done

Most common: Use fixed effects for site.

  • Conditional effect is not as policy relevant as marginal effect
  • We usually don’t believe the assumption

3Miettinen, O. S. Standardization of risk ratios.

American Journal of Epidemiology 96, 383–388 (1972). Kara Rudolph (UCB/UCSF) Generalizing effects across sites 11 / 39

slide-26
SLIDE 26

What’s been done

Most common: Use fixed effects for site.

  • Conditional effect is not as policy relevant as marginal effect
  • We usually don’t believe the assumption

Common-ish: Post-stratification/ direct standardization.3E.g., age-adjusted rates of disease for comparisons between populations.

3Miettinen, O. S. Standardization of risk ratios.

American Journal of Epidemiology 96, 383–388 (1972). Kara Rudolph (UCB/UCSF) Generalizing effects across sites 11 / 39

slide-27
SLIDE 27

What’s been done

Most common: Use fixed effects for site.

  • Conditional effect is not as policy relevant as marginal effect
  • We usually don’t believe the assumption

Common-ish: Post-stratification/ direct standardization.3E.g., age-adjusted rates of disease for comparisons between populations.

  • Breaks down with continuous characteristics or multiple characterstics

because of small cell sizes

3Miettinen, O. S. Standardization of risk ratios.

American Journal of Epidemiology 96, 383–388 (1972). Kara Rudolph (UCB/UCSF) Generalizing effects across sites 11 / 39

slide-28
SLIDE 28

What’s been done

Most common: Use fixed effects for site.

  • Conditional effect is not as policy relevant as marginal effect
  • We usually don’t believe the assumption

Common-ish: Post-stratification/ direct standardization.3E.g., age-adjusted rates of disease for comparisons between populations.

  • Breaks down with continuous characteristics or multiple characterstics

because of small cell sizes

  • No standard errors/ no inference

3Miettinen, O. S. Standardization of risk ratios.

American Journal of Epidemiology 96, 383–388 (1972). Kara Rudolph (UCB/UCSF) Generalizing effects across sites 11 / 39

slide-29
SLIDE 29

What’s been done

Less common, rare: Model-based approaches: Horvitz-Thompson weighting (model-based standardization),4propensity score matching,5and principal stratification6

4Cole, S. R. & Stuart, E. A. Generalizing Evidence From Randomized Clinical Trials to Target Populations The ACTG 320

Trial. American journal of epidemiology 172, 107–115 (2010).

5Stuart, E. A. et al. The use of propensity scores to assess the generalizability of results from randomized trials.

Journal of the Royal Statistical Society: Series A (Statistics in Society) 174, 369–386 (2011).

6Frangakis, C. The calibration of treatment effects from clinical trials to target populations.

Clinical trials (London, England) 6, 136 (2009).

7Pearl, J. & Bareinboim, E. Transportability across studies: A formal approach tech. rep. (DTIC Document, 2011).

Kara Rudolph (UCB/UCSF) Generalizing effects across sites 12 / 39

slide-30
SLIDE 30

What’s been done

Less common, rare: Model-based approaches: Horvitz-Thompson weighting (model-based standardization),4propensity score matching,5and principal stratification6

  • Relies on correct model specification

4Cole, S. R. & Stuart, E. A. Generalizing Evidence From Randomized Clinical Trials to Target Populations The ACTG 320

Trial. American journal of epidemiology 172, 107–115 (2010).

5Stuart, E. A. et al. The use of propensity scores to assess the generalizability of results from randomized trials.

Journal of the Royal Statistical Society: Series A (Statistics in Society) 174, 369–386 (2011).

6Frangakis, C. The calibration of treatment effects from clinical trials to target populations.

Clinical trials (London, England) 6, 136 (2009).

7Pearl, J. & Bareinboim, E. Transportability across studies: A formal approach tech. rep. (DTIC Document, 2011).

Kara Rudolph (UCB/UCSF) Generalizing effects across sites 12 / 39

slide-31
SLIDE 31

What’s been done

Less common, rare: Model-based approaches: Horvitz-Thompson weighting (model-based standardization),4propensity score matching,5and principal stratification6

  • Relies on correct model specification
  • Inference with machine learning is unclear

4Cole, S. R. & Stuart, E. A. Generalizing Evidence From Randomized Clinical Trials to Target Populations The ACTG 320

Trial. American journal of epidemiology 172, 107–115 (2010).

5Stuart, E. A. et al. The use of propensity scores to assess the generalizability of results from randomized trials.

Journal of the Royal Statistical Society: Series A (Statistics in Society) 174, 369–386 (2011).

6Frangakis, C. The calibration of treatment effects from clinical trials to target populations.

Clinical trials (London, England) 6, 136 (2009).

7Pearl, J. & Bareinboim, E. Transportability across studies: A formal approach tech. rep. (DTIC Document, 2011).

Kara Rudolph (UCB/UCSF) Generalizing effects across sites 12 / 39

slide-32
SLIDE 32

What’s been done

Less common, rare: Model-based approaches: Horvitz-Thompson weighting (model-based standardization),4propensity score matching,5and principal stratification6

  • Relies on correct model specification
  • Inference with machine learning is unclear
  • With exception of principal stratification, have not been extended to

encouragement-design interventions

4Cole, S. R. & Stuart, E. A. Generalizing Evidence From Randomized Clinical Trials to Target Populations The ACTG 320

Trial. American journal of epidemiology 172, 107–115 (2010).

5Stuart, E. A. et al. The use of propensity scores to assess the generalizability of results from randomized trials.

Journal of the Royal Statistical Society: Series A (Statistics in Society) 174, 369–386 (2011).

6Frangakis, C. The calibration of treatment effects from clinical trials to target populations.

Clinical trials (London, England) 6, 136 (2009).

7Pearl, J. & Bareinboim, E. Transportability across studies: A formal approach tech. rep. (DTIC Document, 2011).

Kara Rudolph (UCB/UCSF) Generalizing effects across sites 12 / 39

slide-33
SLIDE 33

What’s been done

Less common, rare: Model-based approaches: Horvitz-Thompson weighting (model-based standardization),4propensity score matching,5and principal stratification6

  • Relies on correct model specification
  • Inference with machine learning is unclear
  • With exception of principal stratification, have not been extended to

encouragement-design interventions

Pearl and Bareinbom: formalized theory and assumptions for transportability7

4Cole, S. R. & Stuart, E. A. Generalizing Evidence From Randomized Clinical Trials to Target Populations The ACTG 320

Trial. American journal of epidemiology 172, 107–115 (2010).

5Stuart, E. A. et al. The use of propensity scores to assess the generalizability of results from randomized trials.

Journal of the Royal Statistical Society: Series A (Statistics in Society) 174, 369–386 (2011).

6Frangakis, C. The calibration of treatment effects from clinical trials to target populations.

Clinical trials (London, England) 6, 136 (2009).

7Pearl, J. & Bareinboim, E. Transportability across studies: A formal approach tech. rep. (DTIC Document, 2011).

Kara Rudolph (UCB/UCSF) Generalizing effects across sites 12 / 39

slide-34
SLIDE 34

Outline

1

Motivation Motivating example

2

Methodologic Challenges

3

Approach

4

Results

5

Future directions

Kara Rudolph (UCB/UCSF) Generalizing effects across sites 13 / 39

slide-35
SLIDE 35

Our contribution

New statistical method for “transporting” effects from one population to another8 Transport formula for multi-site encouragement-design interventions (extending Pearl and Bareinboim’s work).

8Rudolph, K. E. & van der Laan, M. J. Double Robust Estimation of Encouragement-design Intervention Effects

Transported Across Sites. Under Review (2015). Kara Rudolph (UCB/UCSF) Generalizing effects across sites 14 / 39

slide-36
SLIDE 36

Our contribution

New statistical method for “transporting” effects from one population to another8 Transport formula for multi-site encouragement-design interventions (extending Pearl and Bareinboim’s work). Estimation using transport formulas addressing previous gaps:

8Rudolph, K. E. & van der Laan, M. J. Double Robust Estimation of Encouragement-design Intervention Effects

Transported Across Sites. Under Review (2015). Kara Rudolph (UCB/UCSF) Generalizing effects across sites 14 / 39

slide-37
SLIDE 37

Our contribution

New statistical method for “transporting” effects from one population to another8 Transport formula for multi-site encouragement-design interventions (extending Pearl and Bareinboim’s work). Estimation using transport formulas addressing previous gaps:

+ Inference based on theory (even when using machine learning)

8Rudolph, K. E. & van der Laan, M. J. Double Robust Estimation of Encouragement-design Intervention Effects

Transported Across Sites. Under Review (2015). Kara Rudolph (UCB/UCSF) Generalizing effects across sites 14 / 39

slide-38
SLIDE 38

Our contribution

New statistical method for “transporting” effects from one population to another8 Transport formula for multi-site encouragement-design interventions (extending Pearl and Bareinboim’s work). Estimation using transport formulas addressing previous gaps:

+ Inference based on theory (even when using machine learning) + Double robust: can misspecify multiple models and still get unbiased estimates

8Rudolph, K. E. & van der Laan, M. J. Double Robust Estimation of Encouragement-design Intervention Effects

Transported Across Sites. Under Review (2015). Kara Rudolph (UCB/UCSF) Generalizing effects across sites 14 / 39

slide-39
SLIDE 39

Problem: there are a lot of relationships to specify and we don’t know the truth!

  • S ∼ W

Z ∼ A, W , A ∗ W , S Y ∼ Z, W , Z ∗ W , S Can you guess the correct models when W is high dimensional? All interactions? Correct form (e.g, linear, quadratic, spline)? Note: A = instrument/encouragement, Z = exposure, Y = outcome, S = site, W = covariates/characteristics

Kara Rudolph (UCB/UCSF) Generalizing effects across sites 15 / 39

slide-40
SLIDE 40

Transport estimators

Targeted maximum likelihood estimators (TMLE) for the following estimands: Effect of A on Y (intent-to-treat) Effect of Z on Y using randomization of the instrument (complier average treatment effect) Effect of Z on Y ignoring randomization Note: A = instrument/encouragement, Z = exposure, Y = outcome, S = site, W = covariates/characteristics

Kara Rudolph (UCB/UCSF) Generalizing effects across sites 16 / 39

slide-41
SLIDE 41

In everyday language, what does TMLE do?

1 Start with identifying the parameter you’re interested in estimating.

E.g., the ITTATE, ψ.

2 Get initial estimate for ψ. E.g., run a regression of the Y model

setting A = 1 and A = 0. The difference will be the initial estimate.

3 The Y model may not be perfect. (If it is, you’re done.) This initial

estimate is then adjusted by something called the clever covariate, C, which is derived from the efficient influence curve. It uses information from the other models improve upon the initial estimate.

4 This fluctuation can be iterated until convergence. Kara Rudolph (UCB/UCSF) Generalizing effects across sites 17 / 39

slide-42
SLIDE 42

In everyday language, what does TMLE do?

  • Kara Rudolph (UCB/UCSF)

Generalizing effects across sites 18 / 39

slide-43
SLIDE 43

Outline

1

Motivation Motivating example

2

Methodologic Challenges

3

Approach

4

Results

5

Future directions

Kara Rudolph (UCB/UCSF) Generalizing effects across sites 19 / 39

slide-44
SLIDE 44

Performance

Results for intent-to-treat effect of A on Y. Results are similar for the two other estimators. Model specification % Bias Variance Coverage MSE All models correct

  • 0.67

0.0004 95.01 0.0004 S model misspecified

  • 0.49

0.0004 95.34 0.0004 Z model misspecified

  • 0.67

0.0004 95.00 0.0004 Y model misspecified

  • 0.71

0.0005 95.36 0.0005 S,Z models misspecified

  • 0.49

0.0004 95.29 0.0004 S,Z,Y models misspecified 6.05 0.0004 94.84 0.0004 Note: A = instrument/encouragement, Z = exposure, Y = outcome, S = site, W = covariates/characteristics

Kara Rudolph (UCB/UCSF) Generalizing effects across sites 20 / 39

slide-45
SLIDE 45

Sensitivity to positivity violations

Structural positivity violations: Person with some set of covariate values in one treatment/selection group has a zero probability of being in another treatment/selection group.

Kara Rudolph (UCB/UCSF) Generalizing effects across sites 21 / 39

slide-46
SLIDE 46

Sensitivity to positivity violations

Structural positivity violations: Person with some set of covariate values in one treatment/selection group has a zero probability of being in another treatment/selection group. Practical positivity violations: This probability isn’t strictly zero, but it’s close.

Kara Rudolph (UCB/UCSF) Generalizing effects across sites 21 / 39

slide-47
SLIDE 47

Sensitivity to positivity violations

Structural positivity violations: Person with some set of covariate values in one treatment/selection group has a zero probability of being in another treatment/selection group. Practical positivity violations: This probability isn’t strictly zero, but it’s close. Why is this a problem?

Kara Rudolph (UCB/UCSF) Generalizing effects across sites 21 / 39

slide-48
SLIDE 48

Sensitivity to positivity violations

Practical positivity violations are a substantial issue in real world data.

Pre−Matching

2 4 0.00 0.25 0.50 0.75 1.00 Predicted probability of job strain density Less job strain More job strain

Kara Rudolph (UCB/UCSF) Generalizing effects across sites 22 / 39

slide-49
SLIDE 49

Sensitivity to positivity violations

Practical positivity violations are a substantial issue in real world data. Why might we expect it in the example below?

Pre−Matching

2 4 0.00 0.25 0.50 0.75 1.00 Predicted probability of job strain density Less job strain More job strain

Kara Rudolph (UCB/UCSF) Generalizing effects across sites 22 / 39

slide-50
SLIDE 50

Sensitivity to positivity violations

Which of the 3 estimands is most vulnerable to these violations using the MTO data?

Kara Rudolph (UCB/UCSF) Generalizing effects across sites 23 / 39

slide-51
SLIDE 51

Sensitivity to positivity violations

Which of the 3 estimands is most vulnerable to these violations using the MTO data? What are some other real-world examples that might be vulnerable to positivity violations?

Kara Rudolph (UCB/UCSF) Generalizing effects across sites 23 / 39

slide-52
SLIDE 52

Sensitivity to positivity violations

CY (A = 1) Mean (SD) Min Max EATE Data-generating mechanism 1 0.49(0.38) 0.05 2.46 Data-generating mechanism 2 1.07(1.62) 0.15×10−2 26.26 Application 2.05(2.76) 4.54×10−2 13.11

Kara Rudolph (UCB/UCSF) Generalizing effects across sites 24 / 39

slide-53
SLIDE 53

Sensitivity to positivity violations

Specification %Bias SE×√n Cov MSE (1.60) EATE: Without Positivity Violations All models correct

  • 0.31

1.60 94.94 0.0005 S model misspecified

  • 0.38

1.46 93.68 0.0005 Z model misspecified

  • 0.31

1.48 93.01 0.0005 Y model misspecified

  • 0.29

1.62 95.09 0.0005 S,Z models misspecified

  • 0.43

1.36 92.95 0.0004 S,Z,Y models misspecified 14.46 1.37 76.27 0.0009 EATE: With Positivity Violations All models correct 0.18 3.60 91.36 0.0029 S model misspecified 1.98 1.96 86.33 0.0012 Z model misspecified 0.18 2.67 82.93 0.0029 Y model misspecified 2.09 4.17 96.05 0.0027 S,Z models misspecified 2.18 1.38 79.27 0.0009 S,Z,Y models misspecified

  • 52.11

1.41 2.49 0.0065

Kara Rudolph (UCB/UCSF) Generalizing effects across sites 25 / 39

slide-54
SLIDE 54

Strategies for addressing positivity violations

Limit the sample to the area of support

Kara Rudolph (UCB/UCSF) Generalizing effects across sites 26 / 39

slide-55
SLIDE 55

Strategies for addressing positivity violations

Limit the sample to the area of support Truncate weights

Kara Rudolph (UCB/UCSF) Generalizing effects across sites 26 / 39

slide-56
SLIDE 56

Strategies for addressing positivity violations

Limit the sample to the area of support Truncate weights Exclude covariates that are neither 1) confounders of the exposure-outcome relationship, nor 2) affect transportability.

Kara Rudolph (UCB/UCSF) Generalizing effects across sites 26 / 39

slide-57
SLIDE 57

Strategies for addressing positivity violations

Limit the sample to the area of support Truncate weights Exclude covariates that are neither 1) confounders of the exposure-outcome relationship, nor 2) affect transportability. Moving the weights from the clever covariate into the model fitting step

Kara Rudolph (UCB/UCSF) Generalizing effects across sites 26 / 39

slide-58
SLIDE 58

Strategies for addressing positivity violations

Truncation Level %Bias SE×√n Cov MSE EATE No modification 0.18 3.60 91.36 0.0029 Truncation at 0.01/100 2.29 3.23 92.50 0.0024 Truncation at 0.05/20 2.71 2.40 89.78 0.0016 Truncation at 0.1/10 2.60 1.90 84.96 0.0013

Kara Rudolph (UCB/UCSF) Generalizing effects across sites 27 / 39

slide-59
SLIDE 59

Results

Can our new statistical method shed light on the previously intractable problem of not knowing why there are differences in effects across sites?

Kara Rudolph (UCB/UCSF) Generalizing effects across sites 28 / 39

slide-60
SLIDE 60

Results

Can our new statistical method shed light on the previously intractable problem of not knowing why there are differences in effects across sites? We take two of the sites: LA and Boston.

Kara Rudolph (UCB/UCSF) Generalizing effects across sites 28 / 39

slide-61
SLIDE 61

Results

Can our new statistical method shed light on the previously intractable problem of not knowing why there are differences in effects across sites? We take two of the sites: LA and Boston. Outcome: adolescent school drop out at follow-up.

Kara Rudolph (UCB/UCSF) Generalizing effects across sites 28 / 39

slide-62
SLIDE 62

Results

Can our new statistical method shed light on the previously intractable problem of not knowing why there are differences in effects across sites? We take two of the sites: LA and Boston. Outcome: adolescent school drop out at follow-up. We use full data from Boston. We ignore the outcome data from LA. Using the outcome model from Boston, we predict the intervention effect in LA, accounting for differences in population composition between the two cities.

Kara Rudolph (UCB/UCSF) Generalizing effects across sites 28 / 39

slide-63
SLIDE 63

Results

Real results: Boston

  • ITTATE

CATE −1.0 −0.5 0.0 0.5 1.0 Boston LA Transported LA, TMLE Boston LA Transported LA, TMLE

Intervention Effect on Risk of School Drop Out

Kara Rudolph (UCB/UCSF) Generalizing effects across sites 29 / 39

slide-64
SLIDE 64

Results

Predicted results: LA

  • ITTATE

CATE −1.0 −0.5 0.0 0.5 1.0 Boston LA Transported LA, TMLE Boston LA Transported LA, TMLE

Intervention Effect on Risk of School Drop Out

Kara Rudolph (UCB/UCSF) Generalizing effects across sites 30 / 39

slide-65
SLIDE 65

Results

Predicted vs. real results: LA

  • ITTATE

CATE −1.0 −0.5 0.0 0.5 1.0 Boston LA Transported LA, TMLE Boston LA Transported LA, TMLE

Intervention Effect on Risk of School Drop Out

Kara Rudolph (UCB/UCSF) Generalizing effects across sites 31 / 39

slide-66
SLIDE 66

Results

The transported estimates for LA are similar to true LA estimates.

Kara Rudolph (UCB/UCSF) Generalizing effects across sites 32 / 39

slide-67
SLIDE 67

Results

The transported estimates for LA are similar to true LA estimates. Using population composition, we can predict the effect for LA → intervention effect on school dropout is transportable.

Kara Rudolph (UCB/UCSF) Generalizing effects across sites 32 / 39

slide-68
SLIDE 68

Results

The transported estimates for LA are similar to true LA estimates. Using population composition, we can predict the effect for LA → intervention effect on school dropout is transportable. This means that the difference in effects between Boston and LA can be largely explained by population composition.

Kara Rudolph (UCB/UCSF) Generalizing effects across sites 32 / 39

slide-69
SLIDE 69

Aside: the importance of incorporating machine learning

  • ITTATE

CATE −3 −2 −1 1 Boston LA Transported LA, TMLE Boston LA Transported LA, TMLE

difference of probability of staying in school

model

  • none

parametric superlearner

Kara Rudolph (UCB/UCSF) Generalizing effects across sites 33 / 39

slide-70
SLIDE 70

Superlearner9

Ensemble machine learning Weights multiple machine learning algorithms to get best prediction Guaranteed to perform at least as well as best algorithm included in the weighting

9Van der Laan, M. J. et al. Super learner.

Statistical applications in genetics and molecular biology 6 (2007). Kara Rudolph (UCB/UCSF) Generalizing effects across sites 34 / 39

slide-71
SLIDE 71

Policy implications

We should not expect an intervention/program/policy to have the same effect in one city as in another city.

Kara Rudolph (UCB/UCSF) Generalizing effects across sites 35 / 39

slide-72
SLIDE 72

Policy implications

We should not expect an intervention/program/policy to have the same effect in one city as in another city. In an era of shrinking budgets, important to recognize that what works in Boston may not work in LA, so resources can be targeted

  • ptimally.

Kara Rudolph (UCB/UCSF) Generalizing effects across sites 35 / 39

slide-73
SLIDE 73

Policy implications

We should not expect an intervention/program/policy to have the same effect in one city as in another city. In an era of shrinking budgets, important to recognize that what works in Boston may not work in LA, so resources can be targeted

  • ptimally.

Broadly useful: multi-site epidemiologic studies, large-scale policy or program interventions, clinical trials.

Kara Rudolph (UCB/UCSF) Generalizing effects across sites 35 / 39

slide-74
SLIDE 74

Outline

1

Motivation Motivating example

2

Methodologic Challenges

3

Approach

4

Results

5

Future directions

Kara Rudolph (UCB/UCSF) Generalizing effects across sites 36 / 39

slide-75
SLIDE 75

Future Directions

Examine other strategies to reduce sensitivity to practical positivity violations, especially excluding covariates and moving the weights.

Kara Rudolph (UCB/UCSF) Generalizing effects across sites 37 / 39

slide-76
SLIDE 76

Future Directions

Examine other strategies to reduce sensitivity to practical positivity violations, especially excluding covariates and moving the weights. In-depth application of transportability to MTO to understand the relationship between neighborhood poverty and exposure to violence and violent behaviors.

Kara Rudolph (UCB/UCSF) Generalizing effects across sites 37 / 39

slide-77
SLIDE 77

Future Directions

Examine other strategies to reduce sensitivity to practical positivity violations, especially excluding covariates and moving the weights. In-depth application of transportability to MTO to understand the relationship between neighborhood poverty and exposure to violence and violent behaviors. Grant application to extend the transportability method to mediation

  • mechanisms. Examine mediation of the relationship between

neighborhood poverty on adolescent risk behaviors by the school environment.

Kara Rudolph (UCB/UCSF) Generalizing effects across sites 37 / 39

slide-78
SLIDE 78

Future Directions

Examine other strategies to reduce sensitivity to practical positivity violations, especially excluding covariates and moving the weights. In-depth application of transportability to MTO to understand the relationship between neighborhood poverty and exposure to violence and violent behaviors. Grant application to extend the transportability method to mediation

  • mechanisms. Examine mediation of the relationship between

neighborhood poverty on adolescent risk behaviors by the school environment. Other ideas? Suggestions?

Kara Rudolph (UCB/UCSF) Generalizing effects across sites 37 / 39

slide-79
SLIDE 79

How can I do this?

Use the R functions that I wrote

Parametric or semiparametric options

i t t a t e t m l e < −f u n c t i o n ( a , z , y , s i t e ,w, truncate , lbound ) catetmle < −f u n c t i o n ( ca , cz , cy , c s i t e , cw , ctruncate , clbound ) n o i n s t r a t e t m l e < −f u n c t i o n ( a , z , y , s i t e ,w, truncate , lbound )

Kara Rudolph (UCB/UCSF) Generalizing effects across sites 38 / 39

slide-80
SLIDE 80

Thanks!

www.biostat.jhsph.edu/∼krudolph kara.rudolph@berkeley.edu Robert Wood Johnson Foundation Health & Society Scholars program, UCSF/UCB Collaborators Mark van der Laan, UC Berkeley Jennifer Ahern, UC Berkeley, Maria Glymour, UCSF Theresa Osypuk, University of Minnesota

Kara Rudolph (UCB/UCSF) Generalizing effects across sites 39 / 39