Counterfactual-based mediation analysis Workshop 1 Rhian Daniel - - PowerPoint PPT Presentation

counterfactual based mediation analysis workshop 1
SMART_READER_LITE
LIVE PREVIEW

Counterfactual-based mediation analysis Workshop 1 Rhian Daniel - - PowerPoint PPT Presentation

Counterfactual-based mediation analysis Workshop 1 Rhian Daniel London School of Hygiene and Tropical Medicine CIMPOD 27th February, 2017 Rhian Daniel/Counterfactual-based mediation analysisWorkshop 1 1/51 Setting the scene Case study


slide-1
SLIDE 1

Counterfactual-based mediation analysis Workshop 1

Rhian Daniel London School of Hygiene and Tropical Medicine CIMPOD 27th February, 2017

Rhian Daniel/Counterfactual-based mediation analysisWorkshop 1 1/51

slide-2
SLIDE 2

Setting the scene Case study Q&A Wrapping up References

Outline

1

Setting the scene Introduction Traditional approach Causal inference gets involved —Estimands —Assumptions —Identification Interventional effects

2

Case study

3

Q&A

4

Wrapping up

5

References

Rhian Daniel/Counterfactual-based mediation analysisWorkshop 1 2/51

slide-3
SLIDE 3

Setting the scene Case study Q&A Wrapping up References

Outline

1

Setting the scene Introduction Traditional approach Causal inference gets involved —Estimands —Assumptions —Identification Interventional effects

2

Case study

3

Q&A

4

Wrapping up

5

References

Rhian Daniel/Counterfactual-based mediation analysisWorkshop 1 3/51

slide-4
SLIDE 4

Setting the scene Case study Q&A Wrapping up References

Outline

1

Setting the scene Introduction Traditional approach Causal inference gets involved —Estimands —Assumptions —Identification Interventional effects

2

Case study

3

Q&A

4

Wrapping up

5

References

Rhian Daniel/Counterfactual-based mediation analysisWorkshop 1 4/51

slide-5
SLIDE 5

Setting the scene Case study Q&A Wrapping up References

Mediation

  • For nearly a century, statisticians, and researchers in many different

substantive disciplines, have been attempting to address questions concerning mediation.

Rhian Daniel/Counterfactual-based mediation analysisWorkshop 1 5/51

slide-6
SLIDE 6

Setting the scene Case study Q&A Wrapping up References

Mediation

  • For nearly a century, statisticians, and researchers in many different

substantive disciplines, have been attempting to address questions concerning mediation.

[Wright 1921, 1934; Baron and Kenny 1986; Robins and Greenland 1992; Pearl 2001; Cole and Hern´ an 2002; VanderWeele and Vansteelandt 2009; VanderWeele 2015.]

Rhian Daniel/Counterfactual-based mediation analysisWorkshop 1 5/51

slide-7
SLIDE 7

Setting the scene Case study Q&A Wrapping up References

Mediation

  • For nearly a century, statisticians, and researchers in many different

substantive disciplines, have been attempting to address questions concerning mediation.

[Wright 1921, 1934; Baron and Kenny 1986; Robins and Greenland 1992; Pearl 2001; Cole and Hern´ an 2002; VanderWeele and Vansteelandt 2009; VanderWeele 2015.]

Alcohol intake GGT SBP

  • For example (today’s case study), how much of the effect of alcohol

consumption on systolic blood pressure is via GGT (gamma-glutamyl transpeptidase), a blood enzyme?

Rhian Daniel/Counterfactual-based mediation analysisWorkshop 1 5/51

slide-8
SLIDE 8

Setting the scene Case study Q&A Wrapping up References

(Of course, things are rarely this simple. . . )

Alc1 Alc2 Alc3 GGT1 GGT2 GGT3 SBP1 SBP2 SBP3

Rhian Daniel/Counterfactual-based mediation analysisWorkshop 1 6/51

slide-9
SLIDE 9

Setting the scene Case study Q&A Wrapping up References

(Of course, things are rarely this simple. . . )

Alc1 Alc2 Alc3 GGT1 GGT2 GGT3 SBP1 SBP2 SBP3 Aalen OO, RK, Gran JM, Kouyos R, Lange T (2014) Can we believe the DAGs? A comment on the relationship between causal DAGs and mechanisms SMMR, 25(5):2294–314. VanderWeele TJ, Tchetgen Tchetgen EJ Mediation analysis with time-varying exposures and mediators JRSS B, in press.

Rhian Daniel/Counterfactual-based mediation analysisWorkshop 1 6/51

slide-10
SLIDE 10

Setting the scene Case study Q&A Wrapping up References

Outline

1

Setting the scene Introduction Traditional approach Causal inference gets involved —Estimands —Assumptions —Identification Interventional effects

2

Case study

3

Q&A

4

Wrapping up

5

References

Rhian Daniel/Counterfactual-based mediation analysisWorkshop 1 7/51

slide-11
SLIDE 11

Setting the scene Case study Q&A Wrapping up References

Traditional approach

Path tracing rules [Wright 1934]

X M Y

  • Originally, mediation analysis was only attempted using linear models.

Rhian Daniel/Counterfactual-based mediation analysisWorkshop 1 8/51

slide-12
SLIDE 12

Setting the scene Case study Q&A Wrapping up References

Traditional approach

Path tracing rules [Wright 1934]

X M Y

  • Originally, mediation analysis was only attempted using linear models.
  • Two models would be fitted:

Rhian Daniel/Counterfactual-based mediation analysisWorkshop 1 8/51

slide-13
SLIDE 13

Setting the scene Case study Q&A Wrapping up References

Traditional approach

Path tracing rules [Wright 1934]

X M Y α1

  • Originally, mediation analysis was only attempted using linear models.
  • Two models would be fitted:

E(M|X) = α0 + α1X

Rhian Daniel/Counterfactual-based mediation analysisWorkshop 1 8/51

slide-14
SLIDE 14

Setting the scene Case study Q&A Wrapping up References

Traditional approach

Path tracing rules [Wright 1934]

X M Y α1 β2 β1

  • Originally, mediation analysis was only attempted using linear models.
  • Two models would be fitted:

E(M|X) = α0 + α1X E(Y|X, M) = β0 + β1X + β2M

Rhian Daniel/Counterfactual-based mediation analysisWorkshop 1 8/51

slide-15
SLIDE 15

Setting the scene Case study Q&A Wrapping up References

Traditional approach

Path tracing rules [Wright 1934]

X M Y α1 β2 β1

  • Originally, mediation analysis was only attempted using linear models.
  • Two models would be fitted:

E(M|X) = α0 + α1X E(Y|X, M) = β0 + β1X + β2M

  • β1 would then be labelled the direct effect.

Rhian Daniel/Counterfactual-based mediation analysisWorkshop 1 8/51

slide-16
SLIDE 16

Setting the scene Case study Q&A Wrapping up References

Traditional approach

Path tracing rules [Wright 1934]

X M Y α1 β2 β1

  • Originally, mediation analysis was only attempted using linear models.
  • Two models would be fitted:

E(M|X) = α0 + α1X E(Y|X, M) = β0 + β1X + β2M

  • β1 would then be labelled the direct effect.
  • And α1β2 the indirect effect.

Rhian Daniel/Counterfactual-based mediation analysisWorkshop 1 8/51

slide-17
SLIDE 17

Setting the scene Case study Q&A Wrapping up References

More complex diagrams

Path tracing rules [Wright 1934]

X M1 M2 M3 M4 M5 M6 M7 Y

  • This simple method extends to arbitrarily complex diagrams, as long

as all models are simple linear regressions (with no interaction terms).

Rhian Daniel/Counterfactual-based mediation analysisWorkshop 1 9/51

slide-18
SLIDE 18

Setting the scene Case study Q&A Wrapping up References

More complex diagrams

Path tracing rules [Wright 1934]

X M1 M2 M3 M4 M5 M6 M7 Y

  • This simple method extends to arbitrarily complex diagrams, as long

as all models are simple linear regressions (with no interaction terms).

  • The path-specific effect along a particular pathway is equal to the

product of the coefficients along that path.

Rhian Daniel/Counterfactual-based mediation analysisWorkshop 1 9/51

slide-19
SLIDE 19

Setting the scene Case study Q&A Wrapping up References

Outline

1

Setting the scene Introduction Traditional approach Causal inference gets involved —Estimands —Assumptions —Identification Interventional effects

2

Case study

3

Q&A

4

Wrapping up

5

References

Rhian Daniel/Counterfactual-based mediation analysisWorkshop 1 10/51

slide-20
SLIDE 20

Setting the scene Case study Q&A Wrapping up References

Causal inference ‘investigates’

  • In the early 1990s, the ‘causal inference’ school became interested in

this area [Robins and Greenland 1992].

Rhian Daniel/Counterfactual-based mediation analysisWorkshop 1 11/51

slide-21
SLIDE 21

Setting the scene Case study Q&A Wrapping up References

Causal inference ‘investigates’

  • In the early 1990s, the ‘causal inference’ school became interested in

this area [Robins and Greenland 1992].

  • Mediation is a causal concept: associations are symmetric, but

mediation implies an ordered sequence.

Rhian Daniel/Counterfactual-based mediation analysisWorkshop 1 11/51

slide-22
SLIDE 22

Setting the scene Case study Q&A Wrapping up References

Causal inference ‘investigates’

  • In the early 1990s, the ‘causal inference’ school became interested in

this area [Robins and Greenland 1992].

  • Mediation is a causal concept: associations are symmetric, but

mediation implies an ordered sequence.

  • Core principles of causal inference: (1) what is the estimand? (2)

under what assumptions can it be identified? (3) are there more flexible estimation methods than currently used?

Rhian Daniel/Counterfactual-based mediation analysisWorkshop 1 11/51

slide-23
SLIDE 23

Setting the scene Case study Q&A Wrapping up References

Potential outcomes and mediators

  • Let Y (x) be the value that Y would take if we intervened on X and

set it (possibly counter to fact) to the value x.

Rhian Daniel/Counterfactual-based mediation analysisWorkshop 1 12/51

slide-24
SLIDE 24

Setting the scene Case study Q&A Wrapping up References

Potential outcomes and mediators

  • Let Y (x) be the value that Y would take if we intervened on X and

set it (possibly counter to fact) to the value x.

  • Let Y (x, m) be the value that Y would take if we intervened

simultaneously on both X and M and set them to the values x and m.

Rhian Daniel/Counterfactual-based mediation analysisWorkshop 1 12/51

slide-25
SLIDE 25

Setting the scene Case study Q&A Wrapping up References

Potential outcomes and mediators

  • Let Y (x) be the value that Y would take if we intervened on X and

set it (possibly counter to fact) to the value x.

  • Let Y (x, m) be the value that Y would take if we intervened

simultaneously on both X and M and set them to the values x and m.

  • Let M (x) be the value that M would take if we intervened on X and

set it to x.

Rhian Daniel/Counterfactual-based mediation analysisWorkshop 1 12/51

slide-26
SLIDE 26

Setting the scene Case study Q&A Wrapping up References

Potential outcomes and mediators

  • Let Y (x) be the value that Y would take if we intervened on X and

set it (possibly counter to fact) to the value x.

  • Let Y (x, m) be the value that Y would take if we intervened

simultaneously on both X and M and set them to the values x and m.

  • Let M (x) be the value that M would take if we intervened on X and

set it to x.

  • Let Y {x, M (x∗)} be the value that Y would take if we intervened on

X and set it to x whilst simultaneously intervening on M and setting it to M (x∗), the value that M would take under an intervention setting X to x∗, where x and x∗ are not necessarily equal.

Rhian Daniel/Counterfactual-based mediation analysisWorkshop 1 12/51

slide-27
SLIDE 27

Setting the scene Case study Q&A Wrapping up References

Potential outcomes and mediators

  • Let Y (x) be the value that Y would take if we intervened on X and

set it (possibly counter to fact) to the value x.

  • Let Y (x, m) be the value that Y would take if we intervened

simultaneously on both X and M and set them to the values x and m.

  • Let M (x) be the value that M would take if we intervened on X and

set it to x.

  • Let Y {x, M (x∗)} be the value that Y would take if we intervened on

X and set it to x whilst simultaneously intervening on M and setting it to M (x∗), the value that M would take under an intervention setting X to x∗, where x and x∗ are not necessarily equal. These hypothetical quantities were used to create model-free definitions of direct/indirect effects that match our intuition.

Rhian Daniel/Counterfactual-based mediation analysisWorkshop 1 12/51

slide-28
SLIDE 28

Setting the scene Case study Q&A Wrapping up References

Outline

1

Setting the scene Introduction Traditional approach Causal inference gets involved —Estimands —Assumptions —Identification Interventional effects

2

Case study

3

Q&A

4

Wrapping up

5

References

Rhian Daniel/Counterfactual-based mediation analysisWorkshop 1 13/51

slide-29
SLIDE 29

Setting the scene Case study Q&A Wrapping up References

Controlled direct effect

Pearl, 2001

  • The controlled direct effect of X on Y when M is controlled at m,

expressed as a marginal mean difference is CDE (m) = E {Y (1, m)} − E {Y (0, m)} .

Rhian Daniel/Counterfactual-based mediation analysisWorkshop 1 14/51

slide-30
SLIDE 30

Setting the scene Case study Q&A Wrapping up References

Controlled direct effect

Pearl, 2001

  • The controlled direct effect of X on Y when M is controlled at m,

expressed as a marginal mean difference is CDE (m) = E {Y (1, m)} − E {Y (0, m)} .

  • This (as always with a causal contrast) is a comparison of two

(or more) hypothetical situations.

Rhian Daniel/Counterfactual-based mediation analysisWorkshop 1 14/51

slide-31
SLIDE 31

Setting the scene Case study Q&A Wrapping up References

Controlled direct effect

Pearl, 2001

  • The controlled direct effect of X on Y when M is controlled at m,

expressed as a marginal mean difference is CDE (m) = E {Y (1, m)} − E {Y (0, m)} .

  • This (as always with a causal contrast) is a comparison of two

(or more) hypothetical situations.

  • In the first, X is set to 1, and in the second X is set to 0. In

both situations, M is set to m.

Rhian Daniel/Counterfactual-based mediation analysisWorkshop 1 14/51

slide-32
SLIDE 32

Setting the scene Case study Q&A Wrapping up References

Controlled direct effect

Pearl, 2001

  • The controlled direct effect of X on Y when M is controlled at m,

expressed as a marginal mean difference is CDE (m) = E {Y (1, m)} − E {Y (0, m)} .

  • This (as always with a causal contrast) is a comparison of two

(or more) hypothetical situations.

  • In the first, X is set to 1, and in the second X is set to 0. In

both situations, M is set to m.

  • By keeping M fixed at m, we are getting at a direct effect of X,

unmediated by M.

Rhian Daniel/Counterfactual-based mediation analysisWorkshop 1 14/51

slide-33
SLIDE 33

Setting the scene Case study Q&A Wrapping up References

Controlled direct effect

Pearl, 2001

  • The controlled direct effect of X on Y when M is controlled at m,

expressed as a marginal mean difference is CDE (m) = E {Y (1, m)} − E {Y (0, m)} .

  • This (as always with a causal contrast) is a comparison of two

(or more) hypothetical situations.

  • In the first, X is set to 1, and in the second X is set to 0. In

both situations, M is set to m.

  • By keeping M fixed at m, we are getting at a direct effect of X,

unmediated by M.

  • In our example, it is the change in mean SBP if everyone vs

noone drinks, with everyone having their GGT fixed to a common value, m.

Rhian Daniel/Counterfactual-based mediation analysisWorkshop 1 14/51

slide-34
SLIDE 34

Setting the scene Case study Q&A Wrapping up References

Natural direct effect

Pearl 2001; Robins and Greenland 1992

  • The natural direct effect of X on Y expressed as a marginal mean

difference is NDE = E [Y {1, M (0)}] − E [Y {0, M (0)}] .

Rhian Daniel/Counterfactual-based mediation analysisWorkshop 1 15/51

slide-35
SLIDE 35

Setting the scene Case study Q&A Wrapping up References

Natural direct effect

Pearl 2001; Robins and Greenland 1992

  • The natural direct effect of X on Y expressed as a marginal mean

difference is NDE = E [Y {1, M (0)}] − E [Y {0, M (0)}] .

  • This is again a comparison of two hypothetical situations.

Rhian Daniel/Counterfactual-based mediation analysisWorkshop 1 15/51

slide-36
SLIDE 36

Setting the scene Case study Q&A Wrapping up References

Natural direct effect

Pearl 2001; Robins and Greenland 1992

  • The natural direct effect of X on Y expressed as a marginal mean

difference is NDE = E [Y {1, M (0)}] − E [Y {0, M (0)}] .

  • This is again a comparison of two hypothetical situations.
  • In the first, X is set to 1, and in the second X is set to 0.

In both, M is set to M (0), its value if X were set to 0.

Rhian Daniel/Counterfactual-based mediation analysisWorkshop 1 15/51

slide-37
SLIDE 37

Setting the scene Case study Q&A Wrapping up References

Natural direct effect

Pearl 2001; Robins and Greenland 1992

  • The natural direct effect of X on Y expressed as a marginal mean

difference is NDE = E [Y {1, M (0)}] − E [Y {0, M (0)}] .

  • This is again a comparison of two hypothetical situations.
  • In the first, X is set to 1, and in the second X is set to 0.

In both, M is set to M (0), its value if X were set to 0.

  • Since M is the same (within subject) in both situations, we are

also intuitively getting at a direct effect of X.

Rhian Daniel/Counterfactual-based mediation analysisWorkshop 1 15/51

slide-38
SLIDE 38

Setting the scene Case study Q&A Wrapping up References

Natural direct effect

Pearl 2001; Robins and Greenland 1992

  • The natural direct effect of X on Y expressed as a marginal mean

difference is NDE = E [Y {1, M (0)}] − E [Y {0, M (0)}] .

  • This is again a comparison of two hypothetical situations.
  • In the first, X is set to 1, and in the second X is set to 0.

In both, M is set to M (0), its value if X were set to 0.

  • Since M is the same (within subject) in both situations, we are

also intuitively getting at a direct effect of X.

  • If no individual-level interaction between X and M,

CDE (m) = NDE ∀m.

Rhian Daniel/Counterfactual-based mediation analysisWorkshop 1 15/51

slide-39
SLIDE 39

Setting the scene Case study Q&A Wrapping up References

Natural direct effect

Pearl 2001; Robins and Greenland 1992

  • The natural direct effect of X on Y expressed as a marginal mean

difference is NDE = E [Y {1, M (0)}] − E [Y {0, M (0)}] .

  • This is again a comparison of two hypothetical situations.
  • In the first, X is set to 1, and in the second X is set to 0.

In both, M is set to M (0), its value if X were set to 0.

  • Since M is the same (within subject) in both situations, we are

also intuitively getting at a direct effect of X.

  • If no individual-level interaction between X and M,

CDE (m) = NDE ∀m.

  • It is the change in mean SBP if everyone vs noone drinks, with

each individual’s GGT fixed at what it would have been for that person under no drinking.

Rhian Daniel/Counterfactual-based mediation analysisWorkshop 1 15/51

slide-40
SLIDE 40

Setting the scene Case study Q&A Wrapping up References

Natural indirect effect

Pearl 2001; Robins and Greenland 1992

  • The natural indirect effect of X on Y is

NIE = E [Y {1, M (1)}] − E [Y {1, M (0)}] .

Rhian Daniel/Counterfactual-based mediation analysisWorkshop 1 16/51

slide-41
SLIDE 41

Setting the scene Case study Q&A Wrapping up References

Natural indirect effect

Pearl 2001; Robins and Greenland 1992

  • The natural indirect effect of X on Y is

NIE = E [Y {1, M (1)}] − E [Y {1, M (0)}] .

  • This is a comparison of two hypothetical situations.

Rhian Daniel/Counterfactual-based mediation analysisWorkshop 1 16/51

slide-42
SLIDE 42

Setting the scene Case study Q&A Wrapping up References

Natural indirect effect

Pearl 2001; Robins and Greenland 1992

  • The natural indirect effect of X on Y is

NIE = E [Y {1, M (1)}] − E [Y {1, M (0)}] .

  • This is a comparison of two hypothetical situations.
  • In the first, M is set to M (1) and in the second M is set to

M (0). In both, X is set to 1.

Rhian Daniel/Counterfactual-based mediation analysisWorkshop 1 16/51

slide-43
SLIDE 43

Setting the scene Case study Q&A Wrapping up References

Natural indirect effect

Pearl 2001; Robins and Greenland 1992

  • The natural indirect effect of X on Y is

NIE = E [Y {1, M (1)}] − E [Y {1, M (0)}] .

  • This is a comparison of two hypothetical situations.
  • In the first, M is set to M (1) and in the second M is set to

M (0). In both, X is set to 1.

  • X is allowed to influence Y only through its influence on M.

Thus it intuitively corresponds to an indirect effect through M.

Rhian Daniel/Counterfactual-based mediation analysisWorkshop 1 16/51

slide-44
SLIDE 44

Setting the scene Case study Q&A Wrapping up References

Natural indirect effect

Pearl 2001; Robins and Greenland 1992

  • The natural indirect effect of X on Y is

NIE = E [Y {1, M (1)}] − E [Y {1, M (0)}] .

  • This is a comparison of two hypothetical situations.
  • In the first, M is set to M (1) and in the second M is set to

M (0). In both, X is set to 1.

  • X is allowed to influence Y only through its influence on M.

Thus it intuitively corresponds to an indirect effect through M.

  • It is the change in mean SBP we would see if we changed

everyone’s GGT from its non-drinking level to its drinking level, whilst fixing the exposure to ‘drinking’.

Rhian Daniel/Counterfactual-based mediation analysisWorkshop 1 16/51

slide-45
SLIDE 45

Setting the scene Case study Q&A Wrapping up References

Effect decomposition

The sum of the natural direct and indirect effects is NDE + NIE = E [Y {1, M (0)}] − E [Y {0, M (0)}] + E [Y {1, M (1)}] − E [Y {1, M (0)}] = E [Y {1, M (1)}] − E [Y {0, M (0)}] = TCE, the total causal effect of X on Y.

Rhian Daniel/Counterfactual-based mediation analysisWorkshop 1 17/51

slide-46
SLIDE 46

Setting the scene Case study Q&A Wrapping up References

Effect decomposition

The sum of the natural direct and indirect effects is NDE + NIE = E [Y {1, M (0)}] − E [Y {0, M (0)}] + E [Y {1, M (1)}] − E [Y {1, M (0)}] = E [Y {1, M (1)}] − E [Y {0, M (0)}] = TCE, the total causal effect of X on Y.

Rhian Daniel/Counterfactual-based mediation analysisWorkshop 1 17/51

slide-47
SLIDE 47

Setting the scene Case study Q&A Wrapping up References

Effect decomposition

The sum of the natural direct and indirect effects is NDE + NIE = E [Y {1, M (0)}] − E [Y {0, M (0)}] + E [Y {1, M (1)}] − E [Y {1, M (0)}] = E [Y {1, M (1)}] − E [Y {0, M (0)}] = TCE, the total causal effect of X on Y.

Rhian Daniel/Counterfactual-based mediation analysisWorkshop 1 17/51

slide-48
SLIDE 48

Setting the scene Case study Q&A Wrapping up References

Effect decomposition

The sum of the natural direct and indirect effects is NDE + NIE = E [Y {1, M (0)}] − E [Y {0, M (0)}] + E [Y {1, M (1)}] − E [Y {1, M (0)}] = E [Y {1, M (1)}] − E [Y {0, M (0)}] = TCE, the total causal effect of X on Y. Note that such a sensible decomposition is not possible using the CDE.

Rhian Daniel/Counterfactual-based mediation analysisWorkshop 1 17/51

slide-49
SLIDE 49

Setting the scene Case study Q&A Wrapping up References

Outline

1

Setting the scene Introduction Traditional approach Causal inference gets involved —Estimands —Assumptions —Identification Interventional effects

2

Case study

3

Q&A

4

Wrapping up

5

References

Rhian Daniel/Counterfactual-based mediation analysisWorkshop 1 18/51

slide-50
SLIDE 50

Setting the scene Case study Q&A Wrapping up References

Assumptions for identification (1)

X M Y C L

  • Consider the setting with baseline confounders C and intermediate

confounders L.

Rhian Daniel/Counterfactual-based mediation analysisWorkshop 1 19/51

slide-51
SLIDE 51

Setting the scene Case study Q&A Wrapping up References

Assumptions for identification (1)

X M Y C L

  • Consider the setting with baseline confounders C and intermediate

confounders L.

  • Sufficient assumptions under which NDE and NIE can be identified:

first, technical assumptions of no interference and consistency.

Rhian Daniel/Counterfactual-based mediation analysisWorkshop 1 19/51

slide-52
SLIDE 52

Setting the scene Case study Q&A Wrapping up References

Assumptions for identification (1)

X M Y C L

  • Consider the setting with baseline confounders C and intermediate

confounders L.

  • Sufficient assumptions under which NDE and NIE can be identified:

first, technical assumptions of no interference and consistency.

  • Then there are sequential conditional exchangeability assumptions:

Y(x, m) ⊥ ⊥ X |C = c , ∀x, m, c Y(x, m) ⊥ ⊥ M |C = c, X = x, L = l , ∀x, m, c, l

Rhian Daniel/Counterfactual-based mediation analysisWorkshop 1 19/51

slide-53
SLIDE 53

Setting the scene Case study Q&A Wrapping up References

Assumptions for identification (1)

X M Y C L U1

  • Consider the setting with baseline confounders C and intermediate

confounders L.

  • Sufficient assumptions under which NDE and NIE can be identified:

first, technical assumptions of no interference and consistency.

  • Then there are sequential conditional exchangeability assumptions:

Y(x, m) ⊥ ⊥ X |C = c , ∀x, m, c Y(x, m) ⊥ ⊥ M |C = c, X = x, L = l , ∀x, m, c, l

Rhian Daniel/Counterfactual-based mediation analysisWorkshop 1 19/51

slide-54
SLIDE 54

Setting the scene Case study Q&A Wrapping up References

Assumptions for identification (1)

X M Y C L U1 U2

  • Consider the setting with baseline confounders C and intermediate

confounders L.

  • Sufficient assumptions under which NDE and NIE can be identified:

first, technical assumptions of no interference and consistency.

  • Then there are sequential conditional exchangeability assumptions:

Y(x, m) ⊥ ⊥ X |C = c , ∀x, m, c Y(x, m) ⊥ ⊥ M |C = c, X = x, L = l , ∀x, m, c, l

Rhian Daniel/Counterfactual-based mediation analysisWorkshop 1 19/51

slide-55
SLIDE 55

Setting the scene Case study Q&A Wrapping up References

Assumptions for identification (2)

X M Y C L U1 U2

  • And:

M(x) ⊥ ⊥ X |C = c , ∀x, c

Rhian Daniel/Counterfactual-based mediation analysisWorkshop 1 20/51

slide-56
SLIDE 56

Setting the scene Case study Q&A Wrapping up References

Assumptions for identification (2)

X M Y C L U1 U2 U3

  • And:

M(x) ⊥ ⊥ X |C = c , ∀x, c

Rhian Daniel/Counterfactual-based mediation analysisWorkshop 1 20/51

slide-57
SLIDE 57

Setting the scene Case study Q&A Wrapping up References

Assumptions for identification (2)

X M Y C L U1 U2 U3

  • And:

M(x) ⊥ ⊥ X |C = c , ∀x, c This much, we would probably expect!

Rhian Daniel/Counterfactual-based mediation analysisWorkshop 1 20/51

slide-58
SLIDE 58

Setting the scene Case study Q&A Wrapping up References

Assumptions for identification (3)

X M Y C L U1 U2 U3

  • Perhaps surprisingly, these assumptions (although sufficient for the

CDE) are not enough for NDE/NIE.

Rhian Daniel/Counterfactual-based mediation analysisWorkshop 1 21/51

slide-59
SLIDE 59

Setting the scene Case study Q&A Wrapping up References

Assumptions for identification (3)

X M Y C L U1 U2 U3

  • Perhaps surprisingly, these assumptions (although sufficient for the

CDE) are not enough for NDE/NIE.

  • In addition, we need something such as the cross-world

independence assumption: M(x∗) ⊥ ⊥ Y(x, m) |C = c , ∀x, m, x∗, c

Rhian Daniel/Counterfactual-based mediation analysisWorkshop 1 21/51

slide-60
SLIDE 60

Setting the scene Case study Q&A Wrapping up References

Assumptions for identification (3)

X M Y C L U1 U2 U3

  • Perhaps surprisingly, these assumptions (although sufficient for the

CDE) are not enough for NDE/NIE.

  • In addition, we need something such as the cross-world

independence assumption: M(x∗) ⊥ ⊥ Y(x, m) |C = c , ∀x, m, x∗, c

  • This implies (but is not implied by, ie it is stronger than) no L.

Rhian Daniel/Counterfactual-based mediation analysisWorkshop 1 21/51

slide-61
SLIDE 61

Setting the scene Case study Q&A Wrapping up References

Relaxing the cross-world independence assumption

  • The cross-world independence assumption

M(x∗) ⊥ ⊥ Y(x, m) |C = c , ∀x, m, x∗, c rules out intermediate confounders L.

Rhian Daniel/Counterfactual-based mediation analysisWorkshop 1 22/51

slide-62
SLIDE 62

Setting the scene Case study Q&A Wrapping up References

Relaxing the cross-world independence assumption

  • The cross-world independence assumption

M(x∗) ⊥ ⊥ Y(x, m) |C = c , ∀x, m, x∗, c rules out intermediate confounders L.

  • In fact, a slightly weaker assumption, which does not rule out L is

sufficient: E{Y(1, m)−Y(0, m) |C = c, M(0) = m} = E{Y(1, m)−Y(0, m) |C = c}

[Petersen et al 2006]

Rhian Daniel/Counterfactual-based mediation analysisWorkshop 1 22/51

slide-63
SLIDE 63

Setting the scene Case study Q&A Wrapping up References

Relaxing the cross-world independence assumption

  • The cross-world independence assumption

M(x∗) ⊥ ⊥ Y(x, m) |C = c , ∀x, m, x∗, c rules out intermediate confounders L.

  • In fact, a slightly weaker assumption, which does not rule out L is

sufficient: E{Y(1, m)−Y(0, m) |C = c, M(0) = m} = E{Y(1, m)−Y(0, m) |C = c}

[Petersen et al 2006]

  • Both assumptions are very strong, and not even a hypothetical

experiment exists in which they would hold by design.

[Richardson and Robins 2013]

Rhian Daniel/Counterfactual-based mediation analysisWorkshop 1 22/51

slide-64
SLIDE 64

Setting the scene Case study Q&A Wrapping up References

Relaxing the cross-world independence assumption

  • The cross-world independence assumption

M(x∗) ⊥ ⊥ Y(x, m) |C = c , ∀x, m, x∗, c rules out intermediate confounders L.

  • In fact, a slightly weaker assumption, which does not rule out L is

sufficient: E{Y(1, m)−Y(0, m) |C = c, M(0) = m} = E{Y(1, m)−Y(0, m) |C = c}

[Petersen et al 2006]

  • Both assumptions are very strong, and not even a hypothetical

experiment exists in which they would hold by design.

[Richardson and Robins 2013]

  • Even the Petersen assumption places strong parametric restrictions
  • n the relationship between L and Y, which can essentially only hold

in linear models with no non-linearities involving L.

[De Stavola et al 2015]

Rhian Daniel/Counterfactual-based mediation analysisWorkshop 1 22/51

slide-65
SLIDE 65

Setting the scene Case study Q&A Wrapping up References

Outline

1

Setting the scene Introduction Traditional approach Causal inference gets involved —Estimands —Assumptions —Identification Interventional effects

2

Case study

3

Q&A

4

Wrapping up

5

References

Rhian Daniel/Counterfactual-based mediation analysisWorkshop 1 23/51

slide-66
SLIDE 66

Setting the scene Case study Q&A Wrapping up References

Identification (1)

Pearl 2001

  • Identifying E[Y{x, M(x∗)}] is sufficient for identifying the NDE and

NIE.

Rhian Daniel/Counterfactual-based mediation analysisWorkshop 1 24/51

slide-67
SLIDE 67

Setting the scene Case study Q&A Wrapping up References

Identification (1)

Pearl 2001

  • Identifying E[Y{x, M(x∗)}] is sufficient for identifying the NDE and

NIE.

  • First we write:

E[Y{x, M(x∗)}] =

  • c,m

E{Y(x, m) |C = c, M(x∗) = m}P{M(x∗) = m|C = c}P{C = c}

Rhian Daniel/Counterfactual-based mediation analysisWorkshop 1 24/51

slide-68
SLIDE 68

Setting the scene Case study Q&A Wrapping up References

Identification (1)

Pearl 2001

  • Identifying E[Y{x, M(x∗)}] is sufficient for identifying the NDE and

NIE.

  • First we write:

E[Y{x, M(x∗)}] =

  • c,m

E{Y(x, m) |C = c, M(x∗) = m}P{M(x∗) = m|C = c}P{C = c}

  • By the cross-world independence assumption, this is equal to:
  • c,m

E{Y(x, m) |C = c}P{M(x∗) = m|C = c}P{C = c}

Rhian Daniel/Counterfactual-based mediation analysisWorkshop 1 24/51

slide-69
SLIDE 69

Setting the scene Case study Q&A Wrapping up References

Identification (1)

Pearl 2001

  • Identifying E[Y{x, M(x∗)}] is sufficient for identifying the NDE and

NIE.

  • First we write:

E[Y{x, M(x∗)}] =

  • c,m

E{Y(x, m) |C = c, M(x∗) = m}P{M(x∗) = m|C = c}P{C = c}

  • By the cross-world independence assumption, this is equal to:
  • c,m

E{Y(x, m) |C = c}P{M(x∗) = m|C = c}P{C = c}

  • By conditional exchangeability, this is:
  • c,m

E{Y(x, m) |X = x, M = m, C = c}P{M(x∗) = m|X = x∗, C = c}P{C = c}

Rhian Daniel/Counterfactual-based mediation analysisWorkshop 1 24/51

slide-70
SLIDE 70

Setting the scene Case study Q&A Wrapping up References

Identification (2)

Pearl 2001

  • c,m

E{Y(x, m) |X = x, M = m, C = c}P{M(x∗) = m|X = x∗, C = c}P{C = c}

Rhian Daniel/Counterfactual-based mediation analysisWorkshop 1 25/51

slide-71
SLIDE 71

Setting the scene Case study Q&A Wrapping up References

Identification (2)

Pearl 2001

  • c,m

E{Y(x, m) |X = x, M = m, C = c}P{M(x∗) = m|X = x∗, C = c}P{C = c}

  • By consistency, this is:
  • c,m

E{Y |X = x, M = m, C = c}P{M = m|X = x∗, C = c}P{C = c}

Rhian Daniel/Counterfactual-based mediation analysisWorkshop 1 25/51

slide-72
SLIDE 72

Setting the scene Case study Q&A Wrapping up References

Identification (2)

Pearl 2001

  • c,m

E{Y(x, m) |X = x, M = m, C = c}P{M(x∗) = m|X = x∗, C = c}P{C = c}

  • By consistency, this is:
  • c,m

E{Y |X = x, M = m, C = c}P{M = m|X = x∗, C = c}P{C = c}

  • The hard work is now done.

Rhian Daniel/Counterfactual-based mediation analysisWorkshop 1 25/51

slide-73
SLIDE 73

Setting the scene Case study Q&A Wrapping up References

Identification (2)

Pearl 2001

  • c,m

E{Y(x, m) |X = x, M = m, C = c}P{M(x∗) = m|X = x∗, C = c}P{C = c}

  • By consistency, this is:
  • c,m

E{Y |X = x, M = m, C = c}P{M = m|X = x∗, C = c}P{C = c}

  • The hard work is now done.
  • By substituting different values for x and x∗, we can re-write

the NDE and the NIE using only functions of aspects of the distribution of the observed data.

Rhian Daniel/Counterfactual-based mediation analysisWorkshop 1 25/51

slide-74
SLIDE 74

Setting the scene Case study Q&A Wrapping up References

Identification (2)

Pearl 2001

  • c,m

E{Y(x, m) |X = x, M = m, C = c}P{M(x∗) = m|X = x∗, C = c}P{C = c}

  • By consistency, this is:
  • c,m

E{Y |X = x, M = m, C = c}P{M = m|X = x∗, C = c}P{C = c}

  • The hard work is now done.
  • By substituting different values for x and x∗, we can re-write

the NDE and the NIE using only functions of aspects of the distribution of the observed data.

  • Plug-in or alternative (semiparametric) estimation could then

be used. Many many proposals have been made!

Rhian Daniel/Counterfactual-based mediation analysisWorkshop 1 25/51

slide-75
SLIDE 75

Setting the scene Case study Q&A Wrapping up References

Summary so far

  • Mediation analysis is not new.

Rhian Daniel/Counterfactual-based mediation analysisWorkshop 1 26/51

slide-76
SLIDE 76

Setting the scene Case study Q&A Wrapping up References

Summary so far

  • Mediation analysis is not new.
  • When all models are linear (with no interactions) quite complicated

structures can be incorporated and path-specific effects estimated.

Rhian Daniel/Counterfactual-based mediation analysisWorkshop 1 26/51

slide-77
SLIDE 77

Setting the scene Case study Q&A Wrapping up References

Summary so far

  • Mediation analysis is not new.
  • When all models are linear (with no interactions) quite complicated

structures can be incorporated and path-specific effects estimated.

  • However, in the traditional approach, it was unclear what exactly was

being estimated, under what assumptions this was possible, and how things could be extended to non-linear settings.

Rhian Daniel/Counterfactual-based mediation analysisWorkshop 1 26/51

slide-78
SLIDE 78

Setting the scene Case study Q&A Wrapping up References

Summary so far

  • Mediation analysis is not new.
  • When all models are linear (with no interactions) quite complicated

structures can be incorporated and path-specific effects estimated.

  • However, in the traditional approach, it was unclear what exactly was

being estimated, under what assumptions this was possible, and how things could be extended to non-linear settings.

  • The causal inference literature has addressed many of these

concerns by giving unambiguous counterfactual definitions of direct and indirect effects that are independent of any model, and by deriving clear identification assumptions.

Rhian Daniel/Counterfactual-based mediation analysisWorkshop 1 26/51

slide-79
SLIDE 79

Setting the scene Case study Q&A Wrapping up References

Summary so far

  • Mediation analysis is not new.
  • When all models are linear (with no interactions) quite complicated

structures can be incorporated and path-specific effects estimated.

  • However, in the traditional approach, it was unclear what exactly was

being estimated, under what assumptions this was possible, and how things could be extended to non-linear settings.

  • The causal inference literature has addressed many of these

concerns by giving unambiguous counterfactual definitions of direct and indirect effects that are independent of any model, and by deriving clear identification assumptions.

  • The identification expressions can be used to estimate direct and

indirect effects in the presence of non-linearities, and thus have greatly increased the flexibility of mediation analysis.

Rhian Daniel/Counterfactual-based mediation analysisWorkshop 1 26/51

slide-80
SLIDE 80

Setting the scene Case study Q&A Wrapping up References

Summary so far

  • Mediation analysis is not new.
  • When all models are linear (with no interactions) quite complicated

structures can be incorporated and path-specific effects estimated.

  • However, in the traditional approach, it was unclear what exactly was

being estimated, under what assumptions this was possible, and how things could be extended to non-linear settings.

  • The causal inference literature has addressed many of these

concerns by giving unambiguous counterfactual definitions of direct and indirect effects that are independent of any model, and by deriving clear identification assumptions.

  • The identification expressions can be used to estimate direct and

indirect effects in the presence of non-linearities, and thus have greatly increased the flexibility of mediation analysis.

  • However, it is plagued by the strength of the cross-world/Petersen

assumptions; in particular, the fact that these assumptions almost rules out intermediate confounding even when measured.

Rhian Daniel/Counterfactual-based mediation analysisWorkshop 1 26/51

slide-81
SLIDE 81

Setting the scene Case study Q&A Wrapping up References

Consequences for multiple mediators

X M Y C L

  • For the same reason that in general we can’t have L . . .

Rhian Daniel/Counterfactual-based mediation analysisWorkshop 1 27/51

slide-82
SLIDE 82

Setting the scene Case study Q&A Wrapping up References

Consequences for multiple mediators

X M2 Y C M1

  • For the same reason that in general we can’t have L . . .
  • . . . settings involving multiple mediators are also problematic.

Rhian Daniel/Counterfactual-based mediation analysisWorkshop 1 27/51

slide-83
SLIDE 83

Setting the scene Case study Q&A Wrapping up References

Consequences for multiple mediators

Alc GGT SBP C Adiposity

  • For the same reason that in general we can’t have L . . .
  • . . . settings involving multiple mediators are also problematic.
  • eg in our motivating example.

Rhian Daniel/Counterfactual-based mediation analysisWorkshop 1 27/51

slide-84
SLIDE 84

Setting the scene Case study Q&A Wrapping up References

Outline

1

Setting the scene Introduction Traditional approach Causal inference gets involved —Estimands —Assumptions —Identification Interventional effects

2

Case study

3

Q&A

4

Wrapping up

5

References

Rhian Daniel/Counterfactual-based mediation analysisWorkshop 1 28/51

slide-85
SLIDE 85

Setting the scene Case study Q&A Wrapping up References

Randomised interventional analogues of NDE/NIE

VanderWeele et al 2014

  • The randomised interventional analogue of the NDE is

RIA-NDE = E

  • Y
  • 1, M∗

0|C

  • −E
  • Y
  • 0, M∗

0|C

  • where M∗

x|C is a random draw from the distribution of M among those

with X = x conditional on C.

Rhian Daniel/Counterfactual-based mediation analysisWorkshop 1 29/51

slide-86
SLIDE 86

Setting the scene Case study Q&A Wrapping up References

Randomised interventional analogues of NDE/NIE

VanderWeele et al 2014

  • The randomised interventional analogue of the NDE is

RIA-NDE = E

  • Y
  • 1, M∗

0|C

  • −E
  • Y
  • 0, M∗

0|C

  • where M∗

x|C is a random draw from the distribution of M among those

with X = x conditional on C.

  • The randomised interventional analogue of the NIE of X on Y

expressed as a marginal mean difference is RIA-NIE = E

  • Y
  • 1, M∗

1|C

  • −E
  • Y
  • 1, M∗

0|C

  • .

Rhian Daniel/Counterfactual-based mediation analysisWorkshop 1 29/51

slide-87
SLIDE 87

Setting the scene Case study Q&A Wrapping up References

Randomised interventional analogues of NDE/NIE

VanderWeele et al 2014

  • The randomised interventional analogue of the NDE is

RIA-NDE = E

  • Y
  • 1, M∗

0|C

  • −E
  • Y
  • 0, M∗

0|C

  • where M∗

x|C is a random draw from the distribution of M among those

with X = x conditional on C.

  • The randomised interventional analogue of the NIE of X on Y

expressed as a marginal mean difference is RIA-NIE = E

  • Y
  • 1, M∗

1|C

  • −E
  • Y
  • 1, M∗

0|C

  • .
  • The RIA-NDE is the effect on the mean SBP of changing everyone’s

drinking status, whilst leaving each subject’s GGT at a random draw from the distribution of GGT given that subject’s background confounder levels, amongst the non-drinkers.

Rhian Daniel/Counterfactual-based mediation analysisWorkshop 1 29/51

slide-88
SLIDE 88

Setting the scene Case study Q&A Wrapping up References

Randomised interventional analogues of NDE/NIE

VanderWeele et al 2014

  • The randomised interventional analogue of the NDE is

RIA-NDE = E

  • Y
  • 1, M∗

0|C

  • −E
  • Y
  • 0, M∗

0|C

  • where M∗

x|C is a random draw from the distribution of M among those

with X = x conditional on C.

  • The randomised interventional analogue of the NIE of X on Y

expressed as a marginal mean difference is RIA-NIE = E

  • Y
  • 1, M∗

1|C

  • −E
  • Y
  • 1, M∗

0|C

  • .
  • The RIA-NDE is the effect on the mean SBP of changing everyone’s

drinking status, whilst leaving each subject’s GGT at a random draw from the distribution of GGT given that subject’s background confounder levels, amongst the non-drinkers.

  • The RIA-NIE is the effect on mean SBP of shifting the GGT

distribution given confounders from that seen in non-drinkers to that seen in drinkers, whilst setting everyone’s exposure to ‘drinking’.

Rhian Daniel/Counterfactual-based mediation analysisWorkshop 1 29/51

slide-89
SLIDE 89

Setting the scene Case study Q&A Wrapping up References

Advantages and disadvantages

  • The RIA-NDE and RIA-NIE can be identified under the no

interference, consistency and conditional exchangeability assumptions mentioned earlier, but without the additional cross-world (or Petersen) assumption.

Rhian Daniel/Counterfactual-based mediation analysisWorkshop 1 30/51

slide-90
SLIDE 90

Setting the scene Case study Q&A Wrapping up References

Advantages and disadvantages

  • The RIA-NDE and RIA-NIE can be identified under the no

interference, consistency and conditional exchangeability assumptions mentioned earlier, but without the additional cross-world (or Petersen) assumption.

  • Intuitively, the 1st identification step (which is where the cross-world

assumption came in) is removed, and the estimand is changed to the quantity in the 2nd line of the identification.

Rhian Daniel/Counterfactual-based mediation analysisWorkshop 1 30/51

slide-91
SLIDE 91

Setting the scene Case study Q&A Wrapping up References

Advantages and disadvantages

  • The RIA-NDE and RIA-NIE can be identified under the no

interference, consistency and conditional exchangeability assumptions mentioned earlier, but without the additional cross-world (or Petersen) assumption.

  • Intuitively, the 1st identification step (which is where the cross-world

assumption came in) is removed, and the estimand is changed to the quantity in the 2nd line of the identification.

  • If the cross-world assumption does hold, then NDE=RIA-NDE.

Rhian Daniel/Counterfactual-based mediation analysisWorkshop 1 30/51

slide-92
SLIDE 92

Setting the scene Case study Q&A Wrapping up References

Advantages and disadvantages

  • The RIA-NDE and RIA-NIE can be identified under the no

interference, consistency and conditional exchangeability assumptions mentioned earlier, but without the additional cross-world (or Petersen) assumption.

  • Intuitively, the 1st identification step (which is where the cross-world

assumption came in) is removed, and the estimand is changed to the quantity in the 2nd line of the identification.

  • If the cross-world assumption does hold, then NDE=RIA-NDE.
  • If not, then the stronger C predicts M, the smaller the difference

between NDE and RIA-NDE.

Rhian Daniel/Counterfactual-based mediation analysisWorkshop 1 30/51

slide-93
SLIDE 93

Setting the scene Case study Q&A Wrapping up References

Advantages and disadvantages

  • The RIA-NDE and RIA-NIE can be identified under the no

interference, consistency and conditional exchangeability assumptions mentioned earlier, but without the additional cross-world (or Petersen) assumption.

  • Intuitively, the 1st identification step (which is where the cross-world

assumption came in) is removed, and the estimand is changed to the quantity in the 2nd line of the identification.

  • If the cross-world assumption does hold, then NDE=RIA-NDE.
  • If not, then the stronger C predicts M, the smaller the difference

between NDE and RIA-NDE.

  • RIA effects correspond to interventions that could in principle be

done.

Rhian Daniel/Counterfactual-based mediation analysisWorkshop 1 30/51

slide-94
SLIDE 94

Setting the scene Case study Q&A Wrapping up References

Advantages and disadvantages

  • The RIA-NDE and RIA-NIE can be identified under the no

interference, consistency and conditional exchangeability assumptions mentioned earlier, but without the additional cross-world (or Petersen) assumption.

  • Intuitively, the 1st identification step (which is where the cross-world

assumption came in) is removed, and the estimand is changed to the quantity in the 2nd line of the identification.

  • If the cross-world assumption does hold, then NDE=RIA-NDE.
  • If not, then the stronger C predicts M, the smaller the difference

between NDE and RIA-NDE.

  • RIA effects correspond to interventions that could in principle be

done.

  • However, RIA-NDE + RIA-NIE =

E

  • Y
  • 1, M∗

1|C

  • −E
  • Y
  • 0, M∗

0|C

  • which is NOT in general equal to the total causal effect!

Rhian Daniel/Counterfactual-based mediation analysisWorkshop 1 30/51

slide-95
SLIDE 95

Setting the scene Case study Q&A Wrapping up References

Summary

  • Mediation analysis, although intuitive and with a long history, is a

surprisingly subtle business as soon as there are any non-linearities in the picture.

Rhian Daniel/Counterfactual-based mediation analysisWorkshop 1 31/51

slide-96
SLIDE 96

Setting the scene Case study Q&A Wrapping up References

Summary

  • Mediation analysis, although intuitive and with a long history, is a

surprisingly subtle business as soon as there are any non-linearities in the picture.

  • Advances thanks to the field of causal inference have greatly clarified

these subtleties, giving rise to clear estimands that capture the notions of direct and indirect effects, clear assumptions under which these can be identified, and flexible estimation methods.

Rhian Daniel/Counterfactual-based mediation analysisWorkshop 1 31/51

slide-97
SLIDE 97

Setting the scene Case study Q&A Wrapping up References

Summary

  • Mediation analysis, although intuitive and with a long history, is a

surprisingly subtle business as soon as there are any non-linearities in the picture.

  • Advances thanks to the field of causal inference have greatly clarified

these subtleties, giving rise to clear estimands that capture the notions of direct and indirect effects, clear assumptions under which these can be identified, and flexible estimation methods.

  • However, this endeavour has been limited by the extremely strong

and untestable cross-world assumption.

Rhian Daniel/Counterfactual-based mediation analysisWorkshop 1 31/51

slide-98
SLIDE 98

Setting the scene Case study Q&A Wrapping up References

Summary

  • Mediation analysis, although intuitive and with a long history, is a

surprisingly subtle business as soon as there are any non-linearities in the picture.

  • Advances thanks to the field of causal inference have greatly clarified

these subtleties, giving rise to clear estimands that capture the notions of direct and indirect effects, clear assumptions under which these can be identified, and flexible estimation methods.

  • However, this endeavour has been limited by the extremely strong

and untestable cross-world assumption.

  • This has effectively prohibited flexible multiple mediation analyses,

even though applied problems frequently involve multiple mediators.

Rhian Daniel/Counterfactual-based mediation analysisWorkshop 1 31/51

slide-99
SLIDE 99

Setting the scene Case study Q&A Wrapping up References

Summary

  • Mediation analysis, although intuitive and with a long history, is a

surprisingly subtle business as soon as there are any non-linearities in the picture.

  • Advances thanks to the field of causal inference have greatly clarified

these subtleties, giving rise to clear estimands that capture the notions of direct and indirect effects, clear assumptions under which these can be identified, and flexible estimation methods.

  • However, this endeavour has been limited by the extremely strong

and untestable cross-world assumption.

  • This has effectively prohibited flexible multiple mediation analyses,

even though applied problems frequently involve multiple mediators.

  • Interventional effects are perhaps the way forward, since they don’t

require this cross-world assumption.

Rhian Daniel/Counterfactual-based mediation analysisWorkshop 1 31/51

slide-100
SLIDE 100

Setting the scene Case study Q&A Wrapping up References

Outline

1

Setting the scene Introduction Traditional approach Causal inference gets involved —Estimands —Assumptions —Identification Interventional effects

2

Case study

3

Q&A

4

Wrapping up

5

References

Rhian Daniel/Counterfactual-based mediation analysisWorkshop 1 32/51

slide-101
SLIDE 101

Setting the scene Case study Q&A Wrapping up References

Data description

  • We now turn to the case study.

Rhian Daniel/Counterfactual-based mediation analysisWorkshop 1 33/51

slide-102
SLIDE 102

Setting the scene Case study Q&A Wrapping up References

Data description

  • We now turn to the case study.
  • The dataset for this case study (Pseudo Izhevsk.dta) has been

simulated to be similar to, but a simplified version of, the data from the Izhevsk Family Study.

Rhian Daniel/Counterfactual-based mediation analysisWorkshop 1 33/51

slide-103
SLIDE 103

Setting the scene Case study Q&A Wrapping up References

Data description

  • We now turn to the case study.
  • The dataset for this case study (Pseudo Izhevsk.dta) has been

simulated to be similar to, but a simplified version of, the data from the Izhevsk Family Study.

  • A case–control study to study the effects of extreme alcohol

consumption on mortality in men in Izhevsk, Russia.

Rhian Daniel/Counterfactual-based mediation analysisWorkshop 1 33/51

slide-104
SLIDE 104

Setting the scene Case study Q&A Wrapping up References

Data description

  • We now turn to the case study.
  • The dataset for this case study (Pseudo Izhevsk.dta) has been

simulated to be similar to, but a simplified version of, the data from the Izhevsk Family Study.

  • A case–control study to study the effects of extreme alcohol

consumption on mortality in men in Izhevsk, Russia.

  • We’ll analyse simulated data that mimic the population-based

controls, and use these men to estimate the effect of drinking more than 10L of ethanol in the previous year on SBP , and the extent to which this effect is mediated by GGT.

Rhian Daniel/Counterfactual-based mediation analysisWorkshop 1 33/51

slide-105
SLIDE 105

Setting the scene Case study Q&A Wrapping up References

Data description

  • We now turn to the case study.
  • The dataset for this case study (Pseudo Izhevsk.dta) has been

simulated to be similar to, but a simplified version of, the data from the Izhevsk Family Study.

  • A case–control study to study the effects of extreme alcohol

consumption on mortality in men in Izhevsk, Russia.

  • We’ll analyse simulated data that mimic the population-based

controls, and use these men to estimate the effect of drinking more than 10L of ethanol in the previous year on SBP , and the extent to which this effect is mediated by GGT.

  • Background confounders: age, SES, smoking status (never, ex,

current). Intermediate confounder: BMI.

Rhian Daniel/Counterfactual-based mediation analysisWorkshop 1 33/51

slide-106
SLIDE 106

Setting the scene Case study Q&A Wrapping up References

Data description

  • We now turn to the case study.
  • The dataset for this case study (Pseudo Izhevsk.dta) has been

simulated to be similar to, but a simplified version of, the data from the Izhevsk Family Study.

  • A case–control study to study the effects of extreme alcohol

consumption on mortality in men in Izhevsk, Russia.

  • We’ll analyse simulated data that mimic the population-based

controls, and use these men to estimate the effect of drinking more than 10L of ethanol in the previous year on SBP , and the extent to which this effect is mediated by GGT.

  • Background confounders: age, SES, smoking status (never, ex,

current). Intermediate confounder: BMI.

  • For simplicity for this workshop, we have dropped the variable

containing the number of cigarettes smoked per day, and we haven’t simulated any data to be missing (whereas in the paper, we used single stochastic imputation for the missing values).

Rhian Daniel/Counterfactual-based mediation analysisWorkshop 1 33/51

slide-107
SLIDE 107

Setting the scene Case study Q&A Wrapping up References

Tasks

Question 1 Familiarise yourselves with the dataset and check the distribu- tion of BMI and GGT. Might log transformations be sensible?

For help with Stata syntax, see CaseStudy1 Q1.do.

Question 2 Investigate, using traditional mediation analysis, the extent to which the effect of alcohol on SBP is mediated by GGT.

You should take into account the background confounders age, SES and smoking, but you should ignore BMI for now, since it is an inter- mediate confounder (we will come back to it in Question 4). For help with Stata syntax, see CaseStudy1 Q2.do.

Rhian Daniel/Counterfactual-based mediation analysisWorkshop 1 34/51

slide-108
SLIDE 108

Setting the scene Case study Q&A Wrapping up References

Tasks

Question 3 (a) Now repeat the same analysis using the paramed com- mand in Stata. You may need to start by installing paramed: findit paramed The syntax for continuous outcome y, continuous mediator m, binary exposure x, and background confounders c1 and c2, with both models simple linear regression, is: paramed y, avar(x) mvar(m) a0(0) a1(1) m(3) yreg(linear) mreg(linear) cvars(c1 c2) nointeraction

For more help with the Stata syntax, see CaseStudy1 Q3.do.

Rhian Daniel/Counterfactual-based mediation analysisWorkshop 1 35/51

slide-109
SLIDE 109

Setting the scene Case study Q&A Wrapping up References

Tasks

Question 3 (cont’d) (b) Now repeat the same analysis, but this time allowing there to be an exposure–mediator interaction. This can be done sim- ply by removing nointeraction from the command in part (a). Do you understand the output? Does the interaction seem im- portant? Do you understand why the nde was not given in the

  • utput for part (a)?

For more help with the Stata syntax, see CaseStudy1 Q3.do.

Rhian Daniel/Counterfactual-based mediation analysisWorkshop 1 36/51

slide-110
SLIDE 110

Setting the scene Case study Q&A Wrapping up References

Tasks

Question 4 We now deal with BMI, the intermediate confounder (L). You may want to consult CaseStudy1 Q4.do from the begin- ning. Since things are getting a bit complex now, with 3 models, and since we wish to include interactions in some/all of these mod- els, we proceed now by Monte Carlo simulation, rather than analytically. The general idea is as follows:

Rhian Daniel/Counterfactual-based mediation analysisWorkshop 1 37/51

slide-111
SLIDE 111

Setting the scene Case study Q&A Wrapping up References

Tasks

Question 4 (cont’d) (1) Fit a model for logBMI given alc, age, SES and smoke. (2) Simulate two values of logBMI for each individual: one had their exposure been 1, and one had their exposure been 0, i.e. L(1) and L(0). These simulations need to be stochastic, so remember to add e(rmse)*rnormal(). (3) Do the same for logGGT, so that you simulate M(1) and M(0) for each individual. [The model will include logBMI, and so when you simulate M(1), use L(1) in place of L, and when you simulate M(0), use L(0) in place pf L.] (4) Finally, fit a model for SBP given all other variables, and use this model to predict Y(1, M(1)), Y(1, M(0)) and Y(0, M(0)) for each individual. Eg when predicting Y(1, M(0)) you will use 1 in place of X, L(1) in place of L and M(0) in place of M.

Rhian Daniel/Counterfactual-based mediation analysisWorkshop 1 38/51

slide-112
SLIDE 112

Setting the scene Case study Q&A Wrapping up References

Tasks

Question 4 (cont’d) (5) Take differences of these three predicted potential out- comes for each individual as follows:

  • OEi = Y(1, M(1)) − Y(0, M(0))
  • NDEi = Y(1, M(0)) − Y(0, M(0))
  • NIEi = Y(1, M(1)) − Y(1, M(0))

(6) Finally, take the average of these individual differences over all individuals to obtain the MC estimates of the OE, NDE and NIE.

Rhian Daniel/Counterfactual-based mediation analysisWorkshop 1 39/51

slide-113
SLIDE 113

Setting the scene Case study Q&A Wrapping up References

Tasks

Question 4 (cont’d) A few additional things to note: (A) We can reduce the MC error in our estimates by increasing the sample size for which we predict all the potential outcomes. (B) For inference, we use the bootstrap; that is why we in- clude all our code into a ‘program’, which can then be called by Stata’s bootstrap command. (C) It might be sensible to start by trying the MC simulation procedure for the two analyses we’ve already carried out, i.e. ignoring BMI, first without the XM interaction, and then with it. Then, in a third step, try adding the intermediate confounder.

For more help with the Stata syntax, see CaseStudy1 Q4.do.

Rhian Daniel/Counterfactual-based mediation analysisWorkshop 1 40/51

slide-114
SLIDE 114

Setting the scene Case study Q&A Wrapping up References

Outline

1

Setting the scene Introduction Traditional approach Causal inference gets involved —Estimands —Assumptions —Identification Interventional effects

2

Case study

3

Q&A

4

Wrapping up

5

References

Rhian Daniel/Counterfactual-based mediation analysisWorkshop 1 41/51

slide-115
SLIDE 115

Setting the scene Case study Q&A Wrapping up References

Outline

1

Setting the scene Introduction Traditional approach Causal inference gets involved —Estimands —Assumptions —Identification Interventional effects

2

Case study

3

Q&A

4

Wrapping up

5

References

Rhian Daniel/Counterfactual-based mediation analysisWorkshop 1 42/51

slide-116
SLIDE 116

Setting the scene Case study Q&A Wrapping up References

Summary of the afternoon

  • Questions concerning mediation are often posed and tie in with our

intuition on what it means to ‘understand mechanism’.

Rhian Daniel/Counterfactual-based mediation analysisWorkshop 1 43/51

slide-117
SLIDE 117

Setting the scene Case study Q&A Wrapping up References

Summary of the afternoon

  • Questions concerning mediation are often posed and tie in with our

intuition on what it means to ‘understand mechanism’.

  • Traditional mediation methods (‘product’ or ‘difference’) suffer from

the same vagueness that has plagued all informal statistical methods for causal inference. What exactly is being estimated? Under what assumptions is our estimation method successful?

Rhian Daniel/Counterfactual-based mediation analysisWorkshop 1 43/51

slide-118
SLIDE 118

Setting the scene Case study Q&A Wrapping up References

Summary of the afternoon

  • Questions concerning mediation are often posed and tie in with our

intuition on what it means to ‘understand mechanism’.

  • Traditional mediation methods (‘product’ or ‘difference’) suffer from

the same vagueness that has plagued all informal statistical methods for causal inference. What exactly is being estimated? Under what assumptions is our estimation method successful?

  • Traditional mediation methods are also limited to simple linear

models.

Rhian Daniel/Counterfactual-based mediation analysisWorkshop 1 43/51

slide-119
SLIDE 119

Setting the scene Case study Q&A Wrapping up References

Summary of the afternoon

  • Questions concerning mediation are often posed and tie in with our

intuition on what it means to ‘understand mechanism’.

  • Traditional mediation methods (‘product’ or ‘difference’) suffer from

the same vagueness that has plagued all informal statistical methods for causal inference. What exactly is being estimated? Under what assumptions is our estimation method successful?

  • Traditional mediation methods are also limited to simple linear

models.

  • The causal inference literature, using counterfactuals, has clarified

what we might mean by ‘direct’ and ‘indirect’ effects, but there isn’t just one possibility.

Rhian Daniel/Counterfactual-based mediation analysisWorkshop 1 43/51

slide-120
SLIDE 120

Setting the scene Case study Q&A Wrapping up References

Summary of the afternoon

  • Questions concerning mediation are often posed and tie in with our

intuition on what it means to ‘understand mechanism’.

  • Traditional mediation methods (‘product’ or ‘difference’) suffer from

the same vagueness that has plagued all informal statistical methods for causal inference. What exactly is being estimated? Under what assumptions is our estimation method successful?

  • Traditional mediation methods are also limited to simple linear

models.

  • The causal inference literature, using counterfactuals, has clarified

what we might mean by ‘direct’ and ‘indirect’ effects, but there isn’t just one possibility.

  • It has led to clear assumptions under which these can be identified,

and a myriad methods for estimation, reaching far beyond two simple linear models.

Rhian Daniel/Counterfactual-based mediation analysisWorkshop 1 43/51

slide-121
SLIDE 121

Setting the scene Case study Q&A Wrapping up References

Summary of the afternoon (cont’d)

  • Today we have focussed on the fully-parametric approach, both

analytic and using MC simulation.

Rhian Daniel/Counterfactual-based mediation analysisWorkshop 1 44/51

slide-122
SLIDE 122

Setting the scene Case study Q&A Wrapping up References

Summary of the afternoon (cont’d)

  • Today we have focussed on the fully-parametric approach, both

analytic and using MC simulation.

  • Today we have focussed only on the setting with a continuous
  • utcome and mediator, and with a single mediator of interest.

Rhian Daniel/Counterfactual-based mediation analysisWorkshop 1 44/51

slide-123
SLIDE 123

Setting the scene Case study Q&A Wrapping up References

Summary of the afternoon (cont’d)

  • Today we have focussed on the fully-parametric approach, both

analytic and using MC simulation.

  • Today we have focussed only on the setting with a continuous
  • utcome and mediator, and with a single mediator of interest.
  • In tomorrow’s workshop, we turn to mediation analysis with multiple

mediators, and we’ll look at a setting with a binary outcome.

Rhian Daniel/Counterfactual-based mediation analysisWorkshop 1 44/51

slide-124
SLIDE 124

Setting the scene Case study Q&A Wrapping up References

Summary of the afternoon (cont’d)

  • Today we have focussed on the fully-parametric approach, both

analytic and using MC simulation.

  • Today we have focussed only on the setting with a continuous
  • utcome and mediator, and with a single mediator of interest.
  • In tomorrow’s workshop, we turn to mediation analysis with multiple

mediators, and we’ll look at a setting with a binary outcome.

  • See Tyler VanderWeele’s (2015) wonderful book for the many many

topics we have not been able to cover: semiparametric estimation methods, time-to-event outcomes, three- and four-way decompositions, etc.

Rhian Daniel/Counterfactual-based mediation analysisWorkshop 1 44/51

slide-125
SLIDE 125

Setting the scene Case study Q&A Wrapping up References

Outline

1

Setting the scene Introduction Traditional approach Causal inference gets involved —Estimands —Assumptions —Identification Interventional effects

2

Case study

3

Q&A

4

Wrapping up

5

References

Rhian Daniel/Counterfactual-based mediation analysisWorkshop 1 45/51

slide-126
SLIDE 126

Setting the scene Case study Q&A Wrapping up References

References: traditional approach

Wright, S. (1921) Correlation and causation: Part I - Method of Path Coefficients Journal of Agriculture Research, 20:557–575. Wright, S. (1934) The method of path coefficients Annals of Mathematical Statistics, 5(3)161–215. Baron, R. M. and Kenny, D. A. (1986) The moderator-mediator variable distinction in social psychological research: conceptual, strategic and statistical considerations Journal of Personality and Social Psychology, 51:1173–1182.

Rhian Daniel/Counterfactual-based mediation analysisWorkshop 1 46/51

slide-127
SLIDE 127

Setting the scene Case study Q&A Wrapping up References

References: early investigations by causal inference

Robins, J.M. and Greenland, S. (1992) Identifiability and exchangeability for direct and indirect effects. Epidemiology, 3:143–155. Pearl, J. (2001) Direct and indirect effects. Proceedings of the 17th Conference in Uncertainty in Artificial Intelligence, 411–420. Cole, S.R. and Hern´ an, M.A. (2002) Fallibility in estimating direct effects. International Journal of Epidemiology, 31(1):163–165.

Rhian Daniel/Counterfactual-based mediation analysisWorkshop 1 47/51

slide-128
SLIDE 128

Setting the scene Case study Q&A Wrapping up References

References: more on identification and estimation

Robins, J. (1999) Testing and estimation of direct effects by reparameterizing directed acyclic graphs with structural nested models. In Computation, Causation, and Discovery, C. Glymour & G. Cooper, eds. Menlo Park, CA, Cambridge: AAAI Press/The MIT Press, pp. 349–405. Petersen, M.L., Sinisi, S.E. and van der Laan, M.J. (2006) Estimation of direct causal effects. Epidemiology, 17:276–284. VanderWeele, T.J. and Vansteelandt, S. (2009) Conceptual issues concerning mediation, interventions and composition. Statistics and its Interface, 2:457–468.

Rhian Daniel/Counterfactual-based mediation analysisWorkshop 1 48/51

slide-129
SLIDE 129

Setting the scene Case study Q&A Wrapping up References

References: more on identification and estimation

VanderWeele, T.J. (2009) Marginal structural models for the estimation of direct and indirect effects. Epidemiology, 20:18–26. Joffe, M. and Greene, T. (2009) Related causal frameworks for surrogate outcomes. Biometrics, 65:530–538. Tchetgen Tchetgen, E.J. (2013) Inverse odds ratio-weighted estimation for causal mediation. Statistics in Medicine, 32:4567–80.

Rhian Daniel/Counterfactual-based mediation analysisWorkshop 1 49/51

slide-130
SLIDE 130

Setting the scene Case study Q&A Wrapping up References

References: key textbook on causal mediation analysis

VanderWeele, T.J. (2015) Explanation in Causal Inference: Methods for Mediation and Interaction. Oxford University Press.

Rhian Daniel/Counterfactual-based mediation analysisWorkshop 1 50/51

slide-131
SLIDE 131

Setting the scene Case study Q&A Wrapping up References

References: on interventional effects

Vanderweele, T.J., Vansteelandt, S. and Robins, J.M. (2014) Effect decomposition in the presence of an exposure-induced mediator-outcome confounder. Epidemiology, 25:300–306. Vansteelandt, S. and Daniel, R.M. (2017) Interventional effects for mediation analysis with multiple mediators. Epidemiology, 28(2):258–265.

Rhian Daniel/Counterfactual-based mediation analysisWorkshop 1 51/51