Adjustment Criteria for Generalizing Experimental Findings
Juan D. Correa, Jin Tian and Elias Bareinboim
- 1
Long Beach, CA
Adjustment Criteria for Generalizing Experimental Findings Juan D. - - PowerPoint PPT Presentation
Adjustment Criteria for Generalizing Experimental Findings Juan D. Correa , Jin Tian and Elias Bareinboim Long Beach, CA 1 Causal Effects and Experiments 2 Causal Effects and Experiments Science is about understanding the laws
Juan D. Correa, Jin Tian and Elias Bareinboim
Long Beach, CA
2
2
2
the context of the empirical sciences (health sciences, economics, social sciences).
2
the context of the empirical sciences (health sciences, economics, social sciences).
explainability, and reinforcement learning.
2
3
[Greenhouse et al. 2008] In the context of pediatric patients treated with antidepressant:
3
[Greenhouse et al. 2008] In the context of pediatric patients treated with antidepressant:
3
[Greenhouse et al. 2008] In the context of pediatric patients treated with antidepressant:
information of the patients and the assessment of their baseline risk.
3
[Greenhouse et al. 2008] In the context of pediatric patients treated with antidepressant:
information of the patients and the assessment of their baseline risk.
controlled experiment is used to identify the unconfounded effect of the antidepressants.
3
Natural world (confounded) X Y B E
(antidep. use) (suicidality) (baseline risk) (background factors)
4
Natural world (confounded) X Y B E
(antidep. use) (suicidality) (baseline risk) (background factors)
4
do(x0)
Natural world (confounded) X Y B E
(antidep. use) (suicidality) (baseline risk) (background factors)
P(y, b, e | do(x0))
x0 Y B E Effect of getting no treatment
4
do(x0)
Natural world (confounded) X Y B E
(antidep. use) (suicidality) (baseline risk) (background factors)
P(y, b, e | do(x0))
x0 Y B E Effect of getting no treatment
4
do(x0) do(x1)
Natural world (confounded) X Y B E
(antidep. use) (suicidality) (baseline risk) (background factors)
P(y, b, e | do(x0))
x0 Y B E
P(y, b, e | do(x1))
x1 Y B E Effect of getting treatment Effect of getting no treatment
4
do(x0) do(x1)
5
[Greenhouse etal. 2008] On the risk of suicidality among pediatric antidepressant users:
5
[Greenhouse etal. 2008] On the risk of suicidality among pediatric antidepressant users:
approximately twice the amount of suicidal thoughts and behaviors compared to the control groups (do(x0)).
P(Y = 1|do(x1)) > P(Y = 1|do(x0))
5
[Greenhouse etal. 2008] On the risk of suicidality among pediatric antidepressant users:
approximately twice the amount of suicidal thoughts and behaviors compared to the control groups (do(x0)).
P(Y = 1|do(x1)) > P(Y = 1|do(x0))
5
[Greenhouse etal. 2008] On the risk of suicidality among pediatric antidepressant users:
approximately twice the amount of suicidal thoughts and behaviors compared to the control groups (do(x0)).
increase of suicidal events in the corresponding age groups.
P*(Y = 1|do(x1)) < P*(Y = 1|do(x0)) P(Y = 1|do(x1)) > P(Y = 1|do(x0))
5
[Greenhouse etal. 2008] On the risk of suicidality among pediatric antidepressant users:
approximately twice the amount of suicidal thoughts and behaviors compared to the control groups (do(x0)).
increase of suicidal events in the corresponding age groups.
antidepressants, even after accounting for access to mental health-care and other confounding factors.
P*(Y = 1|do(x1)) < P*(Y = 1|do(x0)) P(Y = 1|do(x1)) > P(Y = 1|do(x0))
5
˜ P*(Y = 1|do(x1)) < ˜ P*(Y = 1|do(x0))
[Greenhouse etal. 2008] On the risk of suicidality among pediatric antidepressant users:
approximately twice the amount of suicidal thoughts and behaviors compared to the control groups (do(x0)).
increase of suicidal events in the corresponding age groups.
antidepressants, even after accounting for access to mental health-care and other confounding factors.
What is going on here?
P*(Y = 1|do(x1)) < P*(Y = 1|do(x0)) P(Y = 1|do(x1)) > P(Y = 1|do(x0))
5
˜ P*(Y = 1|do(x1)) < ˜ P*(Y = 1|do(x0))
6
6
population that was studied.
6
population that was studied.
different, target domain (in other words, population, setting, environment).
6
7
There is a mismatch between the study population 𝛒 and the general clinical population 𝛒* regarding ethnicity, race, and income (covariates named E).
7
There is a mismatch between the study population 𝛒 and the general clinical population 𝛒* regarding ethnicity, race, and income (covariates named E).
7
👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥
entire/target population 𝛒*
There is a mismatch between the study population 𝛒 and the general clinical population 𝛒* regarding ethnicity, race, and income (covariates named E).
7
👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥
entire/target population 𝛒* study/source population 𝛒
There is a mismatch between the study population 𝛒 and the general clinical population 𝛒* regarding ethnicity, race, and income (covariates named E).
7
👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥
entire/target population 𝛒* study/source population 𝛒
P*(e) ≠ P(e)
8
👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥
entire/target population 𝛒* study/source population 𝛒
FDA's studies sampled from a distinct population by excluding youths with elevated baseline risk for suicide (B) from their cohorts.
8
👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥
entire/target population 𝛒* study/source population 𝛒
FDA's studies sampled from a distinct population by excluding youths with elevated baseline risk for suicide (B) from their cohorts.
8
👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥
entire/target population 𝛒* study/source population 𝛒 sampled individuals (S=1)
FDA's studies sampled from a distinct population by excluding youths with elevated baseline risk for suicide (B) from their cohorts.
8
👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥
entire/target population 𝛒* study/source population 𝛒 sampled individuals (S=1)
P(y, b, e|do(x), S = 1) ≠ P(y, b, e|do(x)) P(x, y, b, e|S = 1) ≠ P(x, y, b, e)
X Y B E
9
We use indicator named T to mark variables with differences between domains 𝛒 and 𝛒*.
X Y B E T
9
We use indicator named T to mark variables with differences between domains 𝛒 and 𝛒*. Similarly, the indicator , named S, is defined such that S=1 for every unit sampled in the study, and 0, otherwise.
X Y B E S T
9
We use indicator named T to mark variables with differences between domains 𝛒 and 𝛒*. Similarly, the indicator , named S, is defined such that S=1 for every unit sampled in the study, and 0, otherwise.
X Y B E S T
9
(called selection diagram D)
We use indicator named T to mark variables with differences between domains 𝛒 and 𝛒*. Similarly, the indicator , named S, is defined such that S=1 for every unit sampled in the study, and 0, otherwise. In this example, the causal effect can be estimated by recalibrating the experimental findings using observations from the target domain
X Y B E S T
9
P*(y|do(x)) = ∑
b,e
P(y|do(x), b, e, S = 1)P*(b, e)
(called selection diagram D)
We use indicator named T to mark variables with differences between domains 𝛒 and 𝛒*. Similarly, the indicator , named S, is defined such that S=1 for every unit sampled in the study, and 0, otherwise. In this example, the causal effect can be estimated by recalibrating the experimental findings using observations from the target domain
X Y B E S T
9
P*(y|do(x)) = ∑
b,e
P(y|do(x), b, e, S = 1)P*(b, e)
causal effect in target domain (called selection diagram D)
We use indicator named T to mark variables with differences between domains 𝛒 and 𝛒*. Similarly, the indicator , named S, is defined such that S=1 for every unit sampled in the study, and 0, otherwise. In this example, the causal effect can be estimated by recalibrating the experimental findings using observations from the target domain
X Y B E S T
9
P*(y|do(x)) = ∑
b,e
P(y|do(x), b, e, S = 1)P*(b, e)
experimental data from the source under selection bias causal effect in target domain (called selection diagram D)
We use indicator named T to mark variables with differences between domains 𝛒 and 𝛒*. Similarly, the indicator , named S, is defined such that S=1 for every unit sampled in the study, and 0, otherwise. In this example, the causal effect can be estimated by recalibrating the experimental findings using observations from the target domain
X Y B E S T
9
P*(y|do(x)) = ∑
b,e
P(y|do(x), b, e, S = 1)P*(b, e)
Observations from the target domain experimental data from the source under selection bias causal effect in target domain (called selection diagram D)
10
10
X Y B E S T
Selection Diagram D
10
X Y B E S T
Selection Diagram D
P(v|do(x), S = 1)
Selection-biased Exp. Distribution P1 from 𝛒
10
X Y B E S T
Selection Diagram D
P(v|do(x), S = 1)
Selection-biased Exp. Distribution P1 from 𝛒
P*(w)
Covariate Distribution P2 from 𝛒*
10
X Y B E S T
Selection Diagram D
Is there a function f such that
P*(y|do(x)) = f(P1, P2)
P(v|do(x), S = 1)
Selection-biased Exp. Distribution P1 from 𝛒
P*(w)
Covariate Distribution P2 from 𝛒*
10
X Y B E S T
Selection Diagram D
Is there a function f such that
P*(y|do(x)) = f(P1, P2)
P(v|do(x), S = 1)
Selection-biased Exp. Distribution P1 from 𝛒
P*(w)
Covariate Distribution P2 from 𝛒*
yes ( ) / no
f
😁 ☹
11
11
confounding type of input selection bias transportability complete
11
confounding type of input selection bias transportability complete
Backdoor Criterion [Pearl ’93] Extended Backdoor [Pearl and Paz ’10]
✔
11
confounding type of input selection bias transportability complete
Backdoor Criterion [Pearl ’93] Extended Backdoor [Pearl and Paz ’10]
✔
Adjustment Criterion [Shpitser et al. ’10; Perkovic et al. ’15,’18]
✔
✔
11
confounding type of input selection bias transportability complete
Backdoor Criterion [Pearl ’93] Extended Backdoor [Pearl and Paz ’10]
✔
Adjustment Criterion [Shpitser et al. ’10; Perkovic et al. ’15,’18]
✔
✔
Selection Backdoor [Bareinboim, Tian and Pearl ’14]
✔
✔
11
confounding type of input selection bias transportability complete
Backdoor Criterion [Pearl ’93] Extended Backdoor [Pearl and Paz ’10]
✔
Adjustment Criterion [Shpitser et al. ’10; Perkovic et al. ’15,’18]
✔
✔
Selection Backdoor [Bareinboim, Tian and Pearl ’14]
✔
✔
Generalized Adjustment Criterion [Correa, Tian and Bareinboim ’18]
✔
✔ ✔
11
confounding type of input selection bias transportability complete
Backdoor Criterion [Pearl ’93] Extended Backdoor [Pearl and Paz ’10]
✔
Adjustment Criterion [Shpitser et al. ’10; Perkovic et al. ’15,’18]
✔
✔
Selection Backdoor [Bareinboim, Tian and Pearl ’14]
✔
✔
Generalized Adjustment Criterion [Correa, Tian and Bareinboim ’18]
✔
✔ ✔
st-Adjustment Criterion [Correa, Tian and Bareinboim ’19]
— exp. ✔ ✔ ✔
12
P*(y|do(x))
12
unbiased target effect in 𝛒*
P*(y|do(x))
12
= ∑
z
P(y|do(x), z, S = 1)P*(z)
unbiased target effect in 𝛒*
P*(y|do(x))
12
= ∑
z
P(y|do(x), z, S = 1)P*(z)
unbiased target effect in 𝛒* experiment results in source domain 𝛒
P*(y|do(x))
12
= ∑
z
P(y|do(x), z, S = 1)P*(z)
unbiased target effect in 𝛒* experiment results in source domain 𝛒
the target domain 𝛒*
P*(y|do(x))
12
= ∑
z
P(y|do(x), z, S = 1)P*(z)
unbiased target effect in 𝛒* experiment results in source domain 𝛒
the target domain 𝛒*
P*(y|do(x))
12
= ∑
z
P(y|do(x), z, S = 1)P*(z)
unbiased target effect in 𝛒* experiment results in source domain 𝛒
the target domain 𝛒*
P*(y|do(x))
12
= ∑
z
P(y|do(x), z, S = 1)P*(z)
unbiased target effect in 𝛒* experiment results in source domain 𝛒
the target domain 𝛒*
P*(y|do(x))
12
13
more bias, instead of controlling for the current, existent ones.
13
more bias, instead of controlling for the current, existent ones.
by the treatment) that are correlated with the outcome given pre-treatment covariates.
13
more bias, instead of controlling for the current, existent ones.
by the treatment) that are correlated with the outcome given pre-treatment covariates.
13
more bias, instead of controlling for the current, existent ones.
by the treatment) that are correlated with the outcome given pre-treatment covariates.
X Y Z3 Z1 S T Z2
13
more bias, instead of controlling for the current, existent ones.
by the treatment) that are correlated with the outcome given pre-treatment covariates.
X Y Z3 Z1 S T Z2
13
Zp = {Z3}.
14
A set of covariates Z is admissible for st-adjustment in D relative to treatment X and
14
A set of covariates Z is admissible for st-adjustment in D relative to treatment X and
(i) Variables in Zp are independent of the treatment given all other covariates, and
14
A set of covariates Z is admissible for st-adjustment in D relative to treatment X and
(i) Variables in Zp are independent of the treatment given all other covariates, and (ii) The outcome Y is independent of all the transportability (T) and selection bias nodes (S) given the covariates Z and the treatment X.
14
A set of covariates Z is admissible for st-adjustment in D relative to treatment X and
(i) Variables in Zp are independent of the treatment given all other covariates, and (ii) The outcome Y is independent of all the transportability (T) and selection bias nodes (S) given the covariates Z and the treatment X.
14
15
Task: Compute P*(y | do(x))
X Y Z3 Z1 S T Z2
15
Task: Compute P*(y | do(x))
between the source and target domains.
X Y Z3 Z1 S T Z2
15
X Y Z3 Z1 S T Z2
Task: Compute P*(y | do(x))
between the source and target domains.
X Y Z3 Z1 S T Z2
15
X Y Z3 Z1 S T Z2 X Y Z3 Z1 S T Z2
Task: Compute P*(y | do(x))
between the source and target domains.
spurious correlation. Hence, we should also control for Z2.
X Y Z3 Z1 S T Z2
15
X Y Z3 Z1 S T Z2 X Y Z3 Z1 S T Z2 X Y Z3 Z1 S T Z2
Task: Compute P*(y | do(x))
between the source and target domains.
spurious correlation. Hence, we should also control for Z2.
X Y Z3 Z1 S T Z2
15
X Y Z3 Z1 S T Z2 X Y Z3 Z1 S T Z2 X Y Z3 Z1 S T Z2 X Y Z3 Z1 S T Z2
X Y Z3 Z1 S T Z2
16
By making Z ={Z1, Z2, Z3}, we can verify the st-adjustment conditions, i.e.:
X Y Z3 Z1 S T Z2
16
By making Z ={Z1, Z2, Z3}, we can verify the st-adjustment conditions, i.e.: (i) The variable in Zp={Z3} is independent of X given the other covariates {Z1, Z2}.
X Y Z3 Z1 S T Z2
16
By making Z ={Z1, Z2, Z3}, we can verify the st-adjustment conditions, i.e.: (i) The variable in Zp={Z3} is independent of X given the other covariates {Z1, Z2}. (ii) The outcome Y is independent of S and T given Z.
X Y Z3 Z1 S T Z2
16
By making Z ={Z1, Z2, Z3}, we can verify the st-adjustment conditions, i.e.: (i) The variable in Zp={Z3} is independent of X given the other covariates {Z1, Z2}. (ii) The outcome Y is independent of S and T given Z.
X Y Z3 Z1 S T Z2
16
P*(y|do(x)) = ∑
z1,z2,z3
P(y|do(x), z1, z2, z3, S = 1)P*(z1, z2, z3)
Hence, the st-adjustment is guaranteed to hold, i.e.:
By making Z ={Z1, Z2, Z3}, we can verify the st-adjustment conditions, i.e.: (i) The variable in Zp={Z3} is independent of X given the other covariates {Z1, Z2}. (ii) The outcome Y is independent of S and T given Z.
X Y Z3 Z1 S T Z2
16
P*(y|do(x)) = ∑
z1,z2,z3
P(y|do(x), z1, z2, z3, S = 1)P*(z1, z2, z3)
Hence, the st-adjustment is guaranteed to hold, i.e.:
measurements from the target domain experimental data from the source under selection bias causal effect in target domain
17
17
trial and error. There could be exponentially many candidates (and even valid ones!).
17
trial and error. There could be exponentially many candidates (and even valid ones!).
17
trial and error. There could be exponentially many candidates (and even valid ones!).
properties (e.g., cost, variance).
17
18
18
X Y B E S T
Selection Diagram D
18
X Y B E S T
Selection Diagram D
P(v|do(x), S = 1)
Selection-biased Exp. Distribution from 𝛒
18
X Y B E S T
Selection Diagram D
P(v|do(x), S = 1)
Selection-biased Exp. Distribution from 𝛒 Set W of covariates measurable in 𝛒*
18
X Y B E S T
Selection Diagram D
What are all the admissible sets satisfying st-adjustment?
P(v|do(x), S = 1)
Selection-biased Exp. Distribution from 𝛒 Set W of covariates measurable in 𝛒*
18
X Y B E S T
Selection Diagram D
What are all the admissible sets satisfying st-adjustment?
P(v|do(x), S = 1)
Selection-biased Exp. Distribution from 𝛒 Set W of covariates measurable in 𝛒*
List of of sets such that for each Zi:
Z1, Z2, … ⊆ W
P*(y|do(x)) = ∑
zi
P(y|do(x), zi, S = 1)P*(zi)
18
X Y B E S T
Selection Diagram D
What are all the admissible sets satisfying st-adjustment?
P(v|do(x), S = 1)
Selection-biased Exp. Distribution from 𝛒 Set W of covariates measurable in 𝛒*
List of of sets such that for each Zi:
Z1, Z2, … ⊆ W
P*(y|do(x)) = ∑
zi
P(y|do(x), zi, S = 1)P*(zi)
We provide an algorithm (Alg. 2) that works with polynomial delay (Thm. 6)
19
adjusting by a given set of covariates is admissible for the identification of causal effects from experimental results in a source domain and some observations from the target domain.
19
adjusting by a given set of covariates is admissible for the identification of causal effects from experimental results in a source domain and some observations from the target domain.
measured.
19
adjusting by a given set of covariates is admissible for the identification of causal effects from experimental results in a source domain and some observations from the target domain.
measured.
help researchers in health sciences, econometrics, reinforcement learning, marketing and others where extrapolating experimental results is crucial.
19
20
Poster
21
[Takata ’10]