Adjustment Criteria for Generalizing Experimental Findings Juan D. - - PowerPoint PPT Presentation

adjustment criteria for generalizing experimental findings
SMART_READER_LITE
LIVE PREVIEW

Adjustment Criteria for Generalizing Experimental Findings Juan D. - - PowerPoint PPT Presentation

Adjustment Criteria for Generalizing Experimental Findings Juan D. Correa , Jin Tian and Elias Bareinboim Long Beach, CA 1 Causal Effects and Experiments 2 Causal Effects and Experiments Science is about understanding the laws


slide-1
SLIDE 1

Adjustment Criteria for Generalizing Experimental Findings

Juan D. Correa, Jin Tian and Elias Bareinboim

  • 1

Long Beach, CA

slide-2
SLIDE 2

Causal Effects and Experiments

2

slide-3
SLIDE 3

Causal Effects and Experiments

  • Science is about understanding the laws of nature, which are usually expressed in terms
  • f cause and effect relationships.

2

slide-4
SLIDE 4

Causal Effects and Experiments

  • Science is about understanding the laws of nature, which are usually expressed in terms
  • f cause and effect relationships.
  • Controlled experimentation is the pillar on top of which empirical science is built upon.

2

slide-5
SLIDE 5

Causal Effects and Experiments

  • Science is about understanding the laws of nature, which are usually expressed in terms
  • f cause and effect relationships.
  • Controlled experimentation is the pillar on top of which empirical science is built upon.
  • Dozens of billions of dollars are spent every year in performing controlled experiments in

the context of the empirical sciences (health sciences, economics, social sciences).

2

slide-6
SLIDE 6

Causal Effects and Experiments

  • Science is about understanding the laws of nature, which are usually expressed in terms
  • f cause and effect relationships.
  • Controlled experimentation is the pillar on top of which empirical science is built upon.
  • Dozens of billions of dollars are spent every year in performing controlled experiments in

the context of the empirical sciences (health sciences, economics, social sciences).

  • Inferring and reasoning with causal relations are central for decision-making,

explainability, and reinforcement learning.

2

slide-7
SLIDE 7

Motivating Example (1) (Why is this problem non-trivial?)

3

slide-8
SLIDE 8

Motivating Example (1) (Why is this problem non-trivial?)

[Greenhouse et al. 2008] In the context of pediatric patients treated with antidepressant:

3

slide-9
SLIDE 9

Motivating Example (1) (Why is this problem non-trivial?)

[Greenhouse et al. 2008] In the context of pediatric patients treated with antidepressant:

  • The FDA was interested in assessing the effect of antidepressant drugs on suicidality.

3

slide-10
SLIDE 10

Motivating Example (1) (Why is this problem non-trivial?)

[Greenhouse et al. 2008] In the context of pediatric patients treated with antidepressant:

  • The FDA was interested in assessing the effect of antidepressant drugs on suicidality.
  • Historically, drugs had been prescribed by doctors taking into account background

information of the patients and the assessment of their baseline risk.

3

slide-11
SLIDE 11

Motivating Example (1) (Why is this problem non-trivial?)

[Greenhouse et al. 2008] In the context of pediatric patients treated with antidepressant:

  • The FDA was interested in assessing the effect of antidepressant drugs on suicidality.
  • Historically, drugs had been prescribed by doctors taking into account background

information of the patients and the assessment of their baseline risk.

  • Since the prescription and the outcome are both affected by the background factors, a

controlled experiment is used to identify the unconfounded effect of the antidepressants.

3

slide-12
SLIDE 12

Controlled Experimentation — Randomization

Natural world (confounded) X Y B E

(antidep. use) (suicidality) (baseline risk) (background factors)

4

slide-13
SLIDE 13

Controlled Experimentation — Randomization

Natural world (confounded) X Y B E

(antidep. use) (suicidality) (baseline risk) (background factors)

4

do(x0)

slide-14
SLIDE 14

Controlled Experimentation — Randomization

Natural world (confounded) X Y B E

(antidep. use) (suicidality) (baseline risk) (background factors)

P(y, b, e | do(x0))

x0 Y B E Effect of
 getting no
 treatment

4

do(x0)

slide-15
SLIDE 15

Controlled Experimentation — Randomization

Natural world (confounded) X Y B E

(antidep. use) (suicidality) (baseline risk) (background factors)

P(y, b, e | do(x0))

x0 Y B E Effect of
 getting no
 treatment

4

do(x0) do(x1)

slide-16
SLIDE 16

Controlled Experimentation — Randomization

Natural world (confounded) X Y B E

(antidep. use) (suicidality) (baseline risk) (background factors)

P(y, b, e | do(x0))

x0 Y B E

P(y, b, e | do(x1))

x1 Y B E Effect of 
 getting
 treatment Effect of
 getting no
 treatment

4

do(x0) do(x1)

slide-17
SLIDE 17

Randomization is not all there is!
 Motivating Example (2)

5

slide-18
SLIDE 18

[Greenhouse etal. 2008] On the risk of suicidality among pediatric antidepressant users:

Randomization is not all there is!
 Motivating Example (2)

5

slide-19
SLIDE 19

[Greenhouse etal. 2008] On the risk of suicidality among pediatric antidepressant users:

  • The FDA performed several RCTs finding that youths receiving antidepressants (do(x1)) had

approximately twice the amount of suicidal thoughts and behaviors compared to the control groups (do(x0)).

Randomization is not all there is!
 Motivating Example (2)

P(Y = 1|do(x1)) > P(Y = 1|do(x0))

5

slide-20
SLIDE 20

[Greenhouse etal. 2008] On the risk of suicidality among pediatric antidepressant users:

  • The FDA performed several RCTs finding that youths receiving antidepressants (do(x1)) had

approximately twice the amount of suicidal thoughts and behaviors compared to the control groups (do(x0)).

  • Results led to the addition of a strict warning to the drug’s label.

Randomization is not all there is!
 Motivating Example (2)

P(Y = 1|do(x1)) > P(Y = 1|do(x0))

5

slide-21
SLIDE 21

[Greenhouse etal. 2008] On the risk of suicidality among pediatric antidepressant users:

  • The FDA performed several RCTs finding that youths receiving antidepressants (do(x1)) had

approximately twice the amount of suicidal thoughts and behaviors compared to the control groups (do(x0)).

  • Results led to the addition of a strict warning to the drug’s label.
  • Surprisingly, following the warning, a decrease in prescription was reported together with an

increase of suicidal events in the corresponding age groups.

Randomization is not all there is!
 Motivating Example (2)

P*(Y = 1|do(x1)) < P*(Y = 1|do(x0)) P(Y = 1|do(x1)) > P(Y = 1|do(x0))

5

slide-22
SLIDE 22

[Greenhouse etal. 2008] On the risk of suicidality among pediatric antidepressant users:

  • The FDA performed several RCTs finding that youths receiving antidepressants (do(x1)) had

approximately twice the amount of suicidal thoughts and behaviors compared to the control groups (do(x0)).

  • Results led to the addition of a strict warning to the drug’s label.
  • Surprisingly, following the warning, a decrease in prescription was reported together with an

increase of suicidal events in the corresponding age groups.

  • Several observational studies reported positive results for patients using the same

antidepressants, even after accounting for access to mental health-care and other confounding factors.

Randomization is not all there is!
 Motivating Example (2)

P*(Y = 1|do(x1)) < P*(Y = 1|do(x0)) P(Y = 1|do(x1)) > P(Y = 1|do(x0))

5

˜ P*(Y = 1|do(x1)) < ˜ P*(Y = 1|do(x0))

slide-23
SLIDE 23

[Greenhouse etal. 2008] On the risk of suicidality among pediatric antidepressant users:

  • The FDA performed several RCTs finding that youths receiving antidepressants (do(x1)) had

approximately twice the amount of suicidal thoughts and behaviors compared to the control groups (do(x0)).

  • Results led to the addition of a strict warning to the drug’s label.
  • Surprisingly, following the warning, a decrease in prescription was reported together with an

increase of suicidal events in the corresponding age groups.

  • Several observational studies reported positive results for patients using the same

antidepressants, even after accounting for access to mental health-care and other confounding factors.

Randomization is not all there is!
 Motivating Example (2)

What is going on here?

P*(Y = 1|do(x1)) < P*(Y = 1|do(x0)) P(Y = 1|do(x1)) > P(Y = 1|do(x0))

5

˜ P*(Y = 1|do(x1)) < ˜ P*(Y = 1|do(x0))

slide-24
SLIDE 24

What are we missing?
 Motivating Example (3)

6

slide-25
SLIDE 25

What are we missing?
 Motivating Example (3)

  • Were the experiments conducted erroneously ?

6

slide-26
SLIDE 26

What are we missing?
 Motivating Example (3)

  • Were the experiments conducted erroneously ?
  • Randomization guarantees internal validity, that is, causal conclusions are true for the

population that was studied.

6

slide-27
SLIDE 27

What are we missing?
 Motivating Example (3)

  • Were the experiments conducted erroneously ?
  • Randomization guarantees internal validity, that is, causal conclusions are true for the

population that was studied.

  • Most experimental findings are intended to be generalized to a broader, or even

different, target domain (in other words, population, setting, environment).

6

slide-28
SLIDE 28

Two Challenges

  • Some possible explanations for the discrepancy in those results are:

7

slide-29
SLIDE 29

Two Challenges

  • Some possible explanations for the discrepancy in those results are:
  • 1. Transportability

There is a mismatch between the study population 𝛒 and the general clinical population 𝛒* regarding ethnicity, race, and income (covariates named E).

7

slide-30
SLIDE 30

Two Challenges

  • Some possible explanations for the discrepancy in those results are:
  • 1. Transportability

There is a mismatch between the study population 𝛒 and the general clinical population 𝛒* regarding ethnicity, race, and income (covariates named E).

7

👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥

entire/target 
 population 
 𝛒*

slide-31
SLIDE 31

Two Challenges

  • Some possible explanations for the discrepancy in those results are:
  • 1. Transportability

There is a mismatch between the study population 𝛒 and the general clinical population 𝛒* regarding ethnicity, race, and income (covariates named E).

7

👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥

entire/target 
 population 
 𝛒* study/source 
 population 
 𝛒

slide-32
SLIDE 32

Two Challenges

  • Some possible explanations for the discrepancy in those results are:
  • 1. Transportability

There is a mismatch between the study population 𝛒 and the general clinical population 𝛒* regarding ethnicity, race, and income (covariates named E).

7

👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥

entire/target 
 population 
 𝛒* study/source 
 population 
 𝛒

P*(e) ≠ P(e)

slide-33
SLIDE 33

Two Challenges

8

👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥

entire/target 
 population 
 𝛒* study/source 
 population 
 𝛒

  • Some possible explanations for the discrepancy in those results are:
slide-34
SLIDE 34

Two Challenges

  • 2. Selection Bias

FDA's studies sampled from a distinct population by excluding youths with elevated baseline risk for suicide (B) from their cohorts.

8

👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥

entire/target 
 population 
 𝛒* study/source 
 population 
 𝛒

  • Some possible explanations for the discrepancy in those results are:
slide-35
SLIDE 35

Two Challenges

  • 2. Selection Bias

FDA's studies sampled from a distinct population by excluding youths with elevated baseline risk for suicide (B) from their cohorts.

8

👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥

entire/target 
 population 
 𝛒* study/source 
 population 
 𝛒 sampled individuals 
 (S=1)

  • Some possible explanations for the discrepancy in those results are:
slide-36
SLIDE 36

Two Challenges

  • 2. Selection Bias

FDA's studies sampled from a distinct population by excluding youths with elevated baseline risk for suicide (B) from their cohorts.

8

👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥 👥

entire/target 
 population 
 𝛒* study/source 
 population 
 𝛒 sampled individuals 
 (S=1)

P(y, b, e|do(x), S = 1) ≠ P(y, b, e|do(x)) P(x, y, b, e|S = 1) ≠ P(x, y, b, e)

  • Some possible explanations for the discrepancy in those results are:
slide-37
SLIDE 37

Formalizing the Problem

X Y B E

9

slide-38
SLIDE 38

Formalizing the Problem

We use indicator named T to mark variables with differences between domains 𝛒 and 𝛒*.

X Y B E T

9

slide-39
SLIDE 39

Formalizing the Problem

We use indicator named T to mark variables with differences between domains 𝛒 and 𝛒*. Similarly, the indicator , named S, is defined such that S=1 for every unit sampled in the study, and 0, otherwise.

X Y B E S T

9

slide-40
SLIDE 40

Formalizing the Problem

We use indicator named T to mark variables with differences between domains 𝛒 and 𝛒*. Similarly, the indicator , named S, is defined such that S=1 for every unit sampled in the study, and 0, otherwise.

X Y B E S T

9

(called selection 
 diagram D)

slide-41
SLIDE 41

Formalizing the Problem

We use indicator named T to mark variables with differences between domains 𝛒 and 𝛒*. Similarly, the indicator , named S, is defined such that S=1 for every unit sampled in the study, and 0, otherwise. In this example, the causal effect can be estimated by recalibrating the experimental findings using observations from the target domain

X Y B E S T

9

P*(y|do(x)) = ∑

b,e

P(y|do(x), b, e, S = 1)P*(b, e)

(called selection 
 diagram D)

slide-42
SLIDE 42

Formalizing the Problem

We use indicator named T to mark variables with differences between domains 𝛒 and 𝛒*. Similarly, the indicator , named S, is defined such that S=1 for every unit sampled in the study, and 0, otherwise. In this example, the causal effect can be estimated by recalibrating the experimental findings using observations from the target domain

X Y B E S T

9

P*(y|do(x)) = ∑

b,e

P(y|do(x), b, e, S = 1)P*(b, e)

causal effect 
 in target domain (called selection 
 diagram D)

slide-43
SLIDE 43

Formalizing the Problem

We use indicator named T to mark variables with differences between domains 𝛒 and 𝛒*. Similarly, the indicator , named S, is defined such that S=1 for every unit sampled in the study, and 0, otherwise. In this example, the causal effect can be estimated by recalibrating the experimental findings using observations from the target domain

X Y B E S T

9

P*(y|do(x)) = ∑

b,e

P(y|do(x), b, e, S = 1)P*(b, e)

experimental data from the 
 source under selection bias causal effect 
 in target domain (called selection 
 diagram D)

slide-44
SLIDE 44

Formalizing the Problem

We use indicator named T to mark variables with differences between domains 𝛒 and 𝛒*. Similarly, the indicator , named S, is defined such that S=1 for every unit sampled in the study, and 0, otherwise. In this example, the causal effect can be estimated by recalibrating the experimental findings using observations from the target domain

X Y B E S T

9

P*(y|do(x)) = ∑

b,e

P(y|do(x), b, e, S = 1)P*(b, e)

Observations from 
 the target domain experimental data from the 
 source under selection bias causal effect 
 in target domain (called selection 
 diagram D)

slide-45
SLIDE 45

Problem Statement

10

slide-46
SLIDE 46

Problem Statement

10

X Y B E S T

Selection Diagram D

slide-47
SLIDE 47

Problem Statement

10

X Y B E S T

Selection Diagram D

P(v|do(x), S = 1)

Selection-biased Exp.
 Distribution P1 from 𝛒

slide-48
SLIDE 48

Problem Statement

10

X Y B E S T

Selection Diagram D

P(v|do(x), S = 1)

Selection-biased Exp.
 Distribution P1 from 𝛒

P*(w)

Covariate Distribution
 P2 from 𝛒*

slide-49
SLIDE 49

Problem Statement

10

X Y B E S T

Selection Diagram D

Is there a function f such that
 


P*(y|do(x)) = f(P1, P2)

P(v|do(x), S = 1)

Selection-biased Exp.
 Distribution P1 from 𝛒

P*(w)

Covariate Distribution
 P2 from 𝛒*

slide-50
SLIDE 50

Problem Statement

10

X Y B E S T

Selection Diagram D

Is there a function f such that
 


P*(y|do(x)) = f(P1, P2)

P(v|do(x), S = 1)

Selection-biased Exp.
 Distribution P1 from 𝛒

P*(w)

Covariate Distribution
 P2 from 𝛒*

yes ( ) / no

f

😁 ☹

slide-51
SLIDE 51

Related Work

11

slide-52
SLIDE 52

Related Work

11

confounding type of input selection bias transportability complete

slide-53
SLIDE 53

Related Work

11

confounding type of input selection bias transportability complete

Backdoor Criterion [Pearl ’93]
 Extended Backdoor [Pearl and Paz ’10]

  • bs.
slide-54
SLIDE 54

Related Work

11

confounding type of input selection bias transportability complete

Backdoor Criterion [Pearl ’93]
 Extended Backdoor [Pearl and Paz ’10]

  • bs.

Adjustment Criterion 
 [Shpitser et al. ’10; Perkovic et al. ’15,’18]

  • bs.

slide-55
SLIDE 55

Related Work

11

confounding type of input selection bias transportability complete

Backdoor Criterion [Pearl ’93]
 Extended Backdoor [Pearl and Paz ’10]

  • bs.

Adjustment Criterion 
 [Shpitser et al. ’10; Perkovic et al. ’15,’18]

  • bs.

Selection Backdoor 
 [Bareinboim, Tian and Pearl ’14]

  • bs.

slide-56
SLIDE 56

Related Work

11

confounding type of input selection bias transportability complete

Backdoor Criterion [Pearl ’93]
 Extended Backdoor [Pearl and Paz ’10]

  • bs.

Adjustment Criterion 
 [Shpitser et al. ’10; Perkovic et al. ’15,’18]

  • bs.

Selection Backdoor 
 [Bareinboim, Tian and Pearl ’14]

  • bs.

Generalized Adjustment Criterion 
 [Correa, Tian and Bareinboim ’18]

  • bs.

✔ ✔

slide-57
SLIDE 57

Related Work

11

confounding type of input selection bias transportability complete

Backdoor Criterion [Pearl ’93]
 Extended Backdoor [Pearl and Paz ’10]

  • bs.

Adjustment Criterion 
 [Shpitser et al. ’10; Perkovic et al. ’15,’18]

  • bs.

Selection Backdoor 
 [Bareinboim, Tian and Pearl ’14]

  • bs.

Generalized Adjustment Criterion 
 [Correa, Tian and Bareinboim ’18]

  • bs.

✔ ✔

st-Adjustment Criterion 
 [Correa, Tian and Bareinboim ’19]

— exp. ✔ ✔ ✔

slide-58
SLIDE 58

Solution: Covariate st-Adjustment

  • Strategy: Recalibrate the results from experiments in the the studied population using
  • bservations from the target population.

12

slide-59
SLIDE 59

Solution: Covariate st-Adjustment

  • Strategy: Recalibrate the results from experiments in the the studied population using
  • bservations from the target population.

P*(y|do(x))

12

slide-60
SLIDE 60

Solution: Covariate st-Adjustment

  • Strategy: Recalibrate the results from experiments in the the studied population using
  • bservations from the target population.

unbiased target 
 effect in 𝛒*

P*(y|do(x))

12

slide-61
SLIDE 61

Solution: Covariate st-Adjustment

  • Strategy: Recalibrate the results from experiments in the the studied population using
  • bservations from the target population.

= ∑

z

P(y|do(x), z, S = 1)P*(z)

unbiased target 
 effect in 𝛒*

P*(y|do(x))

12

slide-62
SLIDE 62

Solution: Covariate st-Adjustment

  • Strategy: Recalibrate the results from experiments in the the studied population using
  • bservations from the target population.

= ∑

z

P(y|do(x), z, S = 1)P*(z)

unbiased target 
 effect in 𝛒* experiment results
 in source domain 𝛒

P*(y|do(x))

12

slide-63
SLIDE 63

Solution: Covariate st-Adjustment

  • Strategy: Recalibrate the results from experiments in the the studied population using
  • bservations from the target population.

= ∑

z

P(y|do(x), z, S = 1)P*(z)

unbiased target 
 effect in 𝛒* experiment results
 in source domain 𝛒

  • bservations from 


the target domain 𝛒*

P*(y|do(x))

12

slide-64
SLIDE 64

Solution: Covariate st-Adjustment

  • Strategy: Recalibrate the results from experiments in the the studied population using
  • bservations from the target population.

= ∑

z

P(y|do(x), z, S = 1)P*(z)

unbiased target 
 effect in 𝛒* experiment results
 in source domain 𝛒

  • bservations from 


the target domain 𝛒*

P*(y|do(x))

12

  • Questions:
slide-65
SLIDE 65

Solution: Covariate st-Adjustment

  • Strategy: Recalibrate the results from experiments in the the studied population using
  • bservations from the target population.

= ∑

z

P(y|do(x), z, S = 1)P*(z)

unbiased target 
 effect in 𝛒* experiment results
 in source domain 𝛒

  • bservations from 


the target domain 𝛒*

P*(y|do(x))

12

  • Questions:
  • 1. How to determine if st-adjustment holds for a set of covariates Z?
slide-66
SLIDE 66

Solution: Covariate st-Adjustment

  • Strategy: Recalibrate the results from experiments in the the studied population using
  • bservations from the target population.

= ∑

z

P(y|do(x), z, S = 1)P*(z)

unbiased target 
 effect in 𝛒* experiment results
 in source domain 𝛒

  • bservations from 


the target domain 𝛒*

P*(y|do(x))

12

  • Questions:
  • 1. How to determine if st-adjustment holds for a set of covariates Z?
  • 2. How to find admissible covariate sets?
slide-67
SLIDE 67

Challenge I. Covariate Admissibility

13

slide-68
SLIDE 68

Challenge I. Covariate Admissibility

  • In general, adjusting for some variables that are affected by the treatment could introduce

more bias, instead of controlling for the current, existent ones.

13

slide-69
SLIDE 69

Challenge I. Covariate Admissibility

  • In general, adjusting for some variables that are affected by the treatment could introduce

more bias, instead of controlling for the current, existent ones.

  • In our setting, in particular, special attention needs to be paid to these covariates (affected

by the treatment) that are correlated with the outcome given pre-treatment covariates.

13

slide-70
SLIDE 70

Challenge I. Covariate Admissibility

  • In general, adjusting for some variables that are affected by the treatment could introduce

more bias, instead of controlling for the current, existent ones.

  • In our setting, in particular, special attention needs to be paid to these covariates (affected

by the treatment) that are correlated with the outcome given pre-treatment covariates.

13

  • Let’s call this set Zp.
slide-71
SLIDE 71

Challenge I. Covariate Admissibility

  • In general, adjusting for some variables that are affected by the treatment could introduce

more bias, instead of controlling for the current, existent ones.

  • In our setting, in particular, special attention needs to be paid to these covariates (affected

by the treatment) that are correlated with the outcome given pre-treatment covariates.

X Y Z3 Z1 S T Z2

13

  • Let’s call this set Zp.
slide-72
SLIDE 72

Challenge I. Covariate Admissibility

  • In general, adjusting for some variables that are affected by the treatment could introduce

more bias, instead of controlling for the current, existent ones.

  • In our setting, in particular, special attention needs to be paid to these covariates (affected

by the treatment) that are correlated with the outcome given pre-treatment covariates.

X Y Z3 Z1 S T Z2

13

  • Let’s call this set Zp.
  • For example if adjusting for Z = {Z1, Z2, Z3} in this model

Zp = {Z3}.

slide-73
SLIDE 73

Main Result I: 
 Complete Graphical Condition

14

slide-74
SLIDE 74

Main Result I: 
 Complete Graphical Condition

A set of covariates Z is admissible for st-adjustment in D relative to treatment X and

  • utcome Y if:

14

slide-75
SLIDE 75

Main Result I: 
 Complete Graphical Condition

A set of covariates Z is admissible for st-adjustment in D relative to treatment X and

  • utcome Y if:

(i) Variables in Zp are independent of the treatment given all other covariates, and

14

slide-76
SLIDE 76

Main Result I: 
 Complete Graphical Condition

A set of covariates Z is admissible for st-adjustment in D relative to treatment X and

  • utcome Y if:

(i) Variables in Zp are independent of the treatment given all other covariates, and (ii) The outcome Y is independent of all the transportability (T) and selection bias nodes (S) given the covariates Z and the treatment X.

14

slide-77
SLIDE 77

Main Result I: 
 Complete Graphical Condition

A set of covariates Z is admissible for st-adjustment in D relative to treatment X and

  • utcome Y if:

(i) Variables in Zp are independent of the treatment given all other covariates, and (ii) The outcome Y is independent of all the transportability (T) and selection bias nodes (S) given the covariates Z and the treatment X.

  • Thm. The causal effect P*(y | do(x)) is identifiable by st-adjustment on a set Z with D if and
  • nly if the conditions above hold for Z relative to X and Y.

14

slide-78
SLIDE 78

Understanding the criterion

15

slide-79
SLIDE 79

Understanding the criterion

Task: Compute P*(y | do(x))

X Y Z3 Z1 S T Z2

15

slide-80
SLIDE 80

Understanding the criterion

Task: Compute P*(y | do(x))

  • The outcome Y is affected by differences in the distribution of Z1

between the source and target domains.

X Y Z3 Z1 S T Z2

15

X Y Z3 Z1 S T Z2

slide-81
SLIDE 81

Understanding the criterion

Task: Compute P*(y | do(x))

  • The outcome Y is affected by differences in the distribution of Z1

between the source and target domains.

  • The variable Z3 affects the likelihood of units being sampled.

X Y Z3 Z1 S T Z2

15

X Y Z3 Z1 S T Z2 X Y Z3 Z1 S T Z2

slide-82
SLIDE 82

Understanding the criterion

Task: Compute P*(y | do(x))

  • The outcome Y is affected by differences in the distribution of Z1

between the source and target domains.

  • The variable Z3 affects the likelihood of units being sampled.
  • If we adjust for Z3 to control for selection bias, we introduce

spurious correlation. Hence, we should also control for Z2.

X Y Z3 Z1 S T Z2

15

X Y Z3 Z1 S T Z2 X Y Z3 Z1 S T Z2 X Y Z3 Z1 S T Z2

slide-83
SLIDE 83

Understanding the criterion

Task: Compute P*(y | do(x))

  • The outcome Y is affected by differences in the distribution of Z1

between the source and target domains.

  • The variable Z3 affects the likelihood of units being sampled.
  • If we adjust for Z3 to control for selection bias, we introduce

spurious correlation. Hence, we should also control for Z2.

X Y Z3 Z1 S T Z2

15

X Y Z3 Z1 S T Z2 X Y Z3 Z1 S T Z2 X Y Z3 Z1 S T Z2 X Y Z3 Z1 S T Z2

slide-84
SLIDE 84

Getting the intuition behind the rules
 Example

X Y Z3 Z1 S T Z2

16

slide-85
SLIDE 85

Getting the intuition behind the rules
 Example

By making Z ={Z1, Z2, Z3}, we can verify the st-adjustment conditions, i.e.:

X Y Z3 Z1 S T Z2

16

slide-86
SLIDE 86

Getting the intuition behind the rules
 Example

By making Z ={Z1, Z2, Z3}, we can verify the st-adjustment conditions, i.e.: (i) The variable in Zp={Z3} is independent of X given the other covariates {Z1, Z2}.

X Y Z3 Z1 S T Z2

16

slide-87
SLIDE 87

Getting the intuition behind the rules
 Example

By making Z ={Z1, Z2, Z3}, we can verify the st-adjustment conditions, i.e.: (i) The variable in Zp={Z3} is independent of X given the other covariates {Z1, Z2}. (ii) The outcome Y is independent of S and T given Z.

X Y Z3 Z1 S T Z2

16

slide-88
SLIDE 88

Getting the intuition behind the rules
 Example

By making Z ={Z1, Z2, Z3}, we can verify the st-adjustment conditions, i.e.: (i) The variable in Zp={Z3} is independent of X given the other covariates {Z1, Z2}. (ii) The outcome Y is independent of S and T given Z.

X Y Z3 Z1 S T Z2

16

P*(y|do(x)) = ∑

z1,z2,z3

P(y|do(x), z1, z2, z3, S = 1)P*(z1, z2, z3)

Hence, the st-adjustment is guaranteed to hold, i.e.:

slide-89
SLIDE 89

Getting the intuition behind the rules
 Example

By making Z ={Z1, Z2, Z3}, we can verify the st-adjustment conditions, i.e.: (i) The variable in Zp={Z3} is independent of X given the other covariates {Z1, Z2}. (ii) The outcome Y is independent of S and T given Z.

X Y Z3 Z1 S T Z2

16

P*(y|do(x)) = ∑

z1,z2,z3

P(y|do(x), z1, z2, z3, S = 1)P*(z1, z2, z3)

Hence, the st-adjustment is guaranteed to hold, i.e.:

measurements from 
 the target domain experimental data from the 
 source under selection bias causal effect 
 in target domain

slide-90
SLIDE 90

Challenge II. 
 Searching for Admissible Sets

17

slide-91
SLIDE 91

Challenge II. 
 Searching for Admissible Sets

  • Given a candidate set Z, we have a condition to determine if it is admissible or not.

17

slide-92
SLIDE 92

Challenge II. 
 Searching for Admissible Sets

  • Given a candidate set Z, we have a condition to determine if it is admissible or not.
  • The natural question that follows is how to find an admissible set without resorting to

trial and error. There could be exponentially many candidates (and even valid ones!).

17

slide-93
SLIDE 93

Challenge II. 
 Searching for Admissible Sets

  • Given a candidate set Z, we have a condition to determine if it is admissible or not.
  • The natural question that follows is how to find an admissible set without resorting to

trial and error. There could be exponentially many candidates (and even valid ones!).

  • How to determine the existence of at least one admissible set?

17

slide-94
SLIDE 94

Challenge II. 
 Searching for Admissible Sets

  • Given a candidate set Z, we have a condition to determine if it is admissible or not.
  • The natural question that follows is how to find an admissible set without resorting to

trial and error. There could be exponentially many candidates (and even valid ones!).

  • How to determine the existence of at least one admissible set?
  • There are sets that could be preferred among other admissible ones due to certain

properties (e.g., cost, variance).

17

slide-95
SLIDE 95

Main Result II: Listing Algorithm

18

slide-96
SLIDE 96

Main Result II: Listing Algorithm

18

X Y B E S T

Selection Diagram D

slide-97
SLIDE 97

Main Result II: Listing Algorithm

18

X Y B E S T

Selection Diagram D

P(v|do(x), S = 1)

Selection-biased Exp.
 Distribution from 𝛒

slide-98
SLIDE 98

Main Result II: Listing Algorithm

18

X Y B E S T

Selection Diagram D

P(v|do(x), S = 1)

Selection-biased Exp.
 Distribution from 𝛒 Set W of covariates 
 measurable in 𝛒*

slide-99
SLIDE 99

Main Result II: Listing Algorithm

18

X Y B E S T

Selection Diagram D

What are all the admissible sets satisfying st-adjustment?

P(v|do(x), S = 1)

Selection-biased Exp.
 Distribution from 𝛒 Set W of covariates 
 measurable in 𝛒*

slide-100
SLIDE 100

Main Result II: Listing Algorithm

18

X Y B E S T

Selection Diagram D

What are all the admissible sets satisfying st-adjustment?

P(v|do(x), S = 1)

Selection-biased Exp.
 Distribution from 𝛒 Set W of covariates 
 measurable in 𝛒*

List of of sets
 
 such that for 
 each Zi:

Z1, Z2, … ⊆ W

P*(y|do(x)) = ∑

zi

P(y|do(x), zi, S = 1)P*(zi)

slide-101
SLIDE 101

Main Result II: Listing Algorithm

18

X Y B E S T

Selection Diagram D

What are all the admissible sets satisfying st-adjustment?

P(v|do(x), S = 1)

Selection-biased Exp.
 Distribution from 𝛒 Set W of covariates 
 measurable in 𝛒*

List of of sets
 
 such that for 
 each Zi:

Z1, Z2, … ⊆ W

P*(y|do(x)) = ∑

zi

P(y|do(x), zi, S = 1)P*(zi)

We provide an algorithm (Alg. 2) that works with polynomial delay (Thm. 6)

slide-102
SLIDE 102

Conclusions

19

slide-103
SLIDE 103

Conclusions

  • Given a selection diagram, we describe complete conditions to determine whether

adjusting by a given set of covariates is admissible for the identification of causal effects from experimental results in a source domain and some observations from the target domain.

19

slide-104
SLIDE 104

Conclusions

  • Given a selection diagram, we describe complete conditions to determine whether

adjusting by a given set of covariates is admissible for the identification of causal effects from experimental results in a source domain and some observations from the target domain.

  • We provide a procedure to list valid adjustment sets given a set of variables that can be

measured.

19

slide-105
SLIDE 105

Conclusions

  • Given a selection diagram, we describe complete conditions to determine whether

adjusting by a given set of covariates is admissible for the identification of causal effects from experimental results in a source domain and some observations from the target domain.

  • We provide a procedure to list valid adjustment sets given a set of variables that can be

measured.

  • We hope the formal and transparent dressing given to the problem by our results can

help researchers in health sciences, econometrics, reinforcement learning, marketing and others where extrapolating experimental results is crucial.

19

slide-106
SLIDE 106

Thank you!

20

#76

Poster

slide-107
SLIDE 107

Polynomial delay

  • Time passing between the start of the execution and first output or failure is polynomial.
  • Time between outputs is also polynomial.

21

[Takata ’10]