A Calculus for Stochastic Interventions: Causal Effect - - PowerPoint PPT Presentation

a calculus for stochastic interventions causal effect
SMART_READER_LITE
LIVE PREVIEW

A Calculus for Stochastic Interventions: Causal Effect - - PowerPoint PPT Presentation

A Calculus for Stochastic Interventions: Causal Effect Identification and Surrogate Experiments Juan D. Correa and Elias Bareinboim {jdcorrea, eb}@cs.columbia.edu February, 2020, New York 1 Outline 2 Outline Hard/atomic interventions


slide-1
SLIDE 1

A Calculus for Stochastic Interventions: Causal Effect Identification and Surrogate Experiments

Juan D. Correa and Elias Bareinboim

{jdcorrea, eb}@cs.columbia.edu

1

February, 2020, New York

slide-2
SLIDE 2

Outline

2

slide-3
SLIDE 3

Outline

  • Hard/atomic interventions vs. Soft/non-atomic interventions
  • Graphical representation
  • Inferences rules for soft interventions (σ-calculus)
  • Imperfect surrogate experiments
  • Conclusions

2

slide-4
SLIDE 4

Motivating example

3

slide-5
SLIDE 5

Motivating example

  • Consider a tutoring program in place at a certain school.

3

slide-6
SLIDE 6

Motivating example

  • Consider a tutoring program in place at a certain school.
  • For each student, we observe the GPA at the beginning of the term, their

motivation (low, high), whether they got tutoring or not, and their GPA at the end.

3

slide-7
SLIDE 7

Motivating example

  • Consider a tutoring program in place at a certain school.
  • For each student, we observe the GPA at the beginning of the term, their

motivation (low, high), whether they got tutoring or not, and their GPA at the end.

3

W

(previous GPA)

Z

(motivation)

  • Motivation depends (among other not observed

factors) on the previous GPA.

slide-8
SLIDE 8

Motivating example

  • Consider a tutoring program in place at a certain school.
  • For each student, we observe the GPA at the beginning of the term, their

motivation (low, high), whether they got tutoring or not, and their GPA at the end.

3

X W

(tutoring) (previous GPA)

Z

(motivation)

  • Motivation depends (among other not observed

factors) on the previous GPA.

  • Students get tutoring depending on their

motivation.

slide-9
SLIDE 9

Motivating example

  • Consider a tutoring program in place at a certain school.
  • For each student, we observe the GPA at the beginning of the term, their

motivation (low, high), whether they got tutoring or not, and their GPA at the end.

3

X Y W

(tutoring) (GPA) (previous GPA)

Z

(motivation)

  • Motivation depends (among other not observed

factors) on the previous GPA.

  • Students get tutoring depending on their

motivation.

  • The GPA at the end of the term is a function of the

previous GPA, student’s motivation and getting tutoring or not.

slide-10
SLIDE 10

Motivating example

  • Consider a tutoring program in place at a certain school.
  • For each student, we observe the GPA at the beginning of the term, their

motivation (low, high), whether they got tutoring or not, and their GPA at the end.

3

X Y W

(tutoring) (GPA) (previous GPA)

Z

(motivation) Natural (current) Regime

G

  • Motivation depends (among other not observed

factors) on the previous GPA.

  • Students get tutoring depending on their

motivation.

  • The GPA at the end of the term is a function of the

previous GPA, student’s motivation and getting tutoring or not.

slide-11
SLIDE 11

4

X Y W

(tutoring) (GPA) (previous GPA)

Z

(motivation) Natural (current) Regime

𝒣

Motivating example

slide-12
SLIDE 12

4

X Y W

(tutoring) (GPA) (previous GPA)

Z

(motivation) Natural (current) Regime

𝒣

  • Using machine learning, and with enough data, a

students GPA can be predicted with small error given other features i.e., P(y | w, z, x).

Motivating example

slide-13
SLIDE 13

4

X Y W

(tutoring) (GPA) (previous GPA)

Z

(motivation) Natural (current) Regime

𝒣

  • Using machine learning, and with enough data, a

students GPA can be predicted with small error given other features i.e., P(y | w, z, x).

  • This distribution is a model that reflects the current/

natural regime, but we are interested in taking decisions to improve the students GPA.

Motivating example

slide-14
SLIDE 14

4

X Y W

(tutoring) (GPA) (previous GPA)

Z

(motivation) Natural (current) Regime

𝒣

  • Using machine learning, and with enough data, a

students GPA can be predicted with small error given other features i.e., P(y | w, z, x).

  • This distribution is a model that reflects the current/

natural regime, but we are interested in taking decisions to improve the students GPA.

  • Taking decisions amount to intervening the current
  • regime. Hence, we are interested in predicting

student’s GPA receiving tutoring in a hypothetical (unrealized) reality.

Motivating example

slide-15
SLIDE 15

4

X Y W

(tutoring) (GPA) (previous GPA)

Z

(motivation) Natural (current) Regime

𝒣

  • Using machine learning, and with enough data, a

students GPA can be predicted with small error given other features i.e., P(y | w, z, x).

  • This distribution is a model that reflects the current/

natural regime, but we are interested in taking decisions to improve the students GPA.

  • Taking decisions amount to intervening the current
  • regime. Hence, we are interested in predicting

student’s GPA receiving tutoring in a hypothetical (unrealized) reality.

  • This is a causal inference question!

Motivating example

slide-16
SLIDE 16

Some types of Interventions

5

slide-17
SLIDE 17

Some types of Interventions

  • Hard/atomic: σX=do(X=x) set variable X to a constant value x. 


(Pearl’s original treatment considered mostly this intervention. )

5

slide-18
SLIDE 18

Some types of Interventions

  • Hard/atomic: σX=do(X=x) set variable X to a constant value x. 


(Pearl’s original treatment considered mostly this intervention. )

  • Every student gets tutoring.

5

slide-19
SLIDE 19

Some types of Interventions

  • Hard/atomic: σX=do(X=x) set variable X to a constant value x. 


(Pearl’s original treatment considered mostly this intervention. )

  • Every student gets tutoring.
  • Conditional: σX=g(w) sets the variable X to output the result of a function g that depends
  • n a set of observable variables W.

5

slide-20
SLIDE 20

Some types of Interventions

  • Hard/atomic: σX=do(X=x) set variable X to a constant value x. 


(Pearl’s original treatment considered mostly this intervention. )

  • Every student gets tutoring.
  • Conditional: σX=g(w) sets the variable X to output the result of a function g that depends
  • n a set of observable variables W.
  • Students get tutoring if and only if they have a low GPA.

5

slide-21
SLIDE 21

Some types of Interventions

  • Hard/atomic: σX=do(X=x) set variable X to a constant value x. 


(Pearl’s original treatment considered mostly this intervention. )

  • Every student gets tutoring.
  • Conditional: σX=g(w) sets the variable X to output the result of a function g that depends
  • n a set of observable variables W.
  • Students get tutoring if and only if they have a low GPA.
  • Stochastic: σX=P*(x|w) sets the variable X to follow a given probability distribution

conditional on a set of variables W.

5

slide-22
SLIDE 22

Some types of Interventions

  • Hard/atomic: σX=do(X=x) set variable X to a constant value x. 


(Pearl’s original treatment considered mostly this intervention. )

  • Every student gets tutoring.
  • Conditional: σX=g(w) sets the variable X to output the result of a function g that depends
  • n a set of observable variables W.
  • Students get tutoring if and only if they have a low GPA.
  • Stochastic: σX=P*(x|w) sets the variable X to follow a given probability distribution

conditional on a set of variables W.

  • Students with low GPA enter a raffle for 80% of the spots, other interested students

enter for the remaining 20%.

5

slide-23
SLIDE 23

Hard/Atomic Interventions

6

slide-24
SLIDE 24

Hard/Atomic Interventions

  • What if we make tutoring mandatory for every student?

6

slide-25
SLIDE 25

Hard/Atomic Interventions

  • What if we make tutoring mandatory for every student?

6

X Y W

(tutoring) (GPA) (previous GPA)

Z

(motivation)

slide-26
SLIDE 26

Hard/Atomic Interventions

  • What if we make tutoring mandatory for every student?

6

X Y W

(tutoring) (GPA) (previous GPA)

Z

(motivation) Natural (current) Regime

𝒣

slide-27
SLIDE 27

Hard/Atomic Interventions

  • What if we make tutoring mandatory for every student?

6

X Y W

(tutoring) (GPA) (previous GPA)

Z

(motivation) Intervention do(X = 1)

Make tutoring mandatory for all students.

Natural (current) Regime

𝒣

slide-28
SLIDE 28

Hard/Atomic Interventions

  • What if we make tutoring mandatory for every student?

6

X Y W

(tutoring) (GPA) (previous GPA)

Z

(motivation)

X=1 Y W

(tutoring) (GPA) (previous GPA)

Z

(motivation) Intervention do(X = 1)

Make tutoring mandatory for all students.

Natural (current) Regime

𝒣

slide-29
SLIDE 29

Hard/Atomic Interventions

  • What if we make tutoring mandatory for every student?

6

X Y W

(tutoring) (GPA) (previous GPA)

Z

(motivation)

X=1 Y W

(tutoring) (GPA) (previous GPA)

Z

(motivation) Intervention do(X = 1)

Make tutoring mandatory for all students.

Natural (current) Regime Intervened (hypothesized) Regime

𝒣 𝒣X

slide-30
SLIDE 30

Hard/Atomic Interventions

  • What if we make tutoring mandatory for every student?

6

X Y W

(tutoring) (GPA) (previous GPA)

Z

(motivation)

X=1 Y W

(tutoring) (GPA) (previous GPA)

Z

(motivation) Intervention do(X = 1)

Make tutoring mandatory for all students.

Instead of P(y | X=1) we are reasoning about P(y | do(X=1)), or, more generally, P(y; σX=do(X=1))

Natural (current) Regime Intervened (hypothesized) Regime

𝒣 𝒣X

slide-31
SLIDE 31

Soft Interventions

7

slide-32
SLIDE 32

Soft Interventions

  • A more realistic, or interesting, type of intervention is, for example, to consider the

effect of making tutoring mandatory for students with historically low GPA and

  • nly to them, on their current GPA.

7

slide-33
SLIDE 33

Soft Interventions

  • A more realistic, or interesting, type of intervention is, for example, to consider the

effect of making tutoring mandatory for students with historically low GPA and

  • nly to them, on their current GPA.

7

X Y W

(tutoring) (GPA) (previous GPA)

Z

(motivation)

slide-34
SLIDE 34

Soft Interventions

  • A more realistic, or interesting, type of intervention is, for example, to consider the

effect of making tutoring mandatory for students with historically low GPA and

  • nly to them, on their current GPA.

7

X Y W

(tutoring) (GPA) (previous GPA)

Z

(motivation) Natural (current) Regime

𝒣

slide-35
SLIDE 35

Soft Interventions

  • A more realistic, or interesting, type of intervention is, for example, to consider the

effect of making tutoring mandatory for students with historically low GPA and

  • nly to them, on their current GPA.

7

X Y W

(tutoring) (GPA) (previous GPA)

Z

(motivation) Intervention σX = 1[W = 1]

Assign tutoring only to students with low GPA.

Natural (current) Regime

𝒣

slide-36
SLIDE 36

Soft Interventions

  • A more realistic, or interesting, type of intervention is, for example, to consider the

effect of making tutoring mandatory for students with historically low GPA and

  • nly to them, on their current GPA.

7

X Y W

(tutoring) (GPA) (previous GPA)

Z

(motivation)

X Y W

(tutoring) (GPA) (previous GPA)

Z

(motivation) σX Intervention σX = 1[W = 1]

Assign tutoring only to students with low GPA.

Natural (current) Regime

𝒣

slide-37
SLIDE 37

Soft Interventions

  • A more realistic, or interesting, type of intervention is, for example, to consider the

effect of making tutoring mandatory for students with historically low GPA and

  • nly to them, on their current GPA.

7

X Y W

(tutoring) (GPA) (previous GPA)

Z

(motivation)

X Y W

(tutoring) (GPA) (previous GPA)

Z

(motivation) σX Intervention σX = 1[W = 1]

Assign tutoring only to students with low GPA.

Natural (current) Regime Intervened (hypothesized) Regime

𝒣σX 𝒣

slide-38
SLIDE 38

Soft Interventions

  • A more realistic, or interesting, type of intervention is, for example, to consider the

effect of making tutoring mandatory for students with historically low GPA and

  • nly to them, on their current GPA.

7

X Y W

(tutoring) (GPA) (previous GPA)

Z

(motivation)

X Y W

(tutoring) (GPA) (previous GPA)

Z

(motivation) σX Intervention σX = 1[W = 1]

Assign tutoring only to students with low GPA.

P(y ; )

σX

Natural (current) Regime Intervened (hypothesized) Regime

𝒣σX 𝒣

slide-39
SLIDE 39

σ-calculus (simplified)

8

slide-40
SLIDE 40

σ-calculus (simplified)

  • Insertion/deletion of observations:

8

P(y ∣ w, t; σX) = P(y ∣ w; σX) (Y ⊥ T ∣ W) 𝒣σX

if in

slide-41
SLIDE 41

σ-calculus (simplified)

  • Insertion/deletion of observations:
  • Change of regimes under observation:

8

P(y ∣ w, t; σX) = P(y ∣ w; σX) (Y ⊥ T ∣ W) 𝒣σX

if in

P(y ∣ x, w; σX) = P(y ∣ x, w) (Y ⊥ Z ∣ W) 𝒣σXX 𝒣X

if in and

slide-42
SLIDE 42

σ-calculus (simplified)

  • Insertion/deletion of observations:
  • Change of regimes under observation:
  • Change of regimes without observations:

8

P(y ∣ w, t; σX) = P(y ∣ w; σX) (Y ⊥ T ∣ W) 𝒣σX

if in

P(y ∣ x, w; σX) = P(y ∣ x, w) (Y ⊥ Z ∣ W) 𝒣σXX 𝒣X

if in and

P(y ∣ w; σX) = P(y ∣ w) (Y ⊥ Z ∣ W) 𝒣σXX(W) 𝒣X(W)

if in and

slide-43
SLIDE 43

Using σ-calculus

9

X Y W

(tutoring) (GPA) (previous GPA)

Z

(motivation)

𝒣

X Y W

(tutoring) (GPA) (previous GPA)

Z

(motivation)

σX

𝒣σX

P(y; σX) = ∑

w,z

P(y|x, w, z; σX)P(x ∣ w, z; σX)P(w, z; σX)

slide-44
SLIDE 44

Using σ-calculus

9

X Y W

(tutoring) (GPA) (previous GPA)

Z

(motivation)

𝒣

X Y W

(tutoring) (GPA) (previous GPA)

Z

(motivation)

σX

𝒣σX

P(y; σX) = ∑

w,z

P(y|x, w, z; σX)P(x ∣ w, z; σX)P(w, z; σX) = ∑

w,z

P(y|w, z)P(x ∣ w; σX)P(w, z; σX) P(x ∣ w; σX)

Rule 1 (X ⊥ Z ∣ W)

𝒣σX

in

slide-45
SLIDE 45

Using σ-calculus

9

X Y W

(tutoring) (GPA) (previous GPA)

Z

(motivation)

𝒣

X Y W

(tutoring) (GPA) (previous GPA)

Z

(motivation)

σX

𝒣σX

P(y; σX) = ∑

w,z

P(y|x, w, z; σX)P(x ∣ w, z; σX)P(w, z; σX) = ∑

w,z

P(y|w, z)P(x ∣ w; σX)P(w, z; σX)

Rule 1 (X ⊥ Z ∣ W)

𝒣σX

in

slide-46
SLIDE 46

Using σ-calculus

9

X Y W

(tutoring) (GPA) (previous GPA)

Z

(motivation)

𝒣

X Y W

(tutoring) (GPA) (previous GPA)

Z

(motivation)

σX

𝒣σX

P(y; σX) = ∑

w,z

P(y|x, w, z; σX)P(x ∣ w, z; σX)P(w, z; σX) = ∑

w,z

P(y|w, z)P(x ∣ w; σX)P(w, z; σX)

Rule 1 (X ⊥ Z ∣ W)

𝒣σX

in

= ∑

w,z

P(y|x, w, z)P(x ∣ w, z; σX)P(w, z; σX) P(y|x, w, z)

Rule 2 (Y ⊥ X ∣ W, Z)

𝒣σXX 𝒣X

in and

slide-47
SLIDE 47

Using σ-calculus

9

X Y W

(tutoring) (GPA) (previous GPA)

Z

(motivation)

𝒣

X Y W

(tutoring) (GPA) (previous GPA)

Z

(motivation)

σX

𝒣σX

P(y; σX) = ∑

w,z

P(y|x, w, z; σX)P(x ∣ w, z; σX)P(w, z; σX) = ∑

w,z

P(y|w, z)P(x ∣ w; σX)P(w, z; σX)

Rule 1 (X ⊥ Z ∣ W)

𝒣σX

in

= ∑

w,z

P(y|x, w, z)P(x ∣ w, z; σX)P(w, z; σX)

Rule 2 (Y ⊥ X ∣ W, Z)

𝒣σXX 𝒣X

in and

slide-48
SLIDE 48

Using σ-calculus

9

X Y W

(tutoring) (GPA) (previous GPA)

Z

(motivation)

𝒣

X Y W

(tutoring) (GPA) (previous GPA)

Z

(motivation)

σX

𝒣σX

P(y; σX) = ∑

w,z

P(y|x, w, z; σX)P(x ∣ w, z; σX)P(w, z; σX) = ∑

w,z

P(y|w, z)P(x ∣ w; σX)P(w, z; σX)

Rule 1 (X ⊥ Z ∣ W)

𝒣σX

in

= ∑

w,z

P(y|x, w, z)P(x ∣ w, z; σX)P(w, z; σX)

Rule 2 (Y ⊥ X ∣ W, Z)

𝒣σXX 𝒣X

in and

= ∑

w,z

P(y|x, w, z)P(x ∣ w; σX)P(w, z) P(w, z)

Rule 3 (W, Z ⊥ X)

𝒣σXX 𝒣X

in and

slide-49
SLIDE 49

Using σ-calculus

9

X Y W

(tutoring) (GPA) (previous GPA)

Z

(motivation)

𝒣

X Y W

(tutoring) (GPA) (previous GPA)

Z

(motivation)

σX

𝒣σX

P(y; σX) = ∑

w,z

P(y|x, w, z; σX)P(x ∣ w, z; σX)P(w, z; σX) = ∑

w,z

P(y|w, z)P(x ∣ w; σX)P(w, z; σX)

Rule 1 (X ⊥ Z ∣ W)

𝒣σX

in

= ∑

w,z

P(y|x, w, z)P(x ∣ w, z; σX)P(w, z; σX)

Rule 2 (Y ⊥ X ∣ W, Z)

𝒣σXX 𝒣X

in and

= ∑

w,z

P(y|x, w, z)P(x ∣ w; σX)P(w, z)

Rule 3 (W, Z ⊥ X)

𝒣σXX 𝒣X

in and

slide-50
SLIDE 50

Using σ-calculus

9

X Y W

(tutoring) (GPA) (previous GPA)

Z

(motivation)

𝒣

X Y W

(tutoring) (GPA) (previous GPA)

Z

(motivation)

σX

𝒣σX

P(y; σX) = ∑

w,z

P(y|x, w, z; σX)P(x ∣ w, z; σX)P(w, z; σX) = ∑

w,z

P(y|w, z)P(x ∣ w; σX)P(w, z; σX)

Rule 1 (X ⊥ Z ∣ W)

𝒣σX

in

= ∑

w,z

P(y|x, w, z)P(x ∣ w, z; σX)P(w, z; σX)

Rule 2 (Y ⊥ X ∣ W, Z)

𝒣σXX 𝒣X

in and

= ∑

w,z

P(y|x, w, z)P(x ∣ w; σX)P(w, z)

Rule 3 (W, Z ⊥ X)

𝒣σXX 𝒣X

in and Defined by σX

slide-51
SLIDE 51

Using σ-calculus

9

X Y W

(tutoring) (GPA) (previous GPA)

Z

(motivation)

𝒣

X Y W

(tutoring) (GPA) (previous GPA)

Z

(motivation)

σX

𝒣σX

P(y; σX) = ∑

w,z

P(y|x, w, z; σX)P(x ∣ w, z; σX)P(w, z; σX) = ∑

w,z

P(y|w, z)P(x ∣ w; σX)P(w, z; σX)

Rule 1 (X ⊥ Z ∣ W)

𝒣σX

in

= ∑

w,z

P(y|x, w, z)P(x ∣ w, z; σX)P(w, z; σX)

Rule 2 (Y ⊥ X ∣ W, Z)

𝒣σXX 𝒣X

in and

= ∑

w,z

P(y|x, w, z)P(x ∣ w; σX)P(w, z)

Rule 3 (W, Z ⊥ X)

𝒣σXX 𝒣X

in and Defined by σX Estimable from current regime

slide-52
SLIDE 52

Surrogate Experiments

10

slide-53
SLIDE 53

Surrogate Experiments

  • It’s not uncommon that the effect of a certain

intervention is not identifiable (not uniquely computable) from observational data alone whenever unobserved confounders are present.

10

Identifiable?

Input: { P(v) } Query: { P(y; σX) } No

𝒣

slide-54
SLIDE 54

Surrogate Experiments

  • It’s not uncommon that the effect of a certain

intervention is not identifiable (not uniquely computable) from observational data alone whenever unobserved confounders are present.

  • Experiments over a set of surrogate variables Z

may be more accessible to manipulation than the target effect σX, e.g., randomizing diet vs randomizing cholesterol.

10

Identifiable?

Input: { P(v) } Query: { P(y; σX) } No

𝒣

slide-55
SLIDE 55

Surrogate Experiments

  • It’s not uncommon that the effect of a certain

intervention is not identifiable (not uniquely computable) from observational data alone whenever unobserved confounders are present.

  • Experiments over a set of surrogate variables Z

may be more accessible to manipulation than the target effect σX, e.g., randomizing diet vs randomizing cholesterol.

  • Those surrogate experiments can be leveraged to

identify the effect of the interventions of interest.

10

Identifiable?

Input: { P(v) } Query: { P(y; σX) } No

𝒣

Identifiable?

Input: { P(v), P(v;σZ1), P(v;σZ2), … } Query: { P(y; σX) } Yes

𝒣

slide-56
SLIDE 56

Surrogate Experiments

11

W Y Z R X

slide-57
SLIDE 57

Surrogate Experiments

  • Input: {P(v), P(v | σZ=P*(Z|X))}

11

W Y Z R X

slide-58
SLIDE 58

Surrogate Experiments

  • Input: {P(v), P(v | σZ=P*(Z|X))}
  • Query: P(y | σX=P*(X|R))

11

W Y Z R X

slide-59
SLIDE 59

Surrogate Experiments

  • Input: {P(v), P(v | σZ=P*(Z|X))}
  • Query: P(y | σX=P*(X|R))

11

W Y Z R X

W Y Z R X

σX

W Y Z R X

σZ

slide-60
SLIDE 60

Surrogate Experiments

  • Input: {P(v), P(v | σZ=P*(Z|X))}
  • Query: P(y | σX=P*(X|R))
  • Not identifiable from P(v) alone, but

11

W Y Z R X

W Y Z R X

σX

W Y Z R X

σZ

slide-61
SLIDE 61

Surrogate Experiments

  • Input: {P(v), P(v | σZ=P*(Z|X))}
  • Query: P(y | σX=P*(X|R))
  • Not identifiable from P(v) alone, but

11

W Y Z R X

P(y; σX)

W Y Z R X

σX

W Y Z R X

σZ

slide-62
SLIDE 62

Surrogate Experiments

  • Input: {P(v), P(v | σZ=P*(Z|X))}
  • Query: P(y | σX=P*(X|R))
  • Not identifiable from P(v) alone, but

11

W Y Z R X

P(y; σX)

W Y Z R X

σX

W Y Z R X

σZ

= ∑

r,w,x,z

P(r)P(x ∣ r; σX)P(z ∣ r, x, w)P(w ∣ r)∑

x′

P(y ∣ r, x′ , z; σZ)P(x′ ∣ r)

slide-63
SLIDE 63

Surrogate Experiments

  • Input: {P(v), P(v | σZ=P*(Z|X))}
  • Query: P(y | σX=P*(X|R))
  • Not identifiable from P(v) alone, but

11

W Y Z R X

P(y; σX)

W Y Z R X

σX

W Y Z R X

σZ

From surrogate
 experiment

= ∑

r,w,x,z

P(r)P(x ∣ r; σX)P(z ∣ r, x, w)P(w ∣ r)∑

x′

P(y ∣ r, x′ , z; σZ)P(x′ ∣ r)

slide-64
SLIDE 64

Surrogate Experiments

  • Input: {P(v), P(v | σZ=P*(Z|X))}
  • Query: P(y | σX=P*(X|R))
  • Not identifiable from P(v) alone, but

11

W Y Z R X

P(y; σX)

W Y Z R X

σX

W Y Z R X

σZ

Natural regime From surrogate
 experiment

= ∑

r,w,x,z

P(r)P(x ∣ r; σX)P(z ∣ r, x, w)P(w ∣ r)∑

x′

P(y ∣ r, x′ , z; σZ)P(x′ ∣ r)

slide-65
SLIDE 65

Surrogate Experiments

  • Input: {P(v), P(v | σZ=P*(Z|X))}
  • Query: P(y | σX=P*(X|R))
  • Not identifiable from P(v) alone, but

11

W Y Z R X

P(y; σX)

W Y Z R X

σX

W Y Z R X

σZ

Natural regime From surrogate
 experiment

= ∑

r,w,x,z

P(r)P(x ∣ r; σX)P(z ∣ r, x, w)P(w ∣ r)∑

x′

P(y ∣ r, x′ , z; σZ)P(x′ ∣ r)

Defined by
 intervention

slide-66
SLIDE 66

Summary of the Results

12

slide-67
SLIDE 67

Summary of the Results

We introduce a set of inference rules called σ-calculus, which generalizes Pearl’s do-calculus, to reason about the effect of general types of

  • interventions. Further, we provide a syntactical method for deriving and

verifying claims about such interventions given a causal graph.

12

1

slide-68
SLIDE 68

Summary of the Results

We introduce a set of inference rules called σ-calculus, which generalizes Pearl’s do-calculus, to reason about the effect of general types of

  • interventions. Further, we provide a syntactical method for deriving and

verifying claims about such interventions given a causal graph. We develop an efficient procedure to determine the identifiability of the (conditional) effect of non-atomic interventions from a combination of

  • bservational and experimental data given a causal diagram.

12

1 2

slide-69
SLIDE 69

Proposed Strategy

13

slide-70
SLIDE 70

Proposed Strategy

Encode qualitative assumptions natural and intervened domain graphically.

13

1

slide-71
SLIDE 71

Proposed Strategy

Encode qualitative assumptions natural and intervened domain graphically.

13

Diagrams annotated with
 nodes. σX

1

slide-72
SLIDE 72

Proposed Strategy

Encode qualitative assumptions natural and intervened domain graphically. Find the mechanisms composing the effect of intervention.

13

Diagrams annotated with
 nodes. σX

1 2

slide-73
SLIDE 73

Proposed Strategy

Encode qualitative assumptions natural and intervened domain graphically. Find the mechanisms composing the effect of intervention. Derive the needed mechanisms from the given distributions.

13

Diagrams annotated with
 nodes. σX

1 2 3

slide-74
SLIDE 74

Proposed Strategy

Encode qualitative assumptions natural and intervened domain graphically. Find the mechanisms composing the effect of intervention. Derive the needed mechanisms from the given distributions. Construct an estimator from the available data.

13

Diagrams annotated with
 nodes. σX

1 2 3 4

slide-75
SLIDE 75

Proposed Strategy

Encode qualitative assumptions natural and intervened domain graphically. Find the mechanisms composing the effect of intervention. Derive the needed mechanisms from the given distributions. Construct an estimator from the available data.

13

Diagrams annotated with
 nodes. σX Use σ-calculus or equivalent algorithmic procedure.

1 2 3 4

slide-76
SLIDE 76

Conclusions

14

slide-77
SLIDE 77

Conclusions

  • σ-calculus allows one to discover and verify, from a causal graph, logical

statements about general interventions suitable to capture real-world situations.

14

slide-78
SLIDE 78

Conclusions

  • σ-calculus allows one to discover and verify, from a causal graph, logical

statements about general interventions suitable to capture real-world situations.

  • These rules can be used to identify the effect of interventions from a combination
  • f observational and experimental data.

14

slide-79
SLIDE 79

Conclusions

  • σ-calculus allows one to discover and verify, from a causal graph, logical

statements about general interventions suitable to capture real-world situations.

  • These rules can be used to identify the effect of interventions from a combination
  • f observational and experimental data.
  • Our algorithm searches for a reduction of the effect of interest to the set of
  • bserved distributions (observational and experimental); if found, it returns a

corresponding mapping expression.

14

slide-80
SLIDE 80

Thank you!

15