Identification of Causal Effect in the Presence of Selection Bias - - PowerPoint PPT Presentation

β–Ά
identification of causal effect in the presence of
SMART_READER_LITE
LIVE PREVIEW

Identification of Causal Effect in the Presence of Selection Bias - - PowerPoint PPT Presentation

Identification of Causal Effect in the Presence of Selection Bias Juan D. Correa Jin Tian Elias Bareinboim AAAI Honolulu, 2019 Challenge 1: Confounding Bias Age Whats the causal effect of Exercise on Cholesterol ? What about


slide-1
SLIDE 1

Identification of Causal Effect in the Presence of Selection Bias

Juan D. Correa Jin Tian Elias Bareinboim

AAAI Honolulu, 2019

slide-2
SLIDE 2

Challenge 1: Confounding Bias

Exercise (Hours) Cholesterol

Age Exercise Cholesterol What’s the causal effect of Exercise on Cholesterol? What about 𝑄 π‘‘β„Žπ‘π‘šπ‘“π‘‘π‘’π‘“π‘ π‘π‘š 𝑓𝑦𝑓𝑠𝑑𝑗𝑑𝑓) ?

2

slide-3
SLIDE 3

Challenge 1: Confounding Bias

Exercise (Hours) Cholesterol

Age 10 Age 20 Age 30 Age 40 Age 50 Age Exercise Cholesterol

3

slide-4
SLIDE 4

Challenge 1: Confounding Bias

Exercise (Hours) Cholesterol

Age 10 Age 20 Age 30 Age 40 Age 50 Age Exercise Cholesterol

𝑄 π‘‘β„Žπ‘π‘šπ‘“π‘‘π‘’π‘“π‘ π‘π‘š 𝑒𝑝(𝑓𝑦𝑓𝑠𝑑𝑗𝑑𝑓)) β‰  𝑄(π‘‘β„Žπ‘π‘šπ‘“π‘‘π‘’π‘“π‘ π‘π‘š | 𝑓𝑦𝑓𝑠𝑑𝑗𝑑𝑓)

This difference is called Confounding Bias

4

slide-5
SLIDE 5

Challenge 2: Selection Bias

Exercise (Hours) Cholesterol

S=1 S=0 Variables in the system affect the inclusion of units in the sample Age Exercise Cholesterol S Fitness

5

slide-6
SLIDE 6

Challenge 2: Selection Bias

Exercise (Hours) Cholesterol

S=1 S=0 Variables in the system affect the inclusion of units in the sample Age Exercise Cholesterol S Fitness

6

𝑄(𝑏𝑕𝑓, 𝑓𝑦, π‘‘β„Ž, 𝑔𝑗𝑒) β‰  𝑄 𝑏𝑕𝑓, 𝑓𝑦, π‘‘β„Ž, 𝑔𝑗𝑒 𝑇 = 1)

This difference is due to Selection Bias

slide-7
SLIDE 7

Current literature

Association = Causation No control

Complete Algorithms

[Tian and Pearl ’02; Huang and Valtorta ’06; Shpitser and Pearl ’06; Bareinboim and Pearl ’12] Controlling Selection Bias [Bareinboim and Pearl ’12] Recovering from Selection Bias in Causal and Statistical Inference [Bareinboim, Tian, Pearl ’14] RCE [Bareinboim, Tian, Pearl ’15] Generalized Adjustment

[Correa, Tian, Bareinboim ’18]

IDSB

[Correa, Tian, Bareinboim ’19]

No Confounding Confounding No Selection Selection

7

slide-8
SLIDE 8

Problem I

Given: Is there a function 𝑔 such that

𝑄 𝒛 𝑒𝑝 π’š = 𝑔(𝑄

;)

?

8

𝒣

𝑇 𝑄(π’˜|𝑇 = 1)

1 … 1 … 1 …

𝑄

Variables

𝒀, 𝒁

slide-9
SLIDE 9

Result 1

Theorem 1: Let 𝒀, 𝒁 βŠ‚ 𝑾 be two disjoint sets of variables and 𝒣 a causal diagram

  • ver 𝑾 and 𝑇. If 𝒁 βŠ₯ 𝑇 𝒣𝒀𝒁

CDE, then 𝑄

π’š(𝒛) is not recoverable from

𝑄(π’˜ | 𝑇 = 1) in 𝒣.

9

slide-10
SLIDE 10

Problem II

Given: Is there a function 𝑔 such that

𝑄 𝒛 𝑒𝑝 π’š = 𝑔(𝑄

;, 𝑄F)

?

10

𝒣

𝑇 𝑄(π’˜|𝑇 = 1)

1 … 1 … 1 …

𝑄(𝒖)

… … …

𝑄

;

𝑄F

Variables

𝒀, 𝒁

slide-11
SLIDE 11

Result II

Algorithm IDSB Given a causal diagram, a selection-biased distribution and external data

  • ver a subset of the variables and the variables of interest (𝒀, 𝒁);

returns an expression for 𝑄

π’š(𝒛) in terms of the input or failure.

Strictly more powerful than the best known algorithm that accepts both biased and unbiased data.

11

slide-12
SLIDE 12

Decomposing the Problem

12

X W1 W2 W3 Y S

Intervention

𝑄

H 𝑧 =

J 𝑄

H(𝑧, π‘₯L, π‘₯F, π‘₯;)

  • NO,NP,NQ

X W1 W2 W3 Y S

slide-13
SLIDE 13

Decomposing the Problem

13

W1 W3 S

C-Components

W2 Y

𝑄

H 𝑧 =

J 𝑄

H(𝑧, π‘₯L, π‘₯F, π‘₯;)

  • NO,NP,NQ

= J 𝑄

H,NO,NQ 𝑧, π‘₯F 𝑄 NP,R π‘₯L, π‘₯;

  • NO,NP,NQ

X W1 W2 W3 Y S

𝑄

H,NO,NQ 𝑧, π‘₯F

𝑄

NP,R π‘₯L, π‘₯;

slide-14
SLIDE 14

Summary

  • 1. Complete characterization recoverable causal effects from

the causal diagram and a selection-biased probability distribution.

  • 2. Sufficient procedure to recover causal effects from a causal

diagram, selection-biased distributions and auxiliary unbiased data which is strictly more powerful than state-of- the-art procedure.

14

Thanks!

slide-15
SLIDE 15

15