Applied Multivariate Statistics – Spring 2012 (not relevant for exam)
Can one extract causal information from high-dimensional observational data?
Can one extract causal information from high-dimensional - - PowerPoint PPT Presentation
Can one extract causal information from high-dimensional observational data? Applied Multivariate Statistics Spring 2012 (not relevant for exam) What is a causal effect? Markus Kalisch, ETH Zurich 2 What is a causal effect? Drowning
Applied Multivariate Statistics – Spring 2012 (not relevant for exam)
Can one extract causal information from high-dimensional observational data?
What is a causal effect?
2 Markus Kalisch, ETH ZurichWhat is a causal effect?
3 Markus Kalisch, ETH ZurichDrowning accidents
What is a causal effect?
4 Markus Kalisch, ETH ZurichDrowning accidents Ice cream sales
What is a causal effect?
5 Markus Kalisch, ETH ZurichDrowning accidents Ice cream sales
What is a causal effect?
6 Markus Kalisch, ETH ZurichDrowning accidents Ice cream sales
What is a causal effect?
7 Markus Kalisch, ETH ZurichDrowning accidents Ice cream sales
?
What is a causal effect?
8 Markus Kalisch, ETH ZurichDrowning accidents Ice cream sales
What is a causal effect?
9 Markus Kalisch, ETH ZurichDrowning accidents Ice cream sales
What is a causal effect?
10 Markus Kalisch, ETH ZurichDrowning accidents Ice cream sales
Another example: Smoking
11 Markus Kalisch, ETH ZurichScenario 1: Observe 1000 smoker and count the incidence of lung cancer
12 Markus Kalisch, ETH ZurichScenario 1: Observe 1000 smokers and count the incidence of lung cancer Scenario 2: Make 1000 random people smoke and count the incidence of lung cancer
13 Markus Kalisch, ETH ZurichScenario 1: Observe 1000 smokers and count the incidence of lung cancer Scenario 2: Make 1000 random people smoke and count the incidence of lung cancer are different.
14 Markus Kalisch, ETH ZurichWhat is a causal effect?
15 Markus Kalisch, ETH ZurichCHANGE BY INTERVENTION
How to find causal effects?
16 Markus Kalisch, ETH ZurichHow to find causal effects?
17 Markus Kalisch, ETH ZurichExperimental Data
?
How to find causal effects?
18 Markus Kalisch, ETH ZurichTwo groups of plots: Identical in all aspects (sunlight, water, soil quality, …)
Experimental Data
How to find causal effects?
19 Markus Kalisch, ETH ZurichTwo groups of plots: Identical in all aspects (sunlight, water, soil quality, …) Practice: Randomized assignment
Experimental Data
How to find causal effects?
20 Markus Kalisch, ETH ZurichExperimental Data
How to find causal effects?
21 Markus Kalisch, ETH ZurichExperimental Data
How to find causal effects?
22 Markus Kalisch, ETH ZurichExperimental Data Outcome due to fertilizer, since everything else was equal
How to find causal effects?
Sometimes, randomized controlled experiments are
If experiment is impossible…
24 Markus Kalisch, ETH ZurichObservational Data
… observe fields of two farmers.
25 Markus Kalisch, ETH ZurichObservational Data
… observe fields of two farmers.
26 Markus Kalisch, ETH ZurichObservational Data
Groups not guaranteed to be identical in all aspects (sunlight, water, soil quality, …)
… observe fields of two farmers.
27 Markus Kalisch, ETH ZurichObservational Data
… observe fields of two farmers.
28 Markus Kalisch, ETH ZurichObservational Data
Is outcome due to fertilizer? We can’t tell !
… observe fields of two farmers.
29 Markus Kalisch, ETH ZurichObservational Data
… observe fields of two farmers.
30 Markus Kalisch, ETH ZurichObservational Data
How to find causal effects?
Can one extract causal information from observational data alone?
31 Markus Kalisch, ETH ZurichGoal of this talk
bounds on set are useful
IDA
Example
Example
Example
the thousands of genes?
35 Markus Kalisch, ETH ZurichExample
the thousands of genes?
Model gene expression of each gene as a random variable. Can we use the joint distribution of gene expression to extract causal information?
36 Markus Kalisch, ETH ZurichHere is a distribution
Now find the causal effect!
Outline in Theory
38 Markus Kalisch, ETH ZurichCausal Structure do-calculus with known causal structure Causal effects Distribution
IDA
Pearl’s do-operator
P(Y=y | do(X=x)) “distribution of Y, if there is an intervention in variable X”
C(x’) = d/dx E[Y=y | do(X=x)]|x=x’ “change in expected value of Y, if there is an intervention in variable X”
39 Markus Kalisch, ETH Zurichdo-calculus with known causal structure
P(Y=y | X=x) ≠ P(Y=y | do(X=x))
40 Markus Kalisch, ETH ZurichP(rain | wet) = high P(rain | do(wet)) = = P(rain) = = low Pick a random day: do-calculus with known causal structure
Pearl’s do-calculus
41 Markus Kalisch, ETH ZurichCausal structure
X Y ZRules: Expression with “do” Expression without “do”
Judea Pearl, “Causality”, 2010, Cambridge University Pressdo-calculus with known causal structure
Example: Back-door Adjustment
42 Markus Kalisch, ETH ZurichCausal structure
X Y ZRules P(Y=y | do(X=x))
P(Y=y | X=x, Z=0) * P(Z=0) + P(Y=y | X=x, Z=1) * P(Z=1) Assume Z is binary (0/1)do-calculus with known causal structure
Example: Back-door Adjustment
43 Markus Kalisch, ETH ZurichCausal structure
X Y ZRules P(Y=y | do(X=x))
P(Y=y | X=x, Z=0) * P(Z=0) + P(Y=y | X=x, Z=1) * P(Z=1) Assume Z is binary (0/1)“do”
do-calculus with known causal structure
Example: Back-door Adjustment
44 Markus Kalisch, ETH ZurichCausal structure
X Y ZRules P(Y=y | do(X=x))
P(Y=y | X=x, Z=0) * P(Z=0) + P(Y=y | X=x, Z=1) * P(Z=1) Assume Z is binary (0/1)No “do”
do-calculus with known causal structure
Conclusion 1
45 Markus Kalisch, ETH ZurichIf causal structure is known, we can infer causal effects from observations
do-calculus with known causal structure
Outline in Theory
46 Markus Kalisch, ETH ZurichCausal Structure do-calculus with known causal structure Causal effects Distribution
IDA
Estimate Causal Structure
47 Markus Kalisch, ETH ZurichCausal Structure
Oftentimes, causal structure is unknown Estimate causal structure
Causal Directed Acyclic Graph (DAG)
X W Z YCausal Structure
Causal Directed Acyclic Graph (DAG)
X W Z YRandom Variables Direct cause Causal Structure
Causal Directed Acyclic Graph (DAG)
X W Z YRandom Variables Direct cause implies
Conditional independence relations among variables
Causal Structure
Estimate a DAG model
51 Markus Kalisch, ETH ZurichDAG encodes independence information
Independencies among variables given by oracle Reverse engineering DAGCausal Structure
Estimate a DAG model
52 Markus Kalisch, ETH ZurichDAG encodes independence information
Independencies among variables given by oracle Reverse engineering DAGPC Algorithm
Causal Structure
Ambiguity: Equivalence class
53 Markus Kalisch, ETH ZurichSeveral DAGs describe exactly the same list of independence relations
X W Z Y X W Z YCausal Structure
Ambiguity: Equivalence class
54 Markus Kalisch, ETH ZurichSeveral DAGs describe exactly the same list of independence relations
X W Z Y X W Z YCausal Structure
Ambiguity: Equivalence class
55 Markus Kalisch, ETH ZurichSeveral DAGs describe exactly the same list of independence relations
X W Z Y X W Z Y X W Z YEquivalence class: PARTIALLY Directed Acyclic Graph (PDAG)
Causal Structure
Ambiguity: Equivalence class
56 Markus Kalisch, ETH ZurichSeveral DAGs describe exactly the same list of independence relations
X W Z Y X W Z Y X W Z YEquivalence class: PARTIALLY Directed Acyclic Graph (PDAG)
Causal Structure
Ambiguity: Equivalence class
57 Markus Kalisch, ETH ZurichSome DAGs describe exactly the same list of independence relations
X W Z Y X W Z Y X W Z YEquivalence class: PARTIALLY Directed Acyclic Graph (PDAG) PC Algorithm finds equivalence class
Causal Structure
Outline in Theory
58 Markus Kalisch, ETH ZurichCausal Structure do-calculus with known causal structure Causal effects Distribution
IDA
Up to equivalence class
Putting everything together
59 Markus Kalisch, ETH Zurich DistributionPutting everything together
60 Markus Kalisch, ETH Zurich DistributionPutting everything together
61 Markus Kalisch, ETH Zurich DistributionPutting everything together
62 Markus Kalisch, ETH Zurich DistributionOutline in Theory
63 Markus Kalisch, ETH ZurichEquivalence class of Causal Structure Set of Causal effects Distribution
do-calculus with known causal structure
IDA
I’m busy! Find your own information on the distribution…
Outline in Theory Practice
65 Markus Kalisch, ETH ZurichEquivalence class of Causal Structure Set of Causal effects Observational data
IDA
do-calculus with known causal structure
Outline in Theory Practice
66 Markus Kalisch, ETH ZurichEquivalence class of Causal Structure Set of Causal effects Observational data
IDA
do-calculus with known causal structure
Conditional independence testsOutline in Theory Practice
67 Markus Kalisch, ETH ZurichEquivalence class of Causal Structure Set of Causal effects Observational data
IDA
do-calculus with known causal structure
Conditional independence tests Estimated propertiesOutline in Theory Practice
68 Markus Kalisch, ETH ZurichEquivalence class of Causal Structure Set of Causal effects Observational data
IDA
do-calculus with known causal structure
Conditional independence tests Estimated propertiesConsistency in high-dimensions: Gaussian case
Estimating graphical models with PC algorithm
69 Markus Kalisch, ETH ZurichDo-calculus in high dimensions
M.H. Maathuis, M. Kalisch, P. Bühlmann, “Estimating high-dimensional intervention effects from observational data”, 2009, Annals of Statistics 37, 3133 - 3164Consistency in high-dimensions: Gaussian case
Estimating graphical models with PC algorithm
70 Markus Kalisch, ETH ZurichDo-calculus in high dimensions
M.H. Maathuis, M. Kalisch, P. Bühlmann, “Estimating high-dimensional intervention effects from observational data”, 2009, Annals of Statistics 37, 3133 - 3164 Intervention effects if DAG is AbsentMain assumptions & requirements
71 Markus Kalisch, ETH ZurichExperimental validation
72 Markus Kalisch, ETH ZurichComplex system Experiment Top causal effects Observational data Top causal effects Agreement ?
IDA
Back to the beer: Experimental validation of IDA in Saccharomyces cerevisiae
73 Markus Kalisch, ETH ZurichSetting
IDA Lasso Elastic net Random guessing
M.H. Maathuis,IDA Lasso Elastic net Random guessing
M.H. Maathuis,Top 1000 estimated effects 100 900
IDA Lasso Elastic net Random guessing
M.H. Maathuis,Top 1000 estimated effects 130 870
IDA Lasso Elastic net Random guessing
M.H. Maathuis,Top 1000 estimated effects 400 600
Outline in Theory
84 Markus Kalisch, ETH ZurichEquivalence class of Causal Structure Set of Causal effects Distribution
do-calculus with known causal structure
IDA
Outline in Theory Practice
85 Markus Kalisch, ETH ZurichEquivalence class of Causal Structure Set of Causal effects Observational data
IDA
do-calculus with known causal structure
Summary of assumptions
86 Markus Kalisch, ETH ZurichR