Causal Structure Search: Philosophical Foundations and Problems - PowerPoint PPT Presentation

Causal Structure Search: Philosophical Foundations and Problems Richard Scheines & Peter Spirtes Carnegie Mellon University 1

Outline Causal Learning (vs. Predictive Learning) 1. 2. Recent Successes 3. Philosophical Foundations of Causal Learning: the Standard Set-up 4. Problems with the Standard Set-up 2

Causal Discovery - Goals 1) Policy, Law, and Science: How can we use data to answer a) subjunctive questions (effects of future policy interventions), or b) counterfactual questions (what would have happened had things been done differently (law)? c) scientific questions (what mechanisms run the world) 2) Rumsfeld Problem: Do we know what we don’t know: Can we tell when there is or is not enough information in the data to answer causal questions?

Causal Learning is Harder than Prediction Causal Prediction Prediction Data( X , Y ) Causal Structure Statistical Learning Algorithm Machine Learning Causal Structure(s) (Graph) P( Y,X ) P( Y | X set ) P( Y | X )

Causal Learning is Limited, but Rumsfeld Population (X 1 ,X 2 ,X 3 ) Population ( X , Y ) X 1 � X 2 � X 3 P(X,Y), Causal Graph(X,Y) P(X 1 ,X 2 ,X 3 ): X 1 _||_ X 3 | X 2 Background BK: X 2 prior to X 3 Data( X , Y ) Data(X 1 ,X 2 ,X 3 ) Knowledge No confounders Causal Structure Causal Structure Learning Algorithm Learning Algorithm Equivalence Class of Equivalence Class Causal Structures X1 – X2 � X3 ?? ?? No Yes P(X 1 | X 2set ) P(X 3 | X 2set ) P( Y | X set )

Recent Successes (Partial List!) • Do-Calculus • Identification • Bounding • Bayesian Search • Time-varying confounders and conditionally randomized treatment (Jamie Robins) • Dynamic Bayes Nets • Equivalence Classes (patterns, PAGs, Factor Analytic Measurement Models) 6

Recent Successes (Partial List!) • Pointwise Consistent Discovery Algorithms (patterns, PAGs, MMs, SEM with pure MM, Linear-Cyclic Models) • Discovery in Time Series (Granger & Swanson, Hoover, Bessler, Moneta) • Linear, non-Gaussian models (Shimizu, Hoyer, Hyvarinen) • Active Search (Cooper, Eberhardt, Tong, Kohler, Murphy, He & Gong) • Overlapping Sets of Variables (Tillman & Danks) • Applications (Ed. Research, Biology, Economics, Sociology, etc.) • Causality Challenge!! 7

Philosophical Foundations of Causal Structure Learning V = { M, L } M measured, L = unobserved (latent) Causal structure over V ⇒ Constraints in P( V ) • Assumption 1 : Weak Causal Markov Assumption V 1 ,V 2 causally disconnected ⇒ V 1 _||_ V 2 • Assumption 2b: Determinism, e.g., • Assumption 2a : Structural Equations Causal Markov Axiom For each V i ∈ V , V i := f(parents(V i )) 8

Philosophical Foundations of Causal Learning Causal structure over V ⇒ Constraints in P(V) Causal Markov Axiom: If G is a causal graph, and P a probability distribution over the variables in G, then in P: every variable V is independent of its non-effects, conditional on its immediate causes. 9

Faithfulness Constraints on a probability distribution P generated by a causal structure G hold for all parameterizations of G. Revenues = aRate + cEconomy + ε Rev. Tax Rate b Economy = bRate + ε Econ. a Economy c Faithfulness: a ? -bc Tax Revenues 10

Modularity of Intervention/Manipulation Causal Structural Equations: Education Education = ε ed Graph Longevity = f 1 ( Education) + ε Longevity Income Longevity Income = f 2 ( Education) + ε income Manipulated Structural Equations: Manipulated Education Education = ε ed Causal Longevity = f 1 ( Education) + ε Longevity Graph Income = f 3 ( M1) Income Longevity M1

Modularity of Intervention/Manipulation Causal Structural Equations: Education Education = ε ed Graph Longevity = f 1 ( Education) + ε Longevity Income Longevity Income = f 2 ( Education) + ε income Manipulated Structural Equations: Manipulated Education Education = ε ed Causal Longevity = f 1 ( Education) + ε Longevity Graph Income = f 3 ( M2,Education) + ε income Income Longevity M2

The Standard Set-up • Meausured Vars M given • V = { M, L } satisfy Markov, Faithfulness, Modularity • Tasks: • Discover structure (e.g., causal relations) among M • Estimate causal parameters • Less often: • Discover existence of L • Discover and estimate causal relations among L 13

Problems with the Standard Set-up • Faithfulness in Redundant or Thermostatic Mechanisms • Measurement • Classical Measurement Error • Coarsening • Aggregation • Ambiguous Manipulations • Modularity in Constraint Based, Reversible Systems • Variable Construction / Decision Theory 14

Faithfulness • Thermostatic Equilibrium • Redundant Mechanisms + Gene A Air Temp Target - Core Protein - + Sweat/Heatup Gene B Core Temp Gene A _||_ Protein Air Temp _||_ Core Temp 15

Classical Measurement Error X Y Z Z’ = Z + ε Measurement Error: Z’ ε ’ X _||_ Y | Z X _||_ Y | Z’ unless Var( ε ’) = 0 16

Coarsening Lung_Cancer By age 60 Smoking_coarse Smoking_precise Ever smoked before Exact amount age 50 smoked before age Tar_stains_precise [y,n] 50 Exact amount of tar- stains on fingers at age 50 Lung_Cancer _||_ Tar_stains_precise | Smoking_precise Lung_Cancer _||_ Tar_stains_precise | Smoking_coarse 17

TV � Obesity Proctor, et al. (2003). Television viewing and change in body fat from preschool to early adolescence: The Framingham Children’s Study International Journal of Obesity , 27, 827-833. Exercise Obesity TV (BMI) Diet Goals: • Estimate the influence of TV on BMI • Tease apart the mechanisms (diet, exercise) 18

Measures of Exercise, Diet Exercise_M [L,H] Exercise TV Obesity (age 4) (BMI) Age 11 Diet (Calories ) Diet_M [L,H] Exercise_M: L � Calories expended in exercise in bottom two tertiles Exercise_M: H � Calories expended in exercise in top tertile Diet_M: L � Calories consumed in bottom two tertiles Diet_M: H � Calories consumed in top tertile 19

Measures of Exercise, Diet Exercise_M [L,H] Exercise TV Obesity (age 4) (BMI) Age 11 Diet (Calories ) Diet_M [L,H] Findings: • TV and Obesity NOT screened off by Exercise_M & Diet_M • Bias in mechanism estimation unknown 20

Screening Off and Aggregation: Genetic Regulatory Network Discovery Cell N Cell 1 Cell 2 Z Z Z X X Y Y ...... X Y Microarrays: measured gene expressions are sums of gene expression across all cells in tissue sample ∀ Cells: X _||_ Y | Z Σ n X _||_ Σ n Y | Σ n Z unless P(X,Y,Z) is special, e.g., Gaussian 21

Causal Discovery in fMRI Brain Region Brain Region Brain Region Y X Z Y Z X 1 1 Z3 X2 1 Y3 Y2 X3 Z2 ∀ i,j : Xi _||_ Yj | {Z} fMRI measures aggregate activity in a voxel Variables aggregate activity over voxels Σ X _||_ Σ Y | Σ Z 22

Ambiguous Manipulations � 1960s : In RCTs, drugs that reduce TC (total cholesterol), reduce the risk of DH (Heart Disease). � P( DH | TC set ) identifiable. Heart Total Disease Cholesterol (DH) (TC) � TC ≡ def f(LDL,HDL), high-density & low-density cholesterol Heart Total Cholesterol (TC) Disease LDL HDL (DH)

Ambiguous Manipulations HDL Heart Total Disease Cholesterol (DH) TC HDL � TC [H,M,L], HDL [H,L], LDL [H,L], DH[Y,N] HDL =L, LDL =L → TC =L } HDL =L, LDL =H → TC=M HDL =H, LDL =L HDL =H, LDL =H → TC =H � arrows in boldface are definitional links

Ambiguous Manipulations LDL + Heart Total Disease Cholesterol ( DH ) TC HDL - � Suppose HDL, LDL unobserved � TC cannot be manipulated independently of both HDL and LDL � “Set TC to M” is ambiguous over: HDL = H and LDL = L HDL = L and HDL = H

Ambiguous Manipulations LDL + Heart Total Disease Cholesterol (DH) TC HDL - � Suppose HDL = H and LDL = L prevents H , and HDL = L and HDL = H promotes H ? � What is P( DH | TC set = M)? � Can ambiguity be detected? � Need additional assumptions? Yes, e.g., variability � From observational data? Sometimes � Will positive causal hypotheses be inferred involving variables whose effect is ambiguous? Probably not

Reversible/Constraint Systems • PV = nR T • Constraint persists, even with surgical interventions • “joint” part of P(V,T,P) remains unaltered by any intervention. • Is there a causal graph and parameterization thereof such that the constraint holds for any permissable set of surgically altered equations? • Can such systems be learned without intervention?

Decision Theory/Variable Construction for Causal Learning Raw Data • Voxels in fMRI Machine Learning • Online Learning Log Prediction Algorithm Features/Variables Causal Learning • Activity in a Brain Region Algorithm • Avg. Time after bottom out hint in hard probs Variable construction can be framed as a search problem, thus a decision problem Decision problem for prediction ? decision problem for causal learning 28

Variable Construction for Causal Learning Raw Data • Voxels in fMRI Features/Variables • Activity in a Brain Region 29

Causal Structure Search: Philosophical Foundations and Problems - PowerPoint PPT Presentation

Causal Structure Search: Philosophical Foundations and Problems Richard Scheines & Peter Spirtes Carnegie Mellon University 1 Outline Causal Learning (vs. Predictive Learning) 1. 2. Recent Successes 3. Philosophical Foundations of

Causal Effect Evaluation and Causal Network Learning Zhi Geng Peking University, China June

Foundations of Causal Discovery Frederick Eberhardt KDD Causality Workshop 2016 Causal Discovery

Political Science 209 - Fall 2018 Causal Inference Florian Hollenbach 7th September 2018 Causal

Causal Inference By: Miguel A. Hern an and James M. Robins Part I: Causal inference without

Causal Programming Causal Programming Joshua Brul Joshua Brul

Few-shot Domain Adaptation 1/12 by Causal Mechanism Transfer Domain adaptation Causal mechanism

Causal Discovery from Observational Data Brady Neal causalcourse.com What if we dont have

Foundations of Artificial Intelligence 9. State-Space Search: Tree Search and Graph Search Malte

Today Mind and Body Some philosophical issues about AI One of the oldest philosophical problems

From Philosophical Logic to Computer Science Aldo Antonelli Science and back again

Outline 1. What are philosophical assumptions and Philosophical Paradigms in why should we give

Philosophical Ethics Philosophical Ethics an introduction George Matthews 2020 1 / 17 The

CPSC 533 Philosophical Foundations of Artificial Intelligence Presented by: Arthur Fischer

Boosting: Foundations and Algorithms Boosting: Foundations and Algorithms Boosting: Foundations

recap to this point foundations foundations foundations foundations genetics =

A Finslerian notion of causal structure Omid Makhmali IMPAN, Warsaw February 21, 2019 IEMath,

Introduction to Graphs nodes. edges between pairs of nodes.

probe ion transport mechanisms in a membrane channel Maria Queralt-Martin , M. Lidn Lpez and

A cortical microcircuit model to study structure-activity relationships Rodrigo Felipe de

Welcome to CNV! Computational Neuroscience CNV is about using computational simulations to

The More You Know: Linkage of Public Health Datasets and All-Payer Claims to Further

Probabilistic Data Generation for Deduplication and Data Linkage Peter Christen Data Mining

Enhancing HIV Testing and Linkage to Care With Peer Recruitment, Financial Incentives, and

Using PROV-O to represent lineage in statistical processes: a record linkage example Flavio