Bayesian Causal Induction Pedro A. Ortega Sensorimotor Learning and - PowerPoint PPT Presentation

Bayesian Causal Induction Pedro A. Ortega Sensorimotor Learning and Decision-Making Group MPI for Biological Cybernetics/Intelligent Systems 17th December 2011

Introduction Causal Induction (AKA Causal Discovery): ◮ One of the oldest philosophical problems: ◮ Aristotle, Kant, Hume, . . . ◮ The generalization from particular causal instances to abstract causal laws.

Introduction Causal Induction (AKA Causal Discovery): ◮ One of the oldest philosophical problems: ◮ Aristotle, Kant, Hume, . . . ◮ The generalization from particular causal instances to abstract causal laws. ◮ Example: ◮ ‘I had a bad fall on wet floor.’ ◮ ‘Therefore, it is dangerous to ride a bike on ice.’ ◮ (‘Because I learned that a slippery floor can cause a fall’)

Introduction Causal Induction (AKA Causal Discovery): ◮ One of the oldest philosophical problems: ◮ Aristotle, Kant, Hume, . . . ◮ The generalization from particular causal instances to abstract causal laws. ◮ Example: ◮ ‘I had a bad fall on wet floor.’ ◮ ‘Therefore, it is dangerous to ride a bike on ice.’ ◮ (‘Because I learned that a slippery floor can cause a fall’) ◮ Two important aspects: ◮ Infer causal link from experience. ◮ Extrapolate to future experience.

Introduction Causal Induction (AKA Causal Discovery): ◮ One of the oldest philosophical problems: ◮ Aristotle, Kant, Hume, . . . ◮ The generalization from particular causal instances to abstract causal laws. ◮ Example: ◮ ‘I had a bad fall on wet floor.’ ◮ ‘Therefore, it is dangerous to ride a bike on ice.’ ◮ (‘Because I learned that a slippery floor can cause a fall’) ◮ Two important aspects: ◮ Infer causal link from experience. ◮ Extrapolate to future experience. ◮ We all do this in our everyday lives— but how?

Causal Graphical Model � �� ◮ A pair of (binary) random variables X and Y ◮ Two candidate causal hypotheses { h , ¬ h } (having identical joint distributions)

Causal Graphical Model � �� ◮ A pair of (binary) random variables X and Y ◮ Two candidate causal hypotheses { h , ¬ h } (having identical joint distributions) ◮ How do we express the problem of causal induction using the language of graphical models alone ?

Causal Graphical Model � � � ◮ A pair of (binary) random variables X and Y ◮ Two candidate causal hypotheses { h , ¬ h } (having identical joint distributions) ◮ How do we express the problem of causal induction using the language of graphical models alone ?

Causal Graphical Model � � � � � ◮ A pair of (binary) random variables X and Y ◮ Two candidate causal hypotheses { h , ¬ h } (having identical joint distributions) ◮ How do we express the problem of causal induction using the language of graphical models alone ? ◮ Do we have to introduce a meta-level for H ?

Probability Trees H 1 1 2 2 � �� X Y � �� 1 1 1 1 2 2 2 2 Y Y X X � �� 3 3 1 1 3 1 1 3 4 4 4 4 4 4 4 4 ◮ Node: mechanism, history dependent ◮ e.g. P ( y | h , ¬ x ) = 1 4 and P ( ¬ y | h , ¬ x ) = 3 4 ◮ Path: causal realization of mechanisms ◮ Tree: causal realizations, possibly heterogeneous ◮ All random variables are first class citizens!

Inferring the Causal Direction ◮ We observe X = x , then we observe Y = y . ◮ What is the probability of H = h ? ◮ Calculate posterior probability: P ( y | h , x ) P ( x | h ) P ( h ) P ( h | x , y ) = P ( y | h , x ) P ( x | h ) P ( h ) + P ( x |¬ h , y ) P ( y |¬ h ) P ( ¬ h ) 4 · 1 3 2 · 1 = 1 2 = 2 = P ( h )! 3 4 · 1 2 · 1 2 + 3 4 · 1 2 · 1 2

Inferring the Causal Direction ◮ We observe X = x , then we observe Y = y . ◮ What is the probability of H = h ? ◮ Calculate posterior probability: P ( y | h , x ) P ( x | h ) P ( h ) P ( h | x , y ) = P ( y | h , x ) P ( x | h ) P ( h ) + P ( x |¬ h , y ) P ( y |¬ h ) P ( ¬ h ) 3 4 · 1 2 · 1 = 1 2 = 2 = P ( h )! 3 4 · 1 2 · 1 2 + 3 4 · 1 2 · 1 2 ◮ We haven’t learned anything!

Inferring the Causal Direction ◮ We observe X = x , then we observe Y = y . ◮ What is the probability of H = h ? ◮ Calculate posterior probability: P ( y | h , x ) P ( x | h ) P ( h ) P ( h | x , y ) = P ( y | h , x ) P ( x | h ) P ( h ) + P ( x |¬ h , y ) P ( y |¬ h ) P ( ¬ h ) 3 4 · 1 2 · 1 = 1 2 = 2 = P ( h )! 4 · 1 3 2 · 1 2 + 3 4 · 1 2 · 1 2 ◮ We haven’t learned anything! ◮ To extract new causal information, we have to supply old causal information: ◮ “no causes in, no causes out” ◮ “to learn what happens if you kick the system, you have to kick the system”

Interventions in a Probability Tree Set X = x : H 1 1 2 2 � �� X Y � �� 1 1 1 1 2 2 2 2 Y Y X X � �� 3 1 1 3 3 1 1 3 4 4 4 4 4 4 4 4 3 1 1 3 3 1 1 3 P ( X , Y | H ) : 8 8 8 8 8 8 8 8

Interventions in a Probability Tree Set X = x : H 1 1 2 2 � �� X Y � �� 1 1 1 0 2 2 Y Y X X � �� 3 1 1 3 1 0 1 0 4 4 4 4 3 1 1 1 P ( X , Y | H ) : 0 0 0 0 4 4 2 2 ◮ Replace all mechanisms resolving X with the delta “ X = x ”.

Inferring the Causal Direction—2nd Attempt ◮ We set X = x , then we observe Y = y . ◮ What is the probability of H = h ?

Inferring the Causal Direction—2nd Attempt ◮ We set X = x , then we observe Y = y . ◮ What is the probability of H = h ? ◮ Calculate posterior probability: P ( y | h , ˆ x ) P (ˆ x | h ) P ( h ) P ( h | ˆ x , y ) = P ( y | h , ˆ x ) P (ˆ x | h ) P ( h ) + P (ˆ x |¬ h , y ) P ( y |¬ h ) P ( ¬ h ) 3 4 · 1 · 1 = 3 2 = 5 � = P ( h ) . 3 4 · 1 · 1 2 + 1 · 1 2 · 1 2 ◮ We have have acquired evidence for “ X → Y ”!

Conclusions ◮ Causal induction can be done using purely Bayesian techniques plus a description allowing multiple causal explanations of an experiment. ◮ Probability trees provide a clean & simple way to encode causal probabilistic information. ◮ The purpose of an intervention is to introduce statistical asymmetries. ◮ The causal information that we can acquire is limited by the interventions we can apply to the system. ◮ In this approach, the causal dependencies are not “in the data”, but they rather arise from the data and the hypotheses that the reasoner “imprints” on them.

Bayesian Causal Induction Pedro A. Ortega Sensorimotor Learning and - PowerPoint PPT Presentation

Bayesian Causal Induction Pedro A. Ortega Sensorimotor Learning and Decision-Making Group MPI for Biological Cybernetics/Intelligent Systems 17th December 2011 Introduction Causal Induction (AKA Causal Discovery): One of the oldest

Causal Effect Evaluation and Causal Network Learning Zhi Geng Peking University, China June

Induction Stepwise induction (for T PA , T cons ) Complete induction (for T PA , T cons )

Political Science 209 - Fall 2018 Causal Inference Florian Hollenbach 7th September 2018 Causal

Foundations of Causal Discovery Frederick Eberhardt KDD Causality Workshop 2016 Causal Discovery

Induction and recursion Chapter 5 Chapter Summary Mathematical Induction Strong Induction

Mathematical Induction Lecture 10-11 Menu Mathematical Induction Strong Induction

MA THEMA TICAL INDUCTION Induction and Deduction Mathematical Induction (its

Beyond Inductive Definitions Induction-Recursion, Induction-Induction, Coalgebras Anton

Lecture Outline Strengthening Induction Hypothesis. Lecture Outline Strengthening Induction

Strong induction (3) 23/38 Let P be a unary predicate on N Strong induction: Induction . . .

Causal Inference By: Miguel A. Hern an and James M. Robins Part I: Causal inference without

Causal Programming Causal Programming Joshua Brul Joshua Brul

Few-shot Domain Adaptation 1/12 by Causal Mechanism Transfer Domain adaptation Causal mechanism

Causal Discovery from Observational Data Brady Neal causalcourse.com What if we dont have

Being Bayesian About Being Bayesian About Net work St ruct ure Net work St ruct ure A Bayesian

Outline Intro to RL and Bayesian Learning History of Bayesian RL Model-based Bayesian

CS325 Artificial Intelligence Ch. 7, 8, 9 Logic, Knowledge, and Inference Cengiz Gnay,

16-824:Visual Learning and Recognition Many slides from A. Farhadi, A. Efros Course Information

History and Philosophy of Robotics Laboratory for Perceptual Robotics Department of Computer

Theory of Computation CS3102 Gabriel Robins Department of Computer Science University of

FT/Co-FT Mia_tiara_nurhidayah@moe.edu.sg and Noorashikin_zainuldin@moe.edu.sg 6506 7344 and 6508

Background to Gottlob Frege Gottlob Frege (18481925) Lifes work: logicism (the

Cops and Robbers: The Cost of Drunkenness Thanasis Kehagias, Faculty of Engineering, Aristotle

Logic, Constraints, and Quantum Information Phokion G. Kolaitis UC Santa Cruz & IBM Research

Bayesian Causal Induction Pedro A. Ortega Sensorimotor Learning and - PowerPoint PPT Presentation

Bayesian Causal Induction Pedro A. Ortega Sensorimotor Learning and Decision-Making Group MPI for Biological Cybernetics/Intelligent Systems 17th December 2011 Introduction Causal Induction (AKA Causal Discovery): One of the oldest

Causal Effect Evaluation and Causal Network Learning Zhi Geng Peking University, China June

Induction Stepwise induction (for T PA , T cons ) Complete induction (for T PA , T cons )

Political Science 209 - Fall 2018 Causal Inference Florian Hollenbach 7th September 2018 Causal

Foundations of Causal Discovery Frederick Eberhardt KDD Causality Workshop 2016 Causal Discovery

Induction and recursion Chapter 5 Chapter Summary Mathematical Induction Strong Induction

Mathematical Induction Lecture 10-11 Menu Mathematical Induction Strong Induction

MA THEMA TICAL INDUCTION Induction and Deduction Mathematical Induction (its

Beyond Inductive Definitions Induction-Recursion, Induction-Induction, Coalgebras Anton

Lecture Outline Strengthening Induction Hypothesis. Lecture Outline Strengthening Induction

Strong induction (3) 23/38 Let P be a unary predicate on N Strong induction: Induction . . .

Causal Inference By: Miguel A. Hern an and James M. Robins Part I: Causal inference without

Causal Programming Causal Programming Joshua Brul Joshua Brul

Few-shot Domain Adaptation 1/12 by Causal Mechanism Transfer Domain adaptation Causal mechanism

Causal Discovery from Observational Data Brady Neal causalcourse.com What if we dont have

Being Bayesian About Being Bayesian About Net work St ruct ure Net work St ruct ure A Bayesian

Outline Intro to RL and Bayesian Learning History of Bayesian RL Model-based Bayesian

CS325 Artificial Intelligence Ch. 7, 8, 9 Logic, Knowledge, and Inference Cengiz Gnay,

16-824:Visual Learning and Recognition Many slides from A. Farhadi, A. Efros Course Information

History and Philosophy of Robotics Laboratory for Perceptual Robotics Department of Computer

Theory of Computation CS3102 Gabriel Robins Department of Computer Science University of

FT/Co-FT Mia_tiara_nurhidayah@moe.edu.sg and Noorashikin_zainuldin@moe.edu.sg 6506 7344 and 6508

Background to Gottlob Frege Gottlob Frege (18481925) Lifes work: logicism (the

Cops and Robbers: The Cost of Drunkenness Thanasis Kehagias, Faculty of Engineering, Aristotle

Logic, Constraints, and Quantum Information Phokion G. Kolaitis UC Santa Cruz &amp; IBM Research

Logic, Constraints, and Quantum Information Phokion G. Kolaitis UC Santa Cruz & IBM Research