Weakening Faithfulness: Some Heuristic Causal Discovery Algorithms - PowerPoint PPT Presentation

Weakening Faithfulness: Some Heuristic Causal Discovery Algorithms Zhalama 1 Jiji Zhang 2 · Wolfgang Mayer 1 1 University of South Australia 2 Lingnan University

Causa Ca sal DAG • Causal DAG 𝐻 = 𝑊, 𝐹 Each edge 𝑌 → 𝑍 represents a direct causal relation that 𝑌 is a direct cause of 𝑍 relative to 𝑊 • Assumption: 𝑊 is causally sufficient A B C D E A B Causal Distribution Causal Sufficiency DAG C D E

Causa Ca sal Inference Assu Assump mptions • Causal Markov Condition: Every conditional independence statement entailed by the causal DAG over 𝑊 is satisfied by the joint probability distribution of 𝑊 . i.e., 𝑌 and 𝑍 are (causally) d-separated by Z ⟹ 𝑌 ⊥ 𝑍 | 𝑎 . Causal Markov Assumption A B C D E A B Causal Distribution Causal Sufficiency DAG C D E

Ca Causa sal Inference Assu Assump mptions • Faithfulness assumption: Every conditional independence statement satisfied by the joint distribution of 𝑊 is entailed by the causal DAG over 𝑊 . i.e., 𝑌 ⊥ 𝑍 | 𝑎 ⟹ 𝑌 and 𝑍 are (causally) d-separated by 𝑎 . Causal Markov Assumption A B C D E A B Causal Distribution Causal Sufficiency DAG C D E Causal Faithfulness

Ca Causa sal Faithfu fulness ss Assu Assump mption • More dubious than Causal Markov assumption. • Even if Faithfulness is not exactly violated, the distribution may be sufficiently close to being unfaithful to make trouble with finite data. • Can we relax the Faithfulness assumption and adjust the causal discovery method to make it more robust against unfaithfulness? • Adjacency unfaithfulness • Orientation unfaithfulness

Adjacency Faithfulness Violation • Adjacency-Faithfulness: For every 𝑌, 𝑍 ∈ 𝑊 , if 𝑌 and 𝑍 are adjacent in the true causal DAG, then they are not independent conditional on any subset of 𝑊\{𝑌, 𝑍}. A B 𝐵 ⊥ 𝐸 | {C} The distribution satisfies 𝐷 ⊥ 𝐶 | {𝐵, 𝐸} one extra independence 𝐵 ⊥ 𝐶 | ∅ C D True Graph

PC under Adjacency Faithfulness Failure A B 1. Adjacency step : for every pair of variables 𝑌 and 𝑍 , search for a set of P : 𝐵 ⊥ 𝐶 | ∅ variables given which 𝑌 and 𝑍 are conditionally independent, and infer them to be adjacent if and only if no C D such set is found. PC • Justified by adjacency faithfulness True Graph assumption 2. Orientation step : for every unshielded triple (𝑌; 𝑍; 𝑎) , infer that it is a collider if and only if the A B set found in step 1 that renders 𝑌 and 𝑎 conditionally independent does not include 𝑍 • Justified by orientation faithfulness assumption C D

GES • Searches for a pattern that • GES seems to be robust against maximizes a score over the space Adjacency unfaithfulness of patterns • Proceeds from one pattern to a neighbor by adding or removing A B edges, one at a time • Forward phase: • Greedily add edges until the score C D cannot improve further • Backward phase: • Remove edges until the score cannot improve further

Orientation Faithfulness Violation • Orientation-Faithfulness: For every unshielded triple (𝑌, 𝑍, 𝑎) • If 𝑌 → 𝑍 ← 𝑎 is a collider, then X and Z are not conditionally independent given any subset of 𝑊\{𝑌, 𝑎} that includes 𝑍 . • Otherwise, X and Z are not conditionally independent given any subset of 𝑊\{𝑌, 𝑎} that excludes 𝑍 . A B 𝐵 ⊥ 𝐸 | {𝐶, 𝐷} The distribution satisfies 𝐶 ⊥ 𝐷 | {𝐵} one extra independence 𝐵 ⊥ 𝐸 | ∅ C D True Graph

GES under Orientation Faithfulness Violation A B The distribution satisfies 𝐵 ⊥ 𝐸|{𝐶, 𝐷} 𝐶 ⊥ 𝐷 | {𝐵} one extra independence 𝐵 ⊥ 𝐸|∅ C D GES True Graph A B C D

𝛽 − Conservative Orientation • Given a skeleton and a unshielded triple therein, consider all subsets of the variables adjacent to 𝑌 or of the variables that are adjacent to 𝑎 that render (𝑌, 𝑎) consitionally independent 𝑠 = 𝑜𝑣𝑛𝑐𝑓𝑠 𝑝𝑔 𝑡𝑓𝑢𝑡 𝑢ℎ𝑏𝑢 𝑗𝑜𝑑𝑚𝑣𝑒𝑓 𝑍 𝑜𝑣𝑛𝑐𝑓𝑠 𝑝𝑔 𝑡𝑓𝑢𝑡 • If 𝑠 ≤ 𝛽 , the triple is marked as a collider. • If 𝑠 ≥ 1 − 𝛽 , the triple is marked as a non-collider. • Otherwise it is ambiguous • CPC(Ramsey et al, 2006) : 𝛽 = 0 : too cautious • Majority rule orientation(Colombo and Maathuis, 2014) : 𝛽 = 0 .5 : not conservative enough • We use 𝛽 = 0.4

Proposed Hybrid Methods • PC+GES • Run PC first, use the output pattern as a starting point for GES • Mitigate PC’s vulnerability to adjacency faithfulness violations • GES+c • Run GES first, then apply the 𝛽 -conservative orientation rules and Meek’s orientation rules(Meek, 1996) • Mitigate GES’s vulnerability to orientation faithfulness violations • PC+GES+c • Run PC+GES first, then apply the 𝛽 -conservative orientation rules and Meek’s orientation rules(Meek, 1996) • Mitigate both vulnerabilities

Simulations – Examples of exact Faithfulness violations Adjacency unfaithfulness Orientation unfaithfulness A B A B C D C D PC PC- PC+GES GES MMHC stable GES GES+c PC CPC MMHC True adj. rate 0.75 0.75 0.95 0.93 0.76 0.35 0.96 0.49 0.99 0.56 False adj. rate 0.01 0.01 0.02 0.06 0.02 Mean Arrow Precision

More comprehensive simulations(without exact unfaithfulness) • Number of variables (dimension) ∈ {10, 20, 30, 40} • Expected vertex degree (sparsity) ∈ {2, 4} • Sample size ∈ {200, 500, 1000, 5000} • For each setting, 100 random DAGs are generated, and on each DAG a linear Gaussian model is randomly built: • Edge coefficients are uniformly drawn from [-1, -0.1] ∩ [0.1, 1] • Variances of error terms are uniformly drawn from [0.5, 1] • From each model, 50 datasets at each sample size are generated.

Adjacency on Random Graphs

Orientation on Random Graphs

Conclusion and Outlook • PC and GES are vulnerable to violations of Faithfulness • Heuristic hybrid algorithms shown to be able to mitigate some adjacency and orientation issues • even if faithfulness is not exactly violated • Try to develop efficient methods for causal inference under weaker faithfulness assumptions (e.g. triangle faithfulness)

Weakening Faithfulness: Some Heuristic Causal Discovery Algorithms - PowerPoint PPT Presentation

Weakening Faithfulness: Some Heuristic Causal Discovery Algorithms Zhalama 1 Jiji Zhang 2 Wolfgang Mayer 1 1 University of South Australia 2 Lingnan University Causa Ca sal DAG Causal DAG = , Each edge

Foundations of Causal Discovery Frederick Eberhardt KDD Causality Workshop 2016 Causal Discovery

Heuristic Search Lucia Moura Winter 2018 Heuristic Search Lucia Moura Heuristic Search Intro

Causal Effect Evaluation and Causal Network Learning Zhi Geng Peking University, China June

Weakening Aggregated Traffic of Weakening Aggregated Traffic of DHCP Discover Messages draft

Heuristic Search Heuristic Search Best-First A * Heuristic Functions Some material

Causal Discovery from Observational Data Brady Neal causalcourse.com What if we dont have

Political Science 209 - Fall 2018 Causal Inference Florian Hollenbach 7th September 2018 Causal

CAUSAL DISCOVERY CAUSAL DISCOVERY Beware of the DAG! Beware of the DAG! Philip Dawid

Causal Inference By: Miguel A. Hern an and James M. Robins Part I: Causal inference without

Causal Programming Causal Programming Joshua Brul Joshua Brul

Few-shot Domain Adaptation 1/12 by Causal Mechanism Transfer Domain adaptation Causal mechanism

Benchmarks, wikis, and open-source causal discovery Patrik O. Hoyer Univ. of Helsinki Finland

UNESCO Discovery Centre reference image of education space UNESCO Discovery Centre Discovery

Introduction to Causal Inference Lan Liu University of Minnesota at Twin Cities liux3771@umn.edu

Week 5 Video 2 Relationship Mining Causal Mining Causal Data Mining These slides developed in

A Brief Introduction to Causal Inference Brady Neal causalcourse.com What is causal inference?

Basic Concepts of Causal Mediation Analysis and Some Extensions Vanessa Didelez School of

Office for Security & Counter Terrorism Learning from the Skripal attack for CBRN

From LTL to Deterministic Parity Automata Javier Esparza 1 Jan K etnsk 1 Salomon Sickert 1

THE FUTURE OF BLOCKCHAIN IS NOT BLOCKCHAIN LUKE ANGELL Events and Partnerships Manager

2012 Inherited Corporate Control and Investment Performance, Co-authored with Johan Eklund and

WHATS HAPPENING TO THE ATTORNEY-CLIENT PRIVILEGE AND WORK PRODUCT DOCTRINE? PROPOSED FEDERAL

C LASSIFYING THE AR P RESENTATION S PACE ISMAR W ORKSHOP M ARCUS

NASDAQ: LOAN March 2019 Forward-Looking Statements This presentation includes forward-looking

Sambuz

Useful Links

Newsletter

Mail Us

Weakening Faithfulness: Some Heuristic Causal Discovery Algorithms - PowerPoint PPT Presentation

Weakening Faithfulness: Some Heuristic Causal Discovery Algorithms Zhalama 1 Jiji Zhang 2 Wolfgang Mayer 1 1 University of South Australia 2 Lingnan University Causa Ca sal DAG Causal DAG = , Each edge

Foundations of Causal Discovery Frederick Eberhardt KDD Causality Workshop 2016 Causal Discovery

Heuristic Search Lucia Moura Winter 2018 Heuristic Search Lucia Moura Heuristic Search Intro

Causal Effect Evaluation and Causal Network Learning Zhi Geng Peking University, China June

Weakening Aggregated Traffic of Weakening Aggregated Traffic of DHCP Discover Messages draft

Heuristic Search Heuristic Search Best-First A * Heuristic Functions Some material

Causal Discovery from Observational Data Brady Neal causalcourse.com What if we dont have

Political Science 209 - Fall 2018 Causal Inference Florian Hollenbach 7th September 2018 Causal

CAUSAL DISCOVERY CAUSAL DISCOVERY Beware of the DAG! Beware of the DAG! Philip Dawid

Causal Inference By: Miguel A. Hern an and James M. Robins Part I: Causal inference without

Causal Programming Causal Programming Joshua Brul Joshua Brul

Few-shot Domain Adaptation 1/12 by Causal Mechanism Transfer Domain adaptation Causal mechanism

Benchmarks, wikis, and open-source causal discovery Patrik O. Hoyer Univ. of Helsinki Finland

UNESCO Discovery Centre reference image of education space UNESCO Discovery Centre Discovery

Introduction to Causal Inference Lan Liu University of Minnesota at Twin Cities liux3771@umn.edu

Week 5 Video 2 Relationship Mining Causal Mining Causal Data Mining These slides developed in

A Brief Introduction to Causal Inference Brady Neal causalcourse.com What is causal inference?

Basic Concepts of Causal Mediation Analysis and Some Extensions Vanessa Didelez School of

Office for Security &amp; Counter Terrorism Learning from the Skripal attack for CBRN

From LTL to Deterministic Parity Automata Javier Esparza 1 Jan K etnsk 1 Salomon Sickert 1

THE FUTURE OF BLOCKCHAIN IS NOT BLOCKCHAIN LUKE ANGELL Events and Partnerships Manager

2012 Inherited Corporate Control and Investment Performance, Co-authored with Johan Eklund and

WHATS HAPPENING TO THE ATTORNEY-CLIENT PRIVILEGE AND WORK PRODUCT DOCTRINE? PROPOSED FEDERAL

C LASSIFYING THE AR P RESENTATION S PACE ISMAR W ORKSHOP M ARCUS

NASDAQ: LOAN March 2019 Forward-Looking Statements This presentation includes forward-looking

Sambuz

Useful Links

Newsletter

Mail Us

Office for Security & Counter Terrorism Learning from the Skripal attack for CBRN