Probabilistic Graphical Models Probabilistic Graphical Models - PowerPoint PPT Presentation

Probabilistic Graphical Models Probabilistic Graphical Models Structure learning in Bayesian networks Siamak Ravanbakhsh Fall 2019

Learning objectives Learning objectives why structure learning is hard? two approaches to structure learning constraint-based methods score based methods MLE vs Bayesian score

Structure learning Structure learning in BayesNets in BayesNets family of methods constraint-based methods estimate cond. independencies from the data find compatible BayesNets

Structure learning Structure learning in BayesNets in BayesNets family of methods constraint-based methods estimate cond. independencies from the data find compatible BayesNets search over the combinatorial space, maximizing a score 2 2 O ( n )

Structure learning Structure learning in BayesNets in BayesNets family of methods constraint-based methods estimate cond. independencies from the data find compatible BayesNets search over the combinatorial space, maximizing a score 2 2 O ( n ) Bayesian model averaging integrate over all possible structures

Structure learning Structure learning in BayesNets in BayesNets family of methods constraint-based methods estimate cond. independencies from the data find compatible BayesNets search over the combinatorial space, maximizing a score Bayesian model averaging integrate over all possible structures

Structure learning Structure learning in BayesNets in BayesNets Identifiable up to I-equivalence family of methods constraint-based methods estimate cond. independencies from the data find compatible BayesNets a DAG with the same set of conditional independencies (CI) I ( G ) = I ( p ) D

Structure learning Structure learning in BayesNets in BayesNets Identifiable up to I-equivalence family of methods constraint-based methods estimate cond. independencies from the data find compatible BayesNets Perfect MAP a DAG with the same set of conditional independencies (CI) I ( G ) = I ( p ) D

Structure learning Structure learning in BayesNets in BayesNets Identifiable up to I-equivalence family of methods constraint-based methods estimate cond. independencies from the data find compatible BayesNets Perfect MAP a DAG with the same set of conditional independencies (CI) I ( G ) = I ( p ) D hypothesis testing

Structure learning Structure learning in BayesNets in BayesNets Identifiable up to I-equivalence family of methods constraint-based methods estimate cond. independencies from the data find compatible BayesNets Perfect MAP a DAG with the same set of conditional independencies (CI) I ( G ) = I ( p ) D hypothesis testing X ⊥ Y ∣ Z ?

Structure learning Structure learning in BayesNets in BayesNets Identifiable up to I-equivalence family of methods constraint-based methods estimate cond. independencies from the data find compatible BayesNets Perfect MAP a DAG with the same set of conditional independencies (CI) I ( G ) = I ( p ) D first attempt: a DAG that is I-map for p I ( G ) ⊆ I ( p ) D D hypothesis testing X ⊥ Y ∣ Z ?

minimal I-map minimal I-map from CI test from CI test a DAG where removing an edge violates I-map property input : IC test oracle; an ordering , … , X X 1 n output : a minimal I-map G X X X for i=1...n 1 i n find minimal s.t. U ⊆ { X , … , X } U ∣ U ) ( X ⊥ , … , X − X 1 i −1 1 i −1 i set U ← Pa ⊥ NonDesc ∣ Pa X X i i X X i i

minimal I-map minimal I-map from CI test from CI test Problems: CI tests involve many variables number of CI tests is exponential a minimal I-MAP may be far from a P-MAP

minimal I-map minimal I-map from CI test from CI test Problems: CI tests involve many variables number of CI tests is exponential a minimal I-MAP may be far from a P-MAP Example: different orderings give different graphs D,I,S,G,L L,S,G,I,D L,D,S,I,G (a topological ordering)

Structure learning in BayesNets Structure learning in BayesNets Identifiable up to I-equivalence family of methods constraint-based methods estimate cond. independencies from the data find compatible BayesNets a DAG with the same set of conditional independencies (CI) first attempt: a DAG that is I-map for p I ( G ) ⊆ I ( p ) D D second attempt: a DAG that is P-map for I ( G ) = I ( p ) D can we find a perfect MAP with fewer IC tests involving fewer variables?

Perfect map Perfect map from CI test from CI test only up to I-equivalence the same set of CIs same skeleton same immoralities

Perfect map Perfect map from CI test from CI test only up to I-equivalence the same set of CIs same skeleton same immoralities procedure: 1. find the undirected skeleton using CI tests 2. identify immoralities in the undirected graph

Perfect map Perfect map from CI test from CI test 1. finding the undirected skeleton observation: if X and Y are not adjacent then OR X ⊥ Y ∣ Pa X ⊥ Y ∣ Pa X Y

Perfect map Perfect map from CI test from CI test 1. finding the undirected skeleton observation: if X and Y are not adjacent then OR X ⊥ Y ∣ Pa X ⊥ Y ∣ Pa X Y assumption: max number of parents d

Perfect map Perfect map from CI test from CI test 1. finding the undirected skeleton observation: if X and Y are not adjacent then OR X ⊥ Y ∣ Pa X ⊥ Y ∣ Pa X Y assumption: max number of parents d idea: search over all subsets of size d, and check CI above

Perfect map from CI test Perfect map from CI test 1. finding the undirected skeleton observation: if X and Y are not adjacent then OR X ⊥ Y ∣ Pa X ⊥ Y ∣ Pa X Y assumption: max number of parents d idea: search over all subsets of size d, and check CI above input: CI oracle; bound on #parents d output: undirected skeleton initialize H as a complete undirected graph for all pairs , X X i j for all subsets U of size (within current neighbors of ) ≤ d , X X i j If then remove from H U − ⊥ ∣ X X X X i j i j return H

Perfect map Perfect map from CI test from CI test 1. finding the undirected skeleton observation: if X and Y are not adjacent then OR X ⊥ Y ∣ Pa X ⊥ Y ∣ Pa X Y assumption: max number of parents d idea: search over all subsets of size d, and check CI above input: CI oracle; bound on #parents d output: undirected skeleton initialize H as a complete undirected graph for all pairs d +2 , X O ( n ) X i j for all subsets U of size (within current neighbors of ) ≤ d , X X = O (( n ) × 2 O (( n − 2) ) d i j If then remove from H U − ⊥ ∣ X X X X i j i j return H

Perfect map Perfect map from CI test from CI test 2. finding the immoralities potential immorality X − Z , Y − Z ∈ H , X − Y  ∈ H X Y Z

Perfect map Perfect map from CI test from CI test 2. finding the immoralities potential immorality not immorality only if X − Z , Y − Z ∈ H , X − Y  ∈ H ⊥ ∣ U ⇒ Z ∈ U X X i j X Y Z

Perfect map Perfect map from CI test from CI test 2. finding the immoralities potential immorality not immorality only if X − Z , Y − Z ∈ H , X − Y ∈ H  ⊥ ∣ U ⇒ Z ∈ U X X i j X Y save the U when removing X-Y see if Z in U? Z if no, then we have immorality input: CI oracle; bound on #parents d output: undirected skeleton X Y initialize H as a complete undirected graph Z for all pairs , X X i j for all subsets U of size (within current neighbors of ) ≤ d , X X i j If then remove from H U − ⊥ ∣ X X X X i j i j return H

Perfect map Perfect map from CI test from CI test 3. propagate the constraints at this point: a mix of directed and undirected edges

Perfect map Perfect map from CI test from CI test 3. propagate the constraints at this point: a mix of directed and undirected edges add directions using the following rules (needed to preserve immoralities / DAG structure) until convergence for exact CI tests, this guarantees the exact I-equivalence family

Perfect map Perfect map from CI test from CI test 3. propagate the constraints at this point: a mix of directed and undirected edges add directions using the following rules (needed to preserve immoralities / DAG structure) until convergence Example Ground truth DAG for exact CI tests, this guarantees the exact I-equivalence family

Perfect map Perfect map from CI test from CI test 3. propagate the constraints at this point: a mix of directed and undirected edges add directions using the following rules (needed to preserve immoralities / DAG structure) until convergence undirected skeleton Example +immoralities Ground truth DAG for exact CI tests, this guarantees the exact I-equivalence family

Probabilistic Graphical Models Probabilistic Graphical Models - PowerPoint PPT Presentation

Probabilistic Graphical Models Probabilistic Graphical Models Structure learning in Bayesian networks Siamak Ravanbakhsh Fall 2019 Learning objectives Learning objectives why structure learning is hard? two approaches to structure learning

Probabilistic Graphical Models CMSC 678 UMBC Probabilistic Graphical Models A graph G that

Probabilistic Graphical Models Probabilistic Graphical Models Variable elimination Siamak

Graphical Models Graphical Models Bayesian Networks Siamak Ravanbakhsh Fall 2019 Previously on

Probabilistic Graphical Models Probabilistic Graphical Models introduction to learning Siamak

Probabilistic Graphical Models Probabilistic Graphical Models Undirected Models Fall 2019

Probabilistic Graphical Models Probabilistic Graphical Models parameter learning in undirected

Probabilistic Graphical Models Probabilistic Graphical Models Gaussian Network Models Fall 2019

Computer Science Let me be provocative Probabilistic graphical models is how we do probabilistic

Probabilistic Graphical Models Probabilistic Graphical Models Relationship between the directed

CS 6782: Fall 2010 Probabilistic Graphical Models Guozhang Wang December 10, 2010 1

Probabilistic Graphical Models Probabilistic Graphical Models Review of probability theory

Probabilistic Graphical Models Probabilistic Graphical Models Loopy BP and Bethe Free Energy

Probabilistic Graphical Models Probabilistic Graphical Models MAP inference Siamak Ravanbakhsh

Probabilistic Graphical Models Probabilistic Graphical Models Markov Chain Monte Carlo Inference

The Elimination Algorithm Probabilistic Graphical Models (10- Probabilistic Graphical Models

Probabilistic Graphical Models Probabilistic Graphical Models Parameter learning in Bayesian

Implementing the LeybourneTaylor test for seasonal unit roots in Stata Christopher F Baum

Discrete Probabilistic Programming from First Principles Guy Van den Broeck The Fourth

The Finite-Set Independence Criterion (FSIC) Zoltn Szab Arthur Gretton Wittawat Jitkrittum

Machine Learning for Signal Processing Detecting faces (& other objects) in images Class 7.

Week 3: Finish SLR Inference Then Multiple Linear Regression I. Confidence and Prediction

Data Mining Techniques CS 6220 - Section 3 - Fall 2016 Lecture 8 Jan-Willem van de Meent (

WINLAB Contact: Liang Xiao lxiao@winlab.rutgers.edu With Profs. Larry Greenstein, Wade Trappe,

and the Tim ime-based Reconstruction Jenny Regina PANDA CM, Computing Session GSI, 24-28 June

Probabilistic Graphical Models Probabilistic Graphical Models - PowerPoint PPT Presentation

Probabilistic Graphical Models Probabilistic Graphical Models Structure learning in Bayesian networks Siamak Ravanbakhsh Fall 2019 Learning objectives Learning objectives why structure learning is hard? two approaches to structure learning

Probabilistic Graphical Models CMSC 678 UMBC Probabilistic Graphical Models A graph G that

Probabilistic Graphical Models Probabilistic Graphical Models Variable elimination Siamak

Graphical Models Graphical Models Bayesian Networks Siamak Ravanbakhsh Fall 2019 Previously on

Probabilistic Graphical Models Probabilistic Graphical Models introduction to learning Siamak

Probabilistic Graphical Models Probabilistic Graphical Models Undirected Models Fall 2019

Probabilistic Graphical Models Probabilistic Graphical Models parameter learning in undirected

Probabilistic Graphical Models Probabilistic Graphical Models Gaussian Network Models Fall 2019

Computer Science Let me be provocative Probabilistic graphical models is how we do probabilistic

Probabilistic Graphical Models Probabilistic Graphical Models Relationship between the directed

CS 6782: Fall 2010 Probabilistic Graphical Models Guozhang Wang December 10, 2010 1

Probabilistic Graphical Models Probabilistic Graphical Models Review of probability theory

Probabilistic Graphical Models Probabilistic Graphical Models Loopy BP and Bethe Free Energy

Probabilistic Graphical Models Probabilistic Graphical Models MAP inference Siamak Ravanbakhsh

Probabilistic Graphical Models Probabilistic Graphical Models Markov Chain Monte Carlo Inference

The Elimination Algorithm Probabilistic Graphical Models (10- Probabilistic Graphical Models

Probabilistic Graphical Models Probabilistic Graphical Models Parameter learning in Bayesian

Implementing the LeybourneTaylor test for seasonal unit roots in Stata Christopher F Baum

Discrete Probabilistic Programming from First Principles Guy Van den Broeck The Fourth

The Finite-Set Independence Criterion (FSIC) Zoltn Szab Arthur Gretton Wittawat Jitkrittum

Machine Learning for Signal Processing Detecting faces (&amp; other objects) in images Class 7.

Week 3: Finish SLR Inference Then Multiple Linear Regression I. Confidence and Prediction

Data Mining Techniques CS 6220 - Section 3 - Fall 2016 Lecture 8 Jan-Willem van de Meent (

WINLAB Contact: Liang Xiao lxiao@winlab.rutgers.edu With Profs. Larry Greenstein, Wade Trappe,

and the Tim ime-based Reconstruction Jenny Regina PANDA CM, Computing Session GSI, 24-28 June

Machine Learning for Signal Processing Detecting faces (& other objects) in images Class 7.