Bayesian networks (2) Lirong Xia Last class Bayesian networks - PowerPoint PPT Presentation

Bayesian networks (2) Lirong Xia

Last class • Bayesian networks – compact, graphical representation of a joint probability distribution – conditional independence 2

Bayesian network • Definition of Bayesian network (Bayes’ net or BN) • A set of nodes, one per variable X • A directed, acyclic graph • A conditional distribution for each node – A collection of distributions over X, one for each combination of parents’ values ( ) p(X| a 1 ,…, a n ) p X A 1 … A n – CPT: conditional probability table – Description of a noisy “causal” process A Bayesian network = Topology (graph) + Local Conditional Probabilities 3

Probabilities in BNs • Bayesian networks implicitly encode joint distributions – As a product of local conditional distributions n ( ) ( ) = ( ) p x 1 , x 2 ,  x n ∏ p x i parents X i i = 1 – Example: ( ) p +Cavity, +Catch, -Toothache • This lets us reconstruct any entry of the full joint • Not every BN can represent every joint distribution – The topology enforces certain conditional independencies 4

Reachability (D-Separation) • Question: are X and Y conditionally independent given evidence vars {Z}? – Yes, if X and Y “separated” by Z – Look for active paths from X to Y – No active paths = independence! • A path is active if each triple is active: A B C – Causal chain where B is → → unobserved (either direction) A B C – Common cause where B ← → is unobserved A B C – Common effect where B → ← or one of its descendents is observed • All it takes to block a path is a single inactive segment 5

Checking conditional independence from BN graph • Given random variables Z 1 ,…Z p , we are asked whether X ⊥ Y|Z 1 ,…Z p • Step 1: shade Z 1 ,…Z p • Step 2: for each undirected path from X to Y – if all triples are active, then X and Y are NOT conditionally independent • If all paths have been checked and none of them is active, then X ⊥ Y|Z 1 ,…Z p 6

Example R B ⊥ Yes! R B T ⊥ R B T ' ⊥ 7

Example L T T ' Yes! ⊥ L B Yes! ⊥ L B T ⊥ L B T ' ⊥ L B T R , Yes! ⊥ 8

Example • Variables: – R: Raining – T: Traffic – D: Roof drips – S: I am sad • Questions: T D ⊥ T D R Yes! ⊥ T D R S , ⊥ 9

Today: Inference---variable elimination (dynamic programming) 10

Inference • Inference: calculating some useful quantity from a joint probability distribution • Examples: – Posterior probability: ( ) p Q E 1 = e 1 ,  E k = e k – Most likely explanation: ( ) argmax q p Q = q E 1 = e 1 ,  11

Inference • Given unlimited time, inference in BNs is easy • Recipe: – State the marginal probabilities you need – Figure out ALL the atomic probabilities you need – Calculate and combine them • Example: p b , j , m ( ) + + + ( ) p b j , m + + + = p j , m ( ) + + 12

Example: Enumeration • In this simple method, we only need the BN to synthesize the joint entries p b , j , m ( ) + + + = ) ( ) ( ) ( ) p b p e p a b , e p j a p m a ( ) ( + + + + + + + + + + ) ( ) ( ) ( ) p b p e p a b , e p j a p m a ( ) ( + + − + + + − + − + ) ( ) ( ) ( ) p b p e p a b , e p j a p m a ( ) ( + − + + − + + + + + ) ( ) ( ) ( ) p b p e p a b , e p j a p m a ( ) ( + − − + − + − + − 13

Inference by Enumeration? 14

The formula p(+R,+D)= p(+R)Σ s p(s) Σ g p(g|+R,s) Σ n p(n|+R)p(+D|n,g) f 2 (s,g) f 1 (s) • Order: s>g>n • is what we want to compute • only involves s • only involves s, g 18

Calculating p(+R,+D) • p(+R,+D)= p(+R)p(+S) f 1 (+S) + p(+R)p(- S) f 1 (-S) =0.2*(0.6*0.591+0.4*0.533)=0.11356 21

General method for variable elimination • Compute a marginal probability p( x 1 ,…, x p ) in a Bayesian network – Let Y 1 ,…,Y k denote the remaining variables – Step 1: fix an order over the Y’s (wlog Y 1 >…>Y k ) – Step 2: rewrite the summation as sth only sth only sth only sth only involving Y 1 , Σ y 1 Σ y 2 …Σ y k -1 Σ y k anything involving involving Y 1 involving Y 1 , Y 2 ,…,Y k -1 Y 2 and X’s X’s and X’s and X’s – Step 3: variable elimination from right to left 23

Bayesian networks (2) Lirong Xia Last class Bayesian networks - PowerPoint PPT Presentation

Bayesian networks (2) Lirong Xia Last class Bayesian networks compact, graphical representation of a joint probability distribution conditional independence 2 Bayesian network Definition of Bayesian network (Bayes net or

CS 331: Bayesian Networks 2 1 Bayesian Networks Youve heard about how Bayesian networks

Bayesian Networks Youve heard about how Bayesian networks have revolutionized AI

Being Bayesian About Being Bayesian About Net work St ruct ure Net work St ruct ure A Bayesian

Outline Intro to RL and Bayesian Learning History of Bayesian RL Model-based Bayesian

AND MACHINE LEARNING CHAPTER 8: GRAPHICAL MODELS Bayesian Networks Directed Acyclic Graph (DAG)

Bayesian Methods for Neural Networks Readings: Bishop, Neural Networks for Pattern Recognition .

Chapter14 Probabilistic Reasoning (Bayesian Networks) Sec. 1 - 2 20070607 Chap14 1

CS440/ECE448 Lecture 15: Bayesian Inference and Bayesian Learning Slides by Svetlana Lazebnik,

Bayesian Learning 1 Outline MLE, MAP vs. Bayesian Learning Bayesian Linear Regression

Bayesian Networks Philipp Koehn 2 April 2020 Philipp Koehn Artificial Intelligence: Bayesian

Bayesian Networks Philipp Koehn 6 April 2017 Philipp Koehn Artificial Intelligence: Bayesian

Probabilistic Modeling: Bayesian Networks Bioinformatics: Sequence Analysis COMP 571 - Spring

Bayesian Networks Li Xiong Slide credits: Page (Wisconsin) CS760 , Zhu (Wisconsin) KDD 12

Bayesian Networks Philipp Koehn 29 October 2015 Philipp Koehn Artificial Intelligence: Bayesian

Part 7 Bayesian hierarchical modelling, simulation and MCMC by Gero Walter 252 Bayesian

Bayesian Networks Volker Sorge Intro to AI: Specifying Probability Distributions Lecture 8

ECE 6504: Advanced Topics in Machine Learning Probabilistic Graphical Models and Large-Scale

4 Bayesian Belief Networks (also called Bayes Nets) Interesting because: The Naive Bayes

Generative Models and Nave Bayes Ke Chen Reading: [14.3, EA], [3.5, KPM], [1.5.4, CMB]

quancol . ........ . . . ... ... ... ... ... ... ... Hillston 21/9/2016 1 / 70

Slides for Lecture 5 ENEL 353: Digital Circuits Fall 2013 Term Steve Norman, PhD, PEng

ECE 238L Arithmetic Operations and Codes August 30, 2006 Typeset by Foil T EX Binary

A Sum Error Detection Scheme for Decimal Arithmetic www.itmati.com Alvaro Vzquez ITMATI.

A Parallel Decimal Multiplier Using Hybrid Binary Coded Decimal (BCD) Codes Xiaoping Cui,