Bayesian Networks George Konidaris gdk@cs.duke.edu Spring 2016

Recall Joint distributions: • P(X 1 , …, X n ). • All you (statistically) need to know about X 1 … X n . • From it you can infer P(X 1 ), P(X 1 | Xs), etc. � � Raining Cold Prob. True True 0.3 True False 0.1 False True 0.4 False False 0.2

Joint Distributions Are Useful Classification • P(X 1 | X 2 … X n ) things you know � thing you want to know � Co-occurrence • P(X a , X b ) how likely are these two things together? � � Rare event detection • P(X 1 , …, X n )

Modeling Joint Distributions Gets large fast • 2 n entries for n binary RVs. � � Independence! • A bit too strong. • Rarely holds. � Conditional independence. • Good compromise.

Conditional Independence A and B are conditionally independent given C if: • P(A | B, C) = P(A | C) • P(A, B | C) = P(A | C) P(B | C) � (recall independence: P(A, B) = P(A)P(B)) � � This means that, if we know C , we can treat A and B as if they were independent . � A and B might not be independent otherwise!

Example Consider 3 RVs: • Temperature • Humidity • Season � Temperature and humidity are not independent. � � But, they might be, given the season: the season explains both , and they become independent of each other.

Bayes Nets A particular type of graphical model: • A directed, acyclic graph. • A node for each RV. � S � � � T H � � � Given parents, each RV independent of non-descendants.

Bayes Net � S � � � T H � � � JPD decomposes: Y � P ( x 1 , ..., x n ) = P ( x i | parents( x i )) � i � So for each node, store conditional probability table (CPT): P ( x i | parents( x i ))

Example Suppose we know: • The flu causes sinus inflammation. • Allergies cause sinus inflammation. • Sinus inflammation causes a runny nose. • Sinus inflammation causes headaches.

Example Flu Allergy Sinus Nose Headache

Example Flu Allergy Flu P Allergy P True 0.6 True 0.2 Sinus False 0.4 False 0.8 Sinus Flu Allergy P True True True 0.9 False True True 0.1 Headache True True False 0.6 False True False 0.4 True False False 0.2 False False False 0.8 Nose True False True 0.4 False False True 0.6 Headache Sinus P Nose Sinus P True True 0.6 False True 0.4 True True 0.8 True False 0.5 False True 0.2 False False 0.5 True False 0.3 joint: 32 (31) entries False False 0.7

Naive Bayes P(S) S … W1 W2 W3 Wn P(W1|S) P(W2|S) P(W3|S) P(Wn|S) (spam filter!)

Uses Things you can do with a Bayes Net: • Inference: given some variables, posterior? • ( might be intractable : NP-hard) • Learning (fill in CPTs) • Structure Learning (fill in edges) � � Generally: • Often few parents. • Inference cost often reasonable. • Can include domain knowledge.

Inference What is: P(f | h)? Flu Allergy Sinus Nose Headache

Inference � P SAN P ( f, h, S, A, N ) P ( f | h ) = P ( f, h ) � = � P ( h ) P SANF P ( h, S, A, N, F ) � � We know from definition of Bayes net: X P ( h ) = P ( h, S, A, N, F ) SANF X P ( h ) = P ( h | S ) P ( N | S ) P ( S | A, F ) P ( F ) P ( A ) SANF

Variable Elimination So we have: � X P ( h ) = P ( h | S ) P ( N | S ) P ( S | A, F ) P ( F ) P ( A ) � SANF � � … we can eliminate variables one at a time: (distributive law) X X P ( h ) = P ( h | S ) P ( N | S ) P ( S | A, F ) P ( F ) P ( A ) SN AF X X X P ( h ) = P ( h | S ) P ( N | S ) P ( S | A, F ) P ( F ) P ( A ) S N AF

Variable Elimination Generically: • Query about X i and X j . • Write out P(X 1 … X n ) in terms of P(X i | parents(X i )) • Sum out all variables except X i and X j • Answer query using joint distribution P(X i , X j ) � Good news: • Potentially exponential reduction in computation. • Polynomial for trees. � Bad news: • Picking variables in optimal order NP-Hard. • For some networks, no elimination.

Spam Filter (Naive Bayes) P(S) S … W1 W2 W3 Wn P(W1|S) P(W2|S) P(W3|S) P(Wn|S) Want P(S | W 1 … W n )

Naive Bayes given P ( S | W 1 , ..., W n ) = P ( W 1 , ..., W n | S ) P ( S ) P ( W 1 , ..., W n ) (from the Y P ( W 1 , ..., W n | S ) = P ( W i | S ) Bayes Net) i

Bayes Nets Potentially very compressed but exact . • Requires careful construction! � VS � Approximate representation. • Hope you’re not too wrong! � � Many, many applications in all areas.

Bayesian Networks George Konidaris gdk@cs.duke.edu Spring 2016 - PowerPoint PPT Presentation

Bayesian Networks George Konidaris gdk@cs.duke.edu Spring 2016 Recall Joint distributions: P(X 1 , , X n ). All you (statistically) need to know about X 1 X n . From it you can infer P(X 1 ), P(X 1 | Xs), etc.

CS 331: Bayesian Networks 2 1 Bayesian Networks Youve heard about how Bayesian networks

Bayesian Networks Youve heard about how Bayesian networks have revolutionized AI

Being Bayesian About Being Bayesian About Net work St ruct ure Net work St ruct ure A Bayesian

Outline Intro to RL and Bayesian Learning History of Bayesian RL Model-based Bayesian

Bayesian networks (2) Lirong Xia Last class Bayesian networks compact, graphical

AND MACHINE LEARNING CHAPTER 8: GRAPHICAL MODELS Bayesian Networks Directed Acyclic Graph (DAG)

Bayesian Methods for Neural Networks Readings: Bishop, Neural Networks for Pattern Recognition .

Chapter14 Probabilistic Reasoning (Bayesian Networks) Sec. 1 - 2 20070607 Chap14 1

CS440/ECE448 Lecture 15: Bayesian Inference and Bayesian Learning Slides by Svetlana Lazebnik,

Bayesian Learning 1 Outline MLE, MAP vs. Bayesian Learning Bayesian Linear Regression

Bayesian Networks Philipp Koehn 2 April 2020 Philipp Koehn Artificial Intelligence: Bayesian

Bayesian Networks Philipp Koehn 6 April 2017 Philipp Koehn Artificial Intelligence: Bayesian

Probabilistic Modeling: Bayesian Networks Bioinformatics: Sequence Analysis COMP 571 - Spring

Bayesian Networks Li Xiong Slide credits: Page (Wisconsin) CS760 , Zhu (Wisconsin) KDD 12

Bayesian Networks Philipp Koehn 29 October 2015 Philipp Koehn Artificial Intelligence: Bayesian

Part 7 Bayesian hierarchical modelling, simulation and MCMC by Gero Walter 252 Bayesian

Bayesian Networks Representation Machine Learning 10701/15781 Carlos Guestrin Carnegie

Clinician burnout: a hot topic and getting hotter. Are electronic medical records fuelling the

Second Generation BTK Inhibitors Acalabrutinib (ACP-196) and Zanubrutinib (BGB-3111)

CSE 232A Graduate Database Systems Arun Kumar Topic 2: Indexing and Sorting Chapters 10,

Bayes Nets 10-701 recitation 04-02-2013 Bayes Nets Represent dependencies between variables

How to Explain Log-Linear Towards an Explanation Resulting Explanation Relation Between Amount

Taking Care of You July 29th, 2020 IAFSP Rapid Response Virtual Home Visiting Webinar

Six Slides to Help State Legislators Improve Health Health Is Determined by Life Conditions The