Bayesian Networks Machine Learning 10-601B Seyoung Kim - PowerPoint PPT Presentation

Bayesian ¡Networks ¡ Machine ¡Learning ¡10-‑601B ¡ Seyoung ¡Kim ¡ Many ¡of ¡these ¡slides ¡are ¡derived ¡from ¡William ¡ Cohen. ¡Thanks! ¡

Bayesian ¡Networks ¡ B ¡ E ¡ ¡ ¡ ¡ B ¡– ¡Did ¡a ¡burglary ¡occur? ¡ ¡ ¡ ¡ ¡E ¡– ¡Did ¡an ¡earthquake ¡occur? ¡ ¡ ¡ ¡ ¡A ¡– ¡Did ¡the ¡alarm ¡sound ¡off? ¡ ¡ ¡ ¡ ¡M ¡– ¡Mary ¡calls ¡ A ¡ ¡ ¡ ¡ ¡J ¡– ¡John ¡calls ¡ J ¡ M

Bayesian ¡network: ¡Inference ¡ • Once ¡the ¡network ¡is ¡constructed, ¡we ¡can ¡use ¡algorithms ¡for ¡ inferring ¡the ¡values ¡of ¡unobserved ¡variables. ¡ • For ¡example, ¡in ¡our ¡previous ¡network ¡the ¡only ¡observed ¡ variables ¡are ¡the ¡phone ¡calls. ¡However, ¡what ¡we ¡are ¡really ¡ interested ¡in ¡is ¡whether ¡there ¡was ¡a ¡burglary ¡or ¡not. ¡ • How ¡can ¡we ¡determine ¡that? ¡

Inference ¡ • Let’s ¡start ¡with ¡a ¡simpler ¡quesTon ¡ ¡ ¡ ¡-‑ ¡How ¡can ¡we ¡compute ¡a ¡joint ¡distribuTon ¡from ¡the ¡network? ¡ ¡ ¡ ¡-‑ ¡For ¡example, ¡P(B, ¬ E,A,J, ¡ ¬ M)? ¡ • Answer: ¡ ¡ ¡ ¡-‑ ¡That’s ¡easy, ¡let’s ¡use ¡the ¡network ¡

Compu;ng: ¡P(B, ¬ E,A,J, ¡ ¬ M) ¡ P(B, ¬ E,A,J, ¡ ¬ M) ¡= ¡ ¡ P(B)=.05 ¡ P(E)=.1 ¡ B ¡ E ¡ P(B)P( ¬ E)P(A ¡| ¡B, ¡ ¬ E) ¡P(J ¡| ¡A)P( ¬ M ¡| ¡A) ¡ = ¡0.05*0.9*.85*.7*.2 ¡ = ¡0.005355 ¡ P(A|B,E) ¡=.95 ¡ P(A|B, ¬ E) ¡= ¡.85 ¡ A ¡ P(A| ¡ ¬ ¡B,E) ¡=.5 ¡ P(A| ¡ ¬ ¡B, ¡ ¬ ¡E) ¡= ¡.05 ¡ P(J|A) ¡)=.7 ¡ J ¡ M P(J| ¬ A) ¡= ¡.05 ¡ P(M|A) ¡=.8 ¡ P(M| ¬ A) ¡= ¡.15 ¡

Compu;ng: ¡P(B, ¬ E,A,J, ¡ ¬ M) ¡ P(B, ¬ E,A,J, ¡ ¬ M) ¡= ¡ ¡ P(B)=.05 ¡ P(E)=.1 ¡ B ¡ E ¡ P(B)P( ¬ E)P(A ¡| ¡B, ¡ ¬ E) ¡P(J ¡| ¡A)P( ¬ M ¡| ¡A) ¡ = ¡0.05*0.9*.85*.7*.2 ¡ = ¡0.005355 ¡ P(A|B,E) ¡)=.95 ¡ We ¡can ¡easily ¡compute ¡a ¡ ¡ P(A|B, ¬ E) ¡= ¡.85 ¡ complete ¡joint ¡distribuTon. ¡ A ¡ P(A| ¡ ¬ ¡B,E) ¡)=.5 ¡ What ¡about ¡parTal ¡ P(A| ¡ ¬ ¡B, ¡ ¬ ¡E) ¡= ¡.05 ¡ distribuTons? ¡ ¡CondiTonal ¡ distribuTons? ¡ P(J|A) ¡)=.7 ¡ J ¡ M P(J| ¬ A) ¡= ¡.05 ¡ P(M|A) ¡)=.8 ¡ P(M| ¬ A) ¡= ¡.15 ¡

Inference ¡ • We ¡are ¡interested ¡in ¡queries ¡of ¡the ¡form: ¡ ¡ ¡ ¡ ¡P(B ¡| ¡J, ¬ M) ¡ • This ¡can ¡also ¡be ¡wricen ¡as: ¡ B ¡ E ¡ A ¡ • How ¡do ¡we ¡compute ¡the ¡new ¡joint? ¡ J ¡ M

Inference ¡in ¡Bayesian ¡networks ¡ We ¡will ¡discuss ¡three ¡methods: ¡ • 1. EnumeraTon ¡ ¡ 2. Variable ¡eliminaTon ¡ 3. StochasTc ¡inference ¡

Compu;ng ¡par;al ¡joints ¡ Sum ¡all ¡instances ¡with ¡these ¡seengs ¡(the ¡sum ¡is ¡ over ¡the ¡possible ¡assignments ¡to ¡the ¡other ¡two ¡ variables, ¡E ¡and ¡A) ¡

Compu;ng: ¡P(B,J, ¡ ¬ M) ¡ P(B,J, ¡ ¬ M) ¡= ¡ ¡ P(B)=.05 ¡ P(E)=.1 ¡ B ¡ E ¡ P(B,J, ¡ ¬ M,A,E) ¡+ ¡ ¡ P(B,J, ¡ ¬ M, ¡ ¬ ¡A,E) ¡+ ¡ ¡ P(B,J, ¡ ¬ M,A, ¡ ¬ ¡E) ¡+ ¡ ¡ P(A|B,E) ¡)=.95 ¡ P(A|B, ¬ E) ¡= ¡.85 ¡ P(B,J, ¡ ¬ M, ¡ ¬ ¡A, ¡ ¬ ¡E) ¡ ¡ A ¡ P(A| ¡ ¬ ¡B,E) ¡)=.5 ¡ = ¡0.0007+0.00001+0.005+0.0003 ¡ P(A| ¡ ¬ ¡B, ¡ ¬ ¡E) ¡= ¡.05 ¡ = ¡0.00601 ¡ P(J|A) ¡)=.7 ¡ J ¡ M P(J| ¬ A) ¡= ¡.05 ¡ P(M|A) ¡)=.8 ¡ P(M| ¬ A) ¡= ¡.15 ¡

Compu;ng ¡par;al ¡joints ¡ Sum ¡all ¡instances ¡with ¡these ¡seengs ¡(the ¡sum ¡is ¡over ¡the ¡ possible ¡assignments ¡to ¡the ¡other ¡two ¡variables, ¡E ¡and ¡A) ¡ • ¡This ¡method ¡can ¡be ¡improved ¡by ¡re-‑using ¡calculaTons ¡(similar ¡to ¡ dynamic ¡programming) ¡ • ¡STll, ¡the ¡number ¡of ¡possible ¡assignments ¡is ¡exponenTal ¡in ¡the ¡ number ¡of ¡unobserved ¡variables? ¡ • ¡That ¡is, ¡unfortunately, ¡the ¡best ¡we ¡can ¡do. ¡General ¡querying ¡of ¡ Bayesian ¡networks ¡is ¡NP-‑complete ¡

Inference ¡in ¡Bayesian ¡networks ¡is ¡NP ¡ complete ¡(sketch) ¡ • ReducTon ¡from ¡3SAT ¡ • Recall: ¡3SAT, ¡find ¡saTsfying ¡assignments ¡to ¡the ¡following ¡ problem: ¡(a ¡ ∨ ¡b ¡ ∨ ¡c) ¡ ∧ ¡(d ¡ ∨ ¡ ¬ ¡b ¡ ∨ ¡ ¬ ¡c) ¡… ¡ What ¡is ¡P(Y=1)? ¡ P(x i =1) ¡= ¡0.5 ¡ P(x i =1) ¡= ¡ P (( x 1 ¡ ∨ ¡ x 2 ¡ ∨ ¡ x 3 )=1) ¡ ¡ P(Y=1) ¡= ¡ P (( x 1 ¡ ∧ ¡ x 2 ¡ ∧ ¡ x 3 ¡ ∧ ¡ x 4 )=1) ¡ ¡ Y ¡

Variable ¡elimina;on ¡ P(B,J, ¡ ¬ M) ¡= ¡ ¡ P(B)=.05 ¡ P(E)=.1 ¡ B ¡ E ¡ P(B,J, ¡ ¬ M,A,E)+ ¡ ¡ P(B,J, ¡ ¬ M, ¡ ¬ ¡A,E) ¡+ ¡ ¡ P(B,J, ¬ M,A, ¡ ¬ ¡E) ¡+ ¡ ¡ P(A|B,E) ¡)=.95 ¡ P(A|B, ¬ E) ¡= ¡.85 ¡ P(B,J, ¡ ¬ M, ¡ ¬ ¡A, ¡ ¬ ¡E) ¡ ¡ A ¡ P(A| ¡ ¬ ¡B,E) ¡)=.5 ¡ = ¡0.0007+0.00001+0.005+0.0003 ¡ ¡ P(A| ¡ ¬ ¡B, ¡ ¬ ¡E) ¡= ¡.05 ¡ = ¡0.00601 ¡ P(J|A) ¡)=.7 ¡ J ¡ M P(J| ¬ A) ¡= ¡.05 ¡ P(M|A) ¡)=.8 ¡ Reuse ¡computaTons ¡ P(M| ¬ A) ¡= ¡.15 ¡ rather ¡than ¡recompute ¡ probabiliTes ¡

Compu;ng: ¡P(B,J, ¡ ¬ M) ¡ B ¡ E ¡ P(B,J, ¡ ¬ M) ¡= ¡ ¡ P(B,J, ¡ ¬ M,A,E)+ ¡ ¡ P(B,J, ¡ ¬ M, ¡ ¬ ¡A,E) ¡+ ¡ ¡ A ¡ P(B,J, ¡ ¬ M,A, ¡ ¬ ¡E) ¡+ ¡ ¡P(B,J, ¡ ¬ M, ¡ ¬ ¡A, ¡ ¬ ¡E) ¡= ¡ J ¡ M Store ¡as ¡a ¡funcTon ¡of ¡a ¡and ¡use ¡whenever ¡ necessary ¡(no ¡need ¡to ¡recompute ¡each ¡ Tme) ¡

Variable ¡elimina;on ¡ B ¡ E ¡ Set: ¡ A ¡ J ¡ M

Variable ¡elimina;on ¡ B ¡ E ¡ Lets ¡conTnue ¡with ¡these ¡funcTons: ¡ We ¡can ¡now ¡define ¡the ¡following ¡funcTon: ¡ A ¡ And ¡so ¡we ¡can ¡write: ¡ J ¡ M

Variable ¡elimina;on ¡ Lets ¡conTnue ¡with ¡another ¡funcTon: ¡ B ¡ E ¡ And ¡finally ¡we ¡can ¡write: ¡ A ¡ J ¡ M

Final ¡computa;on ¡(normaliza;on) ¡

Algorithm ¡ • e ¡-‑ ¡evidence ¡(the ¡variables ¡that ¡are ¡known) ¡ • vars ¡-‑ ¡the ¡condiTonal ¡probabiliTes ¡derived ¡from ¡the ¡network ¡ in ¡reverse ¡order ¡(bocom ¡up) ¡ • For ¡each ¡ var ¡in ¡ vars ¡ ¡ ¡ ¡ ¡-‑ ¡ factors ¡<-‑ ¡make_factor ¡( var,e ) ¡ ¡ ¡ ¡ ¡-‑ ¡if ¡ var ¡is ¡a ¡hidden ¡variable ¡then ¡create ¡a ¡new ¡factor ¡by ¡ summing ¡out ¡ var ¡ • Compute ¡the ¡product ¡of ¡all ¡factors ¡ • Normalize ¡

Computa;onal ¡complexity ¡ • We ¡are ¡reusing ¡computaTons ¡so ¡we ¡are ¡reducing ¡the ¡running ¡ Tme. ¡ • However, ¡there ¡are ¡sTll ¡cases ¡in ¡which ¡this ¡algorithm ¡will ¡lead ¡ to ¡exponenTal ¡running ¡Tme. ¡ • Consider ¡the ¡case ¡of ¡ f x (y 1 ¡… ¡y n ). ¡ When ¡factoring ¡x ¡out ¡we ¡ would ¡need ¡to ¡account ¡for ¡all ¡possible ¡values ¡of ¡the ¡y’s. ¡ Variable ¡eliminaTon ¡can ¡lead ¡to ¡ significant ¡cost ¡saving ¡but ¡its ¡ efficiency ¡depends ¡on ¡the ¡ network ¡structure ¡

Bayesian Networks Machine Learning 10-601B Seyoung Kim - PowerPoint PPT Presentation

Bayesian Networks Machine Learning 10-601B Seyoung Kim Many of these slides are derived from William Cohen. Thanks! Bayesian Networks B E

CS 331: Bayesian Networks 2 1 Bayesian Networks Youve heard about how Bayesian networks

Bayesian Networks Youve heard about how Bayesian networks have revolutionized AI

Being Bayesian About Being Bayesian About Net work St ruct ure Net work St ruct ure A Bayesian

Outline Intro to RL and Bayesian Learning History of Bayesian RL Model-based Bayesian

Bayesian networks (2) Lirong Xia Last class Bayesian networks compact, graphical

AND MACHINE LEARNING CHAPTER 8: GRAPHICAL MODELS Bayesian Networks Directed Acyclic Graph (DAG)

Bayesian Methods for Neural Networks Readings: Bishop, Neural Networks for Pattern Recognition .

Chapter14 Probabilistic Reasoning (Bayesian Networks) Sec. 1 - 2 20070607 Chap14 1

CS440/ECE448 Lecture 15: Bayesian Inference and Bayesian Learning Slides by Svetlana Lazebnik,

Bayesian Learning 1 Outline MLE, MAP vs. Bayesian Learning Bayesian Linear Regression

Bayesian Networks Philipp Koehn 2 April 2020 Philipp Koehn Artificial Intelligence: Bayesian

Bayesian Networks Philipp Koehn 6 April 2017 Philipp Koehn Artificial Intelligence: Bayesian

Probabilistic Modeling: Bayesian Networks Bioinformatics: Sequence Analysis COMP 571 - Spring

Bayesian Networks Li Xiong Slide credits: Page (Wisconsin) CS760 , Zhu (Wisconsin) KDD 12

Bayesian Networks Philipp Koehn 29 October 2015 Philipp Koehn Artificial Intelligence: Bayesian

Part 7 Bayesian hierarchical modelling, simulation and MCMC by Gero Walter 252 Bayesian

Role-Based Architecture & Network Pointers Jona Schoch Seminar in Distributed Computing Oct

Will This Paper Increase Your h -index? Scientific Impact Prediction Yuxiao Dong, Reid A.

Network impact of Web access to device APIs W3C Workshop on Security for Access to Device APIs

Clubs, Beliefs, and Entrapment Scott Duke Kominers Harvard University AMS Session on Behavioral

Based on the Number of Queries Kenta Iwasaki, Kazuyuki Shudo Tokyo Institute of Technology Tokyo

discrimination via semantic segmentation Andy Chappell 11/12/2019 DUNE UK Meeting 2 Roadmap

5G, Preparing the Future Network Society 19 January 2017 Insert Confidentiality Level in slide

Network Science Barab asi: Ch. 1 Introduction Joao Meidanis University of Campinas,

Bayesian Networks Machine Learning 10-601B Seyoung Kim - PowerPoint PPT Presentation

Bayesian Networks Machine Learning 10-601B Seyoung Kim Many of these slides are derived from William Cohen. Thanks! Bayesian Networks B E

CS 331: Bayesian Networks 2 1 Bayesian Networks Youve heard about how Bayesian networks

Bayesian Networks Youve heard about how Bayesian networks have revolutionized AI

Being Bayesian About Being Bayesian About Net work St ruct ure Net work St ruct ure A Bayesian

Outline Intro to RL and Bayesian Learning History of Bayesian RL Model-based Bayesian

Bayesian networks (2) Lirong Xia Last class Bayesian networks compact, graphical

AND MACHINE LEARNING CHAPTER 8: GRAPHICAL MODELS Bayesian Networks Directed Acyclic Graph (DAG)

Bayesian Methods for Neural Networks Readings: Bishop, Neural Networks for Pattern Recognition .

Chapter14 Probabilistic Reasoning (Bayesian Networks) Sec. 1 - 2 20070607 Chap14 1

CS440/ECE448 Lecture 15: Bayesian Inference and Bayesian Learning Slides by Svetlana Lazebnik,

Bayesian Learning 1 Outline MLE, MAP vs. Bayesian Learning Bayesian Linear Regression

Bayesian Networks Philipp Koehn 2 April 2020 Philipp Koehn Artificial Intelligence: Bayesian

Bayesian Networks Philipp Koehn 6 April 2017 Philipp Koehn Artificial Intelligence: Bayesian

Probabilistic Modeling: Bayesian Networks Bioinformatics: Sequence Analysis COMP 571 - Spring

Bayesian Networks Li Xiong Slide credits: Page (Wisconsin) CS760 , Zhu (Wisconsin) KDD 12

Bayesian Networks Philipp Koehn 29 October 2015 Philipp Koehn Artificial Intelligence: Bayesian

Part 7 Bayesian hierarchical modelling, simulation and MCMC by Gero Walter 252 Bayesian

Role-Based Architecture &amp; Network Pointers Jona Schoch Seminar in Distributed Computing Oct

Will This Paper Increase Your h -index? Scientific Impact Prediction Yuxiao Dong, Reid A.

Network impact of Web access to device APIs W3C Workshop on Security for Access to Device APIs

Clubs, Beliefs, and Entrapment Scott Duke Kominers Harvard University AMS Session on Behavioral

Based on the Number of Queries Kenta Iwasaki, Kazuyuki Shudo Tokyo Institute of Technology Tokyo

discrimination via semantic segmentation Andy Chappell 11/12/2019 DUNE UK Meeting 2 Roadmap

5G, Preparing the Future Network Society 19 January 2017 Insert Confidentiality Level in slide

Network Science Barab asi: Ch. 1 Introduction Joao Meidanis University of Campinas,

Role-Based Architecture & Network Pointers Jona Schoch Seminar in Distributed Computing Oct