Lecture 12: Uncertainty - 3 Lecture 12: Uncertainty - 3 Victor - PDF document

Lecture 12: Uncertainty - 3 Lecture 12: Uncertainty - 3 Victor Lesser CMPSCI 683 Fall 2004 Outline Outline • Continuation of Inference in Belief Networks • Automated Belief propagation in PolyTrees 2 V. Lesser CS683 F2004

d-separation: d-separation: Direction-Dependent Separation Direction-Dependent Separation • Network construction – Conditional independence of a node and its predecessors, given its parents – The absence of a link between two variables does not guarantee their independence • Effective inference needs to exploit all available conditional independences – Which set of nodes X are conditionally independent of another set Y, given a set of evidence nodes E • P(X,Y/E) = P(X/E) . P(Y/E) – Limits propagation of information – Comes directly from structure of network 3 V. Lesser CS683 F2004 d-separation d-separation Definition : If X, Y and E are three disjoint subsets of nodes in a DAG, then E is said to d- separate X from Y if every undirected path from X to Y is blocked by E. A path is blocked if it contains a node Z such that: (1) Z has one incoming and one outgoing arrow; or (2) Z has two outgoing arrows; or (3) Z has two incoming arrows and neither Z nor any of its descendants is in E. 4 V. Lesser CS683 F2004

d-separation cont. d-separation cont. X E Y Z Z Z 5 V. Lesser CS683 F2004 d-separation cont. d-separation cont. • Property of belief networks : if X and Y are d- separated by E, then X and Y are conditionally independent given E. • An “if-and-only-if” relationship between the graph and the probabilistic model cannot always be achieved. 6 V. Lesser CS683 F2004

d-separation example- case 1 d-separation example- case 1 Battery Radio Ignition Gas Z E E Starts Moves Whether there is Gas in the car and whether the car Radio plays are independent given evidence about whether the SparkPlugs fire [ignition] ( case 1 ). P(R,G/I) = P(R/I) . P(G/I) P(G/I,R) = P(G/I) 7 V. Lesser CS683 F2004 d-separation example- case 2 d-separation example- case 2 Battery Z Radio Ignition Gas E E Starts Moves Gas and Radio are conditionally-independent if it is known if the battery works ( case2 ). P(R/B,G) = P(R/B); P(G/B,R)=P(G/B) 8 V. Lesser CS683 F2004

d-separation example - case 3 d-separation example - case 3 Battery Radio Ignition Gas E E Starts STARTS&MOVES IN Z BUT ALSO IN E FOR CASE 3 Moves Gas and Radio are independent given no evidence at all. But they are . dependent given evidence about whether the car Starts . For example, if the car does not start, then the radio playing is increased evidence that we are out of gas. Gas and Radio are also dependent given evidence about whether the car Moves , because that is enabled by the car starting. P(Gas/Radio)=P(Gas); P(Radio/Gas)=P(Radio) P(Gas/ Radio,Start) not= P(Gas/Start) 9 V. Lesser CS683 F2004 Inference in Belief Networks Inference in Belief Networks • BNs are fairly expressive and easily engineered representation for knowledge in probabilistic domains. • They facilitate the development of inference algorithms. • They are particularly suited for parallelization • Current inference algorithms are efficient and can solve large real-world problems. 10 V. Lesser CS683 F2004

Network Features Affect Reasoning Network Features Affect Reasoning • Topology (trees, singly-connected, sparsely- connected, DAGs). • Size (number of nodes). • Type of variables (discrete, cont, functional, noisy-logical, mixed). • Network dynamics (static, dynamic). 11 V. Lesser CS683 F2004 Belief Propagation in Polytrees Polytrees Belief Propagation in Polytree belief network, where nodes are singly connected • Exact inference, Linear in size of network Multiconnected belief network. This is a DAG, but not a polytree. • Exact inference, Worst case NP- hard 12 V. Lesser CS683 F2004

Reasoning in Belief Networks Reasoning in Belief Networks Simple examples Q E E Q E of 4 patterns of reasoning that can be handled by belief networks. E Q represents an evidence variable; Q is a query variable. E Q E (Explaining Away) Intercausal Diagnostic Mixed Causal P(Q/E) =? 13 V. Lesser CS683 F2004 Belief Network Calculation in Belief Network Calculation in Polytree: Evidence Above : Evidence Above Polytree What is p(Y5|Y1,Y4) • y 1 y 2 – Define in terms of CPTs = p(Y5,Y4,Y3,Y2,Y1) – p(Y5|Y3,Y4p(Y4),p(Y3|Y1,Y2), p(Y2), p(Y1) – p(Y5|Y1,Y4)= p(Y5,Y1,Y4)/p(Y1,Y4) – Use cpt to sum over missing variables y 3 y 4 – p(Y5,Y1,Y4)= Sum(Y2,Y3) p(Y5,Y4,Y3,Y2,Y1) – assuming variables take on only truth or falsity. y 5 • p(Y5|Y1,Y4) = p(Y5,Y3|Y1, Y4) + P(Y5, not Y3|Y1,Y4) – Connect to parents of Y5 not already part of expression, by marginalization • = SUM(Y3) p(Y5,Y3|Y1,Y4) 14 V. Lesser CS683 F2004

Continuation of Example Above Continuation of Example Above • = SUM(Y3)(p(Y5|Y3, Y1, Y4) * p(Y3| Y1, Y4)) y 1 y 2 – P(s i ,s j |d) = P(s i | s j ,d) P(s j |d) • = SUM(Y3) p(Y5|Y3, Y4) * p(Y3| Y1, Y4) – Y1 conditionally independent of Y5 given Y3, y 3 y 4 – Y3 represents all the contributions of Y1 to Y5 – Case 1: a node is conditionally independent of non- descendants given its parents y 5 • = SUM(Y3) p(Y5|Y3, Y4) * p(Y3|Y1) – Y4 conditionally independent of Y3 given Y1 – Case 3: Y3 not a descendant of Y5 which d-separates Y1 and Y4 15 V. Lesser CS683 F2004 Continuation of Example Above Continuation of Example Above • = SUM(Y3) p(Y5|Y3, Y4) * ( Sum (Y2)p(Y3, Y2 |Y1)) y 1 y 2 – Connect to parents of Y3 not already part of expression • = SUM(Y3) p(Y5|Y3, Y4) *( Sum (Y2) y 3 y 4 p(Y3|Y1,Y2) * p(Y2|Y1)) – p(s i ,s j |d) = p(s i | s j ,d) p(s j |d); product rule y 5 • = SUM(Y3) p(Y5|Y3, Y4) *( SUM(Y2) p(Y3|Y1,Y2)*p(Y2) ) – Y2 independent of Y1; p(Y2/Y1)=p(Y2) – Definition of Baysean network 16 V. Lesser CS683 F2004

Belief Network Calculation in Polytree Belief Network Calculation in Polytree: : Evidence Below Evidence Below What is p(Y1|Y5) • y 1 y 2 – p(Y1|Y5)=p(Y1,Y5)/p(Y5) – p(Y1,Y2,Y3,Y4,Y5) = in terms of cpt y 3 y 4 – p(Y5|Y3,Y4)p(Y3|Y1,Y2)p(Y1)p(Y2)p(Y4) • p(Y1|Y5) = p(Y5|Y1)p(Y1)/p(Y5) y 5 – Bayes Rule • =K * p(Y5|Y1)p(Y1) 17 V. Lesser CS683 F2004 Continuation of Example Below Continuation of Example Below • =K * p(Y5|Y1)p(Y1) y 1 y 2 • = K * (SUM(Y3) p(Y5|Y3)p(Y3lY1)) p(Y1) – Connect to Y3 parent of Y5 not already part of expression – P(s i l s j ) = SUM(d)P(s i | s j , d) P(d | s j ) – Y1 conditionally independent of Y5 given Y3 y 3 y 4 – p(Y5|Y3,Y1)= p(Y5|Y3) • = K * (SUM(Y3) (SUM(Y4)p(Y5|Y3,Y4)p(Y4lY3))p(Y3lY1)) p(Y1) – Connect to Y4 parent of Y5 not already part of expression y 5 – P(s i l s j ) = SUM(d)P(s i | s j , d) P(d | s j ) • = K * (SUM(Y3) (SUM(Y4)p(Y5|Y3,Y4)p(Y4))p(Y3lY1)) p(Y1) – Y4 independent of Y3; p(Y4lY3)= p(Y4) 18 V. Lesser CS683 F2004

Continuation of Example Below Continuation of Example Below • = K * (SUM(Y3) (SUM(Y4)p(Y5|Y3,Y4)p(Y4))p(Y3|Y1)) p(Y1) y 1 y 2 • = K * (SUM(Y3) (SUM(Y4)p(Y5|Y3,Y4)p(Y4))(SUM(Y2)p(Y3| Y1,Y2)p(Y2lY1))) p(Y1) y 3 y 4 – Connect to Y2 parent of Y3 not already part of expression – P(s i l s j ) = SUM(d)P(s i | s j , d) P(d | s j ) y 5 • = K * (SUM(Y3) (SUM(Y4)p(Y5|Y3,Y4)p(Y4))(SUM(Y2)p(Y3| Y1,Y2)p(Y2))) p(Y1) – Y2 independent of Y1 – Expression that can be calculated from cpt 19 V. Lesser CS683 F2004 Variable Elimination Variable Elimination • Can remove a lot of re-calculation/multiplications in expression • K * (SUM(Y3) (SUM(Y4)p(Y5|Y3,Y4)p(Y4))(SUM(Y2)p(Y3lY1,Y2)p(Y2))) p(Y1) • Summations over each variable are done only for those portions of the expression that depend on variable • Save results of inner summing to avoid repeated calculation – Create Intermediate Functions – F-Y2(Y3,Y1)= (SUM(Y2)p(Y3lY1,Y2)p(Y2)) 20 V. Lesser CS683 F2004

Evidence Above and Below for Evidence Above and Below for Polytrees Polytrees If there is evidence both above and below P(Y3lY5,Y2) � + we separate the evidence into above, , and below, , portions and use a version of � � Bayes’ rule to write p ( Q | � + , � � ) = p ( � � | Q , � + ) p ( Q | � + ) p ( � � | � + ) we treat as a normalizing factor and write 1 p ( � � | � + ) = k 2 p ( Q | � + , � � ) = k 2 p ( � � | Q , � + ) p ( Q | � + ) Q d-separates from , so � + � � p ( Q | � + , � � ) = k 2 p ( � � | Q ) p ( Q | � + ) We calculate the first probability in this product as part of the top-down procedure for calculating . The second probability is calculated directly by the bottom-up p ( Q | � � ) procedure. 21 V. Lesser CS683 F2004 Other types of queries Other types of queries • Most probable explanation (MPE) or most likely hypothesis: The instantiation of all the remaining variables U with the highest probability given the evidence MPE(U | e) = argmax u P(u,e) • Maximum a posteriori (MAP): The instantiation of some variables V with the highest probability given the evidence MAP(V | e) = argmax v P(v,e) Note that the assignment to A in MAP(A|e) might be completely different from the assignment to A in MAP({A,B} | e). • sum over values of B vs individual values of B • Other queries: probability of an arbitrary logical expression over query variables, decision policies, information value, seeking evidence , information gathering planning, etc. 22 V. Lesser CS683 F2004

Lecture 12: Uncertainty - 3 Lecture 12: Uncertainty - 3 Victor - PDF document

Lecture 12: Uncertainty - 3 Lecture 12: Uncertainty - 3 Victor Lesser CMPSCI 683 Fall 2004 Outline Outline Continuation of Inference in Belief Networks Automated Belief propagation in PolyTrees 2 V. Lesser CS683 F2004

Uncertainty AIMA Chapter 13 Outline Uncertainty Uncertainty Probability Syntax and

UNCERTAINTY IN KNOWLEDGE Ch. 9 Uncertainty in Knowledge 1 Sources of Uncertainty

7 Modelling Uncertainty Bayes theorem 7 Modelling Uncertainty Bayes theorem

Uncertainty and its Representa/on @kordinglab Uncertainty ma7ers

Decision Making Privacy-Motivated . . . under Uncertainty: Uncertainty Leads to . . .

CPSC 875 CPSC 875 John D McGregor John D. McGregor C10 Error Design Uncertainty Uncertainty

Decision Making Under Uncertainty Making Decisions Under Uncertainty AI C LASS 10 (C H .

Lecture 10: Managing Lecture 10: Managing Uncertainty in the Supply Chain Uncertainty in the

Lecture 9: Demand Uncertainty: Demand Uncertainty: Lecture 9: Forecasting Forecasting

Lecture 9: Demand Uncertainty: Demand Uncertainty: Lecture 9: INSE 6300/4- -UU UU INSE

VISUALIZING UNCERTAINTY Fall 2017 Mac Hill VISUALIZING UNCERTAINTY 2 DEVELOPING A VISUAL

The Role of Expert Knowledge in Uncertainty Quantification (Are We Adding More Uncertainty (Are

Economic and technology uncertainty and implications for policy advise Fr ed eric Babonneau

Uncertainty Chapter 13 Chapter 13 1 Outline Uncertainty Probability Syntax and

Can you trust your models uncertainty? Evaluating Predictive Uncertainty Under Dataset Shift

MAV-Vis: A Notation for Model Uncertainty Design Uncertainty MAV-Vis Michalis Famelis and

CS354 Nathan Sprague October 13, 2020 State Estimation The goal is to estimate the state of the

Today Grammar for first-order logic . . . aka predicate calculus Predicate calculus as a

26:198:722 Expert Systems I Dempster-Shafer Belief Functions I Combining Belief Functions I Types

MONTE CARLO LOCALIZATION (MCL): & INTRODUCTION TO SLAM CS486 Introduction to AI Spring 2006

Executing Specifications of Social Reasoning Agents Iain Wallace Michael Rovatsos School of

The SimBatch simulation model Anders Ringgaard Kristensen Department of Large Animal Sciences

Overview of Discrete-Time Filters Discrete-Time Filters Overview First-order filters N M

Mixed-Signal VLSI Design Course Code: EE719/EE410 Department: Electrical Engineering Semester:

Lecture 12: Uncertainty - 3 Lecture 12: Uncertainty - 3 Victor - PDF document

Lecture 12: Uncertainty - 3 Lecture 12: Uncertainty - 3 Victor Lesser CMPSCI 683 Fall 2004 Outline Outline Continuation of Inference in Belief Networks Automated Belief propagation in PolyTrees 2 V. Lesser CS683 F2004

Uncertainty AIMA Chapter 13 Outline Uncertainty Uncertainty Probability Syntax and

UNCERTAINTY IN KNOWLEDGE Ch. 9 Uncertainty in Knowledge 1 Sources of Uncertainty

7 Modelling Uncertainty Bayes theorem 7 Modelling Uncertainty Bayes theorem

Uncertainty and its Representa/on @kordinglab Uncertainty ma7ers

Decision Making Privacy-Motivated . . . under Uncertainty: Uncertainty Leads to . . .

CPSC 875 CPSC 875 John D McGregor John D. McGregor C10 Error Design Uncertainty Uncertainty

Decision Making Under Uncertainty Making Decisions Under Uncertainty AI C LASS 10 (C H .

Lecture 10: Managing Lecture 10: Managing Uncertainty in the Supply Chain Uncertainty in the

Lecture 9: Demand Uncertainty: Demand Uncertainty: Lecture 9: Forecasting Forecasting

Lecture 9: Demand Uncertainty: Demand Uncertainty: Lecture 9: INSE 6300/4- -UU UU INSE

VISUALIZING UNCERTAINTY Fall 2017 Mac Hill VISUALIZING UNCERTAINTY 2 DEVELOPING A VISUAL

The Role of Expert Knowledge in Uncertainty Quantification (Are We Adding More Uncertainty (Are

Economic and technology uncertainty and implications for policy advise Fr ed eric Babonneau

Uncertainty Chapter 13 Chapter 13 1 Outline Uncertainty Probability Syntax and

Can you trust your models uncertainty? Evaluating Predictive Uncertainty Under Dataset Shift

MAV-Vis: A Notation for Model Uncertainty Design Uncertainty MAV-Vis Michalis Famelis and

CS354 Nathan Sprague October 13, 2020 State Estimation The goal is to estimate the state of the

Today Grammar for first-order logic . . . aka predicate calculus Predicate calculus as a

26:198:722 Expert Systems I Dempster-Shafer Belief Functions I Combining Belief Functions I Types

MONTE CARLO LOCALIZATION (MCL): &amp; INTRODUCTION TO SLAM CS486 Introduction to AI Spring 2006

Executing Specifications of Social Reasoning Agents Iain Wallace Michael Rovatsos School of

The SimBatch simulation model Anders Ringgaard Kristensen Department of Large Animal Sciences

Overview of Discrete-Time Filters Discrete-Time Filters Overview First-order filters N M

Mixed-Signal VLSI Design Course Code: EE719/EE410 Department: Electrical Engineering Semester:

MONTE CARLO LOCALIZATION (MCL): & INTRODUCTION TO SLAM CS486 Introduction to AI Spring 2006