Module 2 Probability Theory CS 886 Sequential Decision Making and - PowerPoint PPT Presentation

A Decision Making Scenario •You are considering to buy a used car… – Is it in good condition? – How much are you willing to pay? – Should you get it inspected by a mechanics? – Should you buy the car? 2 CS886 (c) 2013 Pascal Poupart

Introduction • Logical reasoning breaks down when dealing with uncertainty • Example: Diagnosis –  p Symptom(p,Toothache)  Disease(p, Cavity) • But not all people with toothaches have cavities… –  p Symptom(p, Toothache)  Disease(p,Cavity) v Disease(p,Gumdisease) v Disease(p, Hit in the Jaw) v … • Can’t enumerate all possible causes and not very informative –  p Disease(p, Cavity)  Symptom(p,Toothache) • Does not work since not all cavities cause toothaches… 4 CS886 (c) 2013 Pascal Poupart

Introduction • Logic fails because – We are lazy • Too much work to write down all antecedents and consequences – Theoretical ignorance • Sometimes there is just no complete theory – Practical ignorance • Even if we knew all the rules, we might be uncertain about a particular instance (not collected enough information yet) 5 CS886 (c) 2013 Pascal Poupart

Probabilities to the rescue • For many years AI danced around the fact that the world is an uncertain place • Then a few AI researchers decided to go back to the 18 th century – Revolutionary – Probabilities allow us to deal with uncertainty that comes from our laziness and ignorance – Clear semantics – Provide principled answers for • Combining evidence, predictive and diagnostic reasoning, incorporation of new evidence – Can be learned from data – Intuitive for humans (?) 6 CS886 (c) 2013 Pascal Poupart

Discrete Random Variables • Random variable A describes an outcome that cannot be determined in advance (i.e. roll of a dice) – Discrete random variable means that its possible values come from a countable domain (sample space) • E.G If X is the outcome of a dice throw, then X  {1,2,3,4,5,6} – Boolean random variable A  {True, False} • A = The Canadian PM in 2040 will be female • A = You have Ebola • A = You wake up tomorrow with a headache 7 CS886 (c) 2013 Pascal Poupart

Events • An event is a complete specification of the state of the world in which the agent is uncertain • Example: – Cavity=True Λ Toothache=True – Dice=2 • Events must be – Mutually exclusive – Exhaustive (at least one event must be true) 8 CS886 (c) 2013 Pascal Poupart

Probabilities • We let P(A) denote the “degree of belief” we have that statement A is true – Also “fraction of worlds in which A is true” • Philosophers like to discuss this (but we won’t) • Note: – P(A) DOES NOT correspond to a degree of truth – Example: Draw a card from a shuffled deck • The card is of some type (e.g., ace of spades) • Before looking at it P(ace of spades) = 1/52 • After looking at it P(ace of spades) = 1 or 0 9 CS886 (c) 2013 Pascal Poupart

The Axioms of Probability • 0  P(A)  1 • P(True) = 1 • P(False) = 0 • P(A v B) = P(A) + P(B) - P(A Λ B) • These axioms limit the class of functions that can be considered as probability functions 11 CS886 (c) 2013 Pascal Poupart

Interpreting the axioms • 0  P(A)  1 • P(True) = 1 • P(False) = 0 • P(A v B) = P(A) + P(B) - P(A Λ B) The area A zero area of A would mean can’t be no world smaller could ever than 0 have A as true 12 CS886 (c) 2013 Pascal Poupart

Interpreting the axioms • 0  P(A)  1 • P(True) = 1 • P(False) = 0 • P(A v B) = P(A) + P(B) - P(A Λ B) The area An area of of A 1 would can’t be mean all larger possible than 1 worlds have A as true 13 CS886 (c) 2013 Pascal Poupart

Take the axioms seriously! • There have been attempts to use different methodologies for uncertainty – Fuzzy logic, three valued logic, Dempster- Shafer, non- monotonic reasoning,… • But if you follow the axioms of probability then no one can take advantage of you  15 CS886 (c) 2013 Pascal Poupart

A Betting Game [di Finetti 1931] • Propositions A and B • Agent 1 announces its “degree of belief” in A and B (P(A) and P(B)) • Agent 2 chooses to bet for or against A and B at stakes that are consistent with P(A) and P(B) • If Agent 1 does not follow the axioms, it is guaranteed to lose money Agent 1 Agent 2 Outcome for Agent 1 Proposition Belief Bet Odds A Λ B A Λ ~B ~A Λ B ~A Λ ~B A 0.4 A 4 to 6 -6 -6 4 4 B 0.3 B 3 to 7 -7 3 -7 3 AVB 0.8 ~(AVB) 2 to 8 2 2 2 -8 -11 -1 -1 -1 16 CS886 (c) 2013 Pascal Poupart

Multivalued Random Variables • Assume domain of A (sample space) is {v 1 , v 2 , …, v k } • A can take on exactly one value out of this set – P(A=v i Λ A=v j ) = 0 if i  j – P(A=v 1 V A=v 2 V … V A=v k ) = 1 19 CS886 (c) 2013 Pascal Poupart

Terminology • Probability distribution: – A specification of a probability for each event in our sample space – Probabilities must sum to 1 • Assume the world is described by two (or more) random variables – Joint probability distribution • Specification of probabilities for all combinations of events 20 CS886 (c) 2013 Pascal Poupart

Joint distribution • Given two random variables A and B: • Joint distribution: – Pr(A=a Λ B=b) for all a,b • Marginalisation (sumout rule): – Pr(A=a) = Σ b Pr(A=a Λ B=b) – Pr(B=b) = Σ a Pr(A=a Λ B=b) 21 CS886 (c) 2013 Pascal Poupart

Example: Joint Distribution sunny ~sunny cold ~cold cold ~cold headache 0.072 0.008 headache 0.108 0.012 ~headache 0.144 0.576 ~headache 0.016 0.064 P(headache Λ sunny Λ cold) = 0.108 P(~headache Λ sunny Λ ~cold) = 0.064 P(headacheVsunny) = 0.108 + 0.012 + 0.072 + 0.008 + 0.016 + 0.064 = 0.28 P(headache) = 0.108 + 0.012 + 0.072 + 0.008 = 0.2 marginalization 22 CS886 (c) 2013 Pascal Poupart

Conditional Probability • P(A|B) fraction of worlds in which B is true that also have A true H=“Have headache” F=“Have Flu” F P(H)=1/10 P(F)=1/40 P(H|F)=1/2 H Headaches are rare and flu is rarer, but if you have the flu, then there is a 50-50 chance you will have a headache 23 CS886 (c) 2013 Pascal Poupart

Conditional Probability P(H|F)= Fraction of flu inflicted F worlds in which you have a headache H =(# worlds with flu and headache)/ (# worlds with flu) = (Area of “H and F” region)/ H=“Have headache” (Area of “F” region) F=“Have Flu” = P(H Λ F)/ P(F) P(H)=1/10 P(F)=1/40 P(H|F)=1/2 24 CS886 (c) 2013 Pascal Poupart

Inference One day you wake up with a F headache. You think “Drat! 50% of flues are associated with headaches so I must have a 50- H 50 chance of coming down with the flu” H=“Have headache” F=“Have Flu” Is your reasoning P(H)=1/10 correct? P(F)=1/40 P(H|F)=1/2 26 CS886 (c) 2013 Pascal Poupart

Inference One day you wake up with a F headache. You think “Drat! 50% of flues are associated with headaches so I must have a 50- H 50 chance of coming down with the flu” H=“Have headache” F=“Have Flu” P(F Λ H)=P(F)P(H|F)=1/80 P(H)=1/10 P(F)=1/40 P(H|F)=1/2 27 CS886 (c) 2013 Pascal Poupart

Inference One day you wake up with a F headache. You think “Drat! 50% of flues are associated with headaches so I must have a 50- H 50 chance of coming down with the flu” H=“Have headache” F=“Have Flu” P(F Λ H)=P(F)P(H|F)=1/80 P(H)=1/10 P(F)=1/40 P(F|H) = P(F Λ H)/P(H) = 1/8 P(H|F)=1/2 28 CS886 (c) 2013 Pascal Poupart

Example: Joint Distribution sunny ~sunny cold ~cold cold ~cold headache 0.072 0.008 headache 0.108 0.012 ~headache 0.144 0.576 ~headache 0.016 0.064 P(headache Λ cold | sunny) = P(headache Λ cold Λ sunny) / P(sunny) = 0.108/(0.108+0.012+0.016+0.064) = 0. 54 P(headache Λ cold | ~sunny) = P(headache Λ cold Λ ~sunny) / P(~sunny) = 0.072/(0.072+0.008+0.144+0.576) = 0.09 29 CS886 (c) 2013 Pascal Poupart

Module 2 Probability Theory CS 886 Sequential Decision Making and - PowerPoint PPT Presentation

Module 2 Probability Theory CS 886 Sequential Decision Making and Reinforcement Learning University of Waterloo 1 CS886 (c) 2013 Pascal Poupart A Decision Making Scenario You are considering to buy a used car Is it in good

JOBS IN VALUE CHAINS ANALYSIS INTRODUCTION Roadmap: Why are we here today? Agenda for the

WebEOC Training 1 Topics Module 1 WebEOC Overview Module 2 Getting Started Module 3

Module E: Solving Systems of Linear Equations Module E Math 237 Module E Section E.0 Section

Module V: Vector Spaces Module V Math 237 Module V Section V.0 Section V.1 Section V.2

Agenda Module 1 - Risk, Volatility & Timescale Module 2 - Asset Allocation Module 3 -

Emergency Management Roles and Responsibilities Joe Myers Agenda MODULE 1 WHAT IS MODULE

1 MODULE SPECIFICATION Module Aims The module aims to deliver knowledge of the essential

Canadian Bioinformatics Workshops www.bioinformatics.ca Module #: Title of Module 2 Module bio

Module A: Algebraic properties of linear maps Module A Math 237 Module A Section A.1 Section

6.15 Module 15: Research and Presentation Module Title Research and Presentation Module NFQ

Module Title: Broadcasting & Presentation Skills Level : 4 Credit Value : 20 Code of module

Agenda Module 1 - Risk, Volatility & Timescale Module 2 - Asset Allocation Module 3 -

Using the Code Review Module Szeged DrupalCon Using the Code Review Module Doug Green Stella

Module 3 Doing a Noise Audit This module and Module 2 provide the necessary training needed

MODERATE SEDATION MODULE MODERATE SEDATION MODULE MODERATE SEDATION MODULE Introduction

Auxiliary Rubrics Module 6 Module 5 Review At the conclusion of Module 5, the team completed

Foundations of Artificial Intelligence 11. Making Simple Decisions under Uncertainty Probability

Coherent Beam Combining of 21 Semiconductor Gain Elements in a Common Cavity* SSDLTR 2012

Real Shading in Unreal Engine 4 Brian Karis (brian.karis@epicgames.com) Goals More

Transport through chaotic cavities: RMT reproduced from semiclassics P.B., joint work with

Uncertain t y Chapter 14 c AIMA Slides Stuart Russell and P eter Norvig, 1998

Combining linguistic and non- linguistic information in likelihood-ratio-based forensic voice

Co Controlla labil ilit ity Controlla Co labil ilit ity & & Optj & tjmal

343H: Honors AI Lecture 9: Bayes nets, part 1 2/13/2014 Kristen Grauman UT Austin Slides

Module 2 Probability Theory CS 886 Sequential Decision Making and - PowerPoint PPT Presentation

Module 2 Probability Theory CS 886 Sequential Decision Making and Reinforcement Learning University of Waterloo 1 CS886 (c) 2013 Pascal Poupart A Decision Making Scenario You are considering to buy a used car Is it in good

JOBS IN VALUE CHAINS ANALYSIS INTRODUCTION Roadmap: Why are we here today? Agenda for the

WebEOC Training 1 Topics Module 1 WebEOC Overview Module 2 Getting Started Module 3

Module E: Solving Systems of Linear Equations Module E Math 237 Module E Section E.0 Section

Module V: Vector Spaces Module V Math 237 Module V Section V.0 Section V.1 Section V.2

Agenda Module 1 - Risk, Volatility &amp; Timescale Module 2 - Asset Allocation Module 3 -

Emergency Management Roles and Responsibilities Joe Myers Agenda MODULE 1 WHAT IS MODULE

1 MODULE SPECIFICATION Module Aims The module aims to deliver knowledge of the essential

Canadian Bioinformatics Workshops www.bioinformatics.ca Module #: Title of Module 2 Module bio

Module A: Algebraic properties of linear maps Module A Math 237 Module A Section A.1 Section

6.15 Module 15: Research and Presentation Module Title Research and Presentation Module NFQ

Module Title: Broadcasting &amp; Presentation Skills Level : 4 Credit Value : 20 Code of module

Agenda Module 1 - Risk, Volatility &amp; Timescale Module 2 - Asset Allocation Module 3 -

Using the Code Review Module Szeged DrupalCon Using the Code Review Module Doug Green Stella

Module 3 Doing a Noise Audit This module and Module 2 provide the necessary training needed

MODERATE SEDATION MODULE MODERATE SEDATION MODULE MODERATE SEDATION MODULE Introduction

Auxiliary Rubrics Module 6 Module 5 Review At the conclusion of Module 5, the team completed

Foundations of Artificial Intelligence 11. Making Simple Decisions under Uncertainty Probability

Coherent Beam Combining of 21 Semiconductor Gain Elements in a Common Cavity* SSDLTR 2012

Real Shading in Unreal Engine 4 Brian Karis (brian.karis@epicgames.com) Goals More

Transport through chaotic cavities: RMT reproduced from semiclassics P.B., joint work with

Uncertain t y Chapter 14 c AIMA Slides Stuart Russell and P eter Norvig, 1998

Combining linguistic and non- linguistic information in likelihood-ratio-based forensic voice

Co Controlla labil ilit ity Controlla Co labil ilit ity &amp; &amp; Optj &amp; tjmal

343H: Honors AI Lecture 9: Bayes nets, part 1 2/13/2014 Kristen Grauman UT Austin Slides

Agenda Module 1 - Risk, Volatility & Timescale Module 2 - Asset Allocation Module 3 -

Module Title: Broadcasting & Presentation Skills Level : 4 Credit Value : 20 Code of module

Agenda Module 1 - Risk, Volatility & Timescale Module 2 - Asset Allocation Module 3 -

Co Controlla labil ilit ity Controlla Co labil ilit ity & & Optj & tjmal