Decision Making under Uncertainty AI Class 10 (Ch. 15.1-15.2.1, - PDF document

Decision Making under Uncertainty AI Class 10 (Ch. 15.1-15.2.1, 16.1-16.3) sensors ? environment agent actuators Material from Marie desJardin, Lise Getoor, Jean-Claude Cynthia Matuszek – CMSC 671 Latombe, Daphne Koller, and Paula Matuszek 1 Bookkeeping • HW 3 out • Group work for non-programming parts! • Heavy on CSPs and probability • Forms groups today or in Piazza • Soon: form project teams! 2 1

Today’s Class • Making Decisions Under Uncertainty • Tracking Uncertainty over Time • Decision Making under Uncertainty • Project groups, part 1 ß ? 3 Introduction • The world is not a well-defined place. • Sources of uncertainty • Uncertain inputs : What’s the temperature? • Uncertain (imprecise) definitions : Is Obama a good president? • Uncertain (unobserved) states : Where is the pit? • There is uncertainty in inferences • If I have a blistery, itchy rash and was gardening all weekend I probably have poison ivy 4 2

Sources of Uncertainty • Uncertain outputs • Uncertain inputs • Default reasoning (even • Missing data deduction) is uncertain • Noisy data • Abduction & induction inherently uncertain • Uncertain knowledge • Incomplete deductive • >1 cause à >1 effect inference can be uncertain • Incomplete knowledge of • Derived result is formally causality correct, but wrong in real • Probabilistic effects world Probabilistic reasoning only gives probabilistic results (summarizes uncertainty from various sources) 5 Reasoning Under Uncertainty • People make successful decisions all the time anyhow. • How? • More formally: how do we do reasoning under uncertainty, with inexact knowledge? • Step one: understanding what we know 6 3

MODELING UNCERTAINTY OVER TIME 7 States and Observations • We don’t have a continuous view of world • People don’t either! • We see things as a series of snapshots • Observations , associated with time slices • t 1 , t 2 , t 3 , … • Each snapshot contains all variables, observed or not • X t = (unobserved) state variables at time t; observation at t is E t • This is world state at time t 8 4

Temporal Probabilistic Agent sensors ? environment agent actuators t 1 , t 2 , t 3 , … 9 Time and Uncertainty • The world changes • Examples: diabetes management, traffic monitoring • Tasks: track it; predict it • Basic idea: • Copy state and evidence variables for each time step • Model uncertainty in change over time • Incorporate new observations as they arrive 10 5

Time and Uncertainty • Basic idea: • Copy state and evidence variables for each time step • Model uncertainty in change over time • Incorporate new observations as they arrive • X t = unobservable state variables at time t: BloodSugar t , StomachContents t • E t = evidence variables at time t: MeasuredBloodSugar t , PulseRate t , FoodEaten t • Assuming discrete time steps 11 States, Slightly More formally Process of change is viewed as series of snapshots • • Time slices • Each describing the state of the world at a particular time • Each time slice is represented by a set of random variables indexed by t: 1. the set of unobservable state variables X t 2. the set of observable evidence variables E t The observation at time t is E t = e t for some set of • values e t X a:b denotes the set of variables from X a to X b • 12 6

Transition and Sensor Models • Transition model How big • Models how the world changes over time can this get? • Specifies a probability distribution • Over state variables at time t P ( X t | X 0:t-1 ) • Given values at previous times • Sensor model • Models how evidence gets its values (sensor data) • E.g.: BloodSugar t à MeasuredBloodSugar t 13 Markov Assumption • Markov Assumption : • X t depends on some finite (usually fixed) number of previous X i ’s • First-order Markov process : P(X t |X 0:t-1 ) = P(X t |X t-1 ) • k th order: depends on previous k time steps • Sensor Markov assumption : P(E t |X 0:t , E 0:t-1 ) = P(E t |X t ) • Agent’s observations depend only on the actual current state of the world 14 X 7

Stationary Process • Infinitely many possible values of t • Does each timestep need a distribution? • Assume stationary process : • Changes in the world state are governed by laws that do not themselves change over time • Transition model P(X t |X t-1 ) and sensor model P(E t |X t ) are time-invariant, i.e., they are the same for all t 15 Complete Joint Distribution • Given: • Transition model: P(X t |X t-1 ) • Sensor model: P(E t |X t ) • Prior probability: P(X 0 ) • Then we can specify complete joint distribution of a sequence of states: t ∏ P ( X 0 , X 1 ,..., X t , E 1 ,..., E t ) = P ( X 0 ) P ( X i | X i − 1 ) P ( E i | X i ) i = 1 16 8

Example R t-1 P(R t |R t-1 ) T 0.7 F 0.3 Rain t-1 Rain t Rain t+1 Umbrella t-1 Umbrella t Umbrella t+1 R t P(U t | R t ) T 0.9 F 0.2 This should look like a finite state automaton (since it is one) 17 Inference Tasks • Filtering or monitoring: P(X t |e 1 ,…,e t ) Compute the current belief state, given all evidence to date • Prediction : P(X t+k |e 1 ,…,e t ) Compute the probability of a future state • Smoothing : P(X k |e 1 ,…,e t ) Compute the probability of a past state (hindsight) • Most likely explanation : arg max x1,..xt P(x 1 ,…,x t |e 1 ,…,e t ) Given a sequence of observations, find the sequence of states that is most likely to have generated those observations 18 9

Examples • Filtering: What is the probability that it is raining today, given all of the umbrella observations up through today? • Prediction: What is the probability that it will rain the day after tomorrow, given all of the umbrella observations up through today? • Smoothing: What is the probability that it rained yesterday, given all of the umbrella observations through today? • Most likely explanation: If the umbrella appeared the first three days but not on the fourth, what is the most likely weather sequence to produce these umbrella sightings? 19 Filtering • Maintain a current state estimate and update it • Rather than looking at all percepts (observed values) in history • So, given result of filtering up to t , compute t+1 from e t+1 • We use recursive estimation to compute P(X t+1 | e 1:t+1 ) as a function of e t+1 and P(X t | e 1:t ) • We can write this as: P ( X t + 1 | e 1: t + 1 ) = P ( X t + 1 | e 1: t , e t + 1 ) 20 10

Decision Making Under Uncertainty 23 Decision Making Under Uncertainty • Many environments have multiple possible outcomes • Some of these outcomes may be good; others may be bad • Some may be very likely; others unlikely • What’s a poor agent to do? 24 12

Reasoning Under Uncertainty • So how do we do reasoning under uncertainty and with inexact knowledge? • Heuristics • Mimic heuristic knowledge processing methods used by experts • Empirical associations • Experiential reasoning • Based on limited observations • Probabilities • Objective (frequency counting) • Subjective (human experience ) 25 Non-deterministic vs. Probabilistic Uncertainty ? ? a b c a b c {a,b,c} {a(p a ), b(p b ), c(p c )} à decision that is à decision that maximizes best for worst case expected utility value Non-deterministic model Probabilistic model ~ Adversarial search 26 13

Decision Theory • Combine probability and utility à Agent that makes rational decisions • On average, lead to desired outcome • Immediate simplifications: • Want most desirable immediate outcome (episodic) • nondeterministic, partially observable world • Definition: result of an action a leads to outcome s ’: • RESULT ( a ) is a random variable; domain is possible outcomes • P ( RESULT ( a ) = s ’ | a, e ) 27 Expected Utility • Goal: find best expected outcome • Random variable X with: • n values x 1 ,…,x n • Distribution (p 1 ,…,p n ) • X is the state reached after doing an action A under uncertainty • Utility function U(s) is the utility of a state, i.e., desirability 28 14

Expected Utility • X is state reached after doing an action A under uncertainty • U(s) is the utility of a state ß desirability • The expected utility of action A, given evidence EU( a | e ), is average utility of outcomes (states in S), weighted by probability an action occurs: EU[A] = S i=1,…,n p(x i |A)U(x i ) 29 One State/One Action Example s 0 U(A1, S0) = 100 x 0.2 + 50 x 0.7 + 70 x 0.1 = 20 + 35 + 7 A1 = 62 s 1 s 2 s 3 0.2 0.7 0.1 100 50 70 30 15

Decision Making under Uncertainty AI Class 10 (Ch. 15.1-15.2.1, - PDF document

Decision Making under Uncertainty AI Class 10 (Ch. 15.1-15.2.1, 16.1-16.3) sensors ? environment agent actuators Material from Marie desJardin, Lise Getoor, Jean-Claude Cynthia Matuszek CMSC 671 Latombe, Daphne Koller, and Paula

Decision Making Under Uncertainty Making Decisions Under Uncertainty AI C LASS 10 (C H .

Decision making under uncertainty Course overview Christos Dimitrakakis October 29, 2013 . .

Decision Making Under Decision Making . . . General Set Uncertainty: Proof of This Result

Decision Making Privacy-Motivated . . . under Uncertainty: Uncertainty Leads to . . .

6 Decision- -Making Making MVC (revisited) 6 Decision MVC (revisited) decision

DECISION MAKING readysetpresent.com Decision Making Program Objectives ( 1 of 2 ) To examine

Uncertainty AIMA Chapter 13 Outline Uncertainty Uncertainty Probability Syntax and

Decision Making 1 Decision Making Skills Establishing a positive decision-making environment.

Decision Making Under Uncertainty 14.123 Microeconomic Theory III Muhamet Yildiz Decision

Decision Problems Decision Making under Uncertainty, Part III Christos Dimitrakakis Chalmers

Mathematical Fuzzy Logic in Reasoning and Decision Making under Uncertainty Hykel Hosni

under Uncertainty Roberto Thern, PROVIDE DH Dimitri Van de Ville, IVAN Bucharest, April 4,

MI MI and Shared MI MI and Shared and Shared Decision Making and Shared Decision Making

Decision Models Provide a framework for decision making under uncertainty Help structure

UNCERTAINTY IN KNOWLEDGE Ch. 9 Uncertainty in Knowledge 1 Sources of Uncertainty

Repairing Decision-Making Programs Under Uncertainty Samuel Drews Aws Albarghouthi Loris

Dierence-in-Dierences for Ordinal Outcomes Soichiro Yamauchi Harvard University Applied

Fair division of indivisible goods and compact preference representation: an ordinal approach

Maximum Expected Utility CS 188: Artificial Intelligence Why should we average utilities?

High order multi- block/patch evolutions of Einsteins equations: an update or From

Game playing Chapter 5 Chapter 5 1 Outline Games Perfect play minimax decisions

Outline Games Perfect play minimax decisions Adversarial Search pruning

lld, a linker framework Presented by: Shankar Easwaran Qualcomm Innovation Center, Inc Open

An exercise in separating client-specific parameters from your program Erik Tilanus The