Introduction to Decision Networks Alice Gao Lecture 13 Based on - - PowerPoint PPT Presentation

introduction to decision networks
SMART_READER_LITE
LIVE PREVIEW

Introduction to Decision Networks Alice Gao Lecture 13 Based on - - PowerPoint PPT Presentation

1/20 Introduction to Decision Networks Alice Gao Lecture 13 Based on work by K. Leyton-Brown, K. Larson, and P. van Beek 2/20 Outline Learning Goals Introduction to Decision Theory Decision Network for Mail Delivery Robot Evaluating the


slide-1
SLIDE 1

1/20

Introduction to Decision Networks

Alice Gao

Lecture 13 Based on work by K. Leyton-Brown, K. Larson, and P. van Beek

slide-2
SLIDE 2

2/20

Outline

Learning Goals Introduction to Decision Theory Decision Network for Mail Delivery Robot Evaluating the Robot Decision Network Revisiting the Learning goals

slide-3
SLIDE 3

3/20

Learning Goals

By the end of the lecture, you should be able to

▶ Model a one-ofg decision problem by constructing a decision

network containing nodes, arcs, conditional probability distributions, and a utility function.

▶ Choose the best action by evaluating a decision network.

slide-4
SLIDE 4

4/20

Decision Theory

Decision theory = Probability theory + Utility theory

▶ Decision theory: describes what an agent should do ▶ Probability theory: describes what an agent should believe on

the basis of the evidence

▶ Utility theory: describes what an agent wants

slide-5
SLIDE 5

5/20

Decision Networks

Decision networks = Bayesian network + actions + utilities

slide-6
SLIDE 6

6/20

A robot that delivers mail

The robot must choose its route to pickup the mail. There is a short route and a long route. The long route is slower, but on the short route the robot might slip and fall. The robot can put on

  • pads. This won’t change the probability of an accident, but it will

make it less severe if it happens. Unfortunately, the pads add weight and slow the robot down. The robot would like to pick up the mail quickly with little/no damage. What should the robot do?

slide-7
SLIDE 7

7/20

Variables

What are the random variables? What are the decision variables (actions)?

slide-8
SLIDE 8

8/20

Nodes in a Decision Network

Three kinds of nodes:

Chance nodes represent random variables (as in Bayesian networks).

Decision nodes represent actions (decision variables).

Utility node represents agent’s utility function on states (happiness in each state).

slide-9
SLIDE 9

9/20

Robot decision network

slide-10
SLIDE 10

10/20

Arcs in the Decision Network

How do the random variables and the decision variables relate to

  • ne another?
slide-11
SLIDE 11

11/20

Robot decision network

slide-12
SLIDE 12

12/20

CQ: The robot’s happiness

CQ: Which variables directly infmuence the robot’s happiness? (A) P only (B) S only (C) A only (D) Two of (A), (B), and (C) (E) All of (A), (B) and (C) The robot must choose its route to pickup the

  • mail. There is a short route and a long route.

The long route is slower, but on the short route the robot might slip and fall. The robot can put on pads. This won’t change the probability of an accident, but it will make it less severe if it happens. Unfortunately, the pads add weight and slow the robot down. The robot would like to pick up the mail quickly with little/no damage.

slide-13
SLIDE 13

13/20

Robot decision network

slide-14
SLIDE 14

14/20

CQ The robot’s utility function

CQ: When an accident does not happen, which of the following is true? (A) The robot prefers not wearing pads than wearing pads. (B) The robot prefers the long route over the short route. (C) Both (A) and (B) are true. (D) Neither (A) and (B) is true.

slide-15
SLIDE 15

15/20

The robot’s utility function

State U(wi) ¬P, ¬S, ¬A w0 slow, no weight 6 ¬P, ¬S, A w1 impossible ¬P, S, ¬A w2 quick, no weight 10 ¬P, S, A w3 severe damage P, ¬S, ¬A w4 slow, extra weight 4 P, ¬S, A w5 impossible P, S, ¬A w6 quick, extra weight 8 P, S, A w7 moderate damage 2

slide-16
SLIDE 16

16/20

The robot’s utility function

How does the robot’s utility/happiness depend on the random variables and the decision variables?

slide-17
SLIDE 17

17/20

The robot’s utility function

How does the robot’s utility/happiness depend on the random variables and the decision variables?

slide-18
SLIDE 18

18/20

Robot decision network

slide-19
SLIDE 19

19/20

Evaluating a decision network

How do we choose an action?

  • 1. Set evidence variables for current state
  • 2. For each possible value of decision node

(a) set decision node to that value (b) calculate posterior probability for parent nodes of the utility node (c) calculate expected utility for the action

  • 3. Return action with highest expected utility
slide-20
SLIDE 20

20/20

Revisiting the Learning Goals

By the end of the lecture, you should be able to

▶ Model a one-ofg decision problem by constructing a decision

network containing nodes, arcs, conditional probability distributions, and a utility function.

▶ Choose the best action by evaluating a decision network.