Goals and Preferences Alice . . . went on Would you please tell me, - PowerPoint PPT Presentation

Goals and Preferences Alice . . . went on “Would you please tell me, please, which way I ought to go from here?” “That depends a good deal on where you want to get to,” said the Cat. “I don’t much care where —” said Alice. “Then it doesn’t matter which way you go,” said the Cat. Lewis Carroll, 1832–1898 Alice’s Adventures in Wonderland, 1865 Chapter 6 � D. Poole and A. Mackworth 2008 c Artificial Intelligence, Lecture 9.1, Page 1

Preferences Actions result in outcomes Agents have preferences over outcomes A rational agent will do the action that has the best outcome for them Sometimes agents don’t know the outcomes of the actions, but they still need to compare actions Agents have to act (doing nothing is (often) an action). � D. Poole and A. Mackworth 2008 c Artificial Intelligence, Lecture 9.1, Page 2

Preferences Over Outcomes If o 1 and o 2 are outcomes o 1 � o 2 means o 1 is at least as desirable as o 2 . o 1 ∼ o 2 means o 1 � o 2 and o 2 � o 1 . o 1 ≻ o 2 means o 1 � o 2 and o 2 �� o 1 � D. Poole and A. Mackworth 2008 c Artificial Intelligence, Lecture 9.1, Page 3

Lotteries An agent may not know the outcomes of their actions, but only have a probability distribution of the outcomes. A lottery is a probability distribution over outcomes. It is written [ p 1 : o 1 , p 2 : o 2 , . . . , p k : o k ] where the o i are outcomes and p i > 0 such that � p i = 1 i The lottery specifies that outcome o i occurs with probability p i . When we talk about outcomes, we will include lotteries. � D. Poole and A. Mackworth 2008 c Artificial Intelligence, Lecture 9.1, Page 4

Properties of Preferences Completeness: Agents have to act, so they must have preferences: ∀ o 1 ∀ o 2 o 1 � o 2 or o 2 � o 1 Transitivity: Preferences must be transitive: if o 1 � o 2 and o 2 � o 3 then o 1 � o 3 otherwise o 1 � o 2 and o 2 � o 3 and o 3 ≻ o 1 . If they are prepared to pay to get from o 1 to o 3 − → money pump. (Similarly for mixtures of ≻ and � .) � D. Poole and A. Mackworth 2008 c Artificial Intelligence, Lecture 9.1, Page 5

Properties of Preferences (cont.) Monotonicity: An agent prefers a larger chance of getting a better outcome than a smaller chance: If o 1 ≻ o 2 and p > q then [ p : o 1 , 1 − p : o 2 ] ≻ [ q : o 1 , 1 − q : o 2 ] � D. Poole and A. Mackworth 2008 c Artificial Intelligence, Lecture 9.1, Page 6

Consequence of axioms Suppose o 1 ≻ o 2 and o 2 ≻ o 3 . Consider whether the agent would prefer ◮ o 2 ◮ the lottery [ p : o 1 , 1 − p : o 3 ] for different values of p ∈ [0 , 1]. You can plot which one is preferred as a function of p : o 2 - lottery - 0 1 � D. Poole and A. Mackworth 2008 c Artificial Intelligence, Lecture 9.1, Page 7

Properties of Preferences (cont.) Continuity: Suppose o 1 ≻ o 2 and o 2 ≻ o 3 , then there exists a p ∈ [0 , 1] such that o 2 ∼ [ p : o 1 , 1 − p : o 3 ] � D. Poole and A. Mackworth 2008 c Artificial Intelligence, Lecture 9.1, Page 8

Properties of Preferences (cont.) Decomposability: (no fun in gambling). An agent is indifferent between lotteries that have same probabilities and outcomes. This includes lotteries over lotteries. For example: [ p : o 1 , 1 − p : [ q : o 2 , 1 − q : o 3 ]] ∼ [ p : o 1 , (1 − p ) q : o 2 , (1 − p )(1 − q ) : o 3 ] � D. Poole and A. Mackworth 2008 c Artificial Intelligence, Lecture 9.1, Page 9

Properties of Preferences (cont.) Substitutability: if o 1 ∼ o 2 then the agent is indifferent between lotteries that only differ by o 1 and o 2 : [ p : o 1 , 1 − p : o 3 ] ∼ [ p : o 2 , 1 − p : o 3 ] � D. Poole and A. Mackworth 2008 c Artificial Intelligence, Lecture 9.1, Page 10

Alternative Axiom for Substitutability Substitutability: if o 1 � o 2 then the agent weakly prefers lotteries that contain o 1 instead of o 2 , everything else being equal. That is, for any number p and outcome o 3 : [ p : o 1 , (1 − p ) : o 3 ] � [ p : o 2 , (1 − p ) : o 3 ] � D. Poole and A. Mackworth 2008 c Artificial Intelligence, Lecture 9.1, Page 11

What we would like We would like a measure of preference that can be combined with probabilities. So that value ([ p : o 1 , 1 − p : o 2 ]) = p × value ( o 1 ) + (1 − p ) × value ( o 2 ) Money does not act like this. What you you prefer $1 , 000 , 000 or [0 . 5 : $0 , 0 . 5 : $2 , 000 , 000]? It may seem that preferences are too complex and muti-faceted to be represented by single numbers. � D. Poole and A. Mackworth 2008 c Artificial Intelligence, Lecture 9.1, Page 12

Theorem If preferences follows the preceding properties, then preferences can be measured by a function utility : outcomes → [0 , 1] such that o 1 � o 2 if and only if utility ( o 1 ) ≥ utility ( o 2). Utilities are linear with probabilities: utility ([ p 1 : o 1 , p 2 : o 2 , . . . , p k : o k ]) k � = p i × utility ( o i ) i =1 � D. Poole and A. Mackworth 2008 c Artificial Intelligence, Lecture 9.1, Page 13

Proof If all outcomes are equally preferred, set utility ( o i ) = 0 for all outcomes o i . Otherwise, suppose the best outcome is best and the worst outcome is worst . For any outcome o i , define utility ( o i ) to be the number u i such that o i ∼ [ u i : best , 1 − u i : worst ] This exists by the Continuity property. � D. Poole and A. Mackworth 2008 c Artificial Intelligence, Lecture 9.1, Page 14

Proof (cont.) Suppose o 1 � o 2 and utility ( o i ) = u i , then by Substitutability, [ u 1 : best , 1 − u 1 : worst ] � [ u 2 : best , 1 − u 2 : worst ] Which, by completeness and monotonicity implies u 1 ≥ u 2 . � D. Poole and A. Mackworth 2008 c Artificial Intelligence, Lecture 9.1, Page 15

Proof (cont.) Suppose p = utility ([ p 1 : o 1 , p 2 : o 2 , . . . , p k : o k ]). Suppose utility ( o i ) = u i . We know: o i ∼ [ u i : best , 1 − u i : worst ] By substitutability, we can replace each o i by [ u i : best , 1 − u i : worst ], so p = utility ( [ p 1 : [ u 1 : best , 1 − u 1 : worst ] . . . p k : [ u k : best , 1 − u k : worst ]]) � D. Poole and A. Mackworth 2008 c Artificial Intelligence, Lecture 9.1, Page 16

By decomposability, this is equivalent to: p = utility ( [ p 1 u 1 + · · · + p k u k : best , p 1 (1 − u 1 ) + · · · + p k (1 − u k ) : worst ]]) Thus, by definition of utility, p = p 1 × u 1 + · · · + p k × u k � D. Poole and A. Mackworth 2008 c Artificial Intelligence, Lecture 9.1, Page 17

Utility as a function of money 1 l a r t u e Risk averse n k s i R Utility Risk seeking 0 $0 $2,000,000 � D. Poole and A. Mackworth 2008 c Artificial Intelligence, Lecture 9.1, Page 18

Possible utility as a function of money Someone who really wants a toy worth $30, but who would also like one worth $20: 1 utility 0 10 20 30 40 50 60 70 80 90 100 dollars � D. Poole and A. Mackworth 2008 c Artificial Intelligence, Lecture 9.1, Page 19

Allais Paradox (1953) What would you prefer: A: $1 m — one million dollars B: lottery [0 . 10 : $2 . 5 m , 0 . 89 : $1 m , 0 . 01 : $0] � D. Poole and A. Mackworth 2008 c Artificial Intelligence, Lecture 9.1, Page 20

Allais Paradox (1953) What would you prefer: A: $1 m — one million dollars B: lottery [0 . 10 : $2 . 5 m , 0 . 89 : $1 m , 0 . 01 : $0] What would you prefer: C: lottery [0 . 11 : $1 m , 0 . 89 : $0] D: lottery [0 . 10 : $2 . 5 m , 0 . 9 : $0] � D. Poole and A. Mackworth 2008 c Artificial Intelligence, Lecture 9.1, Page 21

Allais Paradox (1953) What would you prefer: A: $1 m — one million dollars B: lottery [0 . 10 : $2 . 5 m , 0 . 89 : $1 m , 0 . 01 : $0] What would you prefer: C: lottery [0 . 11 : $1 m , 0 . 89 : $0] D: lottery [0 . 10 : $2 . 5 m , 0 . 9 : $0] It is inconsistent with the axioms of preferences to have A ≻ B and D ≻ C . � D. Poole and A. Mackworth 2008 c Artificial Intelligence, Lecture 9.1, Page 22

Allais Paradox (1953) What would you prefer: A: $1 m — one million dollars B: lottery [0 . 10 : $2 . 5 m , 0 . 89 : $1 m , 0 . 01 : $0] What would you prefer: C: lottery [0 . 11 : $1 m , 0 . 89 : $0] D: lottery [0 . 10 : $2 . 5 m , 0 . 9 : $0] It is inconsistent with the axioms of preferences to have A ≻ B and D ≻ C . A,C: lottery [0 . 11 : $1 m , 0 . 89 : X ] B,D: lottery [0 . 10 : $2 . 5 m , 0 . 01 : $0 , 0 . 89 : X ] � D. Poole and A. Mackworth 2008 c Artificial Intelligence, Lecture 9.1, Page 23

The Ellsberg Paradox Two bags: Bag 1 40 white chips, 30 yellow chips, 30 green chips Bag 2 40 white chips, 60 chips that are yellow or green What do you prefer: A: Receive $1m if a white or yellow chip is drawn from bag 1 B: Receive $1m if a white or yellow chip is drawn from bag 2 C: Receive $1m if a white or green chip is drawn from bag 2 � D. Poole and A. Mackworth 2008 c Artificial Intelligence, Lecture 9.1, Page 24

The Ellsberg Paradox Two bags: Bag 1 40 white chips, 30 yellow chips, 30 green chips Bag 2 40 white chips, 60 chips that are yellow or green What do you prefer: A: Receive $1m if a white or yellow chip is drawn from bag 1 B: Receive $1m if a white or yellow chip is drawn from bag 2 C: Receive $1m if a white or green chip is drawn from bag 2 What about D: Lottery [0 . 5 : B , 0 . 5 : C ] � D. Poole and A. Mackworth 2008 c Artificial Intelligence, Lecture 9.1, Page 25

Goals and Preferences Alice . . . went on Would you please tell me, - PowerPoint PPT Presentation

Goals and Preferences Alice . . . went on Would you please tell me, please, which way I ought to go from here? That depends a good deal on where you want to get to, said the Cat. I dont much care where said Alice.

Moral Preferences F R A N C E S C A R O S S I Decision making Based on our preferences

Rational preferences Idea: preferences of a rational agent must obey constraints. Rational

Rational preferences Idea: preferences of a rational agent must obey constraints. Rational

Introduction to Artificial Intelligence Planning under Uncertainty Janyl Jumadinova November 2,

14.54 International Trade Lecture 3: Preferences and Demand 14.54 Week 2 Fall 2016 14.54 (Week

Incentives and Behavior Prof. Dr. Heiner Schumacher KU Leuven 6. Time Preferences I Prof. Dr.

Incentives and Behavior Prof. Dr. Heiner Schumacher KU Leuven 3. Risk Preferences I Prof. Dr.

Cardinal and Ordinal Preferences A preference structure represents an agents preferences over a

Ordinal and Cardinal Preferences A preference structure represents an agents preferences over a

Axiomatic Foundations of Multiplier Preferences Tomasz Strzalecki Multiplier preferences

Platform for Privacy Platform for Privacy Preferences (P3P) Project Preferences (P3P) Project

Sustainable intergenerational preferences preferences Geir B. Combining sensitivity for the

Social Choice: Single-Peaked Preferences Game Theory Course: Jackson, Leyton-Brown & Shoham

APS Strategic Plan Goals and Strategies February 10, 2018 Strategic Plan Goals Goals : Key areas

Meeting your goals Meeting your goals Meeting your goals Meeting your goals We are DNS DNS

x agent with these beliefs and these My preferences preferences to make. Justin C. Fisher

Measuring Reliability in Forensic Voice Comparison Geoffrey Stewart Morrison Julien Epps Philip

Lars Bauer, Artjom Grudnitsky, Hongyan Zhang, Jrg Henkel - 1 - Institut fr Technische

Lars Bauer, Artjom Grudnitsky, Hongyan Zhang, Jrg Henkel - 1 - Institut fr Technische

p(E|H p(E|H p ) p ) p(E|H p(E|H d ) d ) Concerns Logically correct framework for evaluation

Goals and Preferences Alice . . . went on Would you please tell me, please, which way I ought

PHPE 400 Individual and Group Decision Making Eric Pacuit University of Maryland 1 / 24 Allais

Variable-Lived Short-Run Selves Drew Fudenberg and David K. Levine September 8, 2009 The Problem

1 2 3 4 5 6 7 8 9 10 11 12 13

Sambuz

Useful Links

Newsletter

Mail Us

Goals and Preferences Alice . . . went on Would you please tell me, - PowerPoint PPT Presentation

Goals and Preferences Alice . . . went on Would you please tell me, please, which way I ought to go from here? That depends a good deal on where you want to get to, said the Cat. I dont much care where said Alice.

Moral Preferences F R A N C E S C A R O S S I Decision making Based on our preferences

Rational preferences Idea: preferences of a rational agent must obey constraints. Rational

Rational preferences Idea: preferences of a rational agent must obey constraints. Rational

Introduction to Artificial Intelligence Planning under Uncertainty Janyl Jumadinova November 2,

14.54 International Trade Lecture 3: Preferences and Demand 14.54 Week 2 Fall 2016 14.54 (Week

Incentives and Behavior Prof. Dr. Heiner Schumacher KU Leuven 6. Time Preferences I Prof. Dr.

Incentives and Behavior Prof. Dr. Heiner Schumacher KU Leuven 3. Risk Preferences I Prof. Dr.

Cardinal and Ordinal Preferences A preference structure represents an agents preferences over a

Ordinal and Cardinal Preferences A preference structure represents an agents preferences over a

Axiomatic Foundations of Multiplier Preferences Tomasz Strzalecki Multiplier preferences

Platform for Privacy Platform for Privacy Preferences (P3P) Project Preferences (P3P) Project

Sustainable intergenerational preferences preferences Geir B. Combining sensitivity for the

Social Choice: Single-Peaked Preferences Game Theory Course: Jackson, Leyton-Brown &amp; Shoham

APS Strategic Plan Goals and Strategies February 10, 2018 Strategic Plan Goals Goals : Key areas

Meeting your goals Meeting your goals Meeting your goals Meeting your goals We are DNS DNS

x agent with these beliefs and these My preferences preferences to make. Justin C. Fisher

Measuring Reliability in Forensic Voice Comparison Geoffrey Stewart Morrison Julien Epps Philip

Lars Bauer, Artjom Grudnitsky, Hongyan Zhang, Jrg Henkel - 1 - Institut fr Technische

Lars Bauer, Artjom Grudnitsky, Hongyan Zhang, Jrg Henkel - 1 - Institut fr Technische

p(E|H p(E|H p ) p ) p(E|H p(E|H d ) d ) Concerns Logically correct framework for evaluation

Goals and Preferences Alice . . . went on Would you please tell me, please, which way I ought

PHPE 400 Individual and Group Decision Making Eric Pacuit University of Maryland 1 / 24 Allais

Variable-Lived Short-Run Selves Drew Fudenberg and David K. Levine September 8, 2009 The Problem

1 2 3 4 5 6 7 8 9 10 11 12 13

Sambuz

Useful Links

Newsletter

Mail Us

Social Choice: Single-Peaked Preferences Game Theory Course: Jackson, Leyton-Brown & Shoham