Rational preferences Idea: preferences of a rational agent must obey - PDF document

Rational preferences Idea: preferences of a rational agent must obey constraints. Rational preferences ⇒ behavior describable as maximization of expected utility Rational decisions Constraints: Orderability ( A ≻ B ) ∨ ( B ≻ A ) ∨ ( A ∼ B ) Transitivity Chapter 16 ( A ≻ B ) ∧ ( B ≻ C ) ⇒ ( A ≻ C ) Continuity A ≻ B ≻ C ⇒ ∃ p [ p, A ; 1 − p, C ] ∼ B Substitutability A ∼ B ⇒ [ p, A ; 1 − p, C ] ∼ [ p, B ; 1 − p, C ] Monotonicity A ≻ B ⇒ ( p ≥ q ⇔ [ p, A ; 1 − p, B ] ≻ ∼ [ q, A ; 1 − q, B ]) Chapter 16 1 Chapter 16 4 Outline Rational preferences contd. ♦ Rational preferences Violating the constraints leads to self-evident irrationality ♦ Utilities For example: an agent with intransitive preferences can be induced to give away all its money ♦ Money A If B ≻ C , then an agent who has C ♦ Multiattribute utilities 1c would pay (say) 1 cent to get B 1c ♦ Decision networks If A ≻ B , then an agent who has B ♦ Value of information would pay (say) 1 cent to get A B C If C ≻ A , then an agent who has A 1c would pay (say) 1 cent to get C Chapter 16 2 Chapter 16 5 Preferences Maximizing expected utility An agent chooses among prizes ( A , B , etc.) and lotteries, i.e., situations Theorem (Ramsey, 1931; von Neumann and Morgenstern, 1944): with uncertain prizes Given preferences satisfying the constraints there exists a real-valued function U such that A A ≻ U ( A ) ≥ U ( B ) ⇔ ∼ B p U ([ p 1 , S 1 ; . . . ; p n , S n ]) = Σ i p i U ( S i ) L 1−p MEU principle: Lottery L = [ p, A ; (1 − p ) , B ] B Choose the action that maximizes expected utility Note: an agent can be entirely rational (consistent with MEU) Notation: without ever representing or manipulating utilities and probabilities A ≻ B A preferred to B A ∼ B indifference between A and B E.g., a lookup table for perfect tictactoe A ≻ ∼ B B not preferred to A Chapter 16 3 Chapter 16 6

Utilities Decision networks Utilities map states to real numbers. Which numbers? Add action nodes and utility nodes to belief networks to enable rational decision making Standard approach to assessment of human utilities: compare a given state A to a standard lottery L p that has Airport Site “best possible prize” u ⊤ with probability p “worst possible catastrophe” u ⊥ with probability (1 − p ) adjust lottery probability p until A ∼ L p Air Traffic Deaths continue as before 0.999999 Litigation Noise U pay $30 ~ L Construction Cost 0.000001 instant death Algorithm: For each value of action node compute expected value of utility node given action, evidence Return MEU action Chapter 16 7 Chapter 16 10 Utility scales Multiattribute utility Normalized utilities: u ⊤ = 1 . 0 , u ⊥ = 0 . 0 How can we handle utility functions of many variables X 1 . . . X n ? E.g., what is U ( Deaths, Noise, Cost ) ? Micromorts: one-millionth chance of death useful for Russian roulette, paying to reduce product risks, etc. How can complex utility functions be assessed from preference behaviour? QALYs: quality-adjusted life years useful for medical decisions involving substantial risk Idea 1: identify conditions under which decisions can be made without complete identification of U ( x 1 , . . . , x n ) Note: behavior is invariant w.r.t. +ve linear transformation Idea 2: identify various types of independence in preferences U ′ ( x ) = k 1 U ( x ) + k 2 where k 1 > 0 and derive consequent canonical forms for U ( x 1 , . . . , x n ) (analogous to using Bayes nets for P ( x 1 , . . . , x n ) With deterministic prizes only (no lottery choices), only ordinal utility can be determined, i.e., total order on prizes Chapter 16 8 Chapter 16 11 Money Strict dominance Typically define attributes such that U is monotonic in each Money does not behave as a utility function Strict dominance: choice B strictly dominates choice A iff Given a lottery L with expected monetary value EMV ( L ) , ∀ i X i ( B ) ≥ X i ( A ) (and hence U ( B ) ≥ U ( A ) ) usually U ( L ) < U ( EMV ( L )) , i.e., people are risk-averse X This region X Utility curve: for what probability p am I indifferent between a prize x and 2 2 dominates A a lottery [ p, $ M ; (1 − p ) , $0] for large M ? B C B Typical empirical data, extrapolated with risk-prone behavior: C A A +U o D o o o o o o o o o o o +$ X X −150,000 o 800,000 1 1 o Deterministic attributes Uncertain attributes o Strict dominance seldom holds in practice Chapter 16 9 Chapter 16 12

Stochastic dominance Label the arcs + or – SocioEcon 1.2 1 Age GoodStudent 1 0.8 ExtraCar Mileage 0.8 RiskAversion Probability Probability 0.6 VehicleYear 0.6 S1 S2 S1 0.4 SeniorTrain S2 0.4 + 0.2 DrivingSkill MakeModel 0.2 DrivingHist 0 0 -6 -5.5 -5 -4.5 -4 -3.5 -3 -2.5 -2 -6 -5.5 -5 -4.5 -4 -3.5 -3 -2.5 -2 Antilock Negative cost Negative cost DrivQuality AntiTheft CarValue HomeBase Airbag Distribution p 1 stochastically dominates distribution p 2 iff Accident � t � t Ruggedness ∀ t −∞ p 1 ( x ) dx ≤ −∞ p 2 ( t ) dt Theft OwnDamage If U is monotonic in x , then A 1 with outcome distribution p 1 Cushioning OwnCost OtherCost stochastically dominates A 2 with outcome distribution p 2 : � ∞ � ∞ −∞ p 1 ( x ) U ( x ) dx ≥ −∞ p 2 ( x ) U ( x ) dx MedicalCost Multiattribute case: stochastic dominance on all attributes ⇒ optimal LiabilityCost PropertyCost Chapter 16 13 Chapter 16 16 Stochastic dominance contd. Label the arcs + or – SocioEcon Stochastic dominance can often be determined without Age exact distributions using qualitative reasoning GoodStudent ExtraCar Mileage E.g., construction cost increases with distance from city RiskAversion VehicleYear S 1 is closer to the city than S 2 + SeniorTrain ⇒ S 1 stochastically dominates S 2 on cost + DrivingSkill MakeModel E.g., injury increases with collision speed DrivingHist Can annotate belief networks with stochastic dominance information: Antilock DrivQuality AntiTheft CarValue HomeBase + Airbag X → Y ( X positively influences Y ) means that − For every value z of Y ’s other parents Z Accident Ruggedness ∀ x 1 , x 2 x 1 ≥ x 2 ⇒ P ( Y | x 1 , z ) stochastically dominates P ( Y | x 2 , z ) Theft OwnDamage Cushioning OwnCost OtherCost MedicalCost LiabilityCost PropertyCost Chapter 16 14 Chapter 16 17 Label the arcs + or – Label the arcs + or – SocioEcon SocioEcon Age Age GoodStudent GoodStudent ExtraCar ExtraCar Mileage Mileage RiskAversion RiskAversion VehicleYear VehicleYear + SeniorTrain SeniorTrain + DrivingSkill MakeModel DrivingSkill MakeModel DrivingHist DrivingHist Antilock Antilock DrivQuality DrivQuality AntiTheft AntiTheft CarValue HomeBase CarValue HomeBase Airbag Airbag − Accident Accident Ruggedness Ruggedness Theft Theft OwnDamage OwnDamage Cushioning Cushioning OwnCost OwnCost OtherCost OtherCost MedicalCost MedicalCost LiabilityCost PropertyCost LiabilityCost PropertyCost Chapter 16 15 Chapter 16 18

Rational preferences Idea: preferences of a rational agent must obey - PDF document

Rational preferences Idea: preferences of a rational agent must obey constraints. Rational preferences behavior describable as maximization of expected utility Rational decisions Constraints: Orderability ( A B ) ( B A ) (

Rational preferences Idea: preferences of a rational agent must obey constraints. Rational

Extending Rational Apex Extending Rational Apex Greg Bek Greg Bek gab@rational.com

Rational points, rational curves, rational varieties Rational and integral points We study

Rational Robot A Test Automation Tool What is Rational Robot? Rational Robot is a complete

Rational decisions Chapter 16 Chapter 16 1 Outline Rational preferences Utilities

Rational decisions Chapter 16 Chapter 16 1 Outline Rational preferences Utilities

2.5: Rational Expressions and Equations College Algebra Week 2 Rational Expression

E XAMPLE 1 Identify the sum of product as rational or irrational. a. 5 + 8 rational / irrational

On the convergence of rational Ritz values Applications to rational interpolation of rational

Classes of Rational Graphs Christophe Morvan Irisa Journ ees Montoises 2006 1/25 Classes of

Rational Agents (Ch. 2) Rational agent An agent/robot must be able to perceive and interact with

17. Structs and Classes I C++ does not provide a built-in type for rational numbers Goal Rational

Rational isogenies Computing rational isogenies from the equations of the kernel David Lubicz,

Moral Preferences F R A N C E S C A R O S S I Decision making Based on our preferences

Sustainable intergenerational preferences preferences Geir B. Combining sensitivity for the

Decision Theory 1 Outline Rational preferences Utilities Money Multiattribute

Knowledge Discovery from Transportation Network Data Paper Review Jiang, W., Vaidya, J.,

Heterogeneity and State Dependence from Studies in Labor Markets , ed. Sherwin Rosen, NBER

Malaysian Healthy Ageing Society UNDERSTANDING STRESS TOWARDS MANAGEMENT A PRACTITIONERS

M o n t e C a r l o M e t h o d s L e c t u r e 1 Nothing Gluon action density:

Lectur ture 2 e 23 Planni anning U ng Under er U Uncer ertai tainty nty a and Decision

How to Decide Which Cracks Practical Case of . . . Should be Repaired First: Empirical

ROAD DESIGN FOR BETTER SAFETY Abdul Rahman Baharuddin Public Works Department Malaysia CONTENT

Spotlight on STRMU National HOPWA Institute 2017 Tampa, FL Spotlight on STRMU During this

Sambuz

Useful Links

Newsletter

Mail Us