game theory lecture 8
play

Game Theory: Lecture #8 Outline: Individual Optimization Security - PDF document

Game Theory: Lecture #8 Outline: Individual Optimization Security Strategies Optimization and Strategy Previous focus: Given individual preferences, compute group preference? Given individual preferences, compute satisfactory


  1. Game Theory: Lecture #8 Outline: • Individual Optimization • Security Strategies

  2. Optimization and Strategy • Previous focus: – Given individual preferences, compute group preference? – Given individual preferences, compute satisfactory matching? – Given (sub-)group costs, divide costs so all are happy? • Not addressed: Individual strategy. – Given social choice or matching rule, do individuals willingly share preferences? – How to model individual choice? • Overarching question: How does individual choice impact collective behavior? • First challenge: how to think about individual choice? 1

  3. Individual Optimization • Setup: a single decision-maker i – A set of actions for the individual, denoted by A i . – A set of “other things that could happen in the world,” denoted A − i – This induces the set of states of the world A = A i × A − i – The individual’s preferences over states characterized by a function: U i : A → R • Terminology: – U i ( · ) referred to as “payoff” or “utility” or “reward” function – The individual i is referred to as an “agent,” “player,” “decision-maker,” or “user” • Player i prefers state a to state a ′ if and only if U i ( a ) > U i ( a ′ ) In case U i ( a ) = U i ( a ′ ) player i is “indifferent” 2

  4. Alice and the Umbrella • Example: Alice leaves home. Bring umbrella? – Loves walking in the rain with umbrella – Hates walking in the rain without umbrella – If no rain, better to leave umbrella at home • How to model this setup with a utility function? • Matrix form is a convenient representation: The Weather Rain Sun Umbrella 5 2 Alice’s Choice: No Umbrella 0 3 • Action sets (Alice is i , the weather is − i ): A i = { U, ¬ U } A − i = { R, S } • Possible states of the world: A = { ( U, R ) , ( U, S ) , ( ¬ U, R ) , ( ¬ U, S ) } • Utility function gives Alice a payoff for each possible state: U i ( U, R ) = 5 U i ( U, S ) = 2 U i ( ¬ U, R ) = 0 U i ( ¬ U, S ) = 3 • Question: What should Alice do? 3

  5. If Alice knows what will happen The Weather Rain Sun Umbrella 5 2 Alice’s Choice: No Umbrella 0 3 • Simple Question: If Alice knows the weather, what should she do? • Definition: The best response function of player i , B i ( · ) , is B i ( a − i ) = { a i : U i ( a i , a − i ) ≥ U i ( a ′ i , a − i ) for all a ′ i ∈ A i } Note that the best response “function” is actually a set • To visualize, focus on a column: If raining, Alice’s payoff matrix is Rain Umbrella 5 Alice’s Choice: No Umbrella 0 Thus, B i ( R ) = U . If raining, bring umbrella. • If sunny, Alice’s payoff matrix is Sun Umbrella 2 Alice’s Choice: No Umbrella 3 Thus, B i ( S ) = ¬ U . If sunny, leave umbrella at home. • Note: to decide, Alice only compares numbers from a single column! 4

  6. What if Alice Can’t Predict the Weather? • Harder Question: If Alice can’t predict weather, what should she do? • Why not use best response function here? • New idea: Alice could “play it safe,” and try to limit her losses. • Conceptually: assume the worst possible weather. • Visually: optimize based on the smallest number in each payoff row : The Weather Rain Sun Umbrella 5 2 Alice’s Choice: No Umbrella 3 0 • Alice’s pessimistic “payoff matrix” is Worst-case Umbrella 2 Alice’s Choice: No Umbrella 0 • Terminology: – Security value: v = 2 – Security strategy: Umbrella • Interpretation: If Alice always brings her umbrella, the worst payoff she’ll ever get is 2 . • Question: can she guarantee any better? 5

  7. What if Alice Can’t Predict the Weather? • Question: can Alice guarantee a better payoff than 2 ? • Thought experiment: what if Alice occasionally left umbrella at home? • Setup: – bring umbrella a fraction p of the days. – leave umbrella a fraction 1 − p of the days. • What is the worst thing that could happen? – If always rainy, expected (average) payoff is 5 p. – If always sunny, expected payoff is 2 p + 3(1 − p ) = 3 − p . – Plot of these as a function of p : – ( p = 1) corresponds to “umbrella” payoff row – ( p = 0) corresponds to “no umbrella” payoff row • Note: to maximize guaranteed expected payoff, p = 1 / 2 . • When p = 1 / 2 , No matter what the weather does, expected payoff is at least 2 . 5 . 6

  8. Security Strategies • No matter what the weather does, expected payoff is at least 2 . 5 . • How do we formalize this? • Need a notion of probabilistic strategies. • Write ∆( A i ) to denote the set of all probability distributions over player i ’s action set. • We call s i = ( p 1 , p 2 , . . . , p |A i | ) ∈ ∆( A i ) a mixed strategy for player i . That is, – p k ≥ 0 for each k and – � |A i | k =1 p k = 1 . • Similarly, write ∆( A − i ) to denote the set of all probability distributions over other states of the world. • Given a joint mixed strategy s ∈ ∆( A i ) × ∆( A − i ) , player i ’s expected utility is � � U i ( s i , s − i ) = E s U i ( a i , a − i ) = p a i × p a − i × U i ( a i , a − i ) a i ∈A i a − i ∈A − i • A mixed strategy s i guarantees a payoff of v is for any s − i ∈ ∆( A − i ) : U i ( s i , s − i ) ≥ v • A player’s security value v is the highest payoff that the player can guarantee for any strategy s i ∈ ∆( A i ) • A player’s security strategy s ∗ i is any strategy that guarantees the payoff v 7

  9. Interpret • Revisit Alice: • Alice has a security value v = 2 . 5 • Alice has a security strategy s ∗ i = (1 / 2 , 1 / 2) • Question: what is Alice assuming about the chance of rain to obtain this guarantee? • Alice is assuming nothing! No matter what the weather does, she’ll always get at least this amount. • Next lecture: What if the weather were “out to get her?” 8

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend