Plan for Today This is an introduction to Game Theory. In - PowerPoint PPT Presentation

Game Theory Multiagent Systems 2006 Game Theory Multiagent Systems 2006 Plan for Today This is an introduction to Game Theory. In particular, we’ll discuss: • Introductory examples: Prisoners Dilemma , Game of Chicken , . . . • Distinguishing dominant strategies and equilibrium strategies Multiagent Systems: Spring 2006 • Distinguishing pure and mixed Nash equilibria Ulle Endriss • Existence of mixed Nash equilibria Institute for Logic, Language and Computation University of Amsterdam • Computing mixed Nash equilibria We are going to concentrate on non-cooperative (rather than cooperative) strategic (rather than extensive) games with perfect (rather than imperfect) information. We’ll see later what these distinctions actually mean. Ulle Endriss (ulle@illc.uva.nl) 1 Ulle Endriss (ulle@illc.uva.nl) 3 Game Theory Multiagent Systems 2006 Game Theory Multiagent Systems 2006 Prisoner’s Dilemma Game Theory Two partners in crime, A and B , are separated by police and each one of them is offered the following deal: • Last week we have looked into the problem of collective decision • only you confess ❀ go free making from a social point of view — what kind of decision would be good for society? • only the other one confesses ❀ spend 5 years in prison • Today we are going to analyse the behaviour of individual agents • both confess ❀ spend 3 years in prison in the context of making collective decisions. • neither one confesses ❀ get 1 year on remand • Game Theory is the branch of Economic Sciences that studies the u A / u B B confesses B does not strategic behaviour of rational agents in the context of interactive decision-making problems. A confesses 2/2 5/0 A does not 0/5 4/4 • Given the rules of the “game” (the negotiation mechanism , the protocol ), what strategy should a rational agent adopt? (utility = 5 − years in prison) ◮ What would be a rational strategy? Ulle Endriss (ulle@illc.uva.nl) 2 Ulle Endriss (ulle@illc.uva.nl) 4

Game Theory Multiagent Systems 2006 Game Theory Multiagent Systems 2006 Nash Equilibria Dominant Strategies • A Nash equilibrium is a set of strategies, one for each player, such • A strategy is called (strictly) dominant iff, independently of what that no player could improve their payoff by unilaterally deviating any of the other players do, following that strategy will result in a from their assigned strategy (named so after John F. Nash, Nobel larger payoff than any other strategy. Prize in Economic Sciences in 1994; Academy Award in 2001). • Prisoner’s Dilemma: both players have a dominant strategy, • Battle of the Sexes: two Nash equilibria namely to confess: – Both Ann and Bob go to the theatre. – from A ’s point of view: – Both Ann and Bob go to see the football match. ∗ if B confesses, then A is better off confessing as well • In cases where there are no dominant strategies, a set of ∗ if B does not confess, then A is also better off confessing equilibrium strategies is the next best thing. – similarly for B • Discussion: Games with a Nash equilibrium are of great interest to • Terminology: For games of this kind, we say that each player may MAS, because you do not need to keep your strategy secret and either cooperate with its opponent (e.g. by not confessing) or you do not need to waste resources on trying to find out about defect (e.g. by confessing). other agents’ strategies. Naturally, a unique equilibrium is better. Ulle Endriss (ulle@illc.uva.nl) 5 Ulle Endriss (ulle@illc.uva.nl) 7 Game Theory Multiagent Systems 2006 Game Theory Multiagent Systems 2006 Back to the Prisoner’s Dilemma • Unique Nash equilibrium, namely when both players confess: – if A changes strategy unilaterally, she will do worse – if B changes strategy unilaterally, she will also do worse Battle of the Sexes • Discussion: Our analysis shows that it would be rational to Ann ( A ) and Bob ( B ) have different preferences as to what to do on a confess. However, this seems counter-intuitive, because both Saturday night . . . players would be better off if both of them were to remain silent. u A / u B Bob: theatre Bob: football • So there’s a conflict: the stable solution given by the equilibrium is not efficient , because the outcome is not Pareto optimal. Ann: theatre 2/1 0/0 Ann: football 0/0 1/2 • Iterated Prisoner’s Dilemma: – In each round, each player can either cooperate or defect. – Because the other player could retaliate in the next round, it is rational to cooperate. – But it does not work if the number of rounds is fixed . . . Ulle Endriss (ulle@illc.uva.nl) 6 Ulle Endriss (ulle@illc.uva.nl) 8

Game Theory Multiagent Systems 2006 Game Theory Multiagent Systems 2006 Game of Chicken How many Nash equilibria? James and Marlon are driving their cars towards each other at top Keep in mind that the first player chooses the row (T/B) and the speed. Whoever swerves to the right first is a “chicken”. second player chooses the column (L/R) . . . L R L R L R u J / u M M drives on M turns T 2/2 2/1 T 2/2 2/2 T 1/2 2/1 J drives on 0/0 8/1 B 1/3 3/2 B 2/2 2/2 B 2/1 1/2 J turns 1/8 5/5 Ulle Endriss (ulle@illc.uva.nl) 9 Ulle Endriss (ulle@illc.uva.nl) 11 Game Theory Multiagent Systems 2006 Game Theory Multiagent Systems 2006 Notation and Formal Definition Analysing the Game of Chicken A strategic game consists of a set of players , a set of actions for each • No dominant strategy (best move depends on the other player) player, and a preference relation over action profiles for each player. • Two Nash equilibria: • Players: i ∈ { 1 , . . . , n } – James drives on and Marlon turns • Actions: each player i has a set A i of possible actions ∗ if James deviates (and turns), he will be worse off • Action profiles: a = ( a 1 , a 2 , . . . , a n ) for players 1 , . . . , n ∗ if Marlon deviates (and drives on), he will be worse off • Preferences: represented by utilities u i : A 1 × · · · × A n → R – Marlon drives on and James turns (similar argument) Write ( a − i , a ′ i ) for the action profile that is like a , except that player i • If you have reason to believe your opponent will turn, then you chooses a ′ i rather than a i . should drive on. If you have reason to believe your opponent will drive on, then you should turn. ◮ Then a Nash equilibrium is an action profile a such that u i ( a ) ≥ u i ( a − i , a ′ i ) for every player i and every action a ′ i of player i . Ulle Endriss (ulle@illc.uva.nl) 10 Ulle Endriss (ulle@illc.uva.nl) 12

Game Theory Multiagent Systems 2006 Game Theory Multiagent Systems 2006 Remarks A Game without Nash Equilibria • As we have seen, there are games that have no Nash equilibrium. Recall that the following game does not have a Nash equilibrium: • Observe that while we use utilities for ease of presentation, only L R ordinal preferences matter (cardinal intensities are irrelevant). T 1/2 2/1 • Here we only model one-off decisions . In MAS, in particular, it is B 2/1 1/2 however more likely that following a given protocol requires taking a sequence of decisions . Whichever action the row player chooses, the column player can react But we can map an agent’s decision making capability to a single in such a way that the row player would have rather chosen the other strategy encoding what the agent would do in any given situation. way. And so on . . . Hence, the game theoretical-models do apply here as well (see also ◮ Idea: Use a probability distribution over all actions as your strategy. extensive games). Ulle Endriss (ulle@illc.uva.nl) 13 Ulle Endriss (ulle@illc.uva.nl) 15 Game Theory Multiagent Systems 2006 Game Theory Multiagent Systems 2006 Mixed Strategies A mixed strategy p i of a player i is a probability distribution over the actions A i available to i . Surprise Exam Example: Suppose player 1 has three actions: T, M and B; and suppose their order is clear from the context. Then the mixed strategy Suppose a newspaper announces the following competition: to play T with probability 1 2 , M with probability 1 6 , and B with ◮ Every reader may send in a (rational) number between 0 and 100. probability 1 3 , is written as p 1 = ( 1 2 , 1 6 , 1 3 ) . The winner is the player whose number is closest to 2 3 times the The expected payoff of a profile p of mixed strategies: arithmetic mean of all submissions (in case of a tie the prize payoff for a money is split equally amongst those with the best guesses). � �� E i ( p ) = ( u i ( a ) × p i ( a i ) ) What number would you submit (and why)? a ∈ A 1 ×···× A n i ∈{ 1 ,...,n } � �� sum over all probability of action profiles a choosing a Ulle Endriss (ulle@illc.uva.nl) 14 Ulle Endriss (ulle@illc.uva.nl) 16

Plan for Today This is an introduction to Game Theory. In - PowerPoint PPT Presentation

Game Theory Multiagent Systems 2006 Game Theory Multiagent Systems 2006 Plan for Today This is an introduction to Game Theory. In particular, well discuss: Introductory examples: Prisoners Dilemma , Game of Chicken , . . .

PHASE IA PLAN ULTIMATE PLAN 13 PHASE IB PLAN ULTIMATE PLAN 14 ULTIMATE PLAN ULTIMATE PLAN

NEW COURTHOUSE 1 ST FLOOR PLAN ANNEX 3rd FLOOR PLAN 2nd FLOOR PLAN BASEMENT FLOOR PLAN ANNEX

What is the League Today 1 1/23/2017 What is the League Today What is the League Today 2

Today marks our kick off for the 2040 Long Range Transportation Plan. Today marks our kick off for

Medical Plan Comparison Central Care Plan Medical / Prescription Benefit Summary Advantage HDHP/HSA

Master Plan Open House #3 Preferred Alternative Master Plan Master Plan Process What is a

Site Plan May 2009 Site Plan February 2010 Site Plan May 5, 2010 Site Plan

Social/Network/Analysis mohamed.bouguessa@uqo.ca/ 1 Web/today 2

Lecture 15 Logistics HW4 is due today HW5 posted today HW5 posted today Exam

Retirement Plan Information Session Todays meeting Why plan for retirement? Your Plan

To Plan or Not To Plan Failure to plan is a plan to fail Questions to ask yourself Who is this

Retirement Plan Changes Update Overview Retirement Plan Options at W&M No changes to

Impr Improvement ement Plan Orienta Plan Orientation tion Building Your Plan for Academic

San Pedro Community Plan San Pedro Community Plan Presentation Overview Community Plan

Local Development Plan Local Development Plan Local Development Plan Local Development Plan

Project Area Vilas Park Vilas Park Master Plan Vilas Park Master Plan Plan Maestro De Vilas Park

Classical and Weak Solutions to Local First Order Mean Field Games through Elliptic Regularity

Multiple Comparison Procedures Cohen Chapter 13 For EDUC/PSY 6600 1 We have to go to the

Logicism - Frege Caroline Foster / Edgar Andrade Philosophy of Mathematics ILLC - Master of

Introduction to Large-Scale ML Shan-Hung Wu shwu@cs.nthu.edu.tw Department of Computer Science,

Decisions with Multiple Agents: Game Theory Alice Gao Lecture 24 Based on work by K.

What? The study of interacting decision makers Economy Biology Sociology Computer Science

Council Retreat JANUARY 29-30, 2020 Sea Level Rise Subcommittee Priorities S tormwater

Using Public Private Partnerships to Deliver Successful Rail Projects Dr Nick Higton Director,

Sambuz

Useful Links

Newsletter

Mail Us

Plan for Today This is an introduction to Game Theory. In - PowerPoint PPT Presentation

Game Theory Multiagent Systems 2006 Game Theory Multiagent Systems 2006 Plan for Today This is an introduction to Game Theory. In particular, well discuss: Introductory examples: Prisoners Dilemma , Game of Chicken , . . .

PHASE IA PLAN ULTIMATE PLAN 13 PHASE IB PLAN ULTIMATE PLAN 14 ULTIMATE PLAN ULTIMATE PLAN

NEW COURTHOUSE 1 ST FLOOR PLAN ANNEX 3rd FLOOR PLAN 2nd FLOOR PLAN BASEMENT FLOOR PLAN ANNEX

What is the League Today 1 1/23/2017 What is the League Today What is the League Today 2

Today marks our kick off for the 2040 Long Range Transportation Plan. Today marks our kick off for

Medical Plan Comparison Central Care Plan Medical / Prescription Benefit Summary Advantage HDHP/HSA

Master Plan Open House #3 Preferred Alternative Master Plan Master Plan Process What is a

Site Plan May 2009 Site Plan February 2010 Site Plan May 5, 2010 Site Plan

Social/Network/Analysis mohamed.bouguessa@uqo.ca/ 1 Web/today 2

Lecture 15 Logistics HW4 is due today HW5 posted today HW5 posted today Exam

Retirement Plan Information Session Todays meeting Why plan for retirement? Your Plan

To Plan or Not To Plan Failure to plan is a plan to fail Questions to ask yourself Who is this

Retirement Plan Changes Update Overview Retirement Plan Options at W&amp;M No changes to

Impr Improvement ement Plan Orienta Plan Orientation tion Building Your Plan for Academic

San Pedro Community Plan San Pedro Community Plan Presentation Overview Community Plan

Local Development Plan Local Development Plan Local Development Plan Local Development Plan

Project Area Vilas Park Vilas Park Master Plan Vilas Park Master Plan Plan Maestro De Vilas Park

Classical and Weak Solutions to Local First Order Mean Field Games through Elliptic Regularity

Multiple Comparison Procedures Cohen Chapter 13 For EDUC/PSY 6600 1 We have to go to the

Logicism - Frege Caroline Foster / Edgar Andrade Philosophy of Mathematics ILLC - Master of

Introduction to Large-Scale ML Shan-Hung Wu shwu@cs.nthu.edu.tw Department of Computer Science,

Decisions with Multiple Agents: Game Theory Alice Gao Lecture 24 Based on work by K.

What? The study of interacting decision makers Economy Biology Sociology Computer Science

Council Retreat JANUARY 29-30, 2020 Sea Level Rise Subcommittee Priorities S tormwater

Using Public Private Partnerships to Deliver Successful Rail Projects Dr Nick Higton Director,

Sambuz

Useful Links

Newsletter

Mail Us

Retirement Plan Changes Update Overview Retirement Plan Options at W&M No changes to