Game Theory -- Lecture 3 Patrick Loiseau EURECOM Fall 2016 1

Lecture 2 recap • Defined Pareto optimality – Coordination games • Studied games with continuous action space – Always have a Nash equilibrium with some conditions – Cournot duopoly example à Can we always find a Nash equilibrium for all games? à How? 2

Outline 1. Mixed strategies Best response and Nash equilibrium – 2. Mixed strategies Nash equilibrium computation 3. Interpretations of mixed strategies 3

Example: installing checkpoints • Two road, Police choose on which to check, Terrorists choose on which to pass Terrorist • Can you find a Nash R1 R2 equilibrium? R1 1 , -1 -1, 1 à Players must Police randomize -1, 1 1, -1 R2 5

Matching pennies • Similar examples: – Checkpoint placement Player 2 heads tails – Intrusion detection – Penalty kick heads 1 , -1 -1, 1 – Tennis game Player 1 • Need to be unpredictable -1, 1 1, -1 tails 6

Pure strategies/Mixed strategies • Game ( ) ( ) i ∈ N , u i ( ) i ∈ N N , A i • A i : set of actions of player i (what we called S i before) • Action = pure strategy • Mixed strategy : distribution over pure strategies s i ∈ S i = Δ ( A i ) – Include pure strategy as special case – Support: supp s i = { a i ∈ A i : s i ( a i ) > 0} • Strategy profile: s = ( s 1 ,  , s n ) ∈ S = S 1 ×  × S n 7

Matching pennies: payoffs What is Player 1’s payoff if Player 2 • plays s 2 = (1/4, 3/4) and he plays: Player 2 – Heads? heads tails heads 1 , -1 -1, 1 – Tails? Player 1 -1, 1 1, -1 tails – s 1 = (½, ½)? 8

Payoffs in mixed strategies: general formula • Game , let ( ) ( ) i ∈ N , u i ( ) i ∈ N N , A i A = × i ∈ N A i • If players follow a mixed-strategy profile s, the expected payoff of player i is: ∑ ∏ u i ( s ) = u i ( a )Pr( a | s ) where Pr( a | s ) = s i ( a i ) a ∈ A i ∈ N • a: pure strategy (or action) profile • Pr(a|s): probability of seeing a given the mixed strategy profile s 9

Matching pennies: payoffs check What are the payoffs of Player 1 • and Player 2 if s = ((½, ½), (¼, ¾))? Player 2 heads tails heads 1 , -1 -1, 1 Player 1 -1, 1 1, -1 tails Does that look like it could be a • Nash equilibrium? 10

Best response • The definition for mixed strategies is unchanged! Definition: Best Response Player i ’s strategy ŝ i is a BR to strategy s -i of other players if: u i (ŝ i , s -i ) ≥ u i (s’ i , s -i ) for all s’ i in S i • BR i (s -i ): set of best responses of i to s -i 11

Matching pennies: best response • What is the best response of Player 1 to s 2 = (¼, ¾)? Player 2 heads tails • For all s 1 , u 1 (s 1 , s 2 ) lie between u 1 (heads, s 2 ) and u 1 (tails, s 2 ) heads 1 , -1 -1, 1 (the weighted average lies between the pure strategies exp. Payoffs) Player 1 -1, 1 1, -1 tails à Best response is tails! 12

Important property • If a mixed strategy is a best response then each of the pure strategies in the mix must be best responses è They must yield the same expected payoff Proposition: For any (mixed) strategy s -i , if , then s i ∈ BR i ( s − i ) . a i ∈ BR i ( s − i ) for all a i such that s i ( a i ) > 0 In particular, u i (a i , s -i ) is the same for all a i such that s i ( a i ) > 0 13

Wordy proof • Suppose it were not true. Then there must be at least one pure strategy a i that is assigned positive probability by my best-response mix and that yields a lower expected payoff against s i • If there is more than one, focus on the one that yields the lowest expected payoff. Suppose I drop that (low-yield) pure strategy from my mix, assigning the weight I used to give it to one of the other (higher-yield) strategies in the mix • This must raise my expected payoff • But then the original mixed strategy cannot have been a best response: it does not do as well as the new mixed strategy • This is a contradiction 14

Matching pennies again • What is the best response of Player 1 to s 2 = (¼, ¾)? Player 2 heads tails • What is the best response heads 1 , -1 -1, 1 of Player 1 to s 2 = (½, ½)? Player 1 -1, 1 1, -1 tails 15

Nash equilibrium definition Definition: Nash Equilibrium A strategy profile (s 1 *, s 2 *,…, s N *) is a Nash Equilibrium (NE) if, for each i, her choice s i * is a best response to the other players’ choices s -i * • Same definition as for pure strategies! – But here the strategies s i* are mixed strategies 16

Matching pennies again • Nash equilibrium: ((½, ½), (½, ½)) Player 2 heads tails heads 1 , -1 -1, 1 Player 1 -1, 1 1, -1 tails 17

Nash equilibrium existence theorem Theorem: Nash (1951) Every finite game has a Nash equilibrium. • In mixed strategy! – Not true in pure strategy • Finite game: finite set of player and finite action set for all players – Both are necessary! • Proof: reduction to Kakutani’s fixed-point thm 18

Computation of mixed strategy NE • Hard if the support is not known • If you can guess the support, it becomes very easy, using the property shown earlier: Proposition: For any (mixed) strategy s -i , if , then s i ∈ BR i ( s − i ) . a i ∈ BR i ( s − i ) for all a i such that s i ( a i ) > 0 In particular, u i (a i , s -i ) is the same for all a i such that (i.e., a i in the support of s i ) s i ( a i ) > 0 20

Example: battle of the sexes Player 2 Soccer Opera 2,1 0,0 Opera Player 1 0,0 1,2 Soccer • We have seen that (O, O) and (S, S) are NE • Is there any other NE (in mixed strategies)? – Let’s try to find a NE with support {O, S} for each player 21

Example: battle of the sexes (2) Player 2 Soccer Opera 2,1 0,0 Opera Player 1 0,0 1,2 Soccer • Let s 2 = (p, 1-p) • If s 1 is a BR with support {O, S}, then Player 1 must be indifferent between O and S à p = 1/3 22

Example: battle of the sexes (3) Player 2 Soccer Opera 2,1 0,0 Opera Player 1 0,0 1,2 Soccer • Similarly, let s 1 = (q, 1-q) • If s 2 is a BR with support {O, S}, then Player 2 must be indifferent between O and S à q = 2/3 23

Example: battle of the sexes (4) Player 2 Soccer Opera 2,1 0,0 Opera Player 1 0,0 1,2 Soccer • Conclusion: ((2/3, 1/3), (1/3, 2/3)) is a NE 24

Example: prisoner’s dilemma Prisoner 2 • We know that (D, D) is NE D C • Can we find a NE with support {C, D} with each? D -5, -5 0, -6 Prisoner 1 -6, 0 -2, -2 C • A NE in strictly dominant strategies is unique! 25

General methods to compute Nash equilibrium • If you know the support, write the equations translating indifference between strategies in the support (works for any number of actions!) • Otherwise: – The Lemke-Howson Algorithm (1964) – Support enumeration method (Porter et al. 2004) • Smart heuristic search through all sets of support • Exponential time worst case complexity 26

Complexity of finding Nash equilibrium • Is it NP-complete? – No, we know there is a solution – But many derived problems are (e.g., does there exists a strictly Pareto optimal Nash equilibrium?) • PPAD (“Polynomial Parity Arguments on Directed graphs”) [Papadimitriou 1994] • Theorem: Computing a Nash equilibrium is PPAD-complete [Chen, Deng 2006] 27

Complexity of finding Nash equilibrium (2) NP-hard NP-complete NP PPAD P 28

Mixed strategies interpretations • Players randomize • Belief of others’ actions (that make you indifferent) • Empirical frequency of play in repeated interactions • Fraction of a population – Let’s see an example of this one 30

The Income Tax Game (1) Tax payer Cheat Honest 2,0 4,-10 A p Auditor 4,0 0,4 N (1-p) 1-q q • Assume simultaneous move game • Is there a pure strategy NE? • Find mixed strategy NE 31

The Income Tax Game: NE computation • Mixed strategies NE: [ ] ( ( ) ) - = + - ü E U A , q , 1 q 2 q 4 ( 1 q ) 2 1 = - Þ = 2 q 4 ( 1 q ) q ý [ ] ( ( ) ) - = + - E U N , q , 1 q 4 q 0 ( 1 q ) 3 þ 1 [ ] ( ( ) ) - = ü E U H , p , 1 p 0 2 2 = Þ = 4 14 p p ý [ ] ( ( ) ) - = - + - E U C , p , 1 p 10 p 4 ( 1 p ) 7 þ 2 Look at tax To find payers auditors payoffs mixing 32

The Income Tax Game: mixed strategy interpretation • From the auditor’s point of view, he/she is going to audit a single tax payer 2/7 of the time è This is actually a randomization (which is applied by law) • From the tax payer perspective, he/she is going to be honest 2/3 of the time è This in reality implies that 2/3 rd of population is going to pay taxes honestly, i.e., this is a fraction of a large population paying taxes 33

Game Theory -- Lecture 3 Patrick Loiseau EURECOM Fall 2016 1 - PowerPoint PPT Presentation

Game Theory -- Lecture 3 Patrick Loiseau EURECOM Fall 2016 1 Lecture 2 recap Defined Pareto optimality Coordination games Studied games with continuous action space Always have a Nash equilibrium with some conditions

e-Bug Junior Game Junior Game Game Style Game Process Demo Game Mechanics and

e-Bug Senior Game Senior Game Game Style Game Process Demo Game Puzzles and

Game interoperability with functors functor AgsFun (structure Game : GAME) :> sig structure

Game Theory and Nuclear Weapons Game Theory and Nuclear Weapons Game Theory and Nuclear Warfare

Lecture 7: Game theory David Aldous February 24, 2016 STAT 155 is an entire course on Game

Game theory (Ch. 17.5) Announcements Midterm Thursday Game theory Typically game theory uses a

Game Theory: Definition and Assumptions Game Theory and Strategy Game theory studies strategic

Introduction to game theory Introduction to game theory Jie Gao Computer Science Department

Game Theory: Spring 2020 Ulle Endriss Institute for Logic, Language and Computation University

Introduction to Game Theory (1) Mehdi Dastani BBL-521 M.M.Dastani@uu.nl Game Theory What is

Coalitional Game Theory Game Theory MohammadAmin Fazli Algorithmic Game Theory 1 TOC

Game theory (Ch. 17.5) Game theory Typically game theory uses a payoff matrix to represent the

Game Theory CS 188: Artificial Intelligence Game theory: study of strategic situations,

Game Loops CIS 580 - Fundamentals of Game Programming Hangman Game Phases Game Loop

VIDEOGAMES ARE A MESS Ian Bogost WHAT IS A GAME? Is a game a system of rules, or is a game a

Nash demand game Julio D avila 2009 Julio D avila Nash demand game Nash demand game

Computing an LLL-reduced Basis of the Orthogonal Lattice Jingwei Chen Damien Stehl e Gilles

Advances in isogeny-based cryptography Benjamin Smith Inria + Laboratoire dInformatique de

The Variational Predictive Natural Gradient Da Tang 1 Rajesh Ranganath 2 1 Columbia University 2

Mixed Integer Linear Programming Combinatorial Problem Solving (CPS) Javier Larrosa Albert

CS780 Discrete-State Models Instructor: Peter Kemper R 006, phone 221-3462, email:kemper@cs.wm.edu

Transforming Clinical Practice Initiative: A Service Delivery Innovation Model Better Health.

Entanglement Spectroscopy and its application to the fractional quantum Hall phases N. Regnault

http://cs224w.stanford.edu Degree distribution: P(k) Path length: h Clustering coefficient: C

Sambuz

Useful Links

Newsletter

Mail Us