Game Theory: Spring 2020 Ulle Endriss Institute for Logic, Language - PowerPoint PPT Presentation

Zero-Sum Games Game Theory 2020 Game Theory: Spring 2020 Ulle Endriss Institute for Logic, Language and Computation University of Amsterdam Ulle Endriss 1

Zero-Sum Games Game Theory 2020 Plan for Today Today we are going to focus on the special case of zero-sum games and discuss two positive results that do not hold for games in general. • new solution concepts: maximin and minimax solutions • Minimax Theorem: maximin = minimax = NE for zero-sum games • fictitious play: basic model for learning in games • convergence result for the case of zero-sum games The first part of this is also covered in Chapter 3 of the Essentials . K. Leyton-Brown and Y. Shoham. Essentials of Game Theory: A Concise, Multi- disciplinary Introduction . Morgan & Claypool Publishers, 2008. Chapter 3. Ulle Endriss 2

Zero-Sum Games Game Theory 2020 Zero-Sum Games Today we focus on two-player games � N, A , u � with N = { 1 , 2 } . Notation: Given player i ∈ { 1 , 2 } , we refer to her opponent as − i . Recall: A zero-sum game is a two-player normal-form game � N, A , u � for which u i ( a ) + u − i ( a ) = 0 for all action profiles a ∈ A . Examples include (but are not restricted to) games in which you can win ( +1 ), lose ( − 1 ), or draw ( 0 ), such as matching pennies: H T L R − 1 − 5 1 3 H T − 1 − 3 1 5 − 1 − 2 1 0 T B − 1 1 0 2 Ulle Endriss 3

Zero-Sum Games Game Theory 2020 Constant-Sum Games A constant-sum game is a two-player normal-form game � N, A , u � for which there exists a c ∈ R such that u i ( a ) + u − i ( a ) = c for all a ∈ A . Thus: A zero-sum game is a constant-sum game with constant c = 0 . Everything about zero-sum games to be discussed today also applies to constant-sum games, but for simplicity we only talk about the former. Fun Fact: Football is not a constant-sum game, as you get 3 points for a win, 0 for a loss, and 1 for a draw. But prior to 1994, when the “three-points-for-a-win” rule was introduced, World Cup games were constant-sum (with 2, 0, 1 points, for win, loss, draw, respectively). Ulle Endriss 4

Zero-Sum Games Game Theory 2020 Maximin Strategies The definitions on this slide apply to arbitrary normal-form games . . . Suppose player i wants to maximise her worst-case expected utility (e.g., if all others conspire against her). Then she should play: s ⋆ i ∈ argmax s − i ∈ S − i u i ( s i , s − i ) min s i ∈ S i Any such s ⋆ i is called a maximin strategy (usually there is just one). Solution concept: assume each player will play a maximin strategy. Call max min s − i u i ( s i , s − i ) player i ’s maximin value (or security level ). s i Ulle Endriss 5

Zero-Sum Games Game Theory 2020 Exercise: Maximin and Nash Consider the following two-player game: L R 2 0 T 8 0 0 2 B 0 8 What is the maximin solution? How does this relate to Nash equilibria? Note: This is neither a zero-sum nor a constant-sum game. Ulle Endriss 6

Zero-Sum Games Game Theory 2020 Exercise: Maximin and Nash Again Now consider this very similar game, which is zero-sum: L R − 8 0 T 8 0 − 8 0 B 0 8 What is the maximin solution? How does this relate to Nash equilibria? Ulle Endriss 7

Zero-Sum Games Game Theory 2020 Minimax Strategies Now focus on two-player games only, with players i and − i . . . Suppose player i wants to minimise − i ’s best-case expected utility (e.g., to punish her). Then i should play: s ⋆ i ∈ argmin s − i ∈ S − i u − i ( s i , s − i ) max s i ∈ S i Remark: For a zero-sum game, an alternative interpretation is that player i has to play first and her opponent − i can respond . Any such s ⋆ i is called a minimax strategy (usually there is just one). Call min s i max s − i u − i ( s i , s − i ) player − i ’s minimax value . So i ’s minimax value is min s − i max u i ( s − i , s i ) = min s − i max u i ( s i , s − i ) . s i s i Ulle Endriss 8

Zero-Sum Games Game Theory 2020 Equivalence of Maximin and Minimax Values Recall: For two-player games, we have seen the following definitions. • Player i ’s maximin value is max min s − i u i ( s i , s − i ) . s i • Player i ’s minimax value is min s − i max u i ( s i , s − i ) . s i Lemma 1 In a two-player game, maximin and minimax value coincide: max min s − i u i ( s i , s − i ) = min s − i max u i ( s i , s − i ) s i s i We omit the proof. For the case of two actions per player, there is a helpful visualisation in the Essentials . Note that one direction is easy: ( � ) LHS is what i can achieve when she has to move first, while RHS is what i can achieve when she can move second. � Remark: The lemma does not hold if we quantify over actions rather than strategies (counterexample: Matching Pennies). Ulle Endriss 9

Zero-Sum Games Game Theory 2020 The Minimax Theorem Recall: A zero-sum game is a two-player game with u i ( a ) + u − i ( a ) = 0 . Theorem 2 (Von Neumann, 1928) In a zero-sum game, a strategy profile is a NE iff each player’s expected utility equals her minimax value. Proof: Let v i be the minimax/maximin value of player i (and v − i = − v i that of player − i ). (1) Suppose u i ( s i , s − i ) � = v i . Then one player does worse than she could (note that here we use the zero-sum property!). So ( s i , s − i ) is not a NE . � (2) Suppose u i ( s i , s − i ) = v i . Then each player already defends optimally against this worst of all John von Neumann (1903–1957) possible attacks. So ( s i , s − i ) is a NE . � J. von Neumann. Zur Theorie der Gesellschaftsspiele. Mathematische Annalen , 100(1):295–320, 1928. Ulle Endriss 10

Zero-Sum Games Game Theory 2020 Learning in Games Suppose you keep playing the same game against the same opponents. You might try to learn their strategies . A good hypothesis might be that the frequency with which player i plays action a i is approximately her probability of playing a i . Now suppose you always best-respond to those hypothesised strategies. And suppose everyone else does the same. What will happen? We are going to see that for zero-sum games this process converges to a NE. This yields a method for computing a NE for the (non-repeated) game: just imagine players engage in such “ fictitious play ”. Ulle Endriss 11

Zero-Sum Games Game Theory 2020 Empirical Mixed Strategies i , . . . , a ℓ − 1 Given a history of actions H ℓ i = a 0 i , a 1 played by player i in ℓ i prior plays of game � N, A , u � , fix her empirical mixed strategy s ℓ i ∈ S i : 1 s ℓ ℓ · # { k < ℓ | a k i ( a i ) = i = a i } for all a i ∈ A i � �� relative frequency of a i in H ℓ i Ulle Endriss 12

Zero-Sum Games Game Theory 2020 Best Pure Responses Recall: Strategy s ⋆ i ∈ S i is a best response for player i to the (partial) strategy profile s − i if u i ( s ⋆ i , s − i ) � u i ( s ′ i , s − i ) for all s ′ i ∈ S i . Due to the linearity of expected utilities we get: Observation 3 For any given (partial) strategy profile s − i , the set of best responses for player i must include at least one pure strategy. So we can restrict attention to best pure responses for player i to s − i : a ⋆ i ∈ argmax u i ( a i , s − i ) a i ∈ A i Ulle Endriss 13

Zero-Sum Games Game Theory 2020 Fictitious Play Take any action profile a 0 ∈ A for the normal-form game � N, A , u � . Fictitious play of � N, A , u � , starting in a 0 , is the following process: • In round ℓ = 0 , each player i ∈ N plays action a 0 i . • In any round ℓ > 0 , each player i ∈ N plays a best pure response to her opponents’ empirical mixed strategies: u i ( a i , s ℓ a ℓ i ∈ argmax − i ) , where a i ∈ A i i ′ = a i ′ } for all i ′ ∈ N and a i ′ ∈ A i ′ s ℓ i ′ ( a i ′ ) = 1 ℓ · # { k < ℓ | a k Assume some deterministic way of breaking ties between maxima. This yields a sequence a 0 ։ a 1 ։ a 2 ։ . . . with a corresponding sequence of empirical-mixed-strategy profiles s 0 ։ s 1 ։ s 2 ։ . . . ℓ →∞ s ℓ exist and is it a meaningful strategy profile? Question: Does lim Ulle Endriss 14

Zero-Sum Games Game Theory 2020 Example: Matching Pennies Let’s see what happens when we start in the upper lefthand corner HH (and break ties between equally good responses in favour of H): H T − 1 1 H − 1 1 − 1 1 T − 1 1 Any strategy can be represented by a single probability (of playing H). HH ( 1 1 , 1 1 ) ։ HT ( 2 2 , 1 ։ HT ( 3 3 , 1 ։ TT ( 3 4 , 1 ։ TT ( 3 5 , 1 2 ) 3 ) 4 ) 5 ) ։ TT ( 3 6 , 1 ։ TH ( 3 7 , 2 ։ TH ( 3 8 , 3 ։ TH ( 3 9 , 4 6 ) 7 ) 8 ) 9 ) ։ TH ( 3 10 , 5 10 ) ։ HH ( 4 11 , 6 11 ) ։ HH ( 5 12 , 7 12 ) ։ · · · Exercise: Can you guess what this will converge to? Ulle Endriss 15

Zero-Sum Games Game Theory 2020 Convergence Profiles are Nash Equilibria ℓ →∞ s ℓ does not exist (no guaranteed convergence). But: In general, lim Lemma 4 If fictitious play converges, then to a Nash equilibrium. Proof: Suppose s ⋆ = lim ℓ →∞ s ℓ exists. To see that s ⋆ is a NE, note that s ⋆ i is the strategy that i seems to play when she best-responds to s ⋆ − i , which she believes to be the profile of strategies of her opponents. � Remark: This lemma is true for arbitrary (not just zero-sum) games. Ulle Endriss 16

Game Theory: Spring 2020 Ulle Endriss Institute for Logic, Language - PowerPoint PPT Presentation

Zero-Sum Games Game Theory 2020 Game Theory: Spring 2020 Ulle Endriss Institute for Logic, Language and Computation University of Amsterdam Ulle Endriss 1 Zero-Sum Games Game Theory 2020 Plan for Today Today we are going to focus on the

e-Bug Junior Game Junior Game Game Style Game Process Demo Game Mechanics and

e-Bug Senior Game Senior Game Game Style Game Process Demo Game Puzzles and

Game Theory: Spring 2020 Ulle Endriss Institute for Logic, Language and Computation University

Game interoperability with functors functor AgsFun (structure Game : GAME) :> sig structure

Game Theory and Nuclear Weapons Game Theory and Nuclear Weapons Game Theory and Nuclear Warfare

Game Engine Architecture Game Engine Architecture Spring 2017 Spring 2017 03. Event systems

Game theory (Ch. 17.5) Announcements Midterm Thursday Game theory Typically game theory uses a

Microeconomics: Game Theory P . v. Mouche Wageningen University Spring 2020 Game Theory First

Game Theory CS 188: Artificial Intelligence Game theory: study of strategic situations,

Game Theory: Definition and Assumptions Game Theory and Strategy Game theory studies strategic

Introduction to game theory Introduction to game theory Jie Gao Computer Science Department

Introduction to Game Theory (1) Mehdi Dastani BBL-521 M.M.Dastani@uu.nl Game Theory What is

Coalitional Game Theory Game Theory MohammadAmin Fazli Algorithmic Game Theory 1 TOC

Game theory (Ch. 17.5) Game theory Typically game theory uses a payoff matrix to represent the

Lecture 7: Game theory David Aldous February 24, 2016 STAT 155 is an entire course on Game

Game Loops CIS 580 - Fundamentals of Game Programming Hangman Game Phases Game Loop

Chapter 2.5 Intermission Zero-Sum Games Zero-Sum Games A game consists of Players: Can

Today Experts/Zero-Sum Games Equilibrium. Boosting and Experts. Routing and Experts. Two person

In September of 2018, Brigadier General (retired) Zanetti was asked to be the keynote speaker at a

Plan: 1) The context : Kurt Gdels philosophy and the gdelian studies 2) The ANR project and

Convergence Problems of General-Sum Multiagent Reinforcement Learning Michael Bowling Carnegie

Homework for lecture slides 4a, 4b, and 4c. 1,0 1 L R Homework 4.1. 0,2 1,0 2 L R

Zero-Sum Games Are Special CMPUT 366: Intelligent Systems S&LB 3.4.1 Lecture Outline

Game Theory to the Rescue When Hard Decisions Are to Be Made Alexander C. S. Hendorf @hendorf

Game Theory: Spring 2020 Ulle Endriss Institute for Logic, Language - PowerPoint PPT Presentation

Zero-Sum Games Game Theory 2020 Game Theory: Spring 2020 Ulle Endriss Institute for Logic, Language and Computation University of Amsterdam Ulle Endriss 1 Zero-Sum Games Game Theory 2020 Plan for Today Today we are going to focus on the

e-Bug Junior Game Junior Game Game Style Game Process Demo Game Mechanics and

e-Bug Senior Game Senior Game Game Style Game Process Demo Game Puzzles and

Game Theory: Spring 2020 Ulle Endriss Institute for Logic, Language and Computation University

Game interoperability with functors functor AgsFun (structure Game : GAME) :&gt; sig structure

Game Theory and Nuclear Weapons Game Theory and Nuclear Weapons Game Theory and Nuclear Warfare

Game Engine Architecture Game Engine Architecture Spring 2017 Spring 2017 03. Event systems

Game theory (Ch. 17.5) Announcements Midterm Thursday Game theory Typically game theory uses a

Microeconomics: Game Theory P . v. Mouche Wageningen University Spring 2020 Game Theory First

Game Theory CS 188: Artificial Intelligence Game theory: study of strategic situations,

Game Theory: Definition and Assumptions Game Theory and Strategy Game theory studies strategic

Introduction to game theory Introduction to game theory Jie Gao Computer Science Department

Introduction to Game Theory (1) Mehdi Dastani BBL-521 M.M.Dastani@uu.nl Game Theory What is

Coalitional Game Theory Game Theory MohammadAmin Fazli Algorithmic Game Theory 1 TOC

Game theory (Ch. 17.5) Game theory Typically game theory uses a payoff matrix to represent the

Lecture 7: Game theory David Aldous February 24, 2016 STAT 155 is an entire course on Game

Game Loops CIS 580 - Fundamentals of Game Programming Hangman Game Phases Game Loop

Chapter 2.5 Intermission Zero-Sum Games Zero-Sum Games A game consists of Players: Can

Today Experts/Zero-Sum Games Equilibrium. Boosting and Experts. Routing and Experts. Two person

In September of 2018, Brigadier General (retired) Zanetti was asked to be the keynote speaker at a

Plan: 1) The context : Kurt Gdels philosophy and the gdelian studies 2) The ANR project and

Convergence Problems of General-Sum Multiagent Reinforcement Learning Michael Bowling Carnegie

Homework for lecture slides 4a, 4b, and 4c. 1,0 1 L R Homework 4.1. 0,2 1,0 2 L R

Zero-Sum Games Are Special CMPUT 366: Intelligent Systems S&amp;LB 3.4.1 Lecture Outline

Game Theory to the Rescue When Hard Decisions Are to Be Made Alexander C. S. Hendorf @hendorf

Game interoperability with functors functor AgsFun (structure Game : GAME) :> sig structure

Zero-Sum Games Are Special CMPUT 366: Intelligent Systems S&LB 3.4.1 Lecture Outline