CS 598 RM : Algorithmic game theory Lecture 1 Two-player games For - PDF document

CS 598 RM : Algorithmic game theory Lecture 1 Two-player games For any two-player game, we have the following basic notation. Table 1: Basic notation Player 1 ( P 1 ) Player 2 ( P 2 ) Set of actions S 1 S 2 Action i ∈ S 1 j ∈ S 2 Payoff/gain A ij B ij When the two players choose actions i, j respectively, their payoffs are A ij , B ij respectively. These can be conveniently represented as two matrices A, B each of size m × n , where m = | S 1 | and n = | S 2 | , as follows: 1 j n  ( A 11 , B 11 ) · · · · · ·  1   ...       .  .  . ( A ij , B ij ) i     .  ...  .  .      ( A mn , B mn ) m Due to this representation, these games are also called Bi-matrix games. Example : Matching pennies Both the players have two actions each given by, S 1 = S 2 = { Heads, Tails } . P 1 aims to match the outcomes, while P 2 does not. The following payoffs capture this situation: H T � � (1 , − 1) ( − 1 , 1) H ( − 1 , 1) (1 , − 1) T In this game, no pair of actions is stable . In such a case, the players can randomize. We formalize this next. 1

More notation and fundamentals The randomization between possible actions, is achieved by what is called a mixed strategy. We denote the set of mixed strategies for P 1 and P 2 , by ∆ 1 and ∆ 2 respectively, given by, � ∆ 1 = { x = ( x 1 , x 2 , . . . , x | S 1 | ) | x i ≥ 0 ∀ i ∈ S 1 , and x i = 1 } and, i ∈ S 1 � ∆ 2 = { y = ( y 1 , y 2 , . . . , y | S 2 | ) | y j ≥ 0 ∀ j ∈ S 2 , and y j = 1 } j ∈ S 2 When the two players play strategies x ∈ ∆ 1 and y ∈ ∆ 2 respectively, the expected payoff A ij x i y j = x T Ay , and similarly, that of P 2 is x T By . Thus, P 1 tries to of P 1 is given by � i ∈ S 1 j ∈ S 2 maximize x T Ay , and P 2 tries to maximize x T By . Definition ( Nash equilibrium ). A strategy profile ( x ′ , y ′ ) is a Nash Equilibrium (NE) iff x ′ ∈ argmax y ′ ∈ argmax x ′ T By x T Ay ′ and x ∈ ∆ 1 y ∈ ∆ 2 Having defined the NE, one would like to answer the following questions: • How to check if a given strategy profile is a NE? • Does a NE exist in a given game? In every game? • How to compute a NE? Theorem (Nash ’51). Every n -player game has a NE ( n ∈ N ). Characterization of NE Fix y for P 2 . Then, P 1 gets a payoff of ( Ay ) i from action i ∈ S 1 . Thus, the maximum possible from any action is max i ∈ S 1 ( Ay ) i = (say) v . Hence, playing x gives P 1 a payoff of � x T Ay = x i ( Ay ) i = convex combination of ( Ay ) i ’s i ∈ S 1 x T Ay ≤ v x T Ay = v iff ( ∀ i ∈ S 1 , ( x i > 0 ⇒ ( Ay ) i = v )) & ∴ A similar analysis works for P 2 as well. Fixing P 1 ’s strategy to x , P 2 gets a payoff of ( x T B ) j j ∈ S 2 ( x T B ) j , we can deduce, from action j ∈ S 2 . Letting w = max ∀ y ∈ ∆ 2 , x T By ≤ w x T By = w iff ( ∀ j ∈ S 2 , ( y j > 0 ⇒ ( x T B ) j = w )) & We summarize this analysis as the following theorem characterizing Nash Equilibria: 2

Theorem 1. ( x, y ) is a NE iff ∀ i ∈ S 1 : x i > 0 ⇒ ( Ay ) i = v and, ∀ j ∈ S 2 : y j > 0 ⇒ ( x T B ) j = w where, j ∈ S 2 ( x T B ) j v = max i ∈ S 1 ( Ay ) i & w = max This theorem allows us to easily check if a strategy profile is NE. Zero-sum games In these games, we have, B ij = − A ij ∀ i ∈ S 1 , ∀ j ∈ S 2 , i.e., simply B = − A Hence, these games are described by just one matrix A . P 1 tries to maximize its payoff, and thus, maximize x T Ay . Similarly, P 2 tries to maximize x T ( − A ) y , and thus, minimize x T Ay . Hence, P 1 is called the maximizer and P 2 is called the minimizer. Minimax play in zero-sum games Suppose both the players play pessimistically . To elaborate, P 1 assumes that P 2 can find out its strategy x , ahead of time and play y accordingly to achieve its goal of minimization of x T Ay . P 2 has a similar approach in choosing its strategy. Suppose they decide x ∗ , y ∗ as their strategies respectively, by playing pessimistically as described. Then, it must mean, � � � � x ∗ ∈ argmax y ∗ ∈ argmin y ∈ ∆ 2 x T Ay x ∈ ∆ 1 x T Ay min & max x ∈ ∆ 1 y ∈ ∆ 2 Now, let π 1 denote P 1 ’s guaranteed payoff, that is, the minimum worst-case payoff it can ensure - precisely as demonstrated in the pessimistic approach mentioned above. That is, � � y ∈ ∆ 2 x T Ay π 1 = max min (1) x ∈ ∆ 1 y ∈ ∆ 2 x ∗ T Ay = min (2) Similarly, let π 2 be P 2 ’s guaranteed payoff, that is, � � x ∈ ∆ 1 x T Ay π 2 = min max (3) y ∈ ∆ 2 x ∈ ∆ 1 x T Ay ∗ = max (4) We now show a remarkable result. 3

Theorem 2. For x ∗ , y ∗ , π 1 , π 2 as defined above, the following hold. 1. π 1 = π 2 = x ∗ T Ay ∗ 2. If ( x ′ , y ′ ) is a NE, then, x ′ T Ay ′ = x ∗ T Ay ∗ 3. ( x ∗ , y ∗ ) is a NE. Proof. Using the definition of π 1 as in (2), it follows that, π 1 ≤ x ∗ T Ay ∗ . Similarly, using the definition of π 2 in (4), it follows that, π 2 ≥ x ∗ T Ay ∗ . Combining the two, we get, π 1 ≤ x ∗ T Ay ∗ ≤ π 2 (5) Further, for a NE ( x ′ , y ′ ), by definition of NE, we have, x ′ T Ay ′ = max x ′ T Ay ′ = min x ∈ ∆ 1 x T Ay ′ y ∈ ∆ 2 x ′ T Ay (6) (7) From (7) and (1), we get, π 1 ≥ x ′ T Ay ′ . Similarly, from (6) and (3), we get, π 2 ≤ x ′ T Ay ′ . Combining the two, we get, π 2 ≤ x ′ T Ay ′ ≤ π 1 (8) (5) and (8) together prove the first two parts of the theorem. Having proven π 2 = x ∗ T Ay ∗ , and again from the definition of π 2 in (2), it follows that x ∗ ∈ argmax x T Ay ∗ . Similarly, we can get y ∗ ∈ argmin x ∗ T Ay . Hence, ( x ∗ , y ∗ ) is a NE by x ∈ ∆ 1 y ∈ ∆ 2 definition, proving part 3 of the theorem. Linear Programming Formulation (in zero-sum games) Suppose the players are playing to optimize their worst-case payoffs as in the previous section. i ∈ S 1 ( Ay ) i = (say) v y . From P 2 ’s perspective, fixing its strategy to y ∈ ∆ 2 , P 1 ’s best payoff is max y ∈ ∆ 2 v y - equivalently, this linear program LP: Hence, to minimize this, P 2 wants to solve for min min v s.t. v ≥ ( Ay ) i ∀ i ∈ S 1 , (1) � y j = 1 , (2) j ∈ S 2 y j ≥ 0 ∀ j ∈ S 2 (3) 4

The constraints in (2) and (3) ensure that y ∈ ∆ 2 . Letting the dual variables corresponding to the inequalities in (1) be x i ’s and the dual variable corresponding to (2) be w , the dual DLP of the linear program above, can be written as, max w w ≤ ( x T A ) j s.t. ∀ j ∈ S 2 , (4) � x i = 1 , (5) i ∈ S 1 x i ≥ 0 ∀ i ∈ S 1 (6) x ∈ ∆ 1 w x , where, w x = min j ∈ S 2 ( x T A ) j , Then, it’s easy to see that DLP is equivalent to solving for max and the constraints in (5) and (6) ensure that x ∈ ∆ 1 . Thus, this is precisely what P 1 wants to do to maximize its worst-case payoff. Consequently, we have the following theorem: Theorem 3. The solution of LP gives y ∗ , and that of DLP gives x ∗ . Further, the following follow from the properties of the linear programming solutions: • The set of Nash Equilibria of a zero-sum game are convex. • Computing an equilibrium can be done in polynomial time. 5

CS 598 RM : Algorithmic game theory Lecture 1 Two-player games For - PDF document

CS 598 RM : Algorithmic game theory Lecture 1 Two-player games For any two-player game, we have the following basic notation. Table 1: Basic notation Player 1 ( P 1 ) Player 2 ( P 2 ) Set of actions S 1 S 2 Action i S 1 j S 2

e-Bug Junior Game Junior Game Game Style Game Process Demo Game Mechanics and

Coalitional Game Theory Game Theory MohammadAmin Fazli Algorithmic Game Theory 1 TOC

e-Bug Senior Game Senior Game Game Style Game Process Demo Game Puzzles and

Game interoperability with functors functor AgsFun (structure Game : GAME) :> sig structure

Game Theory and Nuclear Weapons Game Theory and Nuclear Weapons Game Theory and Nuclear Warfare

Lecture 7: Game theory David Aldous February 24, 2016 STAT 155 is an entire course on Game

Game theory (Ch. 17.5) Announcements Midterm Thursday Game theory Typically game theory uses a

Algorithmic Complexity Algorithmic Complexity "Algorithmic Complexity", also called

Algorithmic Game Theory - Part 1 Online Mechanism Design Nikolidaki Aikaterini aiknikol@yahoo.gr

Algorithmic Game Theory - Part 2 Online Mechanism Design Nikolidaki Aikaterini aiknikol@yahoo.gr

Algorithmic Game Theory: Introduction Georgios Amanatidis amanatidis@diag.uniroma1.it Based on

Mechanism Design and Auctions Game Theory MohammadAmin Fazli Algorithmic Game Theory 1 TOC

Extensive Form Games Game Theory MohammadAmin Fazli Algorithmic Game Theory 1 TOC Perfect

Concepts Game Theory MohammadAmin Fazli Algorithmic Game Theory 1 TOC Computing the Nash

Game Theory: Definition and Assumptions Game Theory and Strategy Game theory studies strategic

Introduction to game theory Introduction to game theory Jie Gao Computer Science Department

Strategy recovery for stochastic mean payoff games Marcello Mamino TU Dresden GRASTA 15,

REPEATED GAMES Overview Context: players (e.g., firms) interact with each other on an ongoing

http://cs224w.stanford.edu Spreading through Examples: networks: Biological: Cascading

MA111: Contemporary mathematics . Jack Schmidt University of Kentucky September 26, 2011

Mathematical Foundations for Finance Exercise 5 Martin Stefanik ETH Zurich Some Concepts from

Positionality and strategy improvement for continuous payoffs A. Kozachinskiy University of

Agenda Contractor provisions Contractor provisions Employee or contractor? Employee or

Potential Based Routing (PBR) for ICN Suyong Eum, Kiyohide Nakauchi, Yozo Shoji, Masayuki Murata,

CS 598 RM : Algorithmic game theory Lecture 1 Two-player games For - PDF document

CS 598 RM : Algorithmic game theory Lecture 1 Two-player games For any two-player game, we have the following basic notation. Table 1: Basic notation Player 1 ( P 1 ) Player 2 ( P 2 ) Set of actions S 1 S 2 Action i S 1 j S 2

e-Bug Junior Game Junior Game Game Style Game Process Demo Game Mechanics and

Coalitional Game Theory Game Theory MohammadAmin Fazli Algorithmic Game Theory 1 TOC

e-Bug Senior Game Senior Game Game Style Game Process Demo Game Puzzles and

Game interoperability with functors functor AgsFun (structure Game : GAME) :&gt; sig structure

Game Theory and Nuclear Weapons Game Theory and Nuclear Weapons Game Theory and Nuclear Warfare

Lecture 7: Game theory David Aldous February 24, 2016 STAT 155 is an entire course on Game

Game theory (Ch. 17.5) Announcements Midterm Thursday Game theory Typically game theory uses a

Algorithmic Complexity Algorithmic Complexity &quot;Algorithmic Complexity&quot;, also called

Algorithmic Game Theory - Part 1 Online Mechanism Design Nikolidaki Aikaterini aiknikol@yahoo.gr

Algorithmic Game Theory - Part 2 Online Mechanism Design Nikolidaki Aikaterini aiknikol@yahoo.gr

Algorithmic Game Theory: Introduction Georgios Amanatidis amanatidis@diag.uniroma1.it Based on

Mechanism Design and Auctions Game Theory MohammadAmin Fazli Algorithmic Game Theory 1 TOC

Extensive Form Games Game Theory MohammadAmin Fazli Algorithmic Game Theory 1 TOC Perfect

Concepts Game Theory MohammadAmin Fazli Algorithmic Game Theory 1 TOC Computing the Nash

Game Theory: Definition and Assumptions Game Theory and Strategy Game theory studies strategic

Introduction to game theory Introduction to game theory Jie Gao Computer Science Department

Strategy recovery for stochastic mean payoff games Marcello Mamino TU Dresden GRASTA 15,

REPEATED GAMES Overview Context: players (e.g., firms) interact with each other on an ongoing

http://cs224w.stanford.edu Spreading through Examples: networks: Biological: Cascading

MA111: Contemporary mathematics . Jack Schmidt University of Kentucky September 26, 2011

Mathematical Foundations for Finance Exercise 5 Martin Stefanik ETH Zurich Some Concepts from

Positionality and strategy improvement for continuous payoffs A. Kozachinskiy University of

Agenda Contractor provisions Contractor provisions Employee or contractor? Employee or

Potential Based Routing (PBR) for ICN Suyong Eum, Kiyohide Nakauchi, Yozo Shoji, Masayuki Murata,

Game interoperability with functors functor AgsFun (structure Game : GAME) :> sig structure

Algorithmic Complexity Algorithmic Complexity "Algorithmic Complexity", also called