Parameterized Two-Player Nash Equilibrium Danny Hermelin, - PowerPoint PPT Presentation

Parameterized Two-Player Nash Equilibrium Danny Hermelin, Chien-Chung Huang, .. Stefan Kratsch, and Magnus Wahlstrom

Bimatrix Game  Played by two players: Row and Column – Two payoff matrices. A,B ∈ Q n × n . 0 1 -2 0 2 0 0 2 2 0 -2 2 1 2 -1 1 1 1 Row chooses i Column chooses j Row payoff A[i,j] = -2 Column payoff B[i,j] = 0

Bimatrix Game  Example: – Rock, paper, scissors: 0 -1 1 0 1 -1 1 0 -1 -1 0 1 -1 1 0 1 -1 0  This example is a zero-sum game: – Row and column payoffs sum up to zero.  General bimatrix games are not necessarily such. – In fact, the interesting cases (to us) are not zero-sum.

Bimatrix Game  Players can play mixed strategies. y – Distribution over rows and columns. 0 0 1 1/2 0 1 -2 0 2 0 x 0 2 2 1/2 0 -2 2 0 1 2 -1 1 1 1 Row chooses distribution x Column chooses distribution y Row expected payoff Column expected payoff x T Ay = 0 x T By = 1

Nash Equilibrium  Neither player can improve their payoff, assuming the other player plays the same. 0 1 -2 0 2 0 0 2 2 0 -2 2 1 2 -1 1 1 1 Not Nash ! Row can improve by switching to row 2.

Nash Equilibrium  Neither player can improve their payoff, assuming the other player plays the same. 0 1 -2 0 2 0 0 2 2 0 -2 2 1 2 -1 1 1 1 Nash ! Theorem (Nash) : Any bimatrix rational game has a mixed equilibrium.

Computing Nash Equilibrium  The Nash Equilibrium (NE) problem: Given a bimatrix rational game, find an equilibrium.  NP-completeness theory does not apply because solution always exists.  PPAD-complete by a series of papers: – Daskalakis, Goldberg, and Papadimitriou [STOC’06,STOC’06]. – Daskalakis and Papadimitriou [ECCC’05] – Chen and Deng [ECCC’05] – Chen and Deng [FOCS’06]  The 3-SAT of algorithmic game theory !

Computing Nash Equilibrium  Support: Set of strategies played with non-zero probability.  When support of both players is known, NE is easy.  Solve LP with the following constraints: x s > 0 ⇒ (Ay) s ≥ (Ay) j for all j ≠ s. ‒ y s > 0 ⇒ (x T B) s ≥ (x T B) j for all j ≠ s ‒

Computing Nash Equilibrium Theorem : NE can be solved in n O(k) time, when the supports of each player are bounded by k. – Can this be improved substantially? – Can we remove k out of the exponent? Theorem (Estivill-Castro, Parsa) : NE cannot be solved in n o(k) time unless FPT=W[1]. GOAL: find interesting special cases that circumvent this

Graph Representation of Bimatrix Games  Bipartite graph on rows and columns 0 1 -2 0 2 0 + ⇒ 0 2 2 0 -2 2 1 1 1 1 2 -1 (i,j) is an edge ⇔ A[i,j] ≠ 0 or B[i,j] ≠ 0

Interesting Special Cases 1. l -sparse games: previously studied Degrees ≤ l . – games 1. k-unbalanced games: One side has ≤ k vertices. – 1. Locally bounded treewidth: Every d-neighborhood has treewidth ≤ f(d). – – Generalizes both previous cases. ≤ l ≤ k (1) (2) (3)

Our Results Theorem : NE in l -sparse games, where the support is bounded by k, can be solved in l O(k l ) n O(1) time. – Without the restriction on the support size the problem is PPAD- complete [Chen, Deng, and Teng ‘06]. Theorem : NE in k-unbalanced games, where the row player’s payoff 2 matrix has l different values, can be solved in l O(k ) n O(1) time. – General k-sparse games is not known to be FPT. – But how do we show its not ? Theorem : NE in locally bounded treewidth games, where the support is bounded by k, and both payoff matrices have l different values, can be solved in f( l, k) n O(1) time for some computable f().

l -Sparse Games  Recall l := max-degree and k:= support size.  Two easy observations: 1. Enough to search for minimal equilibriums. Definition: An equilibrium (x,y) is minimal if for any equilibrium (x’,y’) with S(x’) ⊆ S(x) and S(y’) ⊆ S(y), we have S(x’) = S(x) and S(y’) = S(y). 2. If n > k l , then both players receive non-negative payoffs on any k × k equilibrium. If a player get negative payoff and n > k l , there will always be a zero-payoff strategy to switch to.

l -Sparse Games Definition: The extended support of (x,y) is S(x) ∪ N(S(y)) for the row player, and S(y) ∪ N(S(x)) for the column player. S(y) 1 1 S(x) 1 1 extended 1 1 N(S(y)) support 1 1 of row 1 1 1 1 1 1 1 1 The size of the extended support of each player ≤ k + k l .

l -Sparse Games  Main technical lemma: Lemma: If (x,y) is minimal equilibrium, then the subgraph H ⊆ G induced by the extended supports has at most 2 connected components. Proof sketch: 1. Prove separately for the case where A s(x),s(y) = 0 and B s(x),s(y) = 0 , and for the case when one of these matrices is not all-zero. 2.In the latter case, normalize probabilities on some connected component of H. 3. In the former case, argue the same on G[N(S(x))] and G[N(S(y))].

l -Sparse Games  Folklore FPT lemma: Lemma: Let G be a graph on n vertices of maximum degree ∆ . Then one can enumerate all induced subgraphs H on h vertices and c connected components in H ⊆ G in ∆ O(h) n O(c) time. Proof sketch: 1. Guess c vertices S in G to be the targets of vertices in different connected components of H. 2.Branch on the h-neighborhood of S to enumerate all H ⊆ G. 3.The size of each branch-tree is ∆ O(h) .

l -Sparse Games  The algorithm: 1.Guess the number h of strategies in both extended support. 2.Guess the number of connected components c ∈ {1,2} in the corresponding induced subgraph. 3.Enumerate all induced subgraphs on h vertices and c connected components. 4.For each such subgraph, the supports of both players are known. Thus, one can use LP to determine if it corresponds to an equilbrium.

l -Sparse Games  Extensions: 1.We can improve running-time to l O(k l ) n O(1) in case both payoff matrices are non-negative. 2.Another route to a well-known PTAS. 3.Connectivity lemma can be used to show that the problem has no “polynomial kernel”.

Open questions 1. k-unbalanced games with an arbitrary number of payoffs. 2. Bounded treewidth games with an arbitrary number of payoffs. 3. Parameterized analog of the PPAD class. Conjecture: NE parameterized by k in k-unbalanced games is Para-PPAD-Complete.

Parameterized Two-Player Nash Equilibrium Danny Hermelin, - PowerPoint PPT Presentation

Parameterized Two-Player Nash Equilibrium Danny Hermelin, Chien-Chung Huang, .. Stefan Kratsch, and Magnus Wahlstrom Bimatrix Game Played by two players: Row and Column Two payoff matrices. A,B Q n n . 0 1 -2 0 2 0 0 2 2

Beyond Nash Equilibrium: Solution Concepts for the 21st Century Joe Halpern and many

Nash demand game Julio D avila 2009 Julio D avila Nash demand game Nash demand game

Algorithms for solving two- player normal form games Recall: Nash equilibrium Let A and B be

LABOR MARKET EQUILIBRIUM Competitive Equilibrium I Equilibrium as the intersection of supply and

Finding Nash Equilibria in Certain Classes of 2-Player Game Adrian Vetta McGill University

ARTigo Tag Cluster tags of player 2 player 4 player 1 player 3 1 russian 1 army 1

Nash Equilibrium in Tullock Contests Aidas Masiliunas 1 1 Aix-Marseille School of Economics

COMPUTING A NASH EQUILIBRIUM Who cares?? If centralized, specially designed algorithms cannot

Nash Equilibrium 14.12 Game Theory Muhamet Yildiz 1 Road Map 1. Definition 2. Examples 3.

CS 598 RM : Algorithmic game theory Lecture 1 Two-player games For any two-player game, we have

Parameterized Power Vertex Cover Eric Angel, Evripidis Bampis, Bruno Escoffier, Michael Lampis

Strategies and Nash Equilibrium A strategy is a specification for how to play A Whirlwind Tour of

Equilibrium Refinements Mihai Manea MIT Sequential Equilibrium In many games information is

The Player Agent The Player Agent Are they the most important league official right now? right

Game theory (Ch. 17.5) Find best strategy How do we formally find a Nash equilibrium? If it is

Two-Player Game State Machine 2-Player Game State Diagram 2PG2 2-Player

Inverse Optimization and Equilibrium with Applications in Finance and Statistics Jong-Shi Pang

Networks Mostly adapted from Goodfellows 2016 NIPS tutorial:

On equilibrium problems on the real axis. Applications Ram on Orive Universidad de La Laguna.

90 Years of Computability and Complexity Stathis Zachos National Technical University of Athens

Noise vs Computational unpredictability in dynamics Crist obal Rojas Joint with M. Braverman

Monte Carlo methods for magnetic systems Zoltn Nda Babe-Bolyai University Dept of

Bi-Lipschitz Solutions to the Prescribed Jacobian Inequality in the Plane and Applications to

Energy-momentum tensor correlators in hot Yang-Mills theory Aleksi Vuorinen University of

Sambuz

Useful Links

Newsletter

Mail Us