Computing Equilibria Christos H. Papadimitriou UC Berkeley - - PowerPoint PPT Presentation

computing equilibria
SMART_READER_LITE
LIVE PREVIEW

Computing Equilibria Christos H. Papadimitriou UC Berkeley - - PowerPoint PPT Presentation

Computing Equilibria Christos H. Papadimitriou UC Berkeley christos Games 1/3 1/3 1/3 1/3 0, 0 1, -1 -1, 1 zero-sum game Min-max theorem 1/3 -1, 1 0, 0 1, -1 von Neumann 1928: a (probabilistic) 1/3 equilibrium exists


slide-1
SLIDE 1

Computing Equilibria

Christos H. Papadimitriou UC Berkeley “christos”

slide-2
SLIDE 2

NVTI Theory Day, March 14 2008

Games 0, 0

  • 1, 1

1, -1 1, -1 0, 0

  • 1, 1
  • 1, 1

1, -1

0, 0

zero-sum game Min-max theorem von Neumann 1928: “a (probabilistic) equilibrium exists” 1/3 1/3 1/3 1/3 1/3 1/3

slide-3
SLIDE 3

NVTI Theory Day, March 14 2008

Non-zero sum games? 0, 0

  • 1, 1

1, -1 1, -1 0, 0

  • 2, 1
  • 1, 1

2, -1

0, 0

Nash (1951): “An equilibrium still exists”

1/4 1/4 1/2 1/4 1/4 1/2

slide-4
SLIDE 4

NVTI Theory Day, March 14 2008

[First Parenthesis: Why study games?

  • Thought experiments that help us

understand strategic rational behavior

  • Studying them may help us understand the

Internet

  • Deep, rich subject
  • It interacts exquisitely with computation]
slide-5
SLIDE 5

NVTI Theory Day, March 14 2008

How to compute a Nash equilibrium?

  • All known algorithms are exponential
  • By the way, it’s a combinatorial problem:
  • ne has to guess the supports
  • Is it then NP-complete?
  • No, because a solution always exists
slide-6
SLIDE 6

NVTI Theory Day, March 14 2008

…and why bother? [a parenthesis

  • Equilibrium concepts provide some of the most

intriguing specimens of computational problems

  • They are notions of rationality, aspiring models of

behavior

  • Efficient computability is an important modeling

prerequisite “if your laptop can’t find it, then neither can the market…” ]

slide-7
SLIDE 7

NVTI Theory Day, March 14 2008

Complexity of Nash

  • Nash’s existence proof relies on Brouwer’s

fixpoint theorem

  • Finding a Brouwer fixpoint is a hard problem

[HPV91]

  • Not quite NP-complete, but as hard as any

problem that always has an answer can be…

  • Technical term: PPAD-complete [P 1991]
  • But is Nash as hard? Or easier?
slide-8
SLIDE 8

NVTI Theory Day, March 14 2008

What is PPAD?

  • Class of problems that always have a solution
  • They all have the same existence proof

“If a finite directed graph has an unbalanced node, then it must have another

  • ne”
slide-9
SLIDE 9

NVTI Theory Day, March 14 2008

Exponential directed graph with indegree, outdegree < 2

Standard source (given)

?

(there must be a sink…)

slide-10
SLIDE 10

NVTI Theory Day, March 14 2008

The four existence proofs

“if a directed graph has an unbalanced node, then it has another” PPAD “if an undirected graph has an odd-degree node, then it has another” PPA “every dag has a sink” PLS “pigeonhole principle” PPP

slide-11
SLIDE 11

NVTI Theory Day, March 14 2008

Back to Nash

  • For n players with s strategies each, input

has length

nsn

  • Exponenial!
slide-12
SLIDE 12

NVTI Theory Day, March 14 2008

The embarrassing subject

  • f many players

[a parenthesis

  • With games we are supposed to model

markets, auctions, the Internet

  • These have many players
  • Thus they require exponential input!
slide-13
SLIDE 13

NVTI Theory Day, March 14 2008

The embarrassing subject

  • f many players (cont.)
  • These important games cannot require

astronomically long descriptions “if your problem is important, then its input cannot be astronomically long…”

  • Conclusion: Many interesting games are

1. multi-player 2. succinctly representable ]

slide-14
SLIDE 14

NVTI Theory Day, March 14 2008

e.g., Graphical Games

  • [Kearns et al. 2002] Players are vertices of a

graph, each player is affected only by his/her neighbors

  • If degrees are bounded by d, nsd numbers

suffice to describe the game

  • Also: multimatrix, congestion, location,

anonymous, hypergraphical, …

slide-15
SLIDE 15

NVTI Theory Day, March 14 2008

An Easier Problem: Correlated equilibrium [Aumann 73]

0,0 5,1 1,5 4,4

Chicken:

  • Two pure equilibria {me, you}
  • Mixed (½, ½) (½, ½) payoff 5/2
slide-16
SLIDE 16

NVTI Theory Day, March 14 2008

Correlated Equilibrium

  • “Traffic signal”

with payoff 3

  • Compare with mixed

Nash equilibrium

  • Even better

with payoff 3 1/3

½ ½

1/4 1/4 1/4 1/4 1/3 1/3 1/3

Probabilities in a lottery drawn by an impartial

  • utsider, and

announced to each player separately

slide-17
SLIDE 17

NVTI Theory Day, March 14 2008

Correlated equilibrium

  • Self-enforcing distribution on the game states

(“boxes”)

  • Generalizes the Nash equilibrium

= uncorrelated (product) distribution

  • Can be computed and optimized over by linear

programming

slide-18
SLIDE 18

NVTI Theory Day, March 14 2008

Correlated equilibrium (cont)

  • But how about the succinct case?
  • Can be computed in polynomial time by the

“ellipsoid against hope” method [Pa05]

  • As long as the utility expectation problem can

be solved in polynomial time

  • Some cases of succinct games can even be
  • ptimized over [PR05]
slide-19
SLIDE 19

NVTI Theory Day, March 14 2008

Ellipsoid against hope: Details

  • Existence ≡ LP is unbounded
  • LP has exponentially many variables
  • “Solve” DLP by ellipsoid method
  • When done (“no”), the facets used are also an

infeasible RDLP, of polynomial size

  • Solving their dual, RLP, solves LP
slide-20
SLIDE 20

NVTI Theory Day, March 14 2008

So…

  • Nash equilibrium seems difficult (in PPAD,

not known to be complete)

  • Correlated equilibrium is easier
slide-21
SLIDE 21

NVTI Theory Day, March 14 2008

And suddenly, the summer of 2005

Theorem [DGP05]: Computing a Nash equilibrium is PPAD-complete even for 4 players

One key insight: Games that do arithmetic!

slide-22
SLIDE 22

NVTI Theory Day, March 14 2008

“Multiplication is the name of the game and each generation plays the same…”

Bobby Darren, 1961

slide-23
SLIDE 23

NVTI Theory Day, March 14 2008

The multiplication game

x y z = x · y

“affects”

w

if w plays 0, then it gets xy. if it plays 1, then it gets z, but z gets punished z wins when it plays 1 and w plays 0

slide-24
SLIDE 24

NVTI Theory Day, March 14 2008

Reduction Brouwer ⇒ Nash: a very rough sketch

  • Graphical games that do multiplication,

addition, comparison, Boolean operations…

  • Simulate the circuit that computes the

Brouwer function by a huge graphical game

  • “Brittle comparator” problem solved by

averaging

  • Simulate the graphical game by a 4-player

game: 4-color the graph

slide-25
SLIDE 25

NVTI Theory Day, March 14 2008

Recall:

  • Nash’s theorem reduces Nash to

Brouwer

  • This is a reduction in the opposite

direction

slide-26
SLIDE 26

NVTI Theory Day, March 14 2008

Brouwer ≡ Nash

So….

slide-27
SLIDE 27

NVTI Theory Day, March 14 2008

Later that Fall

[DP05, CD05]: 3-player Nash is also PPAD-complete [Chen and Deng]: Even 2-player Nash!

slide-28
SLIDE 28

NVTI Theory Day, March 14 2008

game over?

slide-29
SLIDE 29

NVTI Theory Day, March 14 2008

Approximate Nash

Defecting players can only gain (additive) ε (and utilities are normalized to [0,1], 2 players)

  • ε = .75 [KPS06]
  • ε = .5 [DMP06], cf [FNS06]
  • ε = .3393… [ST07]
  • Open: PTAS [any ε > 0 in time O(n1/ε)]
slide-30
SLIDE 30

NVTI Theory Day, March 14 2008

Anonymous games

  • All players have the same set of strategies
  • Each player has own utility
  • But it depends on how many players [of

each type] play each strategy

  • Very Recent: PTAS [DP07] in the bounded

strategy case (the only succinct one…)

slide-31
SLIDE 31

NVTI Theory Day, March 14 2008

Anonymous games: The idea

  • Each player’s strategy is a number in [0,1]
  • Discretize to multiples of ε
  • Important Lemma: payoffs change by √ε
  • Search exhaustively for an approximate

equilibrium

slide-32
SLIDE 32

NVTI Theory Day, March 14 2008

Finally, repeated games

(0,4) (1,1) (3,3) (4,0) “threat point” “individually rational region”

slide-33
SLIDE 33

NVTI Theory Day, March 14 2008

Nash equilibria?

The Folk Theorem [ca. 1980]: Under very general conditions, any point in the IRR can be implemented as a Nash equilibrium. Indeed [L2005]: For two players, a Nash equilibrium of the repeated game can be computed in polynomial time.

slide-34
SLIDE 34

NVTI Theory Day, March 14 2008

The Myth of the Folk Theorem

Theorem [BCIKPR2007]: For 3 or more players, the threat point is NP-hard to compute Furthermore, finding a Nash equilibrium in a repeated game is PPAD-complete.

slide-35
SLIDE 35

NVTI Theory Day, March 14 2008

So…

  • The Nash equilibrium is intractable
  • Also in repeated games
  • Aproximation? Alternative notions?

Special cases? Intractability of those?

  • Computational results and concepts inform

the game theoretic discourse

slide-36
SLIDE 36

NVTI Theory Day, March 14 2008

Thank You!