Strategic Games: a Mini-Course Krzysztof R. Apt CWI, Amsterdam, the - - PowerPoint PPT Presentation

strategic games a mini course
SMART_READER_LITE
LIVE PREVIEW

Strategic Games: a Mini-Course Krzysztof R. Apt CWI, Amsterdam, the - - PowerPoint PPT Presentation

Strategic Games: a Mini-Course Krzysztof R. Apt CWI, Amsterdam, the Netherlands , University of Amsterdam Strategic Games: a Mini-Course p. 1/11 Basic Concepts Strategic Games: a Mini-Course p. 2/11 Overview Best response. Nash


slide-1
SLIDE 1

Strategic Games: a Mini-Course

Krzysztof R. Apt

CWI, Amsterdam, the Netherlands, University of Amsterdam

Strategic Games: a Mini-Course – p. 1/11

slide-2
SLIDE 2

Basic Concepts

Strategic Games: a Mini-Course – p. 2/11

slide-3
SLIDE 3

Overview

Best response. Nash equilibrium. Pareto efficient outcomes. Social welfare. Social optima. Examples.

Strategic Games: a Mini-Course – p. 3/11

slide-4
SLIDE 4

Strategic Games: Definition

Strategic game for n ≥ 2 players: (possibly infinite) set Si of strategies, payoff function pi : S1 × . . . × Sn → R, for each player i. Basic assumptions: players choose their strategies simultaneously, each player is rational: his objective is to maximize his payoff, players have common knowledge of the game and of each

  • thers’ rationality.

Strategic Games: a Mini-Course – p. 4/11

slide-5
SLIDE 5

Three Examples

Prisoner’s Dilemma C D C 2, 2 0, 3 D 3, 0 1, 1 The Battle of the Sexes F B F 2, 1 0, 0 B 0, 0 1, 2 Matching Pennies H T H 1, −1 −1, 1 T −1, 1 1, −1

Strategic Games: a Mini-Course – p. 5/11

slide-6
SLIDE 6

Main Concepts

Notation: si, s′

i ∈ Si, s, s′, (si, s−i) ∈ S1 × . . . × Sn.

si is a best response to s−i if ∀s′

i ∈ Si pi(si, s−i) ≥ pi(s′ i, s−i).

s is a Nash equilibrium if ∀i si is a best response to s−i: ∀i ∈ {1, . . ., n} ∀s′

i ∈ Si pi(si, s−i) ≥ pi(s′ i, s−i).

s is Pareto efficient if for no s′ ∀i ∈ {1, . . ., n} pi(s′) ≥ pi(s), ∃i ∈ {1, . . ., n} pi(s′) > pi(s).

Social welfare of s: n

j=1 pj(s).

s is a social optimum if n

j=1 pj(s) is maximal.

Strategic Games: a Mini-Course – p. 6/11

slide-7
SLIDE 7

Nash Equlibrium

Prisoner’s Dilemma:

1 Nash equilibrium

C D C 2, 2 0, 3 D 3, 0 1, 1 The Battle of the Sexes:

2 Nash equilibria

F B F 2, 1 0, 0 B 0, 0 1, 2 Matching Pennies:

no Nash equlibrium

H T H 1, −1 −1, 1 T −1, 1 1, −1

Strategic Games: a Mini-Course – p. 7/11

slide-8
SLIDE 8

Prisoner’s Dilemma

C D C 2, 2 0, 3 D 3, 0 1, 1

1 Nash equilibrium: (D, D), 3 Pareto efficient outcomes: (C, C), (C, D),(D, C), 1 social optimum: (C, C).

Strategic Games: a Mini-Course – p. 8/11

slide-9
SLIDE 9

Prisoner’s Dilemma in Practice

Strategic Games: a Mini-Course – p. 9/11

slide-10
SLIDE 10

Prisoner’s Dilemma for n Players

n > 1 players,

two strategies: 1 (formerly C), 0 (formerly D).

pi(s) :=

  • 2

j=i sj + 1 if si = 0

2

j=i sj

if si = 1

For n = 2 we get the original Prisoner’s Dilemma game. Let 1 = (1, . . ., 1) and 0 = (0, . . ., 0). 0 is the unique Nash equilibrium, with social welfare n. Social optimum: 1, with social welfare 2n(n − 1).

Strategic Games: a Mini-Course – p. 10/11

slide-11
SLIDE 11

Tragedy of the Commons

Common resources: goods that are are not excludable (people cannot be prevented from using them) but are rival (one person’s use of them diminishes another person’s enjoyment of it). Examples: congested toll-free roads, fish in the ocean, the environment, . . ., Problem: Overuse of such common resources leads to their destruction. This phenomenon is called the tragedy of the commons (Hardin ’81).

Strategic Games: a Mini-Course – p. 11/11

slide-12
SLIDE 12

Tragedy of the Commons I

(Gardner ’95)

n > 1 players,

two strategies: 1 (use the resource), 0 (don’t use), payoff function:

pi(s) := 0.1

if si = 0

F(m)/m

  • therwise

where m = n

j=1 sj and

F(m) := 1.1m − 0.1m2.

Strategic Games: a Mini-Course – p. 12/11

slide-13
SLIDE 13

Tragedy of the Commons I, ctd

payoff function:

pi(s) := 0.1

if si = 0

F(m)/m

  • therwise

where m = n

j=1 sj and F(m) := 1.1m − 0.1m2.

Note: F(m)/m is strictly decreasing,

F(9)/9 = 0.2, F(10)/10 = 0.1, F(11)/11 = 0.

Nash equilibria:

n < 10: all players use the resource, n ≥ 10: 9 or 10 players use the resource,

Social optimum: 5 players use the resource.

Strategic Games: a Mini-Course – p. 13/11

slide-14
SLIDE 14

Tragedy of the Commons II

(Osborne ’04)

n > 1 players,

strategies: [0, 1], payoff function:

pi(s) := si(1 − n

j=1 sj)

if n

j=1 sj ≤ 1

  • therwise

Strategic Games: a Mini-Course – p. 14/11

slide-15
SLIDE 15

Tragedy of the Commons II, ctd

payoff function:

pi(s) := si(1 − n

j=1 sj)

if n

j=1 sj ≤ 1

  • therwise

‘Best’ Nash equilibrium: when each si =

1 n+1,

with social welfare

n (n+1)2 and n j=1 sj = n n+1.

Social optimum, when n

j=1 sj = 1 2,

with social welfare 1

4.

For all n > 1,

n (n+1)2 < 1 4.

limn → ∞

n (n+1)2 = 0 and limn → ∞ n n+1 = 1.

So when n → ∞, social welfare → 0, while resource usage

→ 1.

Strategic Games: a Mini-Course – p. 15/11

slide-16
SLIDE 16

Dominance Notions

Strategic Games: a Mini-Course – p. 16/11

slide-17
SLIDE 17

Strict and Weak Dominance

s′

i is strictly dominated by si if

∀s−i ∈ S−i pi(si, s−i) > pi(s′

i, s−i),

s′

i is weakly dominated by si if

∀s−i ∈ S−i pi(si, s−i) ≥ pi(s′

i, s−i),

∃s−i ∈ S−i pi(si, s−i) > pi(s′

i, s−i).

Strategic Games: a Mini-Course – p. 17/11

slide-18
SLIDE 18

Prisoner’s Dilemma Reviewed

C D C 2, 2 0, 3 D 3, 0 1, 1

Why a dilemma? (Another interpretation.)

(C, C) is a unique social optimum. (D, D) is a unique Nash equilibrium.

For each player C is strictly dominated by D.

Strategic Games: a Mini-Course – p. 18/11

slide-19
SLIDE 19

Quiz

H T E H 1, −1 −1, 1 −1, −1 T −1, 1 1, −1 −1, −1 E −1, −1 −1, −1 −1, −1

What are the Nash equilibria of this game?

Strategic Games: a Mini-Course – p. 19/11

slide-20
SLIDE 20

Answer

H T E H 1, −1 −1, 1 −1, −1 T −1, 1 1, −1 −1, −1 E −1, −1 −1, −1 −1, −1 (E, E) is the only Nash equilibrium.

It is a Nash equilibrium in weakly dominated strategies.

Strategic Games: a Mini-Course – p. 20/11

slide-21
SLIDE 21

IESDS: Example 1

L M R T 3, 0 2, 1 1, 0 C 2, 1 1, 1 1, 0 B 0, 1 0, 1 0, 0 B is strictly dominated by T, R is strictly dominated by M.

By eliminating them we get:

L M T 3, 0 2, 1 C 2, 1 1, 1

Strategic Games: a Mini-Course – p. 21/11

slide-22
SLIDE 22

IESDS, Example 1ctd

L M T 3, 0 2, 1 C 2, 1 1, 1

Now C is strictly dominated by T, so we get:

L M T 3, 0 2, 1

Now L is strictly dominated by M, so we get:

M T 2, 1

We solved the game by IESDS.

Strategic Games: a Mini-Course – p. 22/11

slide-23
SLIDE 23

IESDS

Theorem If G′ is an outcome of IESDS starting from a finite G, then s is a Nash equilibrium of G′ iff it is a Nash equilibrium of G. If G is finite and is solved by IESDS, then the resulting joint strategy is a unique Nash equilibrium of G. (Gilboa, Kalai, Zemel, ’90) Outcome of IESDS is unique (order independence).

Strategic Games: a Mini-Course – p. 23/11

slide-24
SLIDE 24

IESDS: Example

Location game (Hotelling ’29) 2 companies decide simultaneously their location, customers choose the closest vendor. Example: Two bakeries, one (discrete) street. For instance:

3 8

Then baker1(3, 8) = 5, baker2(3, 8) = 6. Where do I put my bakery?

Strategic Games: a Mini-Course – p. 24/11

slide-25
SLIDE 25

Answer

6

Then: baker1(6, 6) = 5.5, baker2(6, 6) = 5.5.

(6, 6) is the outcome of IESDS.

Hence (6, 6) is a unique Nash equilibrium.

Strategic Games: a Mini-Course – p. 25/11

slide-26
SLIDE 26

IEWDS

Theorem If G′ is an outcome of IEWDS starting from a finite G and s is a Nash equilibrium of G′, then s is a Nash equilibrium of G. If G is finite and is solved by IEWDS, then the resulting joint strategy is a Nash equilibrium of G. Outcome of IEWDS does not need to be unique (no order independence).

Strategic Games: a Mini-Course – p. 26/11

slide-27
SLIDE 27

IEWDS: Beauty-contest Game

Example: The 2nd Maldives Mr & Miss Beauty Contest.

Strategic Games: a Mini-Course – p. 27/11

slide-28
SLIDE 28

Beauty-contest Game (ctd)

[Moulin, ’86] each set of strategies = {1, . . ., 100}, payoff to each player: 1 is split equally between the players whose submitted number is closest to 2

3 of the average.

Example submissions: 29, 32, 29; average: 30, payoffs: 1

2, 0, 1 2.

This game is solved by IEWDS. Hence it has a Nash equilibrium, namely (1, . . ., 1).

Strategic Games: a Mini-Course – p. 28/11

slide-29
SLIDE 29

IEWDS: Example 2

The following game has two Nash equilibria:

X Y Z A 2, 1 0, 1 1, 0 B 0, 1 2, 1 1, 0 C 1, 1 1, 0 0, 0 D 1, 0 0, 1 0, 0 D is weakly dominated by A, Z is weakly dominated by X.

By eliminating them we get:

X Y A 2, 1 0, 1 B 0, 1 2, 1 C 1, 1 1, 0

Strategic Games: a Mini-Course – p. 29/11

slide-30
SLIDE 30

Example 2, ctd

X Y A 2, 1 0, 1 B 0, 1 2, 1 C 1, 1 1, 0

Next, we get

X A 2, 1 B 0, 1 C 1, 1

and finally

X A 2, 1

Strategic Games: a Mini-Course – p. 30/11

slide-31
SLIDE 31

IEWDS: Example 3

L R T 1, 1 1, 1 B 1, 1 0, 0

can be reduced both to

L R T 1, 1 1, 1

and to

L T 1, 1 B 1, 1

Strategic Games: a Mini-Course – p. 31/11

slide-32
SLIDE 32

Infinite Games

Consider the game with

Si := N, pi(s) := si.

Here every strategy is strictly dominated, in one step we can eliminate all strategies, all = 0 strategies,

  • ne strategy per player.

Strategic Games: a Mini-Course – p. 32/11

slide-33
SLIDE 33

Infinite Games (2)

Conclusions For infinite games IESDS is not order independent, definition of order independence has to be modified.

Strategic Games: a Mini-Course – p. 33/11

slide-34
SLIDE 34

IENBR: Example 1

X Y A 2, 1 0, 0 B 0, 1 2, 0 C 1, 1 1, 2

No strategy strictly or weakly dominates another one.

C is never a best response.

Eliminating it we get

X Y A 2, 1 0, 0 B 0, 1 2, 0

from which in two steps we get

X A 2, 1

Strategic Games: a Mini-Course – p. 34/11

slide-35
SLIDE 35

IENBR

Theorem If G′ is an outcome of IENBR starting from a finite G, then s is a Nash equilibrium of G′ iff it is a Nash equilibrium of G. If G is finite and is solved by IENBR, then the resulting joint strategy is a unique Nash equilibrium of G. (Apt, ’05) Outcome of IENBR is unique (order independence).

Strategic Games: a Mini-Course – p. 35/11

slide-36
SLIDE 36

IENBR: Example 2

Location game on the open real interval (0, 100).

pi(si, s3−i) :=          si + s3−i − si 2

if si < s3−i

100 − si + si − s3−i 2

if si > s3−i

50

if si = s3−i No strategy strictly or weakly dominates another one. Only 50 is a best response to some strategy (namely 50). So this game is solved by IENBR, in one step.

Strategic Games: a Mini-Course – p. 36/11

slide-37
SLIDE 37

More on Nash Equilibria

Strategic Games: a Mini-Course – p. 37/11

slide-38
SLIDE 38

Overview

Best response dynamics. Potential games. Congestion games. Examples. Price of Stability. Mixed strategies. Nash Theorem.

Strategic Games: a Mini-Course – p. 38/11

slide-39
SLIDE 39

Best Response Dynamics

Consider a game G := (S1, . . ., Sn, p1, . . ., pn). An algorithm to find a Nash equilibrium: choose s ∈ S1 × . . . × Sn; while s is not a NE do choose i ∈ {1, . . ., n} such that

si is not a best response to s−i; si := a best response to s−i

  • d

Strategic Games: a Mini-Course – p. 39/11

slide-40
SLIDE 40

Best Response Dynamics, ctd

Best response dynamics may miss a Nash equilibrium. Example (Shoham and Leyton-Brown ’09)

H T E H 1, −1 −1, 1 −1, −1 T −1, 1 1, −1 −1, −1 E −1, −1 −1, −1 −1, −1

Strategic Games: a Mini-Course – p. 40/11

slide-41
SLIDE 41

Potential Games

(Monderer and Shapley ’96) Consider a game G := (S1, . . ., Sn, p1, . . ., pn). Function P : S1 × . . .Sn → R is a potential function for G if

∀i ∈ {1, . . ., n} ∀s−i ∈ S−i ∀si, s′

i ∈ Si

pi(si, s−i) − pi(s′

i, s−i) = P(si, s−i) − P(s′ i, s−i).

Intuition: P tracks the changes in the payoff when some player deviates. Potential game: a game that has a potential function.

Strategic Games: a Mini-Course – p. 41/11

slide-42
SLIDE 42

Example

Prisoner’s dilemma for n players.

pi(s) :=

  • 2

j=i sj + 1 if si = 0

2

j=i sj

if si = 1

For i = 1, 2

pi(0, s−i) − pi(1, s−i) = 1.

So P(s) := − n

j=1 sj is a potential function.

Strategic Games: a Mini-Course – p. 42/11

slide-43
SLIDE 43

Potential Games, ctd

Theorem (Monderer and Shapley ’96) Every finite potential game has a Nash equilibrium. Proof 1. The games (S1, . . ., Sn, p1, . . ., pn) and (S1, . . ., Sn, P, . . ., P) have the same set of Nash equilibria. Take s for which P reaches maximum. Then s is a Nash equilibrium of (S1, . . ., Sn, P, . . ., P). Proof 2. For finite potential games the best response dynamics terminates.

Strategic Games: a Mini-Course – p. 43/11

slide-44
SLIDE 44

Congestion Games

n > 1 players,

Finite set E of facilities (road segments, primary production factors, . . .), each strategy is a non-empty subset of E, each player has a possibly different set of strategies,

dj : {1, . . ., n} → R is the delay function for using j ∈ E, dj(k) is the delay for using j when there are k users of j, xj(s) := |{r ∈ {1, . . ., n} | j ∈ sr}| is the number of users of

facility j given s,

ci(s) :=

j∈si dj(xj(s)),

(We use here cost functions ci instead of payoff functions pi.)

Strategic Games: a Mini-Course – p. 44/11

slide-45
SLIDE 45

Example

5 drivers. Each driver chooses a road from Katowice to Gliwice, More drivers choose the same road: more delays.

1/2/3 1/4/5 1/5/6 GLIWICE KATOWICE

Strategic Games: a Mini-Course – p. 45/11

slide-46
SLIDE 46

Example as a Congestion Game

5 players, 3 facilities (roads), each strategy: (a singleton set consisting of) a road, cost function:

ci(s) :=                    1 if si = 1 and |{j | sj = 1}| = 1 2 if si = 1 and |{j | sj = 1}| = 2 3 if si = 1 and |{j | sj = 1}| ≥ 3 1 if si = 2 and |{j | sj = 2}| = 1 . . . 6 if si = 3 and |{j | sj = 3}}| ≥ 3

Strategic Games: a Mini-Course – p. 46/11

slide-47
SLIDE 47

Possible Evolution (1)

1/2/3 1/4/5 1/5/6 GLIWICE KATOWICE

Strategic Games: a Mini-Course – p. 47/11

slide-48
SLIDE 48

Possible Evolution (2)

1/2/3 1/4/5 1/5/6 GLIWICE KATOWICE

Strategic Games: a Mini-Course – p. 48/11

slide-49
SLIDE 49

Possible Evolution (3)

1/2/3 1/4/5 1/5/6 GLIWICE KATOWICE

Strategic Games: a Mini-Course – p. 49/11

slide-50
SLIDE 50

Possible Evolution (4)

1/2/3 1/4/5 1/5/6 GLIWICE KATOWICE

We reached a Nash equilibrium using the best response dynamics.

Strategic Games: a Mini-Course – p. 50/11

slide-51
SLIDE 51

Congestion Games, ctd

Theorem (Rosenthal, ’73) Every congestion game is a potential game. Proof.

P(s) :=

  • j∈s1∪. . .∪sn

xj(s)

  • k=1

dj(k),

where (recall) xj(s) = |{r ∈ {1, . . ., n} | j ∈ sr}|, is a potential function. Conclusion Every congestion game has a Nash equilibrium.

Strategic Games: a Mini-Course – p. 51/11

slide-52
SLIDE 52

Another Example

Assumptions: 4000 drivers drive from A to B. Each driver has 2 options (strategies).

T/100 T/100 45 U R B 45 A

Problem: Find a Nash equilibrium (T = number of drivers).

Strategic Games: a Mini-Course – p. 52/11

slide-53
SLIDE 53

Nash Equilibrium

T/100 T/100 45 U R B 45 A

Answer: 2000/2000. Travel time: 2000/100 + 45 = 45 + 2000/100 = 65.

Strategic Games: a Mini-Course – p. 53/11

slide-54
SLIDE 54

Braess Paradox

Add a fast road from U to R. Each driver has now 3 options (strategies): A - U - B, A - R - B, A - U - R - B.

T/100 T/100 45 U R B 45 A

Problem: Find a Nash equilibrium.

Strategic Games: a Mini-Course – p. 54/11

slide-55
SLIDE 55

Nash Equilibrium

T/100 T/100 45 U R B 45 A

Answer: Every driver will choose the road A - U - R - B. Why?: The road A - U - R - B is always a best response.

Strategic Games: a Mini-Course – p. 55/11

slide-56
SLIDE 56

Small Complication

T/100 T/100 45 U R B 45 A

Travel time: 4000/100 + 4000/100 = 80!

Strategic Games: a Mini-Course – p. 56/11

slide-57
SLIDE 57

Does it happen?

from Wikipedia (‘Braess Paradox’): In Seoul, South Korea, a speeding-up in traffic around the city was seen when a motorway was removed as part of the Cheonggyecheon restoration project. In Stuttgart, Germany after investments into the road network in 1969, the traffic situation did not improve until a section of newly-built road was closed for traffic again. In 1990 the closing of 42nd street in New York City reduced the amount of congestion in the area. In 2008 Youn, Gastner and Jeong demonstrated specific routes in Boston, New York City and London where this might actually occur and pointed out roads that could be closed to reduce predicted travel times.

Strategic Games: a Mini-Course – p. 57/11

slide-58
SLIDE 58

Price of Stability

Definition PoS: social welfare of the best Nash equilibrium social welfare of the social optimum Question: What is the price of stability for the congestion games?

Strategic Games: a Mini-Course – p. 58/11

slide-59
SLIDE 59

Example

B x n A

n - (even) number of players. x - number of drivers on the bottom road.

Two Nash equilibria

1/(n − 1), with the social cost n + (n − 1)2. 0/n, with the social cost n2.

Social optimum Take f(x) = x · x + (n − x) · n = x2 − n · x + n2. We want to find a minimum of f.

f′(x) = 2x − n, so f′(x) = 0 if x = n

2.

Strategic Games: a Mini-Course – p. 59/11

slide-60
SLIDE 60

Example

B x n A

Best Nash equilibrium

1/(n − 1), with social cost n + (n − 1)2.

Social optimum

f(x) = x2 − n · x + n2.

Social optimum = f(n

2) = 3 4n2.

PoS = (n + (n − 1)2)/3

4n2 = 4 3 n+(n−1)2 n2

.

limn→∞ PoS = 4

3.

Strategic Games: a Mini-Course – p. 60/11

slide-61
SLIDE 61

Price of Stability

Theorem (Roughgarden and Tárdos, 2002) Suppose delay functions (e.g., T/100) are linear. Then the price of stability for the congestion games is ≤ 4

3.

Strategic Games: a Mini-Course – p. 61/11

slide-62
SLIDE 62

Mixed Extension of a Finite Game

Probability distribution over a finite non-empty set A:

π : A → [0, 1]

such that

a∈A π(a) = 1.

Notation: ∆A. Fix a finite strategic game G := (S1, . . ., Sn, p1, . . ., pn). Mixed strategy of player i in G: mi ∈ ∆Si. Joint mixed strategy: m = (m1, . . ., mn).

Strategic Games: a Mini-Course – p. 62/11

slide-63
SLIDE 63

Mixed Extension of a Finite Game (2)

Mixed extension of G:

(∆S1, . . ., ∆Sn, p1, . . ., pn),

where

m(s) := m1(s1) · . . . · mn(sn)

and

pi(m) :=

  • s∈S

m(s) · pi(s).

Theorem (Nash ’50) Every mixed extension of a finite strategic game has a Nash equilibrium.

Strategic Games: a Mini-Course – p. 63/11

slide-64
SLIDE 64

Kakutani’s Fixed Point Theorem

Theorem (Kakutani ’41) Suppose A is a compact and convex subset of Rn and

Φ : A → P(A)

is such that

Φ(x) is non-empty and convex for all x ∈ A,

for all sequences (xi, yi) converging to (x, y)

yi ∈ Φ(xi) for all i ≥ 0,

implies that

y ∈ Φ(x).

Then x∗ ∈ A exists such that x∗ ∈ Φ(x∗).

Strategic Games: a Mini-Course – p. 64/11

slide-65
SLIDE 65

Proof of Nash Theorem

Fix (S1, . . ., Sn, p1, . . ., pn). Define

besti : Πj=i∆Sj → P(∆Si)

by

besti(m−i) := {m′

i ∈ ∆Si | pi(m′ i, m−i) attains the maximum}.

Then define

best : ∆S1 × . . .∆Sn → P(∆S1 × . . . × ∆Sn)

by

best(m) := best1(m−1) × . . . × best1(m−n).

Note m is a Nash equilibrium iff m ∈ best(m).

best(·) satisfies the conditions of Kakutani’s Theorem.

Strategic Games: a Mini-Course – p. 65/11

slide-66
SLIDE 66

Comments

First special case of Nash theorem: Cournot (1838). Nash theorem generalizes von Neumann’s Minimax Theorem (’28). An alternative proof (also by Nash) uses Brouwer’s Fixed Point Theorem. Search for conditions ensuring existence of Nash equilibrium.

Strategic Games: a Mini-Course – p. 66/11

slide-67
SLIDE 67

2 Examples

Matching Pennies H T H 1, −1 −1, 1 T −1, 1 1, −1 (1

2 · H + 1 2 · T, 1 2 · H + 1 2 · T) is a Nash equilibrium.

The payoff to each player in the Nash equilibrium: 0.

The Battle of the Sexes F B F 2, 1 0, 0 B 0, 0 1, 2 (2/3 · F + 1/3 · B, 1/3 · F + 1/3 · B) is a Nash equilibrium.

The payoff to each player in the Nash equilibrium: 2/3.

Strategic Games: a Mini-Course – p. 67/11

slide-68
SLIDE 68

Mechanism Design

Strategic Games: a Mini-Course – p. 68/11

slide-69
SLIDE 69

Overview

Decision problems. Direct mechanisms. Groves mechanisms. Examples. Optimality results.

Strategic Games: a Mini-Course – p. 69/11

slide-70
SLIDE 70

Intelligent Design

Strategic Games: a Mini-Course – p. 70/11

slide-71
SLIDE 71

Intelligent Design

A theory of an intelligently guided invisible hand wins the Nobel prize WHAT on earth is mechanism design? was the typical reaction to this year’s Nobel prize in economics, announced on October 15th. [...] In fact, despite its dreary name, mechanism design is a hugely important area of economics, and underpins much of what dismal scientists do today. It goes to the heart of one of the biggest challenges in economics: how to arrange

  • ur economic interactions so that, when everyone

behaves in a self-interested manner, the result is something we all like. (The Economist, Oct. 18th, 2007)

Strategic Games: a Mini-Course – p. 71/11

slide-72
SLIDE 72

Decision Problems

Decision problem for n players: set D of decisions, for each player i a set of (private) types Θi and a utility function

vi : D × Θi → R.

Intuitions Type is some private information known only to the player (e.g., player’s valuation of the item for sale),

vi(d, θi) represents the benefit to player i of type θi from

the decision d ∈ D. Assume the individual types are θ1, . . ., θn. Then

n

i=1 vi(d, θi) is the social welfare from d ∈ D.

Strategic Games: a Mini-Course – p. 72/11

slide-73
SLIDE 73

Decision Rules

Decision rule is a function

f : Θ1 × . . . × Θn → D.

Decision rule f is efficient if

n

  • i=1

vi(f(θ), θi) ≥

n

  • i=1

vi(d, θi)

for all θ ∈ Θ and d ∈ D. Intuition f is efficient if it always maximizes the social welfare.

Strategic Games: a Mini-Course – p. 73/11

slide-74
SLIDE 74

Set up

Each player i receives/has a type θi, each player i submits to the central authority a type θ′

i,

the central authority computes decision

d := f(θ′

1, . . ., θ′ n),

and communicates it to each player i. Basic problem How to ensure that θ′

i = θi.

Strategic Games: a Mini-Course – p. 74/11

slide-75
SLIDE 75

Example 1: Sealed-Bid Auction

Set up There is a single object for sale. Each player is a buyer. The decision is taken by means of a sealed-bid auction. The

  • bject is sold to the highest bidder.

D = {1, . . . , n},

each Θi is R+,

vi(d, θi) :=

  • θi if d = i
  • therwise

Let argsmax θ := µi(θi = maxj∈{1,...,n} θj).

f(θ) := argsmax θ.

Note f is efficient. Payments will be treated later.

Strategic Games: a Mini-Course – p. 75/11

slide-76
SLIDE 76

Example 2: Public Project Problem

Each person is asked to report his or her willingness to pay for the project, and the project is undertaken if and only if the aggregate reported willingness to pay exceeds the cost of the project. (15 October 2007, The Royal Swedish Academy of Sciences, Press Release, Scientific Background)

Strategic Games: a Mini-Course – p. 76/11

slide-77
SLIDE 77

Public Project Problem, formally

c: cost of the public project (e.g., building a bridge),

D = {0, 1},

each Θi is R+,

vi(d, θi) := d(θi − c

n),

f(θ) :=

  • 1

if n

i=1 θi ≥ c

  • therwise

Note f is efficient.

Strategic Games: a Mini-Course – p. 77/11

slide-78
SLIDE 78
  • Ex. 3: Reversed Sealed-bid Auction

Set up Each player offers the same service. The decision is taken by means of a sealed-bid auction. The service is purchased from the lowest bidder.

D = {1, . . . , n},

each Θi is R−;

−θi is the price player i offers, vi(d, θi) :=

  • θi if d = i
  • therwise

f(θ) := argsmax θ.

Example f(−8, −5, −4, −6) = 3. That is, given the offers 8, 5, 4, 6, the service is bought from player 3.

Strategic Games: a Mini-Course – p. 78/11

slide-79
SLIDE 79
  • Ex. 4: Buying a Path in a Network

Set up Given a graph G := (V, E).

  • Each edge e ∈ E is owned by player e.
  • Two distinguished vertices s, t ∈ V .
  • Each player e submits the cost θe of using the edge e.
  • The central authority selects the shortest s − t path in G.

D = {p | p is a s − t path in G},

each Θi is R+,

vi(p, θi) :=

  • −θi if i ∈ p
  • therwise

f(θ) := p, where p is the shortest s − t path in G.

Strategic Games: a Mini-Course – p. 79/11

slide-80
SLIDE 80

Manipulations

Example An optimal strategy for player i in public project problem: if θi ≥ c

n submit θ′ i = c.

if θi < c

n submit θ′ i = 0.

For example, assume c = 30. player type A

6

B

7

C

25

Players A and B should submit 0. Player c should submit 30.

Strategic Games: a Mini-Course – p. 80/11

slide-81
SLIDE 81

Revised Set-up: Direct Mechanisms

Each player i receives/has a type θi, each player i submits to the central authority a type θ′

i,

the central authority computes decision

d := f(θ′

1, . . ., θ′ n),

and taxes

(t1, . . ., tn) := g(θ′

1, . . ., θ′ n) ∈ Rn,

and communicates to each player i both d and ti. final utility function for player i:

ui(d, θi) := vi(d, θi) + ti.

Strategic Games: a Mini-Course – p. 81/11

slide-82
SLIDE 82

Direct Mechanisms, ctd

Direct mechanism (f, t) is incentive compatible if for all θ ∈ Θ, i ∈ {1, . . ., n} and θ′

i ∈ Θi

ui((f, t)(θi, θ−i), θi) ≥ ui((f, t)(θ′

i, θ−i), θi).

Intuition Submitting false type (so θ′

i = θi) does not pay off.

Direct mechanism (f, t) is feasible if

n

i=1 ti(θ) ≤ 0 for all θ.

Intuition External financing is never needed.

Strategic Games: a Mini-Course – p. 82/11

slide-83
SLIDE 83

Groves Mechanisms

ti(θ) :=

j=i vj(f(θ), θj) + hi(θ−i), where

hi : Θ−i → R is an arbitrary function.

Note

ui((f, t)(θ), θi) = n

j=1 vj(f(θ), θj) + hi(θ−i).

Intuitions Player i cannot manipulate the value of hi(θ−i). Suppose hi(θ−i) = 0. When the individual types are θ1, . . ., θn

ui((f, t)(θ), θi) is the social welfare from f(θ).

Strategic Games: a Mini-Course – p. 83/11

slide-84
SLIDE 84

Groves Theorem

Theorem (Groves ’73) Suppose f is efficient. Then each Groves mechanism is incentive compatible. Proof. For all θ ∈ Θ, i ∈ {1, . . . , n} and θ′

i ∈ Θi

ui((f, t)(θi, θ−i), θi) =

n

  • j=1

vj(f(θi, θ−i), θj) + hi(θ−i) (f is efficient) ≥

n

  • j=1

vj(f(θ′

i, θ−i), θj) + hi(θ−i)

= ui((f, t)(θ′

i, θ−i), θi).

Strategic Games: a Mini-Course – p. 84/11

slide-85
SLIDE 85

Special Case: Pivotal Mechanism

hi(θ−i) := − maxd∈D

  • j=i vj(d, θj).

Then

ti(θ) :=

  • j=i

vj(f(θ), θj) − max

d∈D

  • j=i

vj(d, θj) ≤ 0.

Note Pivotal mechanism is feasible.

Strategic Games: a Mini-Course – p. 85/11

slide-86
SLIDE 86

Re: Sealed-Bid Auction

Note In the pivotal mechanism

ti(θ) =

  • − maxj=i θj if i = argsmax θ.
  • therwise

So the pivotal mechanism is Vickrey auction (Vickrey ’61): the winner pays the 2nd highest bid.

Strategic Games: a Mini-Course – p. 86/11

slide-87
SLIDE 87

Example

player bid tax to authority util. A

18

B

24 −21 3

C

21

Social welfare: 0 + 0 + 3 = 3.

Strategic Games: a Mini-Course – p. 87/11

slide-88
SLIDE 88

Maximizing Social Welfare

Question: Does Vickrey auction maximize social welfare? Notation θ∗: the reordering of θ is descending order. Example For θ = (1, 4, 2, 3, 1) we have

θ−2 = (1, 2, 3, 0), (θ−2)∗ = (3, 2, 1, 0),

so (θ−2)∗

2 = 2.

Intuition (θ−2)∗

2 is the second highest bid among other bids.

Strategic Games: a Mini-Course – p. 88/11

slide-89
SLIDE 89

Bailey-Cavallo Mechanism

(Bailey ’97, Cavallo ’06) Assume n ≥ 3.

ti(θ) := tp

i (θ) + (θ−i)∗ 2

n

Note Bailey-Cavallo mechanism is a Groves mechanism. Example player bid tax to authority util. why? A

18 7

(= 1/3 of 21) B

24 −2 9

(= 24 − 2 − 7 − 6) C

21 6

(= 1/3 of 18)

Strategic Games: a Mini-Course – p. 89/11

slide-90
SLIDE 90

Bailey-Cavallo Mechanism, ctd

Note Bailey-Cavallo mechanism is feasible.

  • Proof. For all i and θ, (θ−i)∗

2 ≤ θ∗ 2, so n

  • i=1

ti(θ) = −θ∗

2 + n

  • i=1

(θ−i)∗

2

n =

n

  • i=1

−θ∗

2 + (θ−i)∗ 2

n ≤ 0.

Strategic Games: a Mini-Course – p. 90/11

slide-91
SLIDE 91

Re: Public Project Problem

Assume the pivotal mechanism. Examples Suppose c = 30 and n = 3. player type tax

ui

A

6 −4

B

7 −3

C

25 −7 8

Social welfare can be negative. player type tax

ui

A

4 −5 −5

B

3 −6 −6

C

22

Strategic Games: a Mini-Course – p. 91/11

slide-92
SLIDE 92

Formally

Note In the pivotal mechanism

ti(θ) =          if

j=i θj ≥ n−1 n c and n j=1 θj ≥ c

  • j=i θj − n−1

n c if j=i θj < n−1 n c and n j=1 θj ≥ c

if

j=i θj ≤ n−1 n c and n j=1 θj < c n−1 n c − j=i θj if j=i θj > n−1 n c and n j=1 θj < c

This is the mechanism essentially proposed in Clarke ’71).

Strategic Games: a Mini-Course – p. 92/11

slide-93
SLIDE 93

Optimality Result (1)

Theorem (Apt, Conitzer, Guo and Markakis ’08) Consider the sealed bid auction. No tax-based mechanism exists that is feasible, incentive compatible, ‘better’ than Bailey-Cavallo mechanism.

Strategic Games: a Mini-Course – p. 93/11

slide-94
SLIDE 94

Optimality Result (2)

Theorem (Apt, Conitzer, Guo and Markakis ’08) Consider the public project problem. No tax-based mechanism exists that is feasible, incentive compatible, ‘better’ than Clarke’s tax.

Strategic Games: a Mini-Course – p. 94/11

slide-95
SLIDE 95

Proof Steps

  • 1. Limit attention to Groves mechanisms.

(B. Holmström, ’79)

  • 2. Introduce anonymous Groves mechanisms.
  • 3. Pivotal mechanism t is here anonymous.
  • 4. Each Groves mechanism t entails an anonymous Groves

mechanism t′.

  • 5. Lemma

If t is feasible, then so is t′. If t is ‘better’ than t0, then so is t′.

  • 6. Lemma No feasible anonymous Groves mechanism is

‘better’ than the pivotal mechanism t.

Strategic Games: a Mini-Course – p. 95/11

slide-96
SLIDE 96

However . . .

Pivotal mechanism is not optimal in the public project problem when the payments per player can differ. Note: Pivotal mechanism then ceases to be anonymous.

Strategic Games: a Mini-Course – p. 96/11

slide-97
SLIDE 97

Re: Reversed Sealed-Bid Auction

Take

ti(θ) :=

  • j=i

vj(f(θ), θj) − max

d∈D\{i}

  • j=i

vj(d, θj).

Note

ti(θ) =

  • − maxj=i θj if i = argsmax θ.
  • therwise

So in this mechanism the winner receives the amount equal to the 2nd lowest offer. Example Consider Θ = (−8, −5, −4, −6). The service is bought from player 3 who receives for it 5.

Strategic Games: a Mini-Course – p. 97/11

slide-98
SLIDE 98

Re: Buying a Path in a Network

(Nisan, Ronen ’99) Take

ti(θ) :=

  • j=i

vj(f(θ), θj) − max

p∈D(G\{i})

  • j=i

vj(p, θj).

Note

ti(θ) =

  • cost(p2) − cost(p1 − {i}) if i ∈ p1
  • therwise

where

p1 is the shortest s − t path in G(θ), p2 is the shortest s − t path in (G \ {i})(θ−i).

Strategic Games: a Mini-Course – p. 98/11

slide-99
SLIDE 99

Example

Consider the player owning the edge e. To compute the payment he receives determine the shortest s − t path. Its length is 7. It contains e. determine the shortest s − t path that does not include e. Its length is 12. So player e receives 12 − (7 − 4) = 9. His final utility is 9 − 4 = 5.

Strategic Games: a Mini-Course – p. 99/11

slide-100
SLIDE 100

Pre-Bayesian Games

Strategic Games: a Mini-Course – p. 100/11

slide-101
SLIDE 101

Pre-Bayesian Games

(Hyafil, Boutilier ’04, Ashlagi, Monderer, Tennenholtz ’06,) In a strategic game after each player selected his strategy each player knows all the payoffs (complete information). In a pre-Bayesian game after each player selected his strategy each player knows only his payoff (incomplete information). This is achieved by introducing (private) types.

Strategic Games: a Mini-Course – p. 101/11

slide-102
SLIDE 102

Pre-Bayesian Games: Definition

Pre-Bayesian game for n ≥ 2 players: (possibly infinite) set Ai of actions, (possibly infinite) set Θi of (private) types, payoff function

pi : A1 × . . . × An × Θi → R,

for each player i. Basic assumptions: Nature moves first and provides each player i with a θi, players do not know the types received by other players, players choose their actions simultaneously, each player is rational (wants to maximize his payoff), players have common knowledge of the game and of each

  • thers’ rationality.

Strategic Games: a Mini-Course – p. 102/11

slide-103
SLIDE 103

Ex-post Equilibrium

A strategy for player i:

si(·) ∈ AΘi

i .

Joint strategy s(·) is an ex-post equilibrium if each si(·) is a best response to s−i(·):

∀θ ∈ Θ ∀i ∈ {1, . . ., n} ∀s′

i(·) ∈ AΘi i

pi(si(θi), s−i(θ−i), θi) ≥ pi(s′

i(θi), s−i(θ−i), θi).

Note: For each θ ∈ Θ we have one strategic game.

s(·) is an ex-post equilibrium if for each θ ∈ Θ the joint action (s1(θ1), . . ., sn(θn)) is a Nash equilibrium in the θ-game.

Strategic Games: a Mini-Course – p. 103/11

slide-104
SLIDE 104

Quiz

Θ1 = {U, D}, Θ2 = {L, R}, A1 = A2 = {F, B}. U L F B F 2, 1 2, 0 B 0, 1 2, 1 R F B F 2, 0 2, 1 B 0, 0 2, 1 D F B F 3, 1 2, 0 B 5, 1 4, 1 F B F 3, 0 2, 1 B 5, 0 4, 1

Which strategies form an ex-post equilibrium?

Strategic Games: a Mini-Course – p. 104/11

slide-105
SLIDE 105

Answer

Θ1 = {U, D}, Θ2 = {L, R}, A1 = A2 = {F, B}. U L F B F 2, 1 2, 0 B 0, 1 2, 1 R F B F 2, 0 2, 1 B 0, 0 2, 1 D F B F 3, 1 2, 0 B 5, 1 4, 1 F B F 3, 0 2, 1 B 5, 0 4, 1

Strategies

s1(U) = F, s1(D) = B, s2(L) = F, s2(R) = B

form an ex-post equilibrium.

Strategic Games: a Mini-Course – p. 105/11

slide-106
SLIDE 106

But . . .

Ex-post equilibrium does not need to exist in mixed extensions of finite pre-Bayesian games. Example: Mixed extension of the following game.

Θ1 = {U, B}, Θ2 = {L, R}, A1 = A2 = {C, D}. U L C D C 2, 2 0, 0 D 3, 0 1, 1 R C D C 2, 1 0, 0 D 3, 0 1, 2 B C D C 1, 2 3, 0 D 0, 0 2, 1 C D C 1, 1 3, 0 D 0, 0 2, 2

Strategic Games: a Mini-Course – p. 106/11

slide-107
SLIDE 107

Safety-level Equilibrium

Strategy si(·) for player i is a safety-level best response to

s−i(·) if for all strategies s′

i(·) of player i and all θi ∈ Θi

min

θ−i∈Θ−i pi(si(θi), s−i(θ−i), θi) ≥

min

θ−i∈Θ−i pi(s′ i(θi), s−i(θ−i), θi).

Intuition minθ−i∈Θ−i pi(si(θi), s−i(θ−i), θi) is the guaranteed payoff to player i when his type is θi and s(·) are the selected strategies. Joint strategy s(·) is a safety-level equilibrium if each si(·) is a safety-level best response to s−i(·). Theorem (Ashlagi, Monderer, Tennenholtz ’06) Every mixed extension of a finite pre-Bayesian game has a safety-level equilibrium.

Strategic Games: a Mini-Course – p. 107/11

slide-108
SLIDE 108

Relation to Mechanism Design

Strategy si(·) is dominant if for all a ∈ A and θi ∈ Θi

∀a ∈ A pi(si(θi), a−i, θi) ≥ pi(ai, a−i, θi).

A pre-Bayesian game is of a revelation-type if Ai = Θi for all

i ∈ {1, . . ., n}.

So in a revelation-type pre-Bayesian game the strategies of player i are the functions on Θi. A strategy for player i is called truth-telling if it is the identity function πi(·).

Strategic Games: a Mini-Course – p. 108/11

slide-109
SLIDE 109

Relation to Mechanism Design, ctd

Mechanism design (as discussed here) can be viewed as an instance of the revelation-type pre-Bayesian games. With each direct mechanism (f, t) we can associate a revelation-type pre-Bayesian game: Each Θi as in the mechanism, Each Ai = Θi,

pi(θ′

i, θ−i, θi) := ui((f, t)(θ′ i, θ−i), θi).

Note Direct mechanism (f, t) is incentive compatible iff in the associated pre-Bayesian game for each player truth-telling is a dominant strategy. Conclusion In the pre-Bayesian game associated with a Groves mechanism, (π1(·), . . ., πi(·)) is a dominant strategy ex-post equilibrium.

Strategic Games: a Mini-Course – p. 109/11

slide-110
SLIDE 110

References

  • N. Nisan, Introduction to Mechanism Design

(for Computer Scientists), Chapter 9 in: Algorithmic Game Theory, Cambridge University Press, 2007. K.R. Apt, V. Conitzer, M. Guo and E. Markakis, Welfare Undominated Groves Mechanisms,

  • Proc. of the Workshop on Internet & Network Economic

(WINE), 2008.

Strategic Games: a Mini-Course – p. 110/11

slide-111
SLIDE 111

Thank you for your attention

Strategic Games: a Mini-Course – p. 111/11