EC487 Advanced Microeconomics, Part I: Lecture 10 Leonardo Felli - - PowerPoint PPT Presentation

ec487 advanced microeconomics part i lecture 10
SMART_READER_LITE
LIVE PREVIEW

EC487 Advanced Microeconomics, Part I: Lecture 10 Leonardo Felli - - PowerPoint PPT Presentation

EC487 Advanced Microeconomics, Part I: Lecture 10 Leonardo Felli 32L.LG.04 1 December 2017 Repeated Games This is the class of dynamic games which is best understood in game theory. Players face in each period the same normal form


slide-1
SLIDE 1

EC487 Advanced Microeconomics, Part I: Lecture 10

Leonardo Felli

32L.LG.04

1 December 2017

slide-2
SLIDE 2

Repeated Games

◮ This is the class of dynamic games which is best understood

in game theory.

◮ Players face in each period the same normal form stage game. ◮ Players’ payoffs are a weighted discounted average of the

payoffs players receive in every stage game.

Leonardo Felli (LSE) EC487 Advanced Microeconomics, Part II 1 December 2017 2 / 66

slide-3
SLIDE 3

Repeated Games (cont’d)

Main point of the analysis:

◮ players’ overall payoffs depend on the present and the future

stage game payoffs,

◮ it is possible that the threat of a lower future payoff may

induce a player at present to choose a strategy different from the stage game best reply.

Leonardo Felli (LSE) EC487 Advanced Microeconomics, Part II 1 December 2017 3 / 66

slide-4
SLIDE 4

Example: the repeated prisoner dilemma

◮ Stage game:

1\2 C D C 1, 1 −1, 2 D 2, −1 0, 0

◮ Per period payoff depends on current action: gi(at) . ◮ Players’ common discount factor δ. ◮ It is convenient to label the first period t = 0.

Leonardo Felli (LSE) EC487 Advanced Microeconomics, Part II 1 December 2017 4 / 66

slide-5
SLIDE 5

Repeated Prisoner Dilemma (cont’d)

◮ Since we are going to compare the equilibrium payoffs for

different time horizons we need to re-normalize the payoffs so that they are comparable.

◮ The average discounted payoff for a T-periods game is:

Π = 1 − δ 1 − δT

T−1

  • t=0

δtgi(at)

◮ Clearly if gi(at) = 1

Π = 1 − δ 1 − δT

T−1

  • t=0

δt = 1 − δ 1 − δT 1 − δT 1 − δ

  • = 1

Leonardo Felli (LSE) EC487 Advanced Microeconomics, Part II 1 December 2017 5 / 66

slide-6
SLIDE 6

Finitely Repeated Prisoner Dilemma

◮ Assume first that the prisoners’ dilemma game is repeated a

finite number of times.

◮ Nash equilibrium payoffs of the stage game: (0, 0). ◮ Subgame Perfect equilibrium strategies: each player chooses

action D independently of the period and the action the other player chose in the past. 1\2 C D C 1, 1 −1, 2 D 2, −1 0, 0 Proof: backward induction.

◮ Subgame Perfection seems to prevent any gain from repeated,

but finite interaction, but...

Leonardo Felli (LSE) EC487 Advanced Microeconomics, Part II 1 December 2017 6 / 66

slide-7
SLIDE 7

Finitely Repeated Game

◮ Consider a different finitely repeated game. ◮ Stage game:

L C R T 1, 1 5, 0 0, 0 M 0, 5 4, 4 0, 0 B 0, 0 0, 0 3, 3

◮ Nash equilibria of the stage game: (T, L) and (B, R).

Leonardo Felli (LSE) EC487 Advanced Microeconomics, Part II 1 December 2017 7 / 66

slide-8
SLIDE 8

Finitely Repeated Game (cont’d)

Assume the game is played twice and consider the following strategies: Player 1:

◮ play M in the first period; ◮ in the second period play B if the observed outcome is (M, C); ◮ in the second period play T if the observed outcome is not

(M, C); Player 2:

◮ play C in the first period; ◮ in the second period play R if the observed outcome is (M, C); ◮ in the second period play L if the observed outcome is not

(M, C);

Leonardo Felli (LSE) EC487 Advanced Microeconomics, Part II 1 December 2017 8 / 66

slide-9
SLIDE 9

Finitely Repeated Game (cont’d)

Proposition

If δ ≥ 1

2 then these strategies are a subgame perfect equilibrium of

the game. L C R T 1, 1 5, 0 0, 0 M 0, 5 4, 4 0, 0 B 0, 0 0, 0 3, 3 Proof: Backward induction: in the last period the strategies prescribe a Nash equilibrium. In the first period both player 1 and player 2 conform to the strategies if and only if: 1 − δ 1 − δ2

  • [4 + δ 3] = 4 + δ 3

1 + δ ≥ 1 − δ 1 − δ2

  • [5 + δ] = 5 + δ

1 + δ The inequality is satisfied for δ ≥ 1

2.

Leonardo Felli (LSE) EC487 Advanced Microeconomics, Part II 1 December 2017 9 / 66

slide-10
SLIDE 10

Infinitely Repeated Prisoner Dilemma

◮ Consider now the the infinitely repeated prisoner dilemma:

T = +∞.

◮ Stage game:

1\2 C D C 1, 1 −1, 2 D 2, −1 0, 0

Proposition

Both player choosing strategy D in every period is an SPE of the repeated game.

◮ Proof: by one deviation principle. Notice that an infinitely

repeated game is continuous at infinity.

Leonardo Felli (LSE) EC487 Advanced Microeconomics, Part II 1 December 2017 10 / 66

slide-11
SLIDE 11

Infinitely Repeated Prisoner Dilemma (cont’d)

Proposition

The (D, D) equilibrium is the only equilibrium if we restrict players’ strategies to be history independent.

Proposition

If δ ≥ 1

2 then the following strategy profile (σA, σB) is a SPE of

the repeated game:

◮ Player i chooses C in the first period. ◮ Player i continues to choose C as long as no player has

chosen D in any previous period.

◮ Player i will choose D if a player has chosen D in the past

(for the rest of the game).

Leonardo Felli (LSE) EC487 Advanced Microeconomics, Part II 1 December 2017 11 / 66

slide-12
SLIDE 12

Infinitely Repeated Prisoner Dilemma (cont’d)

Proof: If a player i conforms to the prescribed strategies the payoff is 1. If a player deviates in one period and conforms to the prescribed strategy from there on (one deviation principle) the continuation payoff is: (1 − δ)(2 + 0 + . . .) = (1 − δ) 2 If δ ≥ 1

2 then

1 ≥ (1 − δ) 2.

Leonardo Felli (LSE) EC487 Advanced Microeconomics, Part II 1 December 2017 12 / 66

slide-13
SLIDE 13

Infinitely Repeated Prisoner Dilemma (cont’d)

We still need to check that in the subgame in which both players are choosing D neither player wants to deviate. However, choosing D in every period is a SPE of the entire game hence it is a SPE of the (punishment) subgames. Notice that using this type of strategies not only choosing (C, C) in every period is a SPE outcome, a big number of other SPE

  • utcomes are also achievable.

Leonardo Felli (LSE) EC487 Advanced Microeconomics, Part II 1 December 2017 13 / 66

slide-14
SLIDE 14

Infinitely Repeated Prisoner Dilemma (cont’d)

Indeed there exists a Folk Theorem.

✻ ✲ q q ❍❍❍❍❍❍❍❍ ❆ ❆ ❆ ❆ ❆ ❆ ❆ ❆ ❆ ❍ ❍ ❍ ❍ ❍ ❍ ❍ ❍ ❆ ❆ ❆ ❆ ❆ ❆ ❆ ❆ ❆

(1, 1) (0, 0) (−1, 2) (2, −1) Π2 Π1

❆ ❆ ❆ ❆ ❆ ❍ ❍ ❍ ❍ q q

Leonardo Felli (LSE) EC487 Advanced Microeconomics, Part II 1 December 2017 14 / 66

slide-15
SLIDE 15

General repeated normal form game

Definition

Let G be a given stage game: a normal form game G =

  • N, Ai, gi(at)
  • Definition

Let G ∞ be the infinitely repeated game associated with the stage game above: G ∞ = {N, H, P, Ui(σ)} such that:

◮ H = ∞ t=0 At where A0 = ∅; ◮ P(h) = N for every h ∈ H − Z;

Leonardo Felli (LSE) EC487 Advanced Microeconomics, Part II 1 December 2017 15 / 66

slide-16
SLIDE 16

Repeated Normal form Game (cont’d)

◮ The payoffs for the game G ∞ in the case δ < 1 are:

Ui(σ) = (1 − δ)

  • t=0

δtgi(σt(ht))

◮ Denote ht the history known to the players at the beginning

  • f period t: ht = {a0, a1, . . . , at−1}.

◮ Let Ht = At−1 to be the space of all possible period t

histories.

◮ A pure strategy for player i ∈ {1, 2} in the game G ∞ is then

the infinite sequence of mappings: {st

i }∞ t=0 such that

st

i : Ht → Ai.

Leonardo Felli (LSE) EC487 Advanced Microeconomics, Part II 1 December 2017 16 / 66

slide-17
SLIDE 17

Repeated Normal form Game (cont’d)

In general we will allow players to mix in every possible stage game: ∆i(Ai) set of probability distributions on Ai. A behavioral mixed strategy in this environment is instead an infinite sequence of mappings: {σt

i }∞ t=0 such that

σt

i : Ht → ∆(Ai).

Notice that mixed strategies cannot depend on past mixed strategies by the opponents but only on their realizations. The payoffs for the game G ∞ in the case δ < 1 are: Ui = Eσ(1 − δ)

  • t=0

δtgi(σt(ht))

Leonardo Felli (LSE) EC487 Advanced Microeconomics, Part II 1 December 2017 17 / 66

slide-18
SLIDE 18

Repeated Normal form Game (cont’d)

◮ Notice that Eσ(·) is the expectation with respect to the

distribution over the infinite histories generated by the profile

  • f mixed behavioral strategies {σt

i }∞ t=0. ◮ Notice that this specification of payoffs allows us to

reinterpret the discount factor δ as:

◮ the probability that the game will be played in the following

period, where these probabilities are assumed to be independent across periods.

Leonardo Felli (LSE) EC487 Advanced Microeconomics, Part II 1 December 2017 18 / 66

slide-19
SLIDE 19

Repeated Normal form Game (cont’d)

We allow the players to coordinate their strategies through the use

  • f a public randomizing device whose realization in period t is ωt .

Therefore a period t history for player i is: ht = {a0, . . . , at−1; ω0, . . . , ωt}.

Proposition

If α∗ is a NE strategy profile for the stage game G, then the strategy: “each player i plays α∗

i independently of the history of

play” are a NE and a SPE of the infinitely repeated game G ∞(δ).

Leonardo Felli (LSE) EC487 Advanced Microeconomics, Part II 1 December 2017 19 / 66

slide-20
SLIDE 20

Repeated Normal form Game (cont’d)

◮ The proof that the strategies above are a Subgame Perfect

equilibrium of the game G ∞(δ) is easily obtained by using

  • ne-deviation-only principle.

◮ In any given period consider the deviation in the immediate

period and then let the players continue playing the equilibrium strategies.

◮ Then any deviation cannot be profitable: it is a deviation from

the Nash equilibrium action of stage game G.

Leonardo Felli (LSE) EC487 Advanced Microeconomics, Part II 1 December 2017 20 / 66

slide-21
SLIDE 21

Repeated Normal form Game (cont’d)

Assume now that the stage game G has n NE

  • αj,∗n

j=1.

Proposition

Then, for any map j(t) from time periods into an index of the NE

  • αj,∗n

j=1, the strategies:

“each player i plays αj(t),∗

i

in period t” are a SPE of the game G ∞(δ). These SPE strategies are history independent. Therefore each player’s best response in every period t is to play the stage game best response in t: today’s decision does not affect the future play.

Leonardo Felli (LSE) EC487 Advanced Microeconomics, Part II 1 December 2017 21 / 66

slide-22
SLIDE 22

Repeated Normal form Game (cont’d)

◮ In other words, playing repeatedly the stage game G does not

reduce the set of equilibrium payoffs.

◮ To be able to move to the Folk Theorem we first need to

define an area known as the set of feasible and individually rational payoffs.

◮ Consider as an example the following battle of sexes game G:

1\2 B F B 1, 2 0, 0 F 0, 0 2, 1

◮ Assume this game is repeated an infinite number of times

G ∞(δ).

Leonardo Felli (LSE) EC487 Advanced Microeconomics, Part II 1 December 2017 22 / 66

slide-23
SLIDE 23

Repeated Normal form Game (cont’d)

◮ We first need to define the set of feasible payoffs of G ∞. ◮ Recall that when choosing actions in every period players can

coordinate using a public randomizing device.

◮ This implies that: every payoff associated with a pure strategy

profile a can be achieved: (1, 2), (0, 0), (2, 1).

◮ It also implies that every payoff generated by any linear and

convex combination of the pure strategy profile can be achieved.

Leonardo Felli (LSE) EC487 Advanced Microeconomics, Part II 1 December 2017 23 / 66

slide-24
SLIDE 24

Repeated Normal form Game (cont’d)

◮ In general these payoffs are all in the convex hull of the

payoffs associated with the pure strategy profiles.

◮ This is the smallest convex set that includes the payoffs

associated with the pure strategy profiles.

◮ Formally:

V = convex hull {π | πi = gi(a) ∀a ∈ A}

Leonardo Felli (LSE) EC487 Advanced Microeconomics, Part II 1 December 2017 24 / 66

slide-25
SLIDE 25

Repeated Normal form Game (cont’d)

Graphically:

✻ ✲ ✁ ✁ ✁ ✁ ✁ ✁ ✁ ✁❅ ❅ ❅ ❅ ✟ ✟ ✟ ✟ ✟ ✟ ✟ ✟ q q q

(0, 0) (1, 2) (2, 1) π2 π1 V

Leonardo Felli (LSE) EC487 Advanced Microeconomics, Part II 1 December 2017 25 / 66

slide-26
SLIDE 26

Repeated Normal form Game (cont’d)

◮ We now define the set of individually rational payoffs of G ∞. ◮ We first need to define the minmaxing payoff of each player. ◮ The minmaxing payoff to player 1 is the lowest payoff that

player 2 can impose on player 1.

◮ Given that player 1 is rational this is the lowest payoff among

the ones that are player 1’s best reply to player 2 strategies.

◮ This payoff is a best reply for player 1 since he is trying to

achieve the best for himself, given his rationality.

Leonardo Felli (LSE) EC487 Advanced Microeconomics, Part II 1 December 2017 26 / 66

slide-27
SLIDE 27

Repeated Normal form Game (cont’d)

◮ In other words, among the best reply payoffs for player 1,

player 2 chooses her strategy that minimizes these payoffs.

◮ In the battle of sexes game:

1\2 B F B 1, 2 0, 0 F 0, 0 2, 1

◮ the minmax payoff for player 1 is π1 = 1. ◮ the minmax payoff for player 2 is π2 = 1.

◮ In general:

πi = min

α−i

  • max

αi gi(αi, α−i)

  • Leonardo Felli (LSE)

EC487 Advanced Microeconomics, Part II 1 December 2017 27 / 66

slide-28
SLIDE 28

Repeated Normal form Game (cont’d)

◮ Denote mi −i the profile of minimax strategies for players −i if

they minmax player i.

◮ This is the lowest payoff player i’s opponents can hold player i

to by choice of α−i.

Definition

A payoff πi for player i is individually rational if and only if: πi ≥ πi = min

α−i

  • max

αi gi(αi, α−i)

  • Leonardo Felli (LSE)

EC487 Advanced Microeconomics, Part II 1 December 2017 28 / 66

slide-29
SLIDE 29

Repeated Normal form Game (cont’d)

Definition

We define the set of individual rational payoffs to be the set of payoffs that give to each player a payoff I = {(πi, π−i) | πi ≥ πi} The relevant set for us is the set of feasible and individually rational payoffs: V = I ∩ V

Leonardo Felli (LSE) EC487 Advanced Microeconomics, Part II 1 December 2017 29 / 66

slide-30
SLIDE 30

Repeated Normal form Game (16)

The region of feasible and individually rational payoffs V:

✻ ✲ ✁ ✁ ✁ ✁ ✁ ✁ ✁ ✁ ✁ ✁❅ ❅ ❅ ❅ ❅ ✟ ✟ ✟ ✟ ✟ ✟ ✟ ✟ ✟ ✟ q

(0, 0) (1, 2) (2, 1) π2 π1

q ❅ ❅ ❅ ❅ ❅ q q q q

π2 π1

(1, 1) V

Leonardo Felli (LSE) EC487 Advanced Microeconomics, Part II 1 December 2017 30 / 66

slide-31
SLIDE 31

Repeated Normal form Game (cont’d)

◮ Consider the following game:

1\2 L R U −2, 2 1, −2 M 1, −2 −1, 2 D 0, 1 0, 1

◮ Restricting attention to pure strategies then we obtain:

◮ m2

1 = D and π2 = 1;

◮ m1

2 ∈ {L, R} and π1 = 1 .

Leonardo Felli (LSE) EC487 Advanced Microeconomics, Part II 1 December 2017 31 / 66

slide-32
SLIDE 32

Repeated Normal form Game (cont’d)

◮ Consider mixed strategies: assume that player 2 randomizes

with probability q on L.

◮ Then player 1’s expected payoffs for every possible strategy

choice are: Π1(U, q) = 1 − 3q Π1(M, q) = 2q − 1 Π1(D, q) =

◮ This implies that m1 2 = q ∈

1

3, 1 2

  • and Π1 = 0.

Leonardo Felli (LSE) EC487 Advanced Microeconomics, Part II 1 December 2017 32 / 66

slide-33
SLIDE 33

Repeated Normal form Game (cont’d)

◮ Assume that player 1 randomizes with probability pU on U

and probability pM on M.

◮ Then player 2’s expected payoffs for every possible strategy

choice are: Π2(pU, pM, L) = 2 (pU − pM) + 1 − pU − pM Π2(pU, pM, R) = 2 (pM − pU) + 1 − pU − pM

◮ This implies that m2 1 = (pU, pM) =

1

2, 1 2

  • and that Π2 = 0.

Leonardo Felli (LSE) EC487 Advanced Microeconomics, Part II 1 December 2017 33 / 66

slide-34
SLIDE 34

Folk Theorem

◮ Consider a general finite normal form stage game:

G = {N; Ai, gi(a), ∀i ∈ N}

◮ and the dynamic game that consist of the infinitely repeated

play of the game G when players’ discount factor is δ: G ∞(δ).

◮ The payoffs of the infinitely repeated game are:

πi = (1 − δ)

  • t=0

δt gi(ai, a−i)

Leonardo Felli (LSE) EC487 Advanced Microeconomics, Part II 1 December 2017 34 / 66

slide-35
SLIDE 35

Subgame Perfect Folk Theorem

Theorem (Subgame Perfect Folk Theorem – Fudenberg and Maskin (1986))

Consider a stage game such that dim(V) = #N, where #N denotes the number of players and V denotes the set of feasible and individually rational payoffs. Then for any v ∈ V such that (vi > πi), there exists a δv such that for every δ ≥ δv there exists a SPE of G ∞(δ) with payoff vector v.

Leonardo Felli (LSE) EC487 Advanced Microeconomics, Part II 1 December 2017 35 / 66

slide-36
SLIDE 36

Subgame Perfect Folk Theorem (cont’d)

◮ Notice that the extra condition dim(V) = #N is not tight, in

particular the theorem can be proved when dim(V) = #N − 1.

◮ The dimensionality assumption is for example satisfied in the

following example.

Leonardo Felli (LSE) EC487 Advanced Microeconomics, Part II 1 December 2017 36 / 66

slide-37
SLIDE 37

Subgame Perfect Folk Theorem: Example

◮ Let G be the following finite normal form game:

L R U 2, 1 0, 2 D 0, 0 −1, −1

◮ Consider the dynamic game G ∞(δ). ◮ The minmax payoff for both players in pure and mixed

strategies are: π1 = 0 π2 = 0.

Leonardo Felli (LSE) EC487 Advanced Microeconomics, Part II 1 December 2017 37 / 66

slide-38
SLIDE 38

Subgame Perfect Folk Theorem: Example (cont’d)

◮ The set V satisfies the dimensionality assumption dim(V) = 2:

✻ ✲

(0, 2) (0, 0) (−1, −1) (2, 1) π2 π1 Π2 Π1 V

❍❍❍❍❍❍❍❍ q ✑ ✑ ✑ ✑ ✑ ✑ ✑ ✑ ✑ ✑ ✑ ✑ ✂ ✂ ✂ ✂ ✂ ✂ ✂ ✂ ✂ ✂ ✂ ✂ ✑✑✑✑✑✑ ❍ ❍ ❍ ❍ ❍ ❍ ❍ ❍ q ❍ ❍ ❍ ❍ ❍ ❍ ❍ ❍ q q

Leonardo Felli (LSE) EC487 Advanced Microeconomics, Part II 1 December 2017 38 / 66

slide-39
SLIDE 39

Subgame Perfect Folk Theorem: Proof

Proof:

◮ For simplicity we focus on the case in which there exists a

pure action profile a such that g(a) = v.

◮ Assume first that the minmax action profile mi −i for every

i ∈ N is also a pure strategy so that any deviation from minmax behavior is easy to detect.

◮ Choose v′ ∈ int(V) — recall that (vi > πi) — such that:

πi < v′

i < vi

∀i ∈ N

Leonardo Felli (LSE) EC487 Advanced Microeconomics, Part II 1 December 2017 39 / 66

slide-40
SLIDE 40

Subgame Perfect Folk Theorem: Proof (cont’d)

◮ Choose also an ε > 0 and a

v′(i) = (v′

1 + ε, . . . , v′ i−1 + ε, v′ i , v′ i+1 + ε, . . . , v′ I + ε)

such that: v′(i) ∈ V ∀i ∈ N.

◮ Notice that the role of the full-dimensionality assumption is to

assure that there exists a v′(i) for all i and for some ε and v′.

Leonardo Felli (LSE) EC487 Advanced Microeconomics, Part II 1 December 2017 40 / 66

slide-41
SLIDE 41

Subgame Perfect Folk Theorem: Proof (cont’d)

◮ Once again for simplicity assume that for every i ∈ N there

exists an action profile a(i) such that g(a(i)) = v′(i).

◮ Further denote wj i = gi(mj) player i’s payoff when minmaxing

player j.

Leonardo Felli (LSE) EC487 Advanced Microeconomics, Part II 1 December 2017 41 / 66

slide-42
SLIDE 42

Subgame Perfect Folk Theorem: Proof (cont’d)

◮ Finally, choose n such that

max

a

gi(a) + nπi < min

a gi(a) + nv′ i

  • r

n > max

a

gi(a) − min

a gi(a)

v′

i − πi

.

◮ Clearly there exists an n satisfying the condition above being

the numerator bounded above and the denominator bounded below.

Leonardo Felli (LSE) EC487 Advanced Microeconomics, Part II 1 December 2017 42 / 66

slide-43
SLIDE 43

Subgame Perfect Folk Theorem: Proof (cont’d)

◮ We label n the length of a punishment. ◮ To understand the condition above notice that for δ close to 1:

(1 − δn) ≃ (1 − δ)n.

◮ Consider now the following strategy profile.

Leonardo Felli (LSE) EC487 Advanced Microeconomics, Part II 1 December 2017 43 / 66

slide-44
SLIDE 44

Subgame Perfect Folk Theorem: Proof (cont’d)

  • 1. The play starts in Phase I.

Phase I: play the action profile a, (g(a) = v).

  • 2. The play remains in Phase I so long as in each period:

◮ either the realized action is a ◮ or the realized action differs from a in two or more

components.

Leonardo Felli (LSE) EC487 Advanced Microeconomics, Part II 1 December 2017 44 / 66

slide-45
SLIDE 45

Subgame Perfect Folk Theorem: Proof (cont’d)

  • 3. If a single player j deviates from a then the play moves to

Phase IIj. Phase IIj: play mj each period.

  • 4. The play stays in Phase IIj for n periods so long as in each

period:

◮ either the realized action is mj ◮ or the realized action differs from mj in two or more

components.

Leonardo Felli (LSE) EC487 Advanced Microeconomics, Part II 1 December 2017 45 / 66

slide-46
SLIDE 46

Subgame Perfect Folk Theorem: Proof (cont’d)

  • 5. After n subsequent periods in Phase IIj the play switches to

Phase IIIj. Phase IIIj: play a(j).

  • 6. If during Phase IIj a single player i’s action differs from mj

i

begin Phase IIi.

  • 7. The play stays in Phase IIIj so long as in each period:

◮ either the realized action is a(j) ◮ or the realized action differs from a(j) in two or more

components.

Leonardo Felli (LSE) EC487 Advanced Microeconomics, Part II 1 December 2017 46 / 66

slide-47
SLIDE 47

Subgame Perfect Folk Theorem: Proof (cont’d)

  • 8. If during Phase IIIj a single player i’s action differs from ai(j)

then begin Phase IIi.

◮ Using one-deviation-principle we check now that no player has

an incentive to deviate from the prescribed action in any subgame.

◮ Clearly each phase corresponds to a different type of proper

subgame.

Leonardo Felli (LSE) EC487 Advanced Microeconomics, Part II 1 December 2017 47 / 66

slide-48
SLIDE 48

Subgame Perfect Folk Theorem: Proof (cont’d)

Consider Phase I.

◮ By conforming player i receives payoff vi while by deviating he

cannot receive a payoff higher than: πi = (1 − δ) max

a

gi(a) + δ

  • (1 − δn) πi + δnv′

i

  • ◮ Since by construction vi > v′

i for δ sufficiently close to 1:

πi < vi.

◮ Notice indeed that if δ = 0 then πi = max a

gi(a) and if δ = 1 then πi = v′

i .

Leonardo Felli (LSE) EC487 Advanced Microeconomics, Part II 1 December 2017 48 / 66

slide-49
SLIDE 49

Subgame Perfect Folk Theorem: Proof (cont’d)

Consider Phase IIIj, j = i.

◮ By conforming player i receives payoff v′ i + ε while by

deviating he cannot receive more than: πi = (1 − δ) max

a

gi(a) + δ

  • (1 − δn) πi + δnv′

i

  • ◮ Payoff πi < v′

i + ε for δ sufficiently close to 1.

Leonardo Felli (LSE) EC487 Advanced Microeconomics, Part II 1 December 2017 49 / 66

slide-50
SLIDE 50

Subgame Perfect Folk Theorem: Proof (cont’d)

Consider Phase IIIi.

◮ By conforming player i receives payoff v′ i while by deviating he

cannot receive more than: πi = (1 − δ) max

a

gi(a) + δ

  • (1 − δn) πi + δnv′

i

  • ◮ Indeed we need:

v′

i > (1 − δ) max a

gi(a) + δ

  • (1 − δn) πi + δnv′

i

  • ◮ That can be re-written as:

(1 − δn+1)v′

i > (1 − δ) max a

gi(a) + δ (1 − δn) πi

Leonardo Felli (LSE) EC487 Advanced Microeconomics, Part II 1 December 2017 50 / 66

slide-51
SLIDE 51

Subgame Perfect Folk Theorem: Proof (cont’d)

◮ Using the approximation (1 − δn) ≃ (1 − δ)n we get:

(n + 1)v′

i > max a

gi(a) + δnπi

◮ Since v′ i > mina gi(a) and δ < 1 the following is a sufficient

condition for the inequality above: min

a gi(a) + nv′ i > max a

gi(a) + nπi

◮ Clearly from the definition of n for δ sufficiently close to 1

v′

i > πi

Leonardo Felli (LSE) EC487 Advanced Microeconomics, Part II 1 December 2017 51 / 66

slide-52
SLIDE 52

Subgame Perfect Folk Theorem: Proof (cont’d)

Consider Phase IIj, j = i.

◮ If n′ periods remaining in Phase IIj player i’s payoff by

conforming is ui =

  • 1 − δn′

wj

i + δn′(v′ i + ε) ◮ while by deviating he cannot obtain more than:

πi = (1 − δ) max

a

gi(a) + δ

  • (1 − δn) vi + δnv′

i

  • ◮ Notice that for δ sufficiently close to 1

ui > πi

Leonardo Felli (LSE) EC487 Advanced Microeconomics, Part II 1 December 2017 52 / 66

slide-53
SLIDE 53

Subgame Perfect Folk Theorem: Proof (cont’d)

Finally consider Phase IIi.

◮ If n′ < n periods remain in Phase IIi player i’s payoff by

conforming is u′

i =

  • 1 − δn′

πi + δn′v′

i ◮ while by deviating:

π′

i = (1 − δn) πi + δnv′ i ◮ Clearly u′ i > π′ i.

Leonardo Felli (LSE) EC487 Advanced Microeconomics, Part II 1 December 2017 53 / 66

slide-54
SLIDE 54

Application of Repeated Games: Cartels

◮ Consider two firms repeatedly involved in a Cournot Duopoly

for an infinite number of periods.

◮ Both firms produce a perfectly homogeneous good with cost

functions: c(qi) = c qi ∀i ∈ {1, 2}.

◮ and inverse demand function:

P(q1 + q2) = a − (q1 + q2) where c < a.

Leonardo Felli (LSE) EC487 Advanced Microeconomics, Part II 1 December 2017 54 / 66

slide-55
SLIDE 55

Application of Repeated Games: Cartels (cont’d)

◮ The two firms’ profit functions are:

Π1(q1, q2) = q1 [a − (q1 + q2) − c] Π2(q1, q2) = q2 [a − (q1 + q2) − c]

◮ The stage game equilibrium choices (q1, q2) are:

max

q1∈R+ q1 [a − (q1 + q2) − c]

max

q2∈R+ q2 [a − (q1 + q2) − c] ◮ which is the solution to the following problem:

qc

1 = 1

2 (a − qc

2 − c)

qc

2 = 1

2 (a − qc

1 − c) .

Leonardo Felli (LSE) EC487 Advanced Microeconomics, Part II 1 December 2017 55 / 66

slide-56
SLIDE 56

Application of Repeated Games: Cartels (cont’d)

◮ This solution is:

qc

1 = qc 2 = (a − c)

3

◮ with profits:

πc

1 = πc 2 = (a − c)2

9

◮ Consider now a single firm that is a monopolist in this market

and produces a quantity Q.

◮ This firm profit maximization problem is:

max

Q∈R+ Q [a − Q − c]

Leonardo Felli (LSE) EC487 Advanced Microeconomics, Part II 1 December 2017 56 / 66

slide-57
SLIDE 57

Application of Repeated Games: Cartels (cont’d)

◮ The first order conditions are then:

a − 2Q − c = 0

◮ or the monopolist quantity:

Qm = (a − c) 2 , Πm = (a − c)2 4

◮ Assume now that the two firms, without any explicit deal,

decide each to produce half of the monopolist quantity: qm = 1 2 Qm = (a − c) 4

Leonardo Felli (LSE) EC487 Advanced Microeconomics, Part II 1 December 2017 57 / 66

slide-58
SLIDE 58

Application of Repeated Games: Cartels (cont’d)

◮ Each firm’s profit in this case is:

πm

1 = πm 2 = (a − c)2

8

◮ Notice that clearly:

πc

i = (a − c)2

9 < πm

i

= (a − c)2 8

◮ The quantity qm does dominate qc i for both firms. ◮ However, if one of the firm, say firm 1, produces quantity

qm

1 = (a − c)

4 then firm 2 can gain by choosing a different quantity.

Leonardo Felli (LSE) EC487 Advanced Microeconomics, Part II 1 December 2017 58 / 66

slide-59
SLIDE 59

Application of Repeated Games: Cartels (cont’d)

◮ In particular, if firm 2 chooses the quantity:

¯ q2 = (a − qm − c) 2 = 3 (a − c) 8

◮ Then firm 2’s profit is:

¯ π2 = 9 (a − c)2 64

◮ which clearly is:

¯ π2 = 9 (a − c)2 64 > πm = (a − c)2 8

◮ This is the reason why for both firms to choose (qm 1 , qm 2 ) is

not a Nash equilibrium of the Cournot model.

Leonardo Felli (LSE) EC487 Advanced Microeconomics, Part II 1 December 2017 59 / 66

slide-60
SLIDE 60

Application of Repeated Games: Cartels (cont’d)

◮ Assume however that the two firms compete for an infinite

number of periods.

◮ Consider the following strategies: ◮ Firm 1:

◮ choose quantity qm

1 in the first period;

◮ in every subsequent period choose quantity qm

1 if the observed

  • utcome in the previous period is (qm

1 , qm 2 );

◮ in every subsequent period choose quantity qc

1 if in the

previous period you observe that firm 2 chose quantity ¯ q2;

Leonardo Felli (LSE) EC487 Advanced Microeconomics, Part II 1 December 2017 60 / 66

slide-61
SLIDE 61

Application of Repeated Games: Cartels (cont’d)

◮ Firm 2:

◮ choose quantity qm

2 in the first period;

◮ in every subsequent period choose quantity qm

2 if the observed

  • utcome in the previous period is (qm

1 , qm 2 );

◮ in every subsequent period choose quantity qc

2 if in the

previous period you observe that firm 2 chose quantity ¯ q1;

◮ Recall that the average discounted payoff of each firm is:

(1 − δ)

  • t=0

δtπi(t)

Leonardo Felli (LSE) EC487 Advanced Microeconomics, Part II 1 December 2017 61 / 66

slide-62
SLIDE 62

Application of Repeated Games: Cartels (cont’d)

◮ These strategies do not require an explicit agreement between

the two firms provided each firm believes the other firm behaves this way.

◮ Question: for which δ neither firm wants to deviate from

these strategies?

◮ Consider firm i:

πm

i

≥ (1 − δ)¯ πi + δπc

i

  • r

(a − c)2 8 ≥ (1 − δ)9 (a − c)2 64 + δ(a − c)2 9

Leonardo Felli (LSE) EC487 Advanced Microeconomics, Part II 1 December 2017 62 / 66

slide-63
SLIDE 63

Application of Repeated Games: Cartels (cont’d)

◮ which is satisfied if and only if:

δ ≥ 9 17

◮ Moreover no firm has an incentive to deviate from punishment

strategies since (qc

1, qc 2) is a Nash equilibrium of the Cournot

stage game.

◮ Therefore the cartel behaviour described by the strategies

above is a Subgame Perfect equilibrium of the infinitely repeated game if and only if δ ≥ 9/17.

Leonardo Felli (LSE) EC487 Advanced Microeconomics, Part II 1 December 2017 63 / 66

slide-64
SLIDE 64

Subgame Perfect Folk Theorem: Comment

◮ Notice that in the theory of repeated games there does not

exists a commonly accepted theory predicting that the player will play an equilibrium whose payoff is on the Pareto-frontier

  • f V.

◮ In other words nothing guarantees that the outcome will be

  • n the Pareto frontier.

Leonardo Felli (LSE) EC487 Advanced Microeconomics, Part II 1 December 2017 64 / 66

slide-65
SLIDE 65

Subgame Perfect Folk Theorem: Comment (cont’d)

Indeed:

✻ ✲

(0, 2) (0, 0) (−1, −1) (2, 1) π2 π1 Π2 Π1 V

❍❍❍❍❍❍❍❍ q ✑ ✑ ✑ ✑ ✑ ✑ ✑ ✑ ✑ ✑ ✑ ✑ ✂ ✂ ✂ ✂ ✂ ✂ ✂ ✂ ✂ ✂ ✂ ✂ ✑✑✑✑✑✑ ❍ ❍ ❍ ❍ ❍ ❍ ❍ ❍ q ❍ ❍ ❍ ❍ ❍ ❍ ❍ ❍ q q

Leonardo Felli (LSE) EC487 Advanced Microeconomics, Part II 1 December 2017 65 / 66

slide-66
SLIDE 66

Subgame Perfect Folk Theorem: Comment (cont’d)

Overall the Folk Theorem warns us to use caution when arguing that the best way of making predictions in a strategic setting is by using Nash and even Subgame Perfect equilibria.

◮ Office hours in the next two weeks:

Tuesday, Dec. 5 and 12: 11:00-13:00 am

◮ Exam scheduled for:

Thursday, January 4, 2018 at 14:30p.m.

Leonardo Felli (LSE) EC487 Advanced Microeconomics, Part II 1 December 2017 66 / 66