Conventions and Coalitions in Repeated Games S. Nageeb Ali Ce Liu - - PowerPoint PPT Presentation

conventions and coalitions in repeated games
SMART_READER_LITE
LIVE PREVIEW

Conventions and Coalitions in Repeated Games S. Nageeb Ali Ce Liu - - PowerPoint PPT Presentation

Conventions and Coalitions in Repeated Games S. Nageeb Ali Ce Liu (Will) Penn State UCSD May 29, 2018 Cooperative game theory takes a birds eye view to group decisions. Studies stable static conventions immune to group


slide-1
SLIDE 1

Conventions and Coalitions in Repeated Games

  • S. Nageeb Ali

Ce Liu (Will)

Penn State UCSD

May 29, 2018

slide-2
SLIDE 2

Cooperative game theory takes a “bird’s eye” view to group decisions. Studies stable “static conventions” immune to group deviations. Theory of repeated games studies dynamic ongoing relationships. Studies stable “dynamic conventions” immune to individual deviations.

slide-3
SLIDE 3

Cooperative game theory takes a “bird’s eye” view to group decisions. Studies stable “static conventions” immune to group deviations. Theory of repeated games studies dynamic ongoing relationships. Studies stable “dynamic conventions” immune to individual deviations. Our objective: Develop a theory of repeated games for coalitional behavior.

slide-4
SLIDE 4

the enterprise

Framework nests both noncooperative and cooperative environments. Results speak both to repeated play of a cooperative game as well as coalitional deviations in the repeated play of a non-cooperative game. Solution-concept combines farsightedness and limited commitment.

slide-5
SLIDE 5

two results

slide-6
SLIDE 6

two results

Perfect monitoring in both NTU + TU games: Folk Theorem. Every payoff in the set of feasible and strictly individually rational payoffs is supported as δ → 1. ⇒ coalitional deviations do not refine play if players are patient.

slide-7
SLIDE 7

two results

Perfect monitoring in both NTU + TU games: Folk Theorem. Every payoff in the set of feasible and strictly individually rational payoffs is supported as δ → 1. ⇒ coalitional deviations do not refine play if players are patient. TU games + secret transfers: Anti-Folk Theorem. Only payoffs in the β-core are supported, and any such payoff is supported as δ → 1. ⇒ secret transfers + coalitional deviations undo dynamic incentives.

slide-8
SLIDE 8

1 Introduction 2 Examples 3 Model 4 Folk Theorem for NTU 5 Transferable Utility 6 Related Lit + Summary

slide-9
SLIDE 9

the roommates problem

2 rooms, 4 people

Ann Bob Carol

slide-10
SLIDE 10

the roommates problem

2 rooms, 4 people

Ann Bob Carol David

slide-11
SLIDE 11

the cycle

Arrows point to first preference.

slide-12
SLIDE 12

the cycle

Arrows point to first preference. No one is a worse roommate than David.

slide-13
SLIDE 13

the cycle

Arrows point to first preference. No one is a worse roommate than David. Ui(ij) = 4 − k if j is i’s kth favorite partner.

slide-14
SLIDE 14

no stable match

slide-15
SLIDE 15

no stable match

slide-16
SLIDE 16

no stable match

slide-17
SLIDE 17

no stable match

slide-18
SLIDE 18

Suppose individuals play repeatedly. Coalitions cannot commit to future behavior, but can jointly deviate in the stage-game. Our solution-concept seeks robustness to one-shot coalitional deviations. The key idea:

  • Coalitions have some short-term commitment (which one models

implicitly in the cooperative stage-game).

  • But coalitions lack long-term commitment.
slide-19
SLIDE 19

A stable convention if δ > 1

2.

slide-20
SLIDE 20

A stable convention if δ > 1

2.

slide-21
SLIDE 21

A stable convention if δ > 1

2.

slide-22
SLIDE 22

A stable convention if δ > 1

2.

slide-23
SLIDE 23

A divide-the-dollar example

Suppose that {1, 2, 3} are dividing a dollar. Any division requires approval of at least two players. Player 1 has veto power, and must be included in any winning coalition.

slide-24
SLIDE 24

A divide-the-dollar example

Suppose that {1, 2, 3} are dividing a dollar. Any division requires approval of at least two players. Player 1 has veto power, and must be included in any winning coalition. Core of stage-game:

  • Player 1 captures entire dollar.
  • Any other outcome has a profitable deviation for {1, 2} or {1, 3}.
slide-25
SLIDE 25

repeated divisions: perfect monitoring

Suppose the environment is repeated and coalitions cannot commit to long-term deviations. We claim that one can now support every division as δ → 1.

slide-26
SLIDE 26

Claim: ( 0, 1

2, 1 2

) is supportable using a stable convention if δ ⩾ 1

2.

slide-27
SLIDE 27

Claim: ( 0, 1

2, 1 2

) is supportable using a stable convention if δ ⩾ 1

2.

Pf: Use a core-reversion convention:

  • Choose

( 0, 1

2, 1 2

) every period on path of play.

  • Revert to (1, 0, 0) if any other allocation is chosen.
slide-28
SLIDE 28

Claim: ( 0, 1

2, 1 2

) is supportable using a stable convention if δ ⩾ 1

2.

Pf: Use a core-reversion convention:

  • Choose

( 0, 1

2, 1 2

) every period on path of play.

  • Revert to (1, 0, 0) if any other allocation is chosen.

Even if 1 offers entire surplus to coalition-partner: (1 − δ)(1) + δ(0) ⩽ 1

2.

Key idea: Veto player can’t recruit anyone to join his coalition.

slide-29
SLIDE 29

Claim: ( 0, 1

2, 1 2

) is supportable using a stable convention if δ ⩾ 1

2.

Pf: Use a core-reversion convention:

  • Choose

( 0, 1

2, 1 2

) every period on path of play.

  • Revert to (1, 0, 0) if any other allocation is chosen.

Even if 1 offers entire surplus to coalition-partner: (1 − δ)(1) + δ(0) ⩽ 1

2.

Key idea: Veto player can’t recruit anyone to join his coalition. Core-reversion supports any payoff vector u where min{u2, u3} ⩾ 1 − δ.

slide-30
SLIDE 30
slide-31
SLIDE 31
slide-32
SLIDE 32
slide-33
SLIDE 33

anti-folk theorem with secret transfers

Suppose players can make “secret transfers.” Convention conditions only on identity of the blocking coalition, and chosen allocation, but not eventual payoffs.

slide-34
SLIDE 34

anti-folk theorem with secret transfers

Suppose players can make “secret transfers.” Convention conditions only on identity of the blocking coalition, and chosen allocation, but not eventual payoffs. Stable convention supports only stage-game core: (1, 0, 0). (regardless of δ).

slide-35
SLIDE 35

1 Introduction 2 Examples 3 Model 4 Folk Theorem for NTU 5 Transferable Utility 6 Related Lit + Summary

slide-36
SLIDE 36

primitives: NTU stage-game

Choice over alternatives: a ∈ A. Generated Payoffs: v : A → Rn. Set of Coalitions: C ≡ 2N\∅. If alternative a is recommended, EC(a) is the set of alternatives to which coalition C can deviate, assuming no one else deviates. EC(·) is coalition C’s effectivity function.

slide-37
SLIDE 37

example: strategic-form game

a is action profile of the stage-game. Effectivity function extends usual notion of individual deviations to coalitional deviations. EC(a) ≡ { a′ ∈ A : a′

−C = a−C

}

slide-38
SLIDE 38

example: cooperative game

a is a partition of N, and A is the set of all partitions. vi(a) is the payoff from partition a. EC(a) specifies partitions that coalition C can generate, where “untouched” coalitions continue to follow a. EC(a) ⊆ {a′ : C′ ∩ C = ∅ and C′ ∈ a ⇒ C′ ∈ a′}

slide-39
SLIDE 39

examples

Voting: Convention specifies a default policy. But a coalition can introduce an alternative and block the default if it has more than a majority of players. Matching: Convention specifies who matches with whom. But pairs of players can defect and match differently.

slide-40
SLIDE 40

assumptions on effectivity

Transitivity: For every C, a′ ∈ EC(a) & a′′ ∈ EC(a′) ⇒ a′′ ∈ EC(a). Reflexivity: For every C and a, a ∈ EC(a). Omnipotence of the Grand Coalition: EN(a) = A for every a. (For the β-core result)

slide-41
SLIDE 41

repeated interaction

For a path p where at is alternative in period t, player i’s payoff is Ui(p) ≡ (1 − δ)

t=0

δtvi(at). Outcome at t ≡ ( chosen alternative, coalition that blocks (if any) ) . A history is a sequence of outcomes. A convention is a mapping, σ : Histories → A.

slide-42
SLIDE 42

repeated interaction

For a path p where at is alternative in period t, player i’s payoff is Ui(p) ≡ (1 − δ)

t=0

δtvi(at). Outcome at t ≡ ( chosen alternative, coalition that blocks (if any) ) . A history is a sequence of outcomes. A convention is a mapping, σ : Histories → A. a(h | σ) ≡ σ(h) is alternative recommended by σ after history h. p(h | σ) is path recommended by σ after history h. (computed recursively from a(h | σ)).

slide-43
SLIDE 43
  • Definition. A convention is stable if for every history h, ∄ coalition C

and a′ ∈ EC(a(h | σ)) such that for all i in C, (1 − δ)vi(a′) + δUi(p(h, (a′, C)|σ)) > Ui(p(h | σ)). No coalition wishes to deviate at any history, anticipating no deviations at all other histories.

slide-44
SLIDE 44

features of stable conventions

Recursivity. Relationship to core of stage-game. One-shot Deviation Principles?

slide-45
SLIDE 45

recursivity

Can use APS to characterize implementable outcomes for a fixed δ.

slide-46
SLIDE 46

relationship to core of stage-game

  • Definition. An alternative a is a core-alternative of stage-game if ∄

coalition C and a′ ∈ EC(a) such that for all i in C. vi(a′) > vi(a).

slide-47
SLIDE 47

relationship to core of stage-game

  • Definition. An alternative a is a core-alternative of stage-game if ∄

coalition C and a′ ∈ EC(a) such that for all i in C. vi(a′) > vi(a). [Stable convention of G∞: Core of G] ≃ [SPE of G∞ : NE of G].

slide-48
SLIDE 48

Formally, warding off one-shot coalitional deviations. Because this includes singleton coalitions, this is a refinement of SPE. But the sufficiency of examining only one-shot deviations (the so-called “one-shot deviation principle”) is typically a result, not an assumption. What comes from assuming it?

slide-49
SLIDE 49
  • ne-shot deviation principle

for an individual

If Alice profits from stringing together a (possibly infinite) sequence of

  • ne-shot coalitional deviations, each of which is profitable for every

deviating coalition partner but not necessarily for herself, then there is a profitable one-shot coalitional deviation. We prove this statement using an argument analogous to classical result.

slide-50
SLIDE 50
  • ne-shot deviation principle

for coalitions

If a coalition of two or more players have a profitable multi-shot deviation, then it has a profitable one-shot deviation. This version is false, except in the case of secret transfers.

slide-51
SLIDE 51

1 Introduction 2 Examples 3 Model 4 Folk Theorem for NTU 5 Transferable Utility 6 Related Lit + Summary

slide-52
SLIDE 52

Minmax and Folk Theorem

vi ≡ min

a∈A

max

a′∈E{i}(a) vi(a′).

(Player i’s minmax)

slide-53
SLIDE 53

Minmax and Folk Theorem

vi ≡ min

a∈A

max

a′∈E{i}(a) vi(a′).

(Player i’s minmax)

Theorem

Suppose that set of feasible and strictly individually rational payoffs has a non-empty interior. For every payoff in in that set, there is a stable convention that sustains that payoff if players are sufficiently patient.

slide-54
SLIDE 54

who guards the guardians?

Classical question highlights difficulties of simultaneously punishing too many people. Our solution: punish a player in each coalition, grant amnesty to others.

slide-55
SLIDE 55

step 1: individual minmax is lower-bound

vi ≡ min

a∈A

max

a′∈E{i}(a) vi(a′).

No stable convention can generate payoffs below vi for player i. Usual argument: at every history, player i can take a myopic best-response to the play of others.

slide-56
SLIDE 56

step 2: player-specific punishments

slide-57
SLIDE 57

step 2: player-specific punishments

Suppose you would like to support payoff vector v∗ in the interior of the feasible and strictly individually rational payoffs.

  • Design player-specific punishments (vi)n

i=1 where each vi is a vector.

  • Player i’s punishment: vi

i < v∗ i.

  • Better off if someone else is punished: vi

i < vj i.

Assume (for now) that for each payoff, ∃ an alternative that generates it.

slide-58
SLIDE 58

step 3: convention-design

Use Fudenberg & Maskin (1986) to ward off individual deviations:

  • Play v∗ on the path of play.
  • Player i deviates ⇒ Minmax for T(δ) periods, and then vi.
  • vi

j > vj j ⇒ player j doesn’t wish to deviate when i is being punished.

Use a scapegoat policy to ward off coalitional deviations: at each history, for each coalition C, pick a scapegoat i ∈ C, and punish her when coalition C deviates as if she were the only deviator. Scapegoat can be picked arbitrarily, except someone being minmaxed.

slide-59
SLIDE 59

step 4: how to convexity

Public correlation devices. Sequences of play. (Sorin 1986; Fudenberg & Maskin 1991).

slide-60
SLIDE 60

We have looked at only one-shot coalitional deviations. Can allow for coalitional deviations that are up to T < ∞ periods long. Holding T fixed, and taking δ → 1 results in the same theorem.

slide-61
SLIDE 61

1 Introduction 2 Examples 3 Model 4 Folk Theorem for NTU 5 Transferable Utility 6 Related Lit + Summary

slide-62
SLIDE 62

how we model transfers

Describe transfers using a matrix m ≡ [mij] where:

  • mii = 0.
  • mij ⩾ 0 are the transfers paid to player j by player i.

M is the set of all such matrices. Player i’s payoff = vi(a) + Incoming Transfers - Outgoing Transfers. Set of feasible payoffs is U = { u ∈ Rn : ∃a ∈ A such that ∑

i∈N

ui = ∑

i∈N

vi(a) } .

slide-63
SLIDE 63

Outcomes are now (a, C, m). Conventions are mappings of the form σ : Histories→ A × M. If coalition C blocks (a, m), it chooses:

  • Feasible alternative deviation: a′ ∈ EC(a).
  • Transfers: (m′

C, m−C).

Those outside the blocking coalition still make transfers as before.

slide-64
SLIDE 64

A restriction

We consider conventions such that continuation values are (uniformly) bounded across all histories. Formally, {u ∈ Rn : ∃h ∈ H such that U(h | σ = u} is a bounded subset of Rn.

slide-65
SLIDE 65

stable conventions with public transfers

definition

A convention is stable if for every history h, ∄ coalition C, alternative a′ ∈ EC(a(h | σ)), and transfers m′

C such that for all i in C,

Ui(p(h | σ)) < (1 − δ) + δ .

slide-66
SLIDE 66

stable conventions with public transfers

definition

A convention is stable if for every history h, ∄ coalition C, alternative a′ ∈ EC(a(h | σ)), and transfers m′

C such that for all i in C,

Ui(p(h | σ)) < (1 − δ) ( vi(a′) + ∑

j∈C m′ ji + ∑ j/ ∈C mji − ∑ j∈N m′ ij

) + δ .

slide-67
SLIDE 67

stable conventions with public transfers

definition

A convention is stable if for every history h, ∄ coalition C, alternative a′ ∈ EC(a(h | σ)), and transfers m′

C such that for all i in C,

Ui(p(h | σ)) < (1 − δ) ( vi(a′) + ∑

j∈C m′ ji + ∑ j/ ∈C mji − ∑ j∈N m′ ij

) + δ Ui(p(h, (a′, C, m′

C, m−C(h | σ)))).

slide-68
SLIDE 68

folk theorem with public transfers

When transfers are observable, punish players not only for participating in coalitions but also for their transfers. Transfers may allow coalitions to consider their joint surplus today, but continuation values may be adjusted on the basis of these transfers.

slide-69
SLIDE 69

secret transfers

Secret transfers: if a coalition blocks, then transfers within that coalition are unobserved by those outside the coalition. We model this as a measurability condition. σ can condition on:

  • chosen alternative a
  • the blocking coalition (if any)
  • payoffs outside a blocking coalition

But not on payoffs within a blocking coalition.

slide-70
SLIDE 70

Say that two histories are identical up to transfers if the only difference between the two are transfers within blocking coalitions. A convention under secret transfers must respect σ(h) = σ(h′) for any two histories h and h′ that are identical up to transfers.

slide-71
SLIDE 71

a one-shot deviation principle

coalitions

  • Lemma. If a coalition has a profitable multi-shot deviation, then it has a

profitable one-shot deviation.

slide-72
SLIDE 72

a one-shot deviation principle

coalitions

  • Lemma. If a coalition has a profitable multi-shot deviation, then it has a

profitable one-shot deviation. Idea of proof:

  • We show that the total gain for a coalition C from the multi-shot

deviation is bounded.

  • Therefore, there is a profitable multi-shot deviation that deviates at

finitely many steps.

  • Use standard backward-induction logic to find a profitable one-shot

deviation for the coalition.

slide-73
SLIDE 73

statement

The β-characteristic function is defined by ψ(C) ≡ min

a∈A

max

a′∈EC(a)

i∈C

vi(a′)

slide-74
SLIDE 74

statement

The β-characteristic function is defined by ψ(C) ≡ min

a∈A

max

a′∈EC(a)

i∈C

vi(a′)

Definition

The β-core is the set K ≡ {u ∈ U : ∑

i∈C

ui ⩾ ψ(C) for all C ∈ C} The strong β-core is the set Ks ≡ {u ∈ U : ∑

i∈C

ui > ψ(C) for all C ∈ C\{N}; ∑

i∈N

ui = ψ(N)}

slide-75
SLIDE 75

Anti-Folk Theorem

Theorem

Fix δ < 1. No payoff profile outside of the β-core can be supported by a stable convention.

slide-76
SLIDE 76

Anti-Folk Theorem

Theorem

Fix δ < 1. No payoff profile outside of the β-core can be supported by a stable convention. If the strong β-core is non-empty, then for every element of it, ∃ δ such that for all δ > δ, that payoff profile can be supported by a stable convention.

slide-77
SLIDE 77

step 1: coalitional minmax is lower-bound

ψ(C) = min

a∈A

max

a′∈EC(a)

i∈C

vi(a′) If joint surplus for coalition C is less than ψ(C), that coalition has a profitable multi-shot deviation (myopic BR for coalitional value). ⇒ coalition C has a profitable one-shot deviation.

slide-78
SLIDE 78

step 2: achieving any point in β-core

Payoffs are on efficiency frontier: chosen alternative a maximizes ∑

i∈N vi(a).

Our goal: generate a folk theorem on that frontier. Set of coalitions is 2n − 1. Payoffs satisfies Abreu-Dutta-Smith’s NEU condition: no two coalitions have payoffs that are positive affine transformations of each other. ⇒ we can generate player-specific punishments for the 2n − 1 players.

slide-79
SLIDE 79
slide-80
SLIDE 80
slide-81
SLIDE 81
slide-82
SLIDE 82
slide-83
SLIDE 83
slide-84
SLIDE 84
slide-85
SLIDE 85

1 Introduction 2 Examples 3 Model 4 Folk Theorem for NTU 5 Transferable Utility 6 Related Lit + Summary

slide-86
SLIDE 86

related literature

A Natural Approach: Collapse infinitely repeated game into its normal-form, and study its core.

slide-87
SLIDE 87

related literature

A Natural Approach: Collapse infinitely repeated game into its normal-form, and study its core. Corresponds to Aumann (1959):

  • Strong Nash Equilibria of Repeated Game.
  • Limit-of-means discounting (⇒ no gain from short-term deviations).
slide-88
SLIDE 88

related literature

A Natural Approach: Collapse infinitely repeated game into its normal-form, and study its core. Corresponds to Aumann (1959):

  • Strong Nash Equilibria of Repeated Game.
  • Limit-of-means discounting (⇒ no gain from short-term deviations).

Solution-concept is peculiar:

  • Coalitions deviate to strategies in the continuation game.

⇒ Coalitions have unlimited commitment in the infinite-horizon. ⇒ Why don’t they commit from the very beginning?

slide-89
SLIDE 89

Farsighted Stability

Konishi and Ray (2003) Hyndman and Ray (2007) Ray and Vohra (1997,1998,2015) Chwe (1994) Gomes and Jehiel (2005) Jordan (2006) Acemoglu, Egorov, and Sonin (2012) Mert Kimya (2015) Dutta and Vohra (2016)

slide-90
SLIDE 90

“It would be of interest to investigate dynamic noncooperative games with (nonbinding) coalition formation. One might begin with the partition function, so that the formation of a coalition structure at any date has a definite impact on payoffs, perhaps through the writing of binding agreements within coalitions in any period. But the important difference...is that such agreements would—by assumption—be up for grabs at the end

  • f every period. There are no binding agreements that last for

longer than a single date.” - Ray’07

slide-91
SLIDE 91

legal vs. community enforcement

Laws reflect a shared understanding of how future unfolds as a consequence of choices today. In the end, all are caught in a web of self-reinforcing sanctions. Law’s empire, tangible and all-encompassing as it may seem, is founded on little else than beliefs. – Kaushik Basu (2000) Words written on a piece of paper don’t alter the laws of

  • physics. Yet laws do influence behavior. - Mailath, Morris, &

Postlewaite (2001,2015). But people being punished attempt to side-deal with enforcers. ⇒ Credible laws must ward off individuals and coalitional deviations.

slide-92
SLIDE 92

Framework for studying coalitional behavior in repeated games.

  • Coalitional deviations in repeated games.
  • Self-enforcing behavior in repeated cooperative games.

With full observability, short-term coalitional deviations come at little cost as δ → 1. Secret transfers severely limit what may be implemented for every δ. Offers foundations for β-core in exponential discounting.

slide-93
SLIDE 93

Thanks!