Repeated Games George J Mailath A talk prepared for the Nemmers - PowerPoint PPT Presentation

Repeated Games George J Mailath A talk prepared for the Nemmers Conference Northwestern University May 5-7, 2005 Based on chapters 10 and 12 of Repeated Games and Reputations , with Larry Samuelson. http://www.ssc.upenn.edu/ ∼ gmailath/book.html May 5, 2005

Introduction Repeated games with perfect and public monitoring are (thought to be) well understood. Repeated games with private monitoring are more complicated, and until recently, little was known. Now, we know something, and this has shed light on games with perfect and public monitoring.

Since Abreu, Pearce, and Stacchetti (1990), the analysis of public monitoring games has tended to emphasize characterizing the set of equilibrium payoffs, rather than the structure of behavior. Earlier analyses of perfect monitoring games did also focus on structure—optimal penal codes for example (Abreu, 1988), and the complexity literature (Rubinstein (1986), Kalai and Stanford (1988), Abreu and Rubinstein (1988)). The theoretical reputation literature has also focused on the payoff bounds, rather than on the structure of the equilibria. Interesting things can be learnt from focusing on the structure of behavior.

Prisoners’ Dilemma Partnership A i = { E,S } , imperfect monitoring Y = { y, ¯ y } . ¯  p, if a = EE ,      Pr { ¯ y | a } = ρ (¯ y | a ) = q, if a = ES or SE ,    r, if a = SS .   y y ¯ PD E S ¯ 3 − 2 q − p − p − 2 q ex post E ex ante E 2 , 2 − 1 , 3 p − q p − q 3(1 − r ) − 3 r payoffs S payoffs S 3 , − 1 0 , 0 q − r q − r

Two periods, payoffs added. Second period stage game: G B G 3 , 3 0 , 0 B 0 , 0 1 , 1 Trigger profile: EE in first period, GG in the second after ¯ y , and BB after y . ¯ A PPE if 2( p − q ) ≥ 1.

Private Monitoring Player i observes y i ∈ Y i ≡ { y i , ¯ y i } . Joint distribution over signal vector ¯ ( y 1 ,y 2 ) ∈ Y 1 × Y 2 given by π ( y 1 y 2 | a ). Marginal distribution, π i ( y i | a ). ex post payoffs: u ∗ i ( y i ,a i ) y i ∈ Y i u ∗ ex ante payoffs: u i ( a ) = � i ( y,a i ) π i ( y i | a ).

Almost-public private monitoring: ρ ( y | a ) > 0 and for all a , | ρ ( y | a ) − π ( yy | a ) | < ε. For ε sufficiently small, under almost-public monitoring, players signals are highly correlated. a 1 a 2 y 2 y 2 ¯ ¯ y 1 (1 − α )(1 − 2 ε ) ε ¯ y 1 ¯ ε α (1 − 2 ε )

Conditionally-independent private monitoring: for all a , π ( y 1 y 2 | a ) = π 1 ( y 1 | a ) π 2 ( y 2 | a ) . For example,  1 − ε, if y i = ¯ y i and a j = E , or    π i ( y i | a ) = y i = y i and a j = S , j � = i , ¯   ε, otherwise,  EE y 2 y 2 ¯ SE y 2 y 2 ¯ ¯ ¯ ε 2 ε 2 y 1 (1 − ε ) ε y 1 (1 − ε ) ε ¯ ¯ (1 − ε ) 2 (1 − ε ) 2 y 1 ¯ (1 − ε ) ε y 1 ¯ (1 − ε ) ε For ε small, this is almost-perfect private monitoring.

More generally, a private-monitoring game with private monitoring distribution (Ω ,π ) has almost-perfect monitoring if, for all players i , there is a partition of Ω i , { Ω i ( a ) } a ∈ A , such that for all action profiles a ∈ A , � ω i ∈ Ω i ( a ) π i ( ω i | a ) > 1 − η. Almost-perfect private monitoring does not make any assumptions about the correlation structure: both almost-public and conditionally-independent private monitoring distributions can be almost-perfect.

Equilibria when Almost-Public Monitoring (Mailath and Morris, 2002, 2005) Induced behavior by trigger eq: both play E in first period, in second period, player i plays G after ¯ y i and B after y i . ¯ For π close to ρ , this is an eq: • Pr( y 1 = y 2 | a ) ≈ 1, so BB or GG in second period with probability close to ρ . • first period incentives are close to first period incentives under ρ .

Infinitely repeated games. A forgiving profile y y y w w EE SS y strict PPE if 1 1 (3 p − 2 q − r ) < δ < ( p +2 q − 3 r ) .

This forgiving profile has bounded recall: last period’s signal completely determines current state. Behavior induced by public forgiving profile in private monitoring game: ( W ,w 0 ,f i ,τ i ), where W = { w EE ,w SS } is set of states, w 0 = w EE is the initial state (common to both players), f i ( w a ) = a i is the decision rule, and τ i : W × Y i → W is the private transition function. i ; ...,y t − 1 ,a t − 1 After private history h t i = ( y 0 i ,a 0 ), player i has beliefs i i β i ( ·| h t i ) ∈ ∆( W j ) over player j ’s current private state. Private history also implies a current private state for i , w t i = τ i ( w 0 ,h t i ). Eg., w 2 i = τ ( τ ( w 0 i ,y 0 i ) ,y 1 i ). In forgiving profile, after h t i , i ) = Pr( y t − 1 = y t − 1 β i ( w t i | h t i ) = Pr( w t j = w t i | h t | h t i ) ≈ 1 . j i

Grim trigger y y , y y w w EE SS strict PPE if 1 δ > (3 p − 2 q ) . Profile has unbounded recall.

• If q > r , implied private profile is not a Nash equilibrium in any close-by game with full-support private monitoring. • If r ≥ q , implied private profile is a Nash equilibrium in every close- by game with full-support private monitoring. (And so there is a sequential equilibrium with the same outcome as that of grim trigger.) • For all r , q , implied private profile is not a sequential equilibrium in any close-by game with full-support private monitoring.

q > r : Grim trigger is not Nash. S is not optimal after long histories of the form � � E,y 1 ; S, ¯ y 1 ; S, ¯ y 1 ; S, ¯ y 1 ; ··· : Immediately after y 1 , 1 assigns prob very close to 0 to 2 being in w EE (because with prob close to 1, player 2 also observed y 2 ). Thus, playing S in the subsequent period is optimal. But π has full support = ⇒ 1 does not know that 2 is in w SS . ρ (¯ y | SE ) = q > r = ρ (¯ y | SS ) = ⇒ ¯ y 1 after playing S is an indication that player 2 had played E .

q ≤ r : Grim trigger is Nash. � � S is optimal after E,y 1 ; S, ¯ y 1 ; S, ¯ y 1 ; S, ¯ y 1 ; ··· : Immediately after y 1 , 1 assigns prob very close to 0 to 2 being in w EE (because with prob close to 1, player 2 also observed y 2 ). Thus, playing S in the subsequent period is optimal. ρ (¯ y | SE ) = q ≤ r = ρ (¯ y | SS ) = ⇒ ¯ y 1 after playing S is an indication that player 2 had played S (if q = r , ¯ y 1 is uninformative). Observing y 1 is signal that 2 had played E , but if 2 had also observed y 2 , then 2 transits to w SS .

E is optimal after ( E, ¯ y 1 ; E, ¯ y 1 ; E, ¯ y 1 ; E, ¯ y 1 ; ··· ): Posterior that 2 is in state w EE cannot fall very far. π full support = ⇒ Pr { 2 in w SS | 1 in w EE } > 0. But ¯ y 1 is signal that 2 had played E .

For all r , q : Grim trigger is not sequential. S is not optimal after long histories of the form � � Ey 1 ; E ¯ y 1 ; E ¯ y 1 ; E ¯ y 1 ; ··· : Immediately after y 1 , 1 assigns prob very close to 0 to 2 being in w EE (because with prob close to 1, player 2 also observed y 2 ). Thus, playing S in the subsequent period is optimal. But π has full support = ⇒ 1 is not sure that 2 is in w SS . ρ (¯ y | EE ) = p > q = ρ (¯ y | ES ) = ⇒ ¯ y 1 after playing E is an indication that player 2 had played E .

Important to understand the structure of equilibrium behavior. Mailath and Morris (2002) obtain folk thm for almost-perfect almost-public monitoring. Folk theorem for perfect monitoring can be proved using profiles with bounded recall. Unknown if folk theorem for public monitoring (Fudenberg, Levine, and Maskin, 1994) can be proved using bounded recall strategies. However, for some repeated prisoners’ dilemmas, the restriction to strongly symmetric bounded recall PPE results in a dramatic collapse of the set of equilibrium payoffs (Cole and Kocherlakota, forthcoming). Essentially, only bounded recall strict PPE are robust to sufficiently rich almost-public private monitoring (Mailath and Morris, 2005).

Equilibria with Conditionally-Independent Monitoring (Bhaskar and van Damme, 2002) In every pure strategy equilibrium of the two period game, SS is played in the first period (no matter how close to perfect the monitoring is). Consider putative equilibrium with EE in the first period. To support this, player i should play G after ¯ y i and B after y i . ¯ But, i ’s beliefs over the signals observed by j are independent of the signal he observes, and so are his best replies. For ε small, these are strict, and so sequentially rational play must ignore the signal.

Different situation with mixing. Consider symmetric profile with probability µ on E in the first period. Implies a type space for i , T i = { E,S }×{ y i , ¯ y i } , with joint dsn: ¯ E ¯ y 2 E y 2 S ¯ y 2 S y 2 ¯ ¯ µ 2 (1 − ε ) 2 µ 2 ε (1 − ε ) µ (1 − µ ) ε 2 E ¯ y 1 µ (1 − µ ) ε (1 − ε ) µ (1 − µ )(1 − ε ) 2 µ 2 ε (1 − ε ) µ 2 ε 2 E y 1 µ (1 − µ ) ε (1 − ε ) ¯ µ (1 − µ )(1 − ε ) 2 (1 − µ ) 2 ε 2 (1 − µ ) 2 ε (1 − ε ) S ¯ y 1 µ (1 − µ ) ε (1 − ε ) (1 − µ ) 2 ε (1 − ε ) (1 − µ ) 2 (1 − ε ) 2 µ (1 − µ ) ε 2 S y 1 µ (1 − µ ) ε (1 − ε ) ¯

Repeated Games George J Mailath A talk prepared for the Nemmers - PowerPoint PPT Presentation

Repeated Games George J Mailath A talk prepared for the Nemmers Conference Northwestern University May 5-7, 2005 Based on chapters 10 and 12 of Repeated Games and Reputations , with Larry Samuelson. http://www.ssc.upenn.edu/

Dynamic Games in Environmental Economics PhD minicourse Part I: Repeated Games and Self-Enforcing

Game Theory Repeated Games Levent Ko ckesen Ko c University Levent Ko ckesen (Ko c

Repeated Games with Perfect Monitoring Mihai Manea MIT Repeated Games normal-form stage game

Repeated games Felix Munoz-Garcia Strategy and Game Theory - Washington State University Repeated

Finitely Repeated Games 14.12 Game Theory Muhamet Yildiz 1 Road Map 1.

REPEATED GAMES Overview Context: players (e.g., firms) interact with each other on an ongoing

Environmental Economics 4910 Brd Harstad UiO February 2019 Brd Harstad (UiO) Repeated

Game theory for wireless networks static games; dynamic games; repeated games; strict and weak

Analysis of variance and regression 2009-3-11 Lene Theil Skovgaard Repeated measurements May

Games Miheer Dewaskar Chennai Mathematical Institute April 27, 2016 1 / 19 Outline Finite

S S S S erious Games erious Games erious Games erious Games + Computer S + Computer S +

Potential Games Matoula Petrolia April 14, 2011 Examples Potential Games Potential vs

Pre-Grundy Games Games And Graphs Workshop 2017 In collaboration with : Eric Duch ene,

Beliefs and Learning in Repeated Games Florin Constantin and Ivo Parashkevov March 15, 2006

Infinitely Repeated Games 14.12 Game Theory Muhamet Yildiz 1 3. Examples dVDC Road Map 1.

Repeated Games and Networks Literature Francesco Nava London School of Economics April 2015

2 The examples in our paper [2] Example 1: Voronoi Note, in the figure above, that the ( ,

Constraints on general second-order scalar-tensor models from gravitational Cherenkov radiation

Beyond Theories of Change: Working Politically for Transparency and Accountability Brendan

LIGHTER SIMPLER CHEAPER INDUSTRIAL ROBOT-MANIPULATOR polymeric materials

Kochen-Specker theorem and games Laura Mancinska University of Waterloo, Department of C&O

Problems for Breakfast Shaking Hands Seven people in a room start shaking hands. Six of them

U G A V michael.johanson@gmail.com ! " # ! K Q $ @mikebjohanson A J ! 0 1

Tool for Responsible Games London Workshop on Problem Gambling: Theory and (Best) Practice

Sambuz

Useful Links

Newsletter

Mail Us

Repeated Games George J Mailath A talk prepared for the Nemmers - PowerPoint PPT Presentation

Repeated Games George J Mailath A talk prepared for the Nemmers Conference Northwestern University May 5-7, 2005 Based on chapters 10 and 12 of Repeated Games and Reputations , with Larry Samuelson. http://www.ssc.upenn.edu/

Dynamic Games in Environmental Economics PhD minicourse Part I: Repeated Games and Self-Enforcing

Game Theory Repeated Games Levent Ko ckesen Ko c University Levent Ko ckesen (Ko c

Repeated Games with Perfect Monitoring Mihai Manea MIT Repeated Games normal-form stage game

Repeated games Felix Munoz-Garcia Strategy and Game Theory - Washington State University Repeated

Finitely Repeated Games 14.12 Game Theory Muhamet Yildiz 1 Road Map 1.

REPEATED GAMES Overview Context: players (e.g., firms) interact with each other on an ongoing

Environmental Economics 4910 Brd Harstad UiO February 2019 Brd Harstad (UiO) Repeated

Game theory for wireless networks static games; dynamic games; repeated games; strict and weak

Analysis of variance and regression 2009-3-11 Lene Theil Skovgaard Repeated measurements May

Games Miheer Dewaskar Chennai Mathematical Institute April 27, 2016 1 / 19 Outline Finite

S S S S erious Games erious Games erious Games erious Games + Computer S + Computer S +

Potential Games Matoula Petrolia April 14, 2011 Examples Potential Games Potential vs

Pre-Grundy Games Games And Graphs Workshop 2017 In collaboration with : Eric Duch ene,

Beliefs and Learning in Repeated Games Florin Constantin and Ivo Parashkevov March 15, 2006

Infinitely Repeated Games 14.12 Game Theory Muhamet Yildiz 1 3. Examples dVDC Road Map 1.

Repeated Games and Networks Literature Francesco Nava London School of Economics April 2015

2 The examples in our paper [2] Example 1: Voronoi Note, in the figure above, that the ( ,

Constraints on general second-order scalar-tensor models from gravitational Cherenkov radiation

Beyond Theories of Change: Working Politically for Transparency and Accountability Brendan

LIGHTER SIMPLER CHEAPER INDUSTRIAL ROBOT-MANIPULATOR polymeric materials

Kochen-Specker theorem and games Laura Mancinska University of Waterloo, Department of C&amp;O

Problems for Breakfast Shaking Hands Seven people in a room start shaking hands. Six of them

U G A V michael.johanson@gmail.com ! &quot; # ! K Q $ @mikebjohanson A J ! 0 1

Tool for Responsible Games London Workshop on Problem Gambling: Theory and (Best) Practice

Sambuz

Useful Links

Newsletter

Mail Us

Kochen-Specker theorem and games Laura Mancinska University of Waterloo, Department of C&O

U G A V michael.johanson@gmail.com ! " # ! K Q $ @mikebjohanson A J ! 0 1