Learning to Coordinate
Very preliminary - Comments welcome Edouard Schaal1 Mathieu Taschereau-Dumouchel2
1New York University 2Wharton School
University of Pennsylvania
1 / 30
Learning to Coordinate Very preliminary - Comments welcome Edouard - - PowerPoint PPT Presentation
Learning to Coordinate Very preliminary - Comments welcome Edouard Schaal 1 Mathieu Taschereau-Dumouchel 2 1 New York University 2 Wharton School University of Pennsylvania 1 / 30 Introduction We want to understand how agents learn to
1New York University 2Wharton School
University of Pennsylvania
1 / 30
◮ In most models, information comes from various exogenous signals ◮ In reality, agents learn from endogenous sources (prices, aggregates,
◮ The mechanism dampens the impact of small shocks... ◮ ...but amplifies and propagates large shocks 2 / 30
◮ Payoff of action depends on actions of others and on unobserved
◮ Agents use private and public information about θ ◮ Observables (output,...) aggregate individual decisions
◮ When public information is very good or very bad, agents rely less on
◮ The observables becomes less informative ◮ Learning is impeded and the economy can deviate from fundamental
3 / 30
◮ Characterize equilibria and derive conditions for uniqueness ◮ Explore relationship between decisions and information ◮ Study the planner’s problem ◮ Provide numerical examples and simulations along the way 4 / 30
◮ Angeletos and Werning (2004); Hellwig, Mukherji and Tsyvinksi
◮ Angeletos, Hellwig and Pavan (2007): dynamic environment,
◮ Chamley (1999): stylized model with cycles, learning from actions of
5 / 30
6 / 30
◮ Two-state Markov process θt ∈ {θl, θh}, θh > θl with
7 / 30
1 A private signal vit
◮ Drawn from cdf Gθ for θ ∈ {θl, θh} with support v ∈ [a, b] ◮ Gθ are continuously differentiable with pdf gθ ◮ Monotone likelihood ratio property: gh(v)/gl(v) is increasing
2 An exogenous public signal zt drawn from cdf F z θ and pdf f z θ 3 An endogenous public signal mt
◮ Agents observe the mass of projects realized with some additive
◮ νt ∼ iid cdf F ν with associated pdf f ν ◮ Assume without loss of generality that F ν has mean 0 8 / 30
1 θt is realized 2 Private signals vit are observed 3 Decisions are made 4 Public signals mt and zt are observed
9 / 30
10 / 30
11 / 30
pit LR−1 it
12 / 30
t
h (zt)
l (zt)
t
t
Full expression for dynamic of p 13 / 30
p gl(vi) gh(vi)
14 / 30
1 Agent i realizes his project if and only if his vi is higher than ˆ
2 The public signal m is defined by m = 1 − Gθ (ˆ
3 Public and private beliefs are consistent with Bayesian learning
15 / 30
16 / 30
gl is differentiable and there exists ρ > 0 such that
1 If β 1−β ≤ θh − θl, all equilibria are monotone, 2 If β 1−β ≤ ρPhlPlh max{gh,gl}3 , there exists a unique equilibrium.
1 an upper bound on β;
Role of β
2 enough beliefs dispersion.
Role of dispersion 17 / 30
200 400 600 800 1000 0.0 0.2 0.4 0.6 0.8 1.0
1 − Gθ(ˆ v) θ
200 400 600 800 1000 0.0 0.2 0.4 0.6 0.8 1.0
1 − Gθ(ˆ v) θ
18 / 30
ν), then the mutual information between θ and m is
ν
General Lemma
1 The current beliefs p 2 The amount of noise σν added to the signal 3 The difference between Gl (ˆ
Definition of mutual information 19 / 30
2
20 / 30
21 / 30
0.0 0.2 0.4 0.6 0.8 1.0
Current beliefs p
0.0 0.1 0.2 0.3 0.4 0.5 0.6 0.7
Mutual information
Parameters 22 / 30
1 If (1 − β) θl ≤ c ≤ (1 − β) θh, there exists p ∈ [0, 1], such that for
2 If (1 − β) θl + β ≤ c ≤ (1 − β) θh + β, there exists p ∈ [0, 1], such
3 For p ≤ p and p ≥ p, m contains no information about θ.
Details
23 / 30
0.0 0.2 0.4 0.6 0.8 1.0
Current beliefs p
0.0 0.1 0.2 0.3 0.4 0.5 0.6 0.7
Mutual information
β = 0 β = 0.3
Parameters Details
2, higher β implies more projects realized (ˆ
2, higher β implies fewer projects realized (ˆ
24 / 30
25 / 30
10 20 30 40 50 0.0 0.2 0.4 0.6 0.8 1.0
θ m with β =0.0 m with β =0.3
10 20 30 40 50 0.0 0.2 0.4 0.6 0.8 1.0
θ m with β =0.0 m with β =0.3 26 / 30
10 20 30 40 50 0.0 0.2 0.4 0.6 0.8 1.0
1 − Gθ(ˆ v) ν
10 20 30 40 50 0.0 0.2 0.4 0.6 0.8 1.0
1 − Gθ(ˆ v) ν
10 20 30 40 50 0.0 0.2 0.4 0.6 0.8 1.0
1 − Gθ(ˆ v) ν 27 / 30
1 Complementarity: a higher m increases the payoff of others 2 Information: m influences the amount of information revealed
ˆ v
ˆ v
28 / 30
10 20 30 40 50 0.0 0.2 0.4 0.6 0.8 1.0
θ Efficient m Equilibrium m
29 / 30
◮ Large government spending multiplier?
30 / 30
h (z) f (m − 1 + Gh (ˆ
l (z) f (m − 1 + Gl (ˆ
h (z) f (m − 1 + Gh (ˆ
l (z) f (m − 1 + Gl (ˆ
Details 30 / 30
ν), then Γ = (2σ2 ν)−1.
Return 30 / 30
m
Return 30 / 30
Return 30 / 30
Return 30 / 30
1 0.2 0.4 0.6 0.8 1 pi vi Fundamental 1 0.2 0.4 0.6 0.8 1 Ei[1 − Gθ(vi)] vi Complementarity
30 / 30
Return 30 / 30
Return
30 / 30
0.2 0.2 0.4 0.6 0.8 1 π(ˆ v; ˆ v, p) ˆ v
0.2 0.2 0.4 0.6 0.8 1 π(ˆ v; ˆ v, p) ˆ v higher p p p m = 0 m = 1
Return 30 / 30
2
2
2, higher β implies more projects realized (ˆ
2, higher β implies fewer projects realized (ˆ
Return 30 / 30