A Distributional Framework for Matched Employer Employee Data Nov - - PowerPoint PPT Presentation

a distributional framework for matched employer employee
SMART_READER_LITE
LIVE PREVIEW

A Distributional Framework for Matched Employer Employee Data Nov - - PowerPoint PPT Presentation

A Distributional Framework for Matched Employer Employee Data Nov 2017 Introduction Many important labor questions rely on rich worker and firm heterogeneity - decomposing wage inequality, understanding earnings dynamics, mobility


slide-1
SLIDE 1

A Distributional Framework for Matched Employer Employee Data

Nov 2017

slide-2
SLIDE 2

Introduction

  • Many important labor questions rely on rich worker and firm

heterogeneity

  • decomposing wage inequality, understanding earnings

dynamics, mobility (individual and aggregate)

  • mobility between jobs, in and out of employment
  • This heterogeneity might be unobserved
  • but we have repeated measures (matched data)
  • we can learn about latent types
  • Economists have developed frameworks for two-sided

heterogeneity, observed and unobserved

slide-3
SLIDE 3

Two influential literatures for worker and firm heterogeneity

Log linear fixed effect wages

Abowd, Kramarz, and Margolis (1999); Card, Heining, and Kline (2013)

  • yit = αi + ψj(i,t) + ǫit
  • spurred both applied and theoretical literature
  • pros:

allows for 2-sided unobserved heterogeneity, tractable

  • limitations:

imposes additivity (= theory, Eeckhout and Kircher (2011)), suffers from limited mobility bias

Equilibrium search structural models

Burdett and Mortensen (1998); Shimer and Smith (2000); Postel-Vinay and Robin (2004); Hagedorn, Law, and Manovskii (2014)

  • pros:

allows for complex wage functions, can address efficiency/policy questions

  • limitations:

imposes strong structural assumptions (vacancy mechanism, wage setting, mobility decision ...)

slide-4
SLIDE 4

Two influential literatures for worker and firm heterogeneity

Log linear fixed effect wages

Abowd, Kramarz, and Margolis (1999); Card, Heining, and Kline (2013)

  • yit = αi + ψj(i,t) + ǫit
  • spurred both applied and theoretical literature
  • pros:

allows for 2-sided unobserved heterogeneity, tractable

  • limitations:

imposes additivity (= theory, Eeckhout and Kircher (2011)), suffers from limited mobility bias

Equilibrium search structural models

Burdett and Mortensen (1998); Shimer and Smith (2000); Postel-Vinay and Robin (2004); Hagedorn, Law, and Manovskii (2014)

  • pros:

allows for complex wage functions, can address efficiency/policy questions

  • limitations:

imposes strong structural assumptions (vacancy mechanism, wage setting, mobility decision ...)

slide-5
SLIDE 5

This paper:

  • Proposes a distributional model of wages
  • assume discrete heterogeneity: firms (k) and workers (ℓ)
  • non-parametric conditional wage distributions Fkℓ(w)
  • unrestricted firm compositions πk(ℓ)
  • Non-parametric identification & estimation for 2 types of

mobility assumptions:

  • 2 period static model ( ∼ AKM assumptions )
  • 4 period dynamic model
  • Applies method to Swedish matched employee-employer data

Important properties:

  • works with very short panels (2 to 4 periods)
  • relax additivity and mobility
  • provide a ”

regularization”

  • testing framework:
  • compatible with many theoretical models:
  • informative about patterns without imposing full structure,
  • without further assumptions, can’t address efficiency questions
slide-6
SLIDE 6

This paper:

  • Proposes a distributional model of wages
  • assume discrete heterogeneity: firms (k) and workers (ℓ)
  • non-parametric conditional wage distributions Fkℓ(w)
  • unrestricted firm compositions πk(ℓ)
  • Non-parametric identification & estimation for 2 types of

mobility assumptions:

  • 2 period static model ( ∼ AKM assumptions )
  • 4 period dynamic model
  • Applies method to Swedish matched employee-employer data

Important properties:

  • works with very short panels (2 to 4 periods)
  • relax additivity and mobility
  • provide a ”

regularization”

  • testing framework:
  • compatible with many theoretical models:
  • informative about patterns without imposing full structure,
  • without further assumptions, can’t address efficiency questions
slide-7
SLIDE 7

Plan of the talk

1 Framework & identification overview 2 Data and empirical results 3 Performance on a theoretical sorting model

slide-8
SLIDE 8

Model and Indentification

slide-9
SLIDE 9

Heterogeneity and wages

  • Workers indexed by i with discrete types ω(i) ∈ {1, ..., L}
  • Firms indexed by j with discrete classes f (j) ∈ {1, ..., K}.
  • Let jit denote the identifier of the firm where i works at t.
  • The proportion of type-l workers in firm j is πf (j)(l), where:

Pr [ω(i) = ℓ | f (ji1) = k] = πk(ℓ).

  • The conditional cdf of log wages Yi1 is:

Pr [Yi1 ≤ y | ω(i) = ℓ, f (ji1) = k] = Fkℓ(y).

  • Interactions between workers are ruled out.
  • At this K and L are assumed known, which is an important
  • restriction. In a different paper we are extending this. We also

provide theorems of ℓ continuous.

slide-10
SLIDE 10

Heterogeneity and wages

  • Workers indexed by i with discrete types ω(i) ∈ {1, ..., L}
  • Firms indexed by j with discrete classes f (j) ∈ {1, ..., K}.
  • Let jit denote the identifier of the firm where i works at t.
  • The proportion of type-l workers in firm j is πf (j)(l), where:

Pr [ω(i) = ℓ | f (ji1) = k] = πk(ℓ).

  • The conditional cdf of log wages Yi1 is:

Pr [Yi1 ≤ y | ω(i) = ℓ, f (ji1) = k] = Fkℓ(y).

  • Interactions between workers are ruled out.
  • At this K and L are assumed known, which is an important
  • restriction. In a different paper we are extending this. We also

provide theorems of ℓ continuous.

slide-11
SLIDE 11

Job mobility

static model: 2 periods

Yi1 Yi2 k k′ move

  • Consider a worker of type ℓ in firm k in period 1
  • He gets a wage Yi1 drawn from Fkℓ(y).
  • The worker moves to a class-k′ firm with a probability that

depends on k and ℓ, not on Y i1.

  • In period 2 he draws a wage Yi2 from a distribution Gk′ℓ(y′)

that depends on ℓ and k′, not on (k, Y i1).

slide-12
SLIDE 12

Job mobility

dynamic model: 4 periods

Yi1 Yi2 Yi3 Yi4 k k′ move

  • Consider a worker of type ℓ in firm k at t = 2
  • Wages (Yi1, Yi2) are drawn from a bivariate distribution that

depends on (k, ℓ).

  • At t = 2, the worker moves to a type-k′ firm with a

probability that depends on ℓ, k and Yi2, not on Y i1.

  • At t = 3, If he moves, the worker draws a wage Yi3 from a

distribution that depends on ℓ, k′, k, Yi2, not on Y i1.

  • At t = 4, the worker draws a wage Yi4 that depends on ℓ, k′,

Yi3, not on (k, Y i2, Y i1).

slide-13
SLIDE 13

Link to theoretical models

  • 2-periods model:
  • Example: Shimer and Smith (2000), without or with
  • n-the-job search (workers’ threat points being the value of

unemployment).

  • No role for match-specific draws, unless independent over time
  • r measurement error. No sequential auctions.
  • 4-periods model:
  • All models where state variables (ℓ, kt, Yt) are first-order

Markov.

  • Examples: wage posting, sequential auctions (Lamadon, Lise,

Meghir and Robin 2015), with aggregate shocks (Lise and Robin 2014). more ⊲

  • No latent human capital accumulation (ℓt), no

permanent+transitory within-job wage dynamics (example: random walk+i.i.d. shock).

slide-14
SLIDE 14

Plan of attack

1 Identification with large firms 2 Empirical content of means & event study 3 Grouping firms in discrete types

slide-15
SLIDE 15

Main restrictions

Static model

  • Under the assumptions of the static model, we have,
  • For movers from firm k to firm k′ we have:

Pr

  • Yi1≤y, Yi2≤y′|k, k′

=

K

  • ℓ=1

pkk′ (ℓ) Fkℓ(y)Fk′ℓ(y′),

  • For the cross-section in k we have

Pr [Yi1 ≤ y| k] =

K

  • ℓ=1

πk (ℓ) Fkℓ(y).

slide-16
SLIDE 16

Main restrictions

Dynamic model

  • Using mobility assumptions of the dynamic model
  • conditioning on Y2 = y2Y3 = y3, we get:

Pr

  • Yi1≤y, Yi4≤y′|y2, y3, k, k′

=

K

  • ℓ=1

pkk′y2y3 (ℓ) Fkℓ(y|y2)Gk′ℓ(y′|y3)

  • Similar structure as in static model:
  • use 4 period of data
  • replace Fkℓ(y) with Fkℓ(y|y′)
  • replace pkk′ with pkk′y2y3
slide-17
SLIDE 17

Identification Wage Functions

Large firms

  • Consider two larger firms k and k′ and joint Y1, Y2 wages for

movers k → k′ Ak,k′(y1, y2) =

Fkℓ(y1)pkk′(ℓ)Fk′ℓ(y2)

  • Discretize wage (nw) support and write in Matrix form:

A(k, k′)

nw×nw

= F(k)

nw×nℓ

P(k, k′)

nℓ×nℓ diag.

F(k′)⊺

  • Consider case where nw=nℓ, and both k → k′ and k′ → k:

A(k, k′)A−1(k′, k)⊺ = F(k)P(k, k′)P−1(k′, k)F −1(k)

  • Which is an eigen value decomposition.
slide-18
SLIDE 18

Identification Wage function

  • In general, the identification relies on a joint diagonalization
  • f all A(k, k′).

A(k, k′) = F(k)P(k, k′)F(k′)⊺

  • It is sufficient (but not necessary, see paper) for identification
  • f Fkℓ that:
  • pkk ′(ℓ) = 0 for ℓ = 1, ..., L.
  • pkk′(ℓ

pk′k(ℓ), k = 1, ..., L, are distinct.

  • The columns of F(k) (the Fkℓ) are linearly independent.
  • once Fkℓ is known, go to cross-section to get πk(ℓ)
  • In the 4 period model, replace Y1, Y2 with Y1|Y2, Y4|Y3 and

do everything conditional on k, k′, Y2, Y3.

slide-19
SLIDE 19

Empirical content of wage means

intro

  • In the linear framework (AKM) where Yit = αi + ψj(i,t) + ǫit
  • ne can focus on movers to get:

E(Yit+1 − Yit|m = 1) = ψj(i,t+1) − ψj(i,t), (1) which can be recovered with OLS.

  • Now consider an interacted model at the class level:

Yit = a(kit) + b(kit)αi + ǫit with E[ǫit|αi, ki1, ki2, mi1] = 0.

  • what can we do?
slide-20
SLIDE 20

Empirical content of wage means

intro

  • In the linear framework (AKM) where Yit = αi + ψj(i,t) + ǫit
  • ne can focus on movers to get:

E(Yit+1 − Yit|m = 1) = ψj(i,t+1) − ψj(i,t), (1) which can be recovered with OLS.

  • Now consider an interacted model at the class level:

Yit = a(kit) + b(kit)αi + ǫit with E[ǫit|αi, ki1, ki2, mi1] = 0.

  • what can we do?
slide-21
SLIDE 21

Empirical content of wage means

interactions

  • Interacted model Yit = a(kit) + b(kit)αi + ǫit with 2 firms:

¯ y21(2) ¯ y21(1) ¯ y12(1) ¯ y12(2) a(2) + b(2) · ¯ α21 = a(1) + b(1) · ¯ α12 = = a(1) + b(1) · ¯ α21 = a(2) + b(2) · ¯ α12 k=2 k=2 k=1 k=1 ∆21 ∆12 wage time

  • Comparing changes: ∆21 + ∆12 = (¯

α12 − ¯ α21)(b(2) − b(1))

  • 0 if no interactions b(2) = b(1)
  • also 0 when composition is identical ¯

α12 = ¯ α21

slide-22
SLIDE 22

Empirical content of wage means

interactions

  • Consider the following differences:

¯ y21(2) − ¯ y12(2) = b(2)(¯ α21 − ¯ α12) ¯ y21(1) − ¯ y12(1) = b(1)(¯ α21 − ¯ α12)

  • Taking ration whenever ¯

α21 = ¯ α12: b(2) b(1) = ¯ y21(1) − ¯ y12(2) ¯ y21(2) − ¯ y12(1)

  • this recovers the interaction term.
slide-23
SLIDE 23

Empirical content of wage means

Event Study in wages from Shimer Smith

Wages in Shimer Smith Event study

0.90 0.95 1.00 1.05 2.5 5.0 7.5 10.0

firm class

model log wages (PAM)

slide-24
SLIDE 24

Firm types

Intro

  • Identification relies on large flows of workers between k and

k′, However mobility is low at the firm pair.

  • We propose to discretized firm types: assume K discrete type

in the population

  • k drives the unconditional firm wage distribution:

Hk(y) =

πk(ℓ)Fkℓ(y)

  • Recover types using cross-section, then treat groups as large
  • firms. Requires that Hk(y) are separable.
  • This first stage classification achieves a double purpose:
  • reduces the problem of limited mobility.
  • it breaks the complicated dependence structure between firms.
slide-25
SLIDE 25

Firm types

Grouping in practice

  • Under the assumption that in the population i) firms are

clustered in K groups and ii) the Hk are separated, then:

  • Then firms partition f (j) ∈ 1..K is identified, and can

recovered by k-mean on firms wage distributions. min

f (1),...,f (J),H1,...,HK J

  • j=1

nj

D

  • d=1
  • Fj (yd) − H f (j) (yd)

2 ,

  • when K is known and Hk(y) are separable, this classification

is super-consistent in firm size.

slide-26
SLIDE 26

Firm types

Distribution of wages

0.90 0.95 1.00 1.05 2.5 5.0 7.5 10.0

firm class

model log wages (PAM)

  • 0.4

0.5 0.6 0.7 2.5 5.0 7.5 10.0

firm class

quantile log wages (PAM)

0.96 1.00 1.04 1.08 1.12 2.5 5.0 7.5 10.0

firm class

model log wages (NAM)

  • 0.50

0.55 0.60 0.65 0.70 2.5 5.0 7.5 10.0

firm class

quantile log wages (NAM)

Notes: Model based on Shimer and Smith (2000) with on-the-job search.

slide-27
SLIDE 27

Recap of the full method

1 get Firm classes membership f (j) ∈ [1..K].

  • group firms based on wage distributions
  • in practice we use k-means
  • cluster based on cross-section (or combine with movers)

more ⊲

2 get pkk′(ℓ), and Fkℓ or Fkℓ(y1|y2)

  • use movers, treat worker type as random
  • non-parametric identification in the paper
  • in practice we use the EM algorithm

3 get firm compositions πk(ℓ)

  • using stayers (cross-section or 4 periods)
  • another EM
slide-28
SLIDE 28

Applying framework to Swedish data

slide-29
SLIDE 29

Sample description

  • We use matched employer-employee data from Sweden

between 1997 and 2006.

  • We select full-year employed males in 2002 (period 1) and in

2004 (period 2): 1, 000, 000 workers and 60, 000 firms.

  • From this we define movers as workers whose firm IDs are

different in 2002 and 2004. We focus on continuing firms and get 20, 000 job changers, with 13, 000 firms.

more ⊲

  • We use log pre-tax annual earnings, net of time dummies

(interacted with education*cohort).

slide-30
SLIDE 30

Estimated firm classes

  • We estimate firm classes on the 2002 cross-section using a

weighted k-means algorithm (empirical cdfs with 40 points, 10000 starting values). We allow for K = 10 classes.

  • Wage variation across firms is captured well: the between-class

variance of log wages is 90% of the between-firm variance.

  • Note: the ordering of firm classes (by mean log wages) is

arbitrary.

  • Differences between classes in terms of worker composition

(education, age), but also log value added per worker.

slide-31
SLIDE 31

Descriptive statistics on estimated firm classes

firm cluster: 1 2 3 4 5 6 7 8 9 10 all number of workers 21,662 62,929 110,792 114,324 100,080 78,837 137,971 85,806 58,728 27,023 798,152 number of firms 6,487 7,972 7,804 6,494 4,663 3,748 4,209 3,984 3,157 2,812 51,330 % HS dropout 28.9 28 26.6 26.9 23.7 21.1 18.9 12.2 5.31 3 20.7 % HS grade 59.7 62.5 62.6 62.5 61.7 57.8 58.6 47.2 32.9 23.9 56.1 % some college 11.4 9.42 10.7 10.7 14.6 21.2 22.5 40.5 61.8 73.1 23.3 % workers younger than 30 25 20.9 20.5 18.4 16.4 18.4 14.3 15.3 16 14.8 17.5 % workers between 31 and 50 52.7 52.2 53.2 54.1 55.4 54.6 56 57 59.1 63.6 55.3 % workers older than 51 22.3 26.9 26.2 27.5 28.2 27.1 29.7 27.7 24.9 21.6 27.2 mean log wages 9.6 9.87 9.99 10.1 10.1 10.1 10.2 10.4 10.5 10.8 10.2 variance of log wages 0.15 0.0841 0.0934 0.0732 0.0699 0.141 0.0918 0.114 0.116 0.177 0.148 between firm variance of log wages 0.0576 0.00614 0.0039 0.00185 0.0016 0.0056 0.00184 0.00367 0.00456 0.039 0.0544 mean of log value added per worker 12.4 12.5 12.7 12.7 12.8 12.8 12.9 13 13 13.2 12.7 variance of log value added per worker 0.202 0.163 0.155 0.141 0.175 0.255 0.249 0.304 0.398 0.594 0.52 median number of workers per firm 2 4 6 5 5 6 5 5 4 3 4

Notes: Sample 1 in 2002. All workers are males, employed during the full year

  • 2002. “HS”is high school.
slide-32
SLIDE 32

Parametrization of the 2-periods model

  • We specify a model with Gaussian error:
  • Yit ∼ N(µtkℓ, σtkl)
  • πk(ℓ) and pkk ′(ℓ) left unrestricted
  • With K = 10, L = 6 we get 900 parameters.
  • We have also estimated a mixture of mixture models.
  • We also have an interacted model
  • Yit = a(k) + b(k)α + ǫit
slide-33
SLIDE 33

Estimated mean log wages and type proportions (2-periods model)

Mean log wages Proportions of worker types

9.5 10.0 10.5 11.0 1 2 3 4 5 6 7 8 9 10

firm class k log−earnings

0.00 0.25 0.50 0.75 1.00 1 2 3 4 5 6 7 8 9 10

firm class k type proportions

Notes: The left graph plots the mean of Fkℓ. The L = 10 firm classes (on the x-axis) are ordered by mean log wage. The K = 6 worker types correspond to the 6 different colors. 95% confidence intervals based on the parametric bootstrap (200 replications). The right graph plots type proportions πk(ℓ).

slide-34
SLIDE 34

Variance Decomposition and mean effects (2-periods model)

Variance decomposition (×100)

Var(α) Var(α+ψ) Var(ψ) Var(α+ψ) 2Cov(α,ψ) Var(α+ψ)

Corr(α, ψ) R2 80.3

(.8)

3.4

(.2)

16.3

(.6)

49.1

(.9)

74.8

(.6)

Reallocation exercise (×100)

Mean Median 10%-quantile 90%-quantile Variance .5

(.09)

.6

(.10)

2.7

(.20)

−1.2

(.30)

−1.1

(.11)

slide-35
SLIDE 35

Simulations and decompositions

  • We simulate the model based on the estimated parameters,

conditional on the job moves in the data.

  • We simulate entire employment spells, using the spell lengths

in the data.

  • We run linear regressions of the form:

Yi1 = α(ω0

i ) + ψ(f 0 ji1) + εi1

  • We compare our results with AKM on real and simulated data.
slide-36
SLIDE 36

Variance decompositions (2-periods model)

Var(α) Var(α+ψ) Var(ψ) Var(α+ψ) 2Cov(α,ψ) Var(α+ψ)

Corr(α, ψ)

Data (K = 10)

estimate 79.6% 4.3% 16.1% 0.436

Monte-Carlo (K = 10, 100 reps)

mean 80.3% 3.4% 16.3% 0.489 0.025 quantile 79% 3% 15% 0.455 0.975 quantile 82% 3.9% 17.4% 0.526

Varying the number of classes

K = 5 81.5% 3.6% 14.9% 0.437 K = 15 79% 4.5% 16.5% 0.436 K = 20 79.2% 4.7% 16.2% 0.42

Mixture model

estimate 77% 5.2% 17.8% 0.443

Monte-Carlo (K = 10, 100 reps)

mean 77.8% 4.4% 17.8% 0.481 0.025 quantile 76.3% 4% 16.8% 0.465 0.975 quantile 79.1% 5% 18.8% 0.494

slide-37
SLIDE 37

Fixed effect, limited mobility bias

min spell rep

Var(α) Var(α+ψ) Var(ψ) Var(α+ψ) 2Cov(α,ψ) Var(α+ψ)

Corr(α, ψ)

Data

This paper 0.7766 0.0473 0.1762 0.4598 Fixed-effects 0.9813 0.3014

  • 0.2826
  • 0.2599

Simulated from the model

This paper 1 1 0.7669 0.0466 0.1866 0.4934 Fixed-effects 1 1 1.0879 0.3447

  • 0.4326
  • 0.3532

Simulated from the model without limited mobility

Fixed-effects 4 1 0.8948 0.1602

  • 0.055
  • 0.0727

Fixed-effects 4 10 0.7816 0.053 0.1654 0.4064

slide-38
SLIDE 38

Estimation of the 4-periods model

  • We add two employment periods: 2001 and 2005. There are

12,519 workers moving in 2003 employed in the five years.

  • We write a model for Pr[Y1, Y2, Y3, Y4|ℓ, k, k′] :

Yi1 = ρ1|2Yi2+ a1(k) + b(k)α+ εi1 Yi2 = a2(k) + b(k)α+ ξ2(k′)+ εi2 Yi3 = a3(k′) + b(k′)α+ ξ3(k)+ εi3 Yi4 = ρ4|3Yi3+ a4(k′) + b(k′)α+ εi4 with ε’s covariance matrix has to respect the Markovian property.

  • we leave E(α|k, k′) and Var(α|k, k′) unrestricted
slide-39
SLIDE 39

Estimated mean log wages and type proportions

(4-periods model)

Mean log-earnings Proportions of worker types

9.5 10.0 10.5 11.0 1 2 3 4 5 6 7 8 9 10

firm class k log−earnings

0.00 0.25 0.50 0.75 1.00 1 2 3 4 5 6 7 8 9 10

firm class k type proportions

slide-40
SLIDE 40

Variance decompositions on Swedish data (4-periods model)

Var(α) Var(α+ψ) Var(ψ) Var(α+ψ) 2Cov(α,ψ) Var(α+ψ)

Corr(α, ψ) ρ1|2 ρ4|3

Data (K = 10)

estimate 79.9% 5.4% 14.8% 0.357 0.2452 0.6603

Monte-Carlo (K = 10, 100 reps)

mean 77.6% 5.9% 16.5% 0.389 0.2138 0.6639 0.025 quantile 69.9% 4% 13.6% 0.33 0.205 0.6601 0.975 quantile 81.8% 9.4% 21% 0.441 0.2219 0.6675

Varying the number of classes

K = 5 81.8% 3.9% 14.3% 0.4 0.2674 0.6565 K = 15 76.3% 6.4% 17.3% 0.391 0.2469 0.6582 K = 20 74.8% 7.3% 17.9% 0.382 0.2476 0.6566

slide-41
SLIDE 41

Dynamic results

(4-periods model

slide-42
SLIDE 42

Dynamic results

(4-periods model)

slide-43
SLIDE 43

Dimensionality of firm heterogeneity

(4-periods model)

slide-44
SLIDE 44

Performance on sorting models

slide-45
SLIDE 45

Theoretical search-matching model: wage distributions

  • A simple extension of Shimer and Smith (2000)
  • worker x and firm y, on-the-job search (λ0, λ1)
  • production function f (x, y) = a + (νx ρ + (1 − ν)yρ)1/ρ
  • wages are continuously bargained ( outside option is

unemployment)

eqs ⊲

  • consider PAM (ρ = −3) and NAM (ρ = 4)
slide-46
SLIDE 46

Theoretical search-matching model: wage distributions

0.90 0.95 1.00 1.05 2.5 5.0 7.5 10.0

firm class

model log wages (PAM)

  • 0.4

0.5 0.6 0.7 2.5 5.0 7.5 10.0

firm class

quantile log wages (PAM)

0.96 1.00 1.04 1.08 1.12 2.5 5.0 7.5 10.0

firm class

model log wages (NAM)

  • 0.50

0.55 0.60 0.65 0.70 2.5 5.0 7.5 10.0

firm class

quantile log wages (NAM)

Notes: Model based on Shimer and Smith (2000) with on-the-job search.

slide-47
SLIDE 47

Theoretical search-matching model: simulation results

dim %bw %wwbf %wwwf

Var(α) Var(α+ψ) Var(ψ) Var(α+ψ) 2Cov(α,ψ) Var(α+ψ)

Corr(α, ψ) PAM model 6 × 10 0.693 0.103 0.203 0.791 0.054 0.156 0.377 BLM 6 × 10 0.636 0.101 0.263 0.756 0.069 0.175 0.385 NAM model 6 × 10 0.661 0.136 0.203 1.082 0.125

  • 0.206
  • 0.281

BLM 6 × 10 0.625 0.114 0.262 1.049 0.099

  • 0.148
  • 0.23

PAM model 50 × 50 0.693 0.108 0.2 0.758 0.071 0.171 0.367 BLM 6 × 10 0.591 0.121 0.288 0.701 0.095 0.204 0.396 NAM model 50 × 50 0.685 0.115 0.201 1.079 0.107

  • 0.186
  • 0.273

BLM 6 × 10 0.668 0.044 0.288 1.009 0.041

  • 0.05
  • 0.122

Notes: Model based on Shimer and Smith (2000) with on-the-job search.

slide-48
SLIDE 48

Theoretical search-matching model: wage distributions

slide-49
SLIDE 49

Conclusion

  • We introduce a new framework for wages in matched data:
  • unrestricted interactions, short panels, robust to low mobility
  • compatible with many structural models (micro and macro)
  • the method is important for many applications: teachers value

added, sorting among cities, intergenerational mobility, ...

  • Important lessons for model of the labor market:
  • serial correlation for movers is large and first order
  • prelimanary results suggest large firm effects
  • endogenous mobility is empiricaly important (effect of ℓ′ on Y2

and of ℓ on Y3)

  • Clustering on distribution:
  • Important insight for structural estimation,
  • We are currently working on statistical properties when

heterogene- ity may not be grouped in the population, and clustering provides an approximation to the structure of heterogeneity

slide-50
SLIDE 50

Wages of job movers

Eℓ1ℓ2 (Yi2 − Yi1) (x-axis) vs Eℓ2ℓ1 (Yi1 − Yi2) (y-axis), ℓ1 < ℓ2

  • 0.0

0.1 0.2 0.3 0.0 0.1 0.2 0.3 0.4

E[y2 − y1|l1,l2] wage gain when moving from l1 to l2 E[y1 − y2|l2,l1] wage loss when moving from l2 to l1

N

  • 200

400 600

slide-51
SLIDE 51

Estimated standard deviations of log wages by worker type and firm class

0.0 0.2 0.4 2.5 5.0 7.5 10.0

firm cluster (ordered my mean wage) sd log wage factor(k)

1 2 3 4 5 6

Notes: The graph plots the standard deviation of Fkℓ. The L = 10 firm classes

slide-52
SLIDE 52

Cluster to cluster transitions for each type

posterior transitions

slide-53
SLIDE 53

Fit of log wage densities

1 2 3 4 5 6 7 8 9 10 1 2 3 1 2 3 0.0 0.5 1.0 1.5 1 2 3 1 2 3 0.0 0.5 1.0 1.5 2.0 1 2 3 1 2 3 0.0 0.5 1.0 1.5 2.0 2.5 1 2 3 4 5 6 7 8 9 8 9 1011 8 9 101112 8 10 12 8 10 12 14 8 9 10111213 8 10 12 148 10 12 8 10 12 148 9 101112138 10 12

lw density

Notes: Marginal densities of log wages for each x cell (in rows) and firm class (in columns). Sample 1, 2002. The red line is the model, the shaded area is from the data.

back ⊲

slide-54
SLIDE 54

Fit of log wage correlations

  • −1.0

−0.5 0.0 0.5 1.0 −1.0 −0.5 0.0 0.5 1.0

data model

N

  • 200

400 600

Notes: Log wage correlations Corr(Y1, Y2|ℓ1, ℓ2), for job movers, by pairs of firm classes. Sample 2. In the data (x-axis) and in the simulated data (y-axis).

back ⊲

slide-55
SLIDE 55

Theoretical search-matching model: setup

  • worker x and firm y, on-the-job search (λ0, λ1)

back ⊲

  • firm post vacancies
  • production function f (x, y) = a + (νx ρ + (1 − ν)yρ)1/ρ

Surplus equation is given by:

(r + δ)S(x, y) = (1 + r) (f (x, y) − δ(b(x) − c(y))) − r(1 − δ)(Π0(y) + W0(x)) + (1 − δ)λ1

  • P(x, y, y′)(αS(x, y′) − S(x, y))v1/2(y′)dy′.

Wage equation is given by:

(1 + r)w(x, y) = (r + δ)αS(x, y) + (1 − δ)rW0(x)− (1 − δ)λ1

  • P(x, y, y′)(αS(x, y′) − αS(x, y))v1/2(y′)dy′.
slide-56
SLIDE 56

Theoretical search-matching model: plots

Production PAM

x y

Surplus PAM

x y

Allocation PAM

x y

Production NAM

x y

Surplus NAM

x y

Allocation NAM

x y

Notes: Model based on Shimer and Smith (2000) with on-the-job search.

slide-57
SLIDE 57

Descriptive statistics and data selection

earnings from 2002 to 2003 earnings from 2001 to 2005 all

  • empl. either
  • empl. both
  • cont. firms
  • empl. either
  • empl. both
  • cont. firms

firms in 2002 54,753 53,610 46,597 43,884 53,159 43,670 40,987 firms in 2003 55,623 54,674 47,553 43,845 54,218 44,760 40,958 firms in 2004 56,374 54,867 46,450 43,887 54,427 43,605 40,986 workers in 2002 1,091,509 907,883 635,186 599,963 891,256 587,614 554,489 workers in 2003 1,082,028 910,454 635,135 598,834 893,392 587,573 553,681 workers in 2004 1,073,174 886,573 635,186 599,963 870,419 587,614 554,489 mean reported firm size in 2002 34.4 34.9 37.6 37.1 35.1 39.4 39 median reported firm size in 2002 10 10 10 10 10 11 11 movers between 2002 and 2004 142,580 121,090 54,968 19,745 117,389 50,629 17,504 % movers employed 12 months in 2003 0.755 0.868 0.952 0.968 0.873 0.953 0.969 co-movers 90 percentile 2 2 2 1 2 2 1 co-movers 99 percentile 13 15 22 7 15 22 7 co-movers 100 percentile 2,458 2,439 2,137 233 2,432 2,033 222 quaterly j2j probability 0.0196 0.0183 0.0149 0.00678 0.0181 0.0148 0.00652 quaterly e2u probability 0.0228 0.0133 0.00396 0.0127 0.00375 quaterly u2e probability 0.155 0.34 0.477 1 0.348 0.509 1

Description of the data in the different samples.

back ⊲

slide-58
SLIDE 58

Wages of job movers

  • 19.5

20.0 20.5 21.0 21.5 19.6 20.0 20.4 20.8 21.2

E[y1+y2|l1,l2] E[y1+y2|l2,l1]

N

  • 200

400 600

Eℓ1ℓ2 (Yi1 + Yi2) (x-axis) vs Eℓ2ℓ1 (Yi1 + Yi2) (y-axis), ℓ1 < ℓ2

back ⊲

slide-59
SLIDE 59

Variance of firm fixed-effects

slide-60
SLIDE 60

Approximate clustering

  • In a second paper we consider the case where

Yit = η(i, t) + ǫ(i, t) where bot η(i, t) and ǫ(i, t) are unobserverd.

  • Assuming that η(i, t) is low complexity d
  • meaning the number of ǫ−balls to cover η is ∝ ǫ−d
  • for example η(i, t) = φ(ξi, t) with xi d−dimensional
  • We show that the convergence rate of the clustering as

(K, N , T) → ∞ is: Op(log K T ) + Op(K N ) + Op(K − 2

d )

back ⊲

slide-61
SLIDE 61

stayers movers y′ in same firm m = 1 decision to move ℓ′ realization y′ in new firm Pr[y′|k, ℓ, y, Ω] 4 Pr[m|k, ℓ, y, Ω] Pr[ℓ′|k, ℓ, y, m = 1, Ω] Pr[y′|k, ℓ, y, ℓ′, Ω] 2-period

  • 3

(k, ℓ) (k, ℓ) (k, ℓ′) 4-period (k, ℓ, y) (k, ℓ, y) (k, ℓ, y) (k, ℓ′, ℓ, y) Shimer Smith (k, ℓ)

  • 2

(k) (k, ℓ′) Shimer Smith + OTJ 1 (k, ℓ) (k, ℓ) (k, ℓ) (k, ℓ′) Postel-Vinay Robin (k, ℓ, y) (k, ℓ) (k, ℓ) (k, ℓ, ℓ′) Lamadon, Lise, Meghir, Robin 5 (k, ℓ, y) (k, ℓ) (k, ℓ) (k, ℓ, ℓ′) Burdett Mortensen - wage posting 6 (k, ℓ) (k, ℓ) (k, ℓ) (k, ℓ) Burdett Coles - contract posting (k, ℓ, y) (k, ℓ, y) (k, ℓ, ℓ′) (k, ℓ′) Lise Robin 2014 (z, k, ℓ, y) (z, k, ℓ) (z, k, ℓ, ℓ′) (z, k, ℓ′)

4 Call Ω the information set that contains all the past. Each cell of the table shows what the subset of variable that are sufficient

for the probability.

1 We consider a model where bargaining is with value of unemployment, not a sequential acution model. 2 Mobility here is only by exogenous match separation 3 We don’t model stayers in this version 5 A model with sequential auction and sorting in equilibrium 6 Burdet Mortensen does not include heterogeneity directly, we refer to the wage posting mechanism. This also includes Shi and

Delacroix paper.

7 z is the aggregate state.

back ⊲

slide-62
SLIDE 62

Re - classification

step

Var(α) Var(α+ψ) Var(ψ) Var(α+ψ) 2Cov(α,ψ) Var(α+ψ)

Corr(α, ψ) cor(kj , ˜ kj )

Data (K = 10)

79.6% 4.3% 16.1% 0.436 1

Reclassifying the firms

1 74.2% 6.6% 19.1% 0.43 0.976 2 74.7% 7.3% 18% 0.383 0.966 3 76.5% 5.7% 17.8% 0.428 0.961 4 77.3% 5.4% 17.3% 0.424 0.957 5 77.3% 5.6% 17% 0.408 0.954 6 75.1% 6.6% 18.3% 0.412 0.951 7 74.6% 6.9% 18.5% 0.408 0.949 8 74.5% 6.7% 18.8% 0.421 0.945 9 75.4% 6.4% 18.1% 0.412 0.939 10 75.8% 6% 18.1% 0.424 0.934

slide-63
SLIDE 63

Estimation of ρ

rho 1|2 rho 4|3 0.00 0.25 0.50 0.75 1.00 0.00 0.25 0.50 0.75 1.00 10 20 30 40

value of rho value of the objective

slide-64
SLIDE 64

Mobility pattern

  • 0.40

0.45 0.50 0.55 0.60 2 4 6

worker type probability of moving to higher firm classes

slide-65
SLIDE 65

Worker type composition

drop out high school some college 0.00 0.25 0.50 0.75 1.00 1 2 3 4 5 6 7 8 1 2 3 4 5 6 7 8 1 2 3 4 5 6 7 8

firm cluster (ordered my mean wage) proportions

slide-66
SLIDE 66

Probability model

static model Pr[Y1, Y2, k′|k, ℓ] = Pr[Y2|Y1, k, k′, ℓ] · Pr[k′|Y1, k, ℓ] · Pr[Y1|k, ℓ] = Pr[Y2|k′, ℓ] · Pr[k′|k, ℓ] · Pr[Y1|k, ℓ] dynamic model Pr[Y3, Y4, k′|Y1, Y2, k, ℓ] = Pr[Y4|Y1, Y2, Y3, k′, k, ℓ] · Pr[Y3|Y1, Y2, k′, k, ℓ] · Pr[k′|Y1, Y2, k, ℓ] Pr[Y4|Y3, k′, k, ℓ] · Pr[Y3|Y2, k′, k, ℓ] · Pr[k′|Y2, k, ℓ]

slide-67
SLIDE 67

Table of content

Main Supplements lit review Endogenous mobility contribution Proportions without x Fit marginal wages distr framework Fit cov(y1, y2) mobility wages sd identification mini model Search model desc general case Search model plots data data selection firm classes sec-approx-clustering conclusion wages Approximate clustering proportions pk(ℓ, ℓ′) plot *** 2 period var decomposition 4 period var decomposition

slide-68
SLIDE 68

References

Abowd, J. M., F. Kramarz, and D. N. Margolis (1999): “High Wage Workers and High Wage Firms,”Econometrica, 67(2), 251–333. Burdett, K., and D. T. Mortensen (1998): “Wage differentials, employer size, and unemployment,”Int. Econ. Rev., pp. 257–273. Card, D., J. Heining, and P. Kline (2013): “Workplace Heterogeneity and the Rise of West German Wage Inequality*,”

  • Q. J. Econ., 128(3), 967–1015.

Eeckhout, J., and P. Kircher (2011): “Identifying sorting in theory,”

  • Rev. Econ. Stud., 78(3), 872–906.

Hagedorn, M., T. H. Law, and I. Manovskii (2014): “Identifying Equilibrium Models of Labor Market Sorting,”Working Paper. Postel-Vinay, F., and J.-M. Robin (2004): “To match or not to match?: Optimal wage policy with endogenous worker search intensity,”Rev. Econ. Dyn., 7(2), 297–330. Shimer, R., and L. Smith (2000): “Assortative Matching and Search,”Econometrica, 68(2), 343–369.