A Computationally Practical Simulation Estimation Algorithm for - - PDF document

a computationally practical simulation estimation
SMART_READER_LITE
LIVE PREVIEW

A Computationally Practical Simulation Estimation Algorithm for - - PDF document

A Computationally Practical Simulation Estimation Algorithm for Dynamic Panel Data Models with Unobserved Endogenous State Variables and Implications of Classification Error for the Dynamics of Female Labor Supply: A Comment on Hyslop (1999)


slide-1
SLIDE 1

A Computationally Practical Simulation Estimation Algorithm for Dynamic Panel Data Models with Unobserved Endogenous State Variables and Implications of Classification Error for the Dynamics of Female Labor Supply: A Comment on Hyslop (1999) Michael P. Keane University of Technology Sydney Arizona State University Robert M. Sauer University of Southampton

slide-2
SLIDE 2

Introduction  Missing endogenous state variables is a widespread problem in panel discrete choice models  Present when unobserved initial conditions and missing choices during sample period  Assess and implement a new SML algorithm that specifically addresses these issues  Main advantage: computationally easy  relies on unconditional simulation of data from a model  conditional simulation of choice probabilities when history not observed is difficult (e.g., GHK, MCMC, EM)

slide-3
SLIDE 3

Plan of Presentation  Specify Panel Data Probit Model  Discuss Models of Classification Error (CE)  Describe the SML Algorithm  Show Monte Carlo Results  Application to Female Labor Supply with PSID data  Implications of CE itself for endogeneity of fertility and nonlabor income

slide-4
SLIDE 4

Panel Probit Model uit  0  1xit  ∑

0 t−1

di  it dit  1 if uit ≥ 0 0 otherwise. it  i  it it  1i,t−1  it xit  2xi,t−1  it i  N0,

2 

it  N0,

2

it  N0,v

2

simulate data from model so algorithm can easily handle wider range of distributions

slide-5
SLIDE 5

Classification Error (CE)  Required to form likelihood in our approach  CE probabilistically "matches" simulated choice and reported choice  General model of CE we consider 10t  Pr dit

∗  0 | dit  1

01t  Pr dit

∗  1 | dit  0

00t  1 − 01t 11t  1 − 10t as in Poterba and Summers 1986,1995 and HAS 1998  HAS 1998 develops identification conditions (more later)  Only need tractable expression for jkt’s to form likelihood

slide-6
SLIDE 6

CE Model 1: Unbiased Classification Error  imposes unconditional prob report an

  • ption equals true prob

Prdit

∗  1  Prdit  1

generates tractable linear expressions for jkt’s 11t  Pr dit

∗  1 | dit  1

 E  1 − EPrdit  1 01t  Pr dit

∗  1 | dit  0

 1 − EPrdit  1 because Prdit

∗  1  11t Prdit  1  01t Prdit  0

11t  E  1 − EPrdit  1  Prdit

∗  1  Prdit  1

slide-7
SLIDE 7

CE Model 1: Unbiased Classification Error  In 11t  E  1 − EPrdit  1  E is estimable parameter where  low prob events have prob equal to E

  • f being classified correctly

 prob of correct classification increases linearly in E  Prdit  1 is easily simulated

slide-8
SLIDE 8

CE Model 2: Biased Classification Error  Do not impose Prdit

∗  1  Prdit  1

rather assume: lit  0  1dit  2dit−1

 it dit

∗ 

1 if lit ≥ 0 0 otherwise. it  logistic  Get tractable expressions for jkt’s: 11t  Pr dit

∗  1 | dit  1

 e012dit−1

1  e012dit−1

01t  Pr dit

∗  1 | dit  0

 e02dit−1

1  e02dit−1

 Note that easily incorporates dynamic (persistent) misreporting  dit−1

can be simulated from lit model if missing

slide-9
SLIDE 9

Identification  Again, rewrite Prdit

∗  1

 11t Prdit  1  01t Prdit  0  1 − 10tPrdit  1  01t1 − Prdit  1  01t  1 − 10t − 01tPrdit  1  need non-linear Prdit  1 and monotonicity 10t  01t  1  otherwise standard identification issues  SD effects from causal effect of lagged X’s  SD effects sensitive to modeling of serial correlation (RE or AR(1), etc.)  in biased CE model, lagged reported choice identified because not perfectly correlated with lagged X’s

slide-10
SLIDE 10

The SML Estimation Algorithm Data: Di

∗,xii1 N , Di ∗  dit ∗t1 T , xi  xitt1 T

  • 1. Draw M times from the it distribution to

form it

mt1 T i1 N m1 M

  • 2. Given xitt1

T i1 N and

it

mt1 T i1 N m1 M , construct

dit

mt1 T i1 N m1 M

according to model

  • 3. Construct conditional probs

 jkt

m t1 T m1 M

  • 4. Form a simulator likelihood contribution:

P Di

∗ | ,xi

1 M ∑ m1 M

t1 T

j0 1

k0 1 

jkt

m Idit m  j,dit ∗  k I dit

∗ observed

slide-11
SLIDE 11

Important Things to Note  any observed choice history has non-zero prob conditional on any simulated history  building likelihood off of unconditional simulations  state space updated according to simulated outcomes  completely circumvents problem of partially observed choice history  consistency and asymptotic normality require that

M N →  as N →  (as in

Pakes and Pollard 1989 and Lee 1992)  but still need to check small sample properties of estimator

slide-12
SLIDE 12

Some More Important Things to Note  if missing xit’s, can simulate them and modify likelihood to include density of xit’s  if initial conditions problem can simulate model from t  0,...,T with di0  xi0  0  or can imbed Heckman’s solution - specify marginal distribution for di

, then simulate

from t   ,...,T  or can imbed Wooldridge’s solution - random effect function of di

 and

covariates, simulate from t     1,...,T  for standard errors, or to use gradient based

  • ptimization, have smooth version of the

algorithm

slide-13
SLIDE 13

Importance Sampling (Smooth) Version of Algorithm  construct dit

m0t1 T

and Uit

m0t1 T

and hold fixed as vary   calculate importance sampling weights as vary  Wm 

g Ui1

m0,...,UiT m0 | ,xi

g Ui1

m0,...,UiT m0 | 0,xi

gUi

m0|,xi   t1 T 1   a  

a  Uit

m0 − 0 − 1xit − ∑ 0 t−1

di

m0

 likelihood contribution becomes P Di

∗ | ,xi

1 M ∑ m1 M

Wm

t  T

fmxitI xit observed

j0 1

k0 1 

jkt

m Idit m  j,dit ∗  k I dit

∗ observed

slide-14
SLIDE 14

Repeated Sampling Experiments  REAR(1) Polya Model with Exponential Decay: uit  0  1xit  ∑

0 t−1

di  it   e−t−−1 it  i  it it  1i,t−1  it it  N0,1 − 

2 1 − 1 2

xit  2xi,t−1  it,it  N0,v

2

 Sample Size: N  500, T  10  Replications: R  50  Simulated histories per person: M  1000  Also do experiments on Markov model

slide-15
SLIDE 15

Table 3 Repeated Sampling Experiments Random Effects Polya Model Unbiased Classification Error (Missing X’s, No Initial Conditions Problem) Parameter True Value Mean b β Median b β Std(c β) RMSE t-Stat 20% Missing Choices and X’s (t = 1, ..., 10) β0

  • .1000
  • .1051
  • .1023

.0436 .0439

  • .83

β1 1.0000 1.0167 1.0191 .0611 .0634 1.92 ρ 1.0000 1.0479 1.0446 .0444 .0653 7.63 α .5000 .4977 .5031 .0656 .0657

  • .24

φ2 .2500 .2520 .2505 .0176 .0177 .80 σν .5000 .5015 .5016 .0057 .0059 1.86 σμ .8000 .8056 .8017 .0287 .0292 1.38 E .7500 .7428 .7430 .0172 .0187

  • 2.95

40% Missing Choices and X’s (t = 1, ..., 10) β0

  • .1000
  • .1087
  • .1099

.0539 .0546

  • 1.15

β1 1.0000 1.0141 1.0233 .0678 .0692 1.48 ρ 1.0000 1.0458 1.0374 .0636 .0784 5.10 α .5000 .4953 .4949 .0600 .0602 .56 φ2 .2500 .2521 .2546 .0253 .0254 .59 σν .5000 .5012 .5012 .0069 .0070 1.21 σμ .8000 .8046 .8063 .0347 .0350 .94 E .7500 .7474 .7416 .0245 .0246

  • .74

60% Missing Choices and X’s (t = 1, ..., 10) β0

  • .1000
  • .0997
  • .1116

.0542 .0543 .05 β1 1.0000 1.034 1.0258 .0894 .0924 1.85 ρ 1.0000 1.0401 1.0512 .0682 .0791 4.15 α .5000 .4957 .4973 .0721 .0722

  • .42

φ2 .2500 .2507 .2498 .0372 .0373 .13 σν .5000 .5011 .5017 .0089 .0090 .88 σμ .8000 .8096 .8044 .0421 .0432 1.61 E .7500 .7493 .7440 .0288 .0288

  • .16

Note: The number of replications in each experiment is 50 and the number of individuals in the sample is 500. Std(c β) and RMSE refer to the sample standard deviation and the root mean square error, respectively, of the estimated parameters. The t-statistics are calculated as √ 50 µ

Mean β−β Std( β)

¶ . The model is the same as in Table 1. 1

slide-16
SLIDE 16

Table 4 Repeated Sampling Experiments Random Effects Polya Model Unbiased Classification Error (No Missing Choices or X’s, Initial Conditions Problem) Parameter True Value Mean b β Median b β Std(c β) RMSE t-Stat Simulate from start of process with di0 = 0 (t = 11,..., 20) β0

  • .1000
  • .1001
  • .1022

.0295 .0295

  • .02

β1 1.0000 1.0286 1.0337 .0454 .0537 4.46 ρ 1.0000 1.0298 1.0253 .0324 .0440 6.51 α .5000 .5044 .5004 .0320 .0323 .98 φ2 .2500 .2501 .2526 .0135 .0135 .05 σν .5000 .5015 .5025 .0042 .4985 2.56 σμ .8000 .8130 .8145 .0245 .0277 3.74 E .7500 .7450 .7410 .0193 .0199

  • 1.82

Assume process starts with di,10 = 0 (t = 11, ..., 20) β0

  • .1000

.9367 .9513 .0543 1.0381 135.05 β1 1.0000 .2966 .2844 .0938 .7096

  • 53.01

ρ 1.0000 .9543 .9333 .3278 .3310

  • .99

α .5000 .4187 .3995 .2957 .3067

  • 1.94

σμ .8000 .9905 .9923 .0090 .1907 149.11 E .7500 .7144 .7125 .0230 .0424

  • 10.96

Use reported data from t = 11, ..., 20 to proxy for initial condition at t = 21 (t = 11, ..., 30) β0

  • .1000
  • .5239
  • .4859

.3039 .5216

  • 9.86

β1 1.0000 .4742 .4671 .1788 .5553

  • 20.80

ρ 1.0000 1.0522 1.1064 .3076 .3120 1.20 α .5000 .5839 .6139 .2299 .2448 2.58 σμ .8000 .9388 .9758 .0811 .1608 12.10 E .7500 .5795 .5714 .0615 .1812

  • 19.61

Note: The number of replications in each experiment is 50 and the number of individuals in the sample is 500. Std(c β) and RMSE refer to the sample standard deviation and the root mean square error, respectively, of the estimated parameters. The t-statistics are calculated as √ 50 µ

Mean β−β Std( β)

¶ . The model is the same as in Table 1. 2

slide-17
SLIDE 17

Table 14 Repeated Sampling Experiments Random Effects Polya Model Biased Classification Error Smooth Algorithm (20% Missing Choices and X’s, No Initial Conditions Problem) Parameter True Value Mean b β Median b β Std(c β) RMSE t-Stat Low Classification Error Bias (t = 1, ..., 10) β0

  • .1000
  • .0795
  • .0686

.0685 .0714 2.12 β1 1.0000 1.0265 1.0330 .0833 .0874 2.25 ρ 1.0000 .9466 .9374 .1410 .1508

  • 2.68

α .5000 .4409 .4360 .1038 .1195

  • 4.02

φ2 .2500 .2480 .2472 .0153 .0155

  • .91

σν .5000 .5019 .5027 .0048 .0052 2.76 σμ .8000 .8211 .8225 .0321 .0384 4.65 γ0

  • 3.5000
  • 3.3313
  • 3.2996

.2606 .3104 4.58 γ1 5.0000 4.7243 4.7334 .3014 .4084

  • 6.47

γ2 2.0000 2.1031 2.0794 .2372 .3185 3.07 Note: The number of replications is 50 and the number of individuals in the sample is 500. Std(c β) and RMSE refer to the sample standard deviation and the root mean square error, respectively,

  • f the estimated parameters. The t-statistics are calculated as

√ 50 µ

Mean β−β Std( β)

¶ . The model is the same as in Table 1. 3

slide-18
SLIDE 18

Table 17 Repeated Sampling Experiments Polya Model with Random Effects and AR (1) Errors Biased Classification Error Smooth Algorithm (20% Missing Choices and X’s, No Initial Conditions Problem) Parameter True Value Mean b β Median b β Std(c β) RMSE t-Stat Low Classification Error Bias (t = 1, ..., 10) β0

  • .1000
  • .0823
  • .0824

.0513 .0543 2.44 β1 1.0000 1.0215 1.0082 .0907 .0932 1.67 ρ 1.0000 .9782 .9948 .1459 .1475

  • 1.06

α .5000 .4709 .4931 .1092 .1130

  • 1.89

φ2 .2500 .2477 .2487 .0154 .0155

  • 1.04

σν .5000 .5020 .5028 .0048 .0052 2.89 σμ .8000 .8267 .8280 .0372 .0458 5.07 φ1 .4000 .3892 .4114 .1223 .1228

  • .62

γ0

  • 3.5000
  • 3.3261
  • 3.2815

.2645 .3165 4.65 γ1 5.0000 4.7020 4.7290 .3270 .4424

  • 6.44

γ2 2.0000 2.1233 2.1126 .2316 .3495 3.76 Note: The number of replications in each experiment is 50 and the number of individuals in the sample is 500. Std(c β) and RMSE refer to the sample standard deviation and the root mean square error, respectively, of the estimated parameters. The t-statistics are calculated as √ 50 µ

Mean β−β Std( β)

¶ . The model is: uit = β0 + β1xit +

t−1

X

τ=0

diτρτ + εit di0 = 0, ρτ = ρe−α(t−τ−1) xit = φ2xi,t−1 + νit, νit ∼ N ¡0, σ2

ν

¢ εit = μi + ξit ξit = φ1ξit−1 + ηit, ηit ∼ N(0, (1 − σ2

μ)(1 − φ2 1))

4

slide-19
SLIDE 19

Empirical Application  Estimate Polya and Markov models of female labor force participation using PSID data 1994-2003  Missing data problem because PSID went biannual after 1997 (missing choices and X’s in 1998, 2000 and 2002)  Embed AR(1) process for nonlabor income  Simulate choices from age 16 (theoretical start of process)  Test for endogeneity of fertility and nonlabor income in female labor supply behavior (as in Chamberlain 1984, Jakubson 1988 and Hyslop 1999)

slide-20
SLIDE 20

Table 18 Sample Characteristics PSID Calendar Years 1994-2003 Missing Years 1998, 2000, and 2002 (N=1310) Mean

  • Std. Dev.

(1) (2) Participation .816 .291 (avg. over 7 years) (.008) Husband’s Annual Earnings 46.40 41.18 (avg. over 7 years) (11.38) ($1000 1994)

  • No. Children aged 0-2 years

.135 .231 (avg. over 10 years) (.006)

  • No. Children aged 3-5 years

.181 .254 (avg. over 10 years) (.007)

  • No. Children aged 6-17 years

.937 .864 (avg. over 10 years) (.024) Age 36.93 8.00 (1994) (.221) Education 13.56 2.10 (maximum over 10 years) (.06) Race .198 .398 (1=Black) (.011)

Note: Means and standard errors (in parentheses) for 1310 continuously married women in the PSID between 1994 and 2003, aged 18-60 in 1994, with positive annual earnings and hours worked each non- missing year for both partners in the married couple. Earnings are in thousands of 1994 dollars. Variable definitions and sample selection criteria are the same as those chosen by Hyslop (1999) for PSID calendar years 1980-1986.

1

slide-21
SLIDE 21

Table 20 Female Labor Force Participation Decisions PSID Calendar Years 1994-2003 Missing Years 1998, 2000, and 2002 Polya Model with Random Effects and AR(1) errors Biased Classification Error Smooth Algorithm Correlated Correlated Random Effects Random Effects Random Effects Random Effects + AR(1) Errors + AR(1) Errors (1) (2) (3) (4) ln(yit)

  • .3089 (.0001)
  • .3111 (.0001)
  • .3066 (.0001)
  • .3040 (.0001)

#kids0-2t

  • .5964 (.0030)
  • .6000 (.0028)
  • .6495 (.0004)
  • .6339 (.0004)

#kids3-5t

  • .3648 (.0006)
  • .3565 (.0007)
  • .3325 (.0003)
  • .3466 (.0003)

#kids6-17t

  • .0145 (.0011)
  • .0123 (.0019)
  • .0211 (.0003)
  • .0225 (.0011)

aget/10 .7527 (.0031) .5387 (.0040) .7081 (.0001) .7263 (.0001) age2

t/100

  • .1310 (.0000)
  • .1074 (.0001)
  • .1262 (.0000)
  • .1280 (.0000)

racei .3083 (.0006) .2272 (.0005) .2945 (.0002) .2684 (.0002) educationi .0652 (.0000) .0558 (.0000) .0630 (.0000) .0611 (.0000) ρ .6363 (.0005) .7281 (.0008) .6758 (.0003) .6979 (.0003) α 1.8924 (.0020) 1.9502 (.0019) 2.1278 (.0020) 2.1457 (.0011) φ2 .9994 (.0002) .9994 (.0002) .9994 (.0001) .9994 (.0001) σν .2743 (.0003) .2736 (.0004) .2742 (.0002) .2736 (.0002) σμ .8949 (.0013) .8970 (.0013) .8952 (.0003) .8960 (.0003) γ0

  • 1.1203 (.0482)
  • .8962 (.0457)
  • .9940 (.0457)
  • .9404 (.0463)

γ1 3.8880 (.0761) 3.6738 (.0769) 3.6809 (.0760) 3.7190 (.0781) γ2 1.6520 (.0985) 1.5320 (.0977) 1.5658 (.0974) 1.6096 (.0987) φ1

  • .4606 (.0004)

.4596 (.0005) Log-Likelihood

  • 12568.10
  • 12544.89
  • 12561.69
  • 12531.88

χ2 (H0: δ = 0)

  • 46.42 (.0158)
  • 59.62 (.0005)

χ2 (Pearson GOF) 54.62 (.2075) 51.90 (.2887) 53.32 (.2442) 51.02 (.3186) N 1310 1310 1310 1310

Note: The model is:

uit = β0 + β1 ln(yit) + β0

2Xit + t−1

X

τ=0

diτρτ + εit di0 = 0, ρτ = ρe−α(t−τ−1) ln(yit) = φ2 ln(yi,t−1) + νit, νit ∼ N ¡ 0, σ2

ν

¢ εit = μi + ξit ξit = φ1ξit−1 + ηit, ηit ∼ N(0, (1 − σ2

μ)(1 − φ2 1))

lit = γ0 + γ1dit + γ2d∗

it−1 + ωit

μi =

T

P

t=1

δ0

tWit + σμζi, ζi ∼ N(0, 1)

yit is the husband’s annual earnings in year t. Xit contains year effects in addition to the fertility, race

and education covariates that appear explicitly in the table. Wit contains ln (yit) and the three fertility

  • variables. Standard errors are in parentheses (p-values for the LRT and Pearson GOF chi-square statistics).

2

slide-22
SLIDE 22

Comment on Hyslop 1999  Hyslop (1999) did not reject exogeneity of fertility and nonlabor income in CREAR(1) Markov panel probits  He used GHK with no adjustment for CE  We re-estimate his models with our algorithm and do reject exogeneity  Adjusting for CE increases the importance

  • f persistence due to permanent unobserved

heterogeneity  This leads to easier detection of relationship between individual effect and and covariates in CRE model

slide-23
SLIDE 23

Table 3 Correlated Random Effects Probit Models of Participation (SML Estimates) Hyslop Keane and Sauer No CE No CE No Persistent CE Persistent CE (1) (2) (3) (4)

ymp

  • .341
  • .336
  • .400
  • .375

(.05) (.05) (.04) (.04)

ymt

  • .099
  • .103
  • .127
  • .172

(.03) (.03) (.02) (.03)

#Kids0-2t

  • .300
  • .305
  • .290
  • .388

(.03) (.03) (.04) (.05)

#Kids3-5t

  • .247
  • .245
  • .265
  • .271

(.03) (.03) (.03) (.04)

#Kids6-17t

  • .084
  • .083
  • .090
  • .087

(.03) (.03) (.02) (.03)

V ar (ηi)

.804 .829 .938 .943 (.02) (.04) (.07) (.10)

γ0

  • 2.427
  • 2.386

(.09) (.11)

γ1

  • 6.996

5.056 (.21) (.19)

γ2

  • 2.611

(.11)

Log-Likelihood

  • 4888.38
  • 4887.75
  • 4878.27
  • 4672.62

N

1812 1812 1812 1812

δ#Kids0−2= 0

32.36(.00)∗∗ 35.31(.00)∗∗ 52.14(.00)∗∗ 57.34(.00)∗∗

δ#Kids3−5= 0

12.77(.12) 13.02(.11) 49.04(.00)∗∗ 61.04(.00)∗∗

δ#Kids6−17= 0

21.74(.01)∗∗ 23.01(.00)∗∗ 49.50(.00)∗∗ 61.19(.00)∗∗

δymt= 0

48.50(.00)∗∗ 48.71(.00)∗∗ 50.08(.00)∗∗ 62.60(.00)∗∗ Note: All specifications include number of children aged 0-2 years lagged one year, race, maximum years

  • f education over the sample period, a quadratic in age, and unrestricted year effects. Non-labor income is

measured by ymp and ymt which denote husband’s permanent (sample average) and transitory (deviations from sample average) annual earnings, respectively.

V ar(ηi) is the variance of permanent unobserved

heterogeneity and the γ’s are the classification error parameters. ∗ indicates significance at the 1% level and

∗∗ indicates significance at the 5% level.

1

slide-24
SLIDE 24

Table 4 Correlated Random Effects Probit Models of Participation with AR(1) Errors (SML Estimates) Hyslop Keane and Sauer No CE No CE No Persistent CE Persistent CE (1) (2) (3) (4)

ymp

  • .332
  • .327
  • .345
  • .345

(.05) (.04) (.00) (.00)

ymt

  • .097
  • .108
  • .112
  • .085

(.03) (.03) (.01) (.01)

#Kids0-2t

  • .272
  • .251
  • .306
  • .307

(.03) (.03) (.02) (.02)

#Kids3-5t

  • .234
  • .219
  • .265
  • .269

(.03) (.02) (.01) (.01)

#Kids6-17t

  • .077
  • .083
  • .079

.083 (.02) (.02) (.01) (.01)

V ar (ηi)

.546 .582 .830 .831 (.04) (.03) (.03) (.04)

ρ

.696 .710 .746 .748 (.04) (.05) (.00) (.00)

γ0

  • 2.650
  • 2.675

(.12) (.13)

γ1

  • 7.909

6.837 (.35) (.85)

γ2

  • 1.576

(.19)

Log-Likelihood

  • 4663.71
  • 4662.55
  • 4646.65
  • 4633.67

N

1812 1812 1812 1812

δ#Kids0−2= 0

9.65(.29) 10.27(.25) 36.05(.00)∗∗ 37.31(.00)∗∗

δ#Kids3−5= 0

9.37(.31) 10.39(.24) 43.80(.00)∗∗ 35.17(.00)∗∗

δ#Kids6−17= 0

8.04(.43) 9.44(.31) 52.44(.00)∗∗ 34.53(.00)∗∗

δymt= 0

8.22(.22) 8.91(.18) 53.84(.00)∗∗ 40.45(.00)∗∗ Note: All specifications include number of children aged 0-2 years lagged one year, race, maximum years

  • f education over the sample period, a quadratic in age, and unrestricted year effects. Non-labor income is

measured by ymp and ymt which denote husband’s permanent (sample average) and transitory (deviations from sample average) annual earnings, respectively.

V ar(ηi) is the variance of permanent unobserved

heterogeneity and the γ’s are the classification error parameters. ρ is the AR(1) serial correlation coefficient.

∗ indicates significance at the 1% level and ∗∗ indicates significance at the 5% level.

2

slide-25
SLIDE 25

Table 5 Correlated Random Effects Probit Models of Participation with AR(1) Errors and First-Order State Dependence (SML Estimates) Hyslop Keane and Sauer No CE No CE No Persistent CE Persistent CE (1) (2) (3) (4)

ymp

  • .285
  • .291
  • .362
  • .451

(.05) (.05) (.01) (.01)

ymt

  • .140
  • .137
  • .134
  • .186

(.04) (.05) (.03) (.03)

#Kids0-2t

  • .252
  • .254
  • .322
  • .420

(.05) (.05) (.05) (.05)

#Kids3-5t

  • .135
  • .131
  • .158
  • .171

(.05) (.04) (.03) (.03)

#Kids6-17t

  • .054
  • .053
  • .072
  • .110

(.04) (.04) (.02) (.03)

V ar (ηi)

.485 .519 .781 .787 (.04) (.06) (.09) (.11)

ρ

  • .213
  • .141

.619 .649 (.04) (.03) (.03) (.03)

ht−1

1.042 1.031 .733 .726 (.09) (.07) (.03) (.04)

Corr (ui0,uit)

.494 .561 .835 .853 (.03) (.09) (.18) (.21)

γ0

  • 2.684
  • 2.252

(.09) (.08)

γ1

  • 6.842

5.427 (.14) (.21)

γ2

  • 1.335

(.17)

Log-Likelihood

  • 4643.52
  • 4641.62
  • 4609.70
  • 4583.94

N

1812 1812 1812 1812

δ#Kids0−2= 0

3.39(.91) 6.02(.65) 39.80(.00)∗∗ 36.91(.00)∗∗

δ#Kids3−5= 0

3.84(.87) 6.78(.56) 35.90(.00)∗∗ 32.25(.00)∗∗

δ#Kids6−17= 0

3.34(.91) 6.89(.55) 32.97(.00)∗∗ 31.19(.00)∗∗

δymt= 0

2.92(.82) 5.92(.43) 47.70(.00)∗∗ 38.20(.00)∗∗ Note: All specifications include number of children aged 0-2 years lagged one year, race, maximum years

  • f education over the sample period, a quadratic in age, and unrestricted year effects. Non-labor income is

measured by ymp and ymt which denote husband’s permanent (sample average) and transitory (deviations from sample average) annual earnings, respectively.

V ar(ηi) is the variance of permanent unobserved

heterogeneity and the γ’s are the classification error parameters. ρ is the AR(1) serial correlation coefficient and ht−1 is lagged participation status. Corr(ui0,uit) is the error correlation relevant for the Heckman approximate solution to the initial conditions problem.

∗ indicates significance at the 1% level and ∗∗

indicates significance at the 5% level.

3

slide-26
SLIDE 26

Concluding Remarks  New SML algorithm may have a significant computational advantage over GHK, MCMC and EM in certain missing data contexts  Algorithm requires assumption of CE but we believe this is reasonable in almost all empirical applications in economics  Estimator has good small sample properties  It produces sensible results in empirical applications of female labour supply behaviour  Estimator is very easy to implement