Model: the Dutch Labour Force Survey Oksana Bollineni-Balabay, Jan - - PowerPoint PPT Presentation

β–Ά
model the dutch labour force survey
SMART_READER_LITE
LIVE PREVIEW

Model: the Dutch Labour Force Survey Oksana Bollineni-Balabay, Jan - - PowerPoint PPT Presentation

Accounting for Hyperparameter Uncertainty in SAE Based on a State-Space Model: the Dutch Labour Force Survey Oksana Bollineni-Balabay, Jan van den Brakel, Franz Palm The Dutch LFS monthly estimates for the total numbers of the unemployed


slide-1
SLIDE 1

Oksana Bollineni-Balabay, Jan van den Brakel, Franz Palm Accounting for Hyperparameter Uncertainty in SAE Based on a State-Space Model: the Dutch Labour Force Survey

slide-2
SLIDE 2

The Dutch LFS

  • monthly estimates for the total numbers of the

unemployed labour force;

  • five-wave rotating panel survey (from Oct 1999);
  • GREG estimator;
  • 1st wave net sample size β‰ˆ 6500 persons;
  • a structural time series model in production since 2010(6)
  • time span covered in this MSE study: 2001(1)-2010(6)

Oksana Bollineni-Balabay, Jan van den Brakel, Franz Palm

slide-3
SLIDE 3

Numbers of unemployed in NL: design- and model-based estimates

SE reduction: 24%

slide-4
SLIDE 4

The DLFS model

Vector 𝒁𝒖 with GREG estimates for the 5 waves: 𝒁𝒖 = 𝑍

𝑒 𝐽

𝑍

𝑒 𝐽𝐽

𝑍

𝑒 𝐽𝐽𝐽

𝑍

𝑒 π½π‘Š

𝑍

𝑒 π‘Š

= 1 1 1 1 1 ξ𝑒 + 𝑆𝐻𝐢𝑒

𝐽𝐽

𝑆𝐻𝐢𝑒

𝐽𝐽𝐽

𝑆𝐻𝐢𝑒

π½π‘Š

𝑆𝐻𝐢𝑒

π‘Š

+ 𝑓𝑒

𝐽

𝑓𝑒

𝐽𝐽

𝑓𝑒

𝐽𝐽𝐽

𝑓𝑒

π½π‘Š

𝑓𝑒

π‘Š

Oksana Bollineni-Balabay, Jan van den Brakel, Franz Palm

true population parameter: ξ𝑒 = 𝑀𝑒 + 𝑇𝑒 survey errors rotation group bias

slide-5
SLIDE 5

Stochastic components of the model

𝑀𝑒 - a stochastic trend with disturbances πœƒπ‘’~𝑂 0, πœπ‘€

2 ;

𝑇𝑒 - a trigonometric seasonal component with disturbances πœ•π‘’~𝑂 0, πœπ‘‡

2 ;

𝑆𝐻𝐢𝑒

π½π½βˆ’π‘Š - random walk with disturbances 𝜘~𝑂 0, πœπ‘†π»πΆ 2

; 𝑓𝑒

𝐽 = ν𝑒 𝐽;

𝑓𝑒

𝐽𝐽 = Οπ‘“π‘’βˆ’3 𝐽

+ ν𝑒

𝐽𝐽, etc.

Hyperparameter vector: 𝜾 = (πœπ‘€

2, πœπ‘‡ 2, πœπ‘†π»πΆ 2

, πœπœ‰π½

2 , πœπœ‰π½π½ 2 , πœπœ‰π½π½π½ 2 , πœπœ‰π½π‘Š 2 , πœπœ‰π‘Š 2 , ρ)

Oksana Bollineni-Balabay, Jan van den Brakel, Franz Palm

survey errors with πœ‰π‘’

π‘₯ ~𝑂 0, πœπœ‰π‘₯ 2

, w={1, … 5}; waves II-V as AR(1) not known, estimated

slide-6
SLIDE 6

STS Model Estimation

  • the Kalman filter extracts signals (trends…) 𝛽

𝑒|𝑒(𝜾);

  • MSE of 𝛽

𝑒|𝑒(𝜾) at time t: 𝑁𝑇𝐹𝑒|𝑒 = 𝐹𝑒[𝛽 𝑒|𝑒 𝜾 βˆ’π›½π‘’]2;

  • the true MSE that accounts for uncertainty around 𝜾

: 𝑁𝑇𝐹𝑒|𝑒 = 𝐹𝑒[𝛽 𝑒|𝑒 𝜾 βˆ’π›½π‘’]2+𝐹𝑒[𝛽 𝑒|𝑒(𝜾 )βˆ’π›½ 𝑒|𝑒 𝜾 ]2;

Oksana Bollineni-Balabay, Jan van den Brakel, Franz Palm

but 𝜾 used instead of 𝜾  𝑁𝑇𝐹𝑒|𝑒 is no longer the true MSE!

filter uncertainty hyperparameter uncertainty filter uncertainty

slide-7
SLIDE 7

Methods to Account for Hyperparameter Uncertainty

  • AA - asymptotic approximation (Hamilton (1986));

bootstraps:

  • PT1 – Pfeffermann-Tiller, parametric;
  • PT2 - Pfeffermann-Tiller, non-parametric;
  • RR1 – Rodriguez-Ruiz, parametric;
  • RR2 - Rodriguez-Ruiz, non-parametric;
  • PT: 𝐹𝑒 taken unconditionally on the data;
  • RR: 𝐹𝑒 taken conditionally on the original data set; claimed to

have better finite sample properties than PT.

Oksana Bollineni-Balabay, Jan van den Brakel, Franz Palm

(Pfeffermann and Tiller (2005)) Rodriguez and Ruiz (2012)

slide-8
SLIDE 8

Monte-Carlo Study of MSE Approximation Approaches

  • S=1000 series generated from the DLFS model;
  • B=500 draws per series s made for AA;
  • B=300 bootstrap series generated per series s for

PT1, PT2, RR1, RR2;

  • true MSE obtained as:

𝑁𝑇𝐹𝑒

π‘ˆπ‘†π‘‰πΉ = 1 50000

[𝛽 𝑛,𝑒 𝜾 βˆ’ 𝛽𝑛,𝑒]2

𝑛=50000

;

  • sample lengths: T=80, T=114, T=200 months;
  • 4 versions of the DLFS model considered:

Oksana Bollineni-Balabay, Jan van den Brakel, Franz Palm

Model 1 Model 2 Model 3 Model 4 Original model πœπ‘‡

2=0

πœπ‘†π»πΆ

2

=0 πœπ‘‡

2=πœπ‘†π»πΆ 2

=0

slide-9
SLIDE 9

Hyperparameter distribution under the DLFS model (Model 1)

ln(πœπ‘€

2)

ln(πœπ‘‡

2)

ln(πœπ‘†π»πΆ

2

) ln(πœπœ‰π½

2 )

ln(πœπœ‰π½π½

2 )

ln(πœπœ‰π½π½π½

2 )

ln(πœπœ‰π½π‘Š

2 )

ln(πœπœ‰π‘Š

2 )

slide-10
SLIDE 10

Hyperparameter distribution under Model 3

ln(πœπ‘€

2)

ln(πœπ‘‡

2)

ln(πœπœ‰π½

2 )

ln(πœπœ‰π½π½

2 )

ln(πœπœ‰π½π½π½

2 )

ln(πœπœ‰π½π‘Š

2 )

ln(πœπœ‰π‘Š

2 )

slide-11
SLIDE 11

Signal MSE comparison for Model 3, T=114 months

Naive KF bias Naive KF bias

slide-12
SLIDE 12

Signal MSE relative bias, %, averaged

  • ver time T and simulations S

T=80 T=114 T=200

Models M1 M2 πœπ‘‡

2

M3

πœπ‘†π»πΆ

2

M4

πœπ‘‡

2

πœπ‘†π»πΆ

2

M1 M2

πœπ‘‡

2

M3

πœπ‘†π»πΆ

2

M4

πœπ‘‡

2

πœπ‘†π»πΆ

2

M1 M2

πœπ‘‡

2

M3

πœπ‘†π»πΆ

2

M4

πœπ‘‡

2

πœπ‘†π»πΆ

2

KF

  • 3.0
  • 3.2
  • 2.1
  • 2.2
  • 2.1 -2.6 -2.4 -2.2 -1.3 -1.6 -1.3 -1.3

AA

NA NA NA 14.9 NA NA NA 5.2 NA NA NA 5.9

PT1

8.6 6.7 4.9 6.2 8.1 5.7 3.3 5.5 6.3 6.2 6.3 5.5

PT2

4.8 3.7 1.4 2.1 2.2 3.2 1.9 1.5 6.8 4.0 3.0 2.3

RR1

  • 7.2
  • 9.0
  • 7.3
  • 7.2
  • 8.3 -7.8 -6.4 -6.5 -8.0 -8.0 -4.9 -5.9

RR2

6.7

  • 3.5
  • 3.9
  • 3.7
  • 1.1 -6.0 -3.9 -3.5 -5.1 -5.6 -4.5 -5.0

Oksana Bollineni-Balabay, Jan van den Brakel, Franz Palm

slide-13
SLIDE 13

Signal MSE relative bias, %, averaged

  • ver time T and simulations S

T=80 T=114 T=200

Models M1 M2 πœπ‘‡

2

M3

πœπ‘†π»πΆ

2

M4

πœπ‘‡

2

πœπ‘†π»πΆ

2

M1 M2

πœπ‘‡

2

M3

πœπ‘†π»πΆ

2

M4

πœπ‘‡

2

πœπ‘†π»πΆ

2

M1 M2

πœπ‘‡

2

M3

πœπ‘†π»πΆ

2

M4

πœπ‘‡

2

πœπ‘†π»πΆ

2

KF

  • 3.0
  • 3.2
  • 2.1
  • 2.2
  • 2.1 -2.6 -2.4 -2.2 -1.3 -1.6 -1.3 -1.3

AA

NA NA NA 14.9 NA NA NA 5.2 NA NA NA 5.9

PT1

8.6 6.7 4.9 6.2 8.1 5.7 3.3 5.5 6.3 6.2 6.3 5.5

PT2

4.8 3.7 1.4 2.1 2.2 3.2 1.9 1.5 6.8 4.0 3.0 2.3

RR1

  • 7.2
  • 9.0
  • 7.3
  • 7.2
  • 8.3 -7.8 -6.4 -6.5 -8.0 -8.0 -4.9 -5.9

RR2

6.7

  • 3.5
  • 3.9
  • 3.7
  • 1.1 -6.0 -3.9 -3.5 -5.1 -5.6 -4.5 -5.0

Oksana Bollineni-Balabay, Jan van den Brakel, Franz Palm

slide-14
SLIDE 14

Signal MSE relative bias, %, averaged

  • ver time T and simulations S

T=80 T=114 T=200

Models M1 M2 πœπ‘‡

2

M3

πœπ‘†π»πΆ

2

M4

πœπ‘‡

2

πœπ‘†π»πΆ

2

M1 M2

πœπ‘‡

2

M3

πœπ‘†π»πΆ

2

M4

πœπ‘‡

2

πœπ‘†π»πΆ

2

M1 M2

πœπ‘‡

2

M3

πœπ‘†π»πΆ

2

M4

πœπ‘‡

2

πœπ‘†π»πΆ

2

KF

  • 3.0
  • 3.2
  • 2.1
  • 2.2
  • 2.1 -2.6 -2.4 -2.2 -1.3 -1.6 -1.3 -1.3

AA

NA NA NA 14.9 NA NA NA 5.2 NA NA NA 5.9

PT1

8.6 6.7 4.9 6.2 8.1 5.7 3.3 5.5 6.3 6.2 6.3 5.5

PT2

4.8 3.7 1.4 2.1 2.2 3.2 1.9 1.5 6.8 4.0 3.0 2.3

RR1

  • 7.2
  • 9.0
  • 7.3
  • 7.2
  • 8.3 -7.8 -6.4 -6.5 -8.0 -8.0 -4.9 -5.9

RR2

6.7

  • 3.5
  • 3.9
  • 3.7
  • 1.1 -6.0 -3.9 -3.5 -5.1 -5.6 -4.5 -5.0

Oksana Bollineni-Balabay, Jan van den Brakel, Franz Palm

slide-15
SLIDE 15

Conclusions

– the naive KF MSE does not have huge biases in the DLFS model ; – MSE biases become smaller with the series length; – AA may fail in models with small hyperparameters; – non-parametric bootstraps overperform the parametric

  • nes;

– RR perform consistently worse than PT-bootstraps, with negative biases larger than those of the naive Kalman filter.

Oksana Bollineni-Balabay, Jan van den Brakel, Franz Palm