Implementing Quantile Selection Models in Stata Mariel Siravegna - - PowerPoint PPT Presentation

implementing quantile selection models in stata
SMART_READER_LITE
LIVE PREVIEW

Implementing Quantile Selection Models in Stata Mariel Siravegna - - PowerPoint PPT Presentation

Implementing Quantile Selection Models in Stata Mariel Siravegna Ercio Munoz Georgetown University The Graduate Center, CUNY July 30, 2020 1 / 20 Non-random sample selection is a major issue in empirical work - A simple sample selection


slide-1
SLIDE 1

Implementing Quantile Selection Models in Stata

Mariel Siravegna Ercio Munoz Georgetown University The Graduate Center, CUNY

July 30, 2020

1 / 20

slide-2
SLIDE 2

Non-random sample selection is a major issue in empirical work

  • A simple sample selection model can be written as the latent model

Y ∗ = X ′β + µ but Y ∗ is only observed if S=1 S = 1(Z ′γ + ν ≥ 0)

  • Since the seminal work of Heckman (1979), much progress has been made in methods

that extend the original model or relax some of its assumptions

  • And recently Arellano and Bonhomme (2017) proposed a copula-based method to

correct for sample selection in quantile regression

2 / 20

slide-3
SLIDE 3

Two Recent Applications

3 / 20

slide-4
SLIDE 4

Maasoumi and Wang (JPE 2019)

  • In this paper the authors use the CPS between 1976-2013 to see how the gender

wage gap vary across the wage distribution

  • They assess how selective participation of individuals in the labor market affects the

gender gap

4 / 20

slide-5
SLIDE 5

Comparison of Female and Male Wage CDF

(Without correction) (Correcting for Selection)

5 / 20

slide-6
SLIDE 6

Bollinger et al. (JPE 2019)

  • Survey earnings response is not random
  • In this paper the authors match the survey earnings responses to administrative

records to see how response vary across the earnings distribution

  • They find that non-response rate follows an U shape across earnings and this

produces an underestimation of inequality, which can be corrected using this copula-based approach

6 / 20

slide-7
SLIDE 7

Bollinger et al. (JPE 2019)

7 / 20

slide-8
SLIDE 8

Estimation

8 / 20

slide-9
SLIDE 9

Three-step Algorithm of Arellano and Bohnomme (2017)

Given an i.i.d sample (Yi, Zi, Si), i = 1, ..., N where Zi = (Xi, Wi) and assuming that quantile functions are linear: q(τ, x) = x′βτ, for all τ ∈ (0, 1) and x ∈ X (3) the algorithm is as follows:

  • 1. Estimation of the propensity score p(z)
  • 2. Estimation of the dependence parameter or degree of selection ρ using this moment

restriction: E[I(Y ≤ X ′ ˆ βτ) − G(τ, p(z); ρ)|S = 1, Z = z] = 0

9 / 20

slide-10
SLIDE 10

Second Step

Taken to the sample by choosing a ρ that minimizes the following objective function: ˆ ρ = argminρ

N

i=1 L

l=1

Si ϕτl(zi)[I{Yi ≤ X ′

i ˜

βτl(ρ)} − G(τl, p(z′

i ); ρ)]

where . is the Euclidean norm, τ1 < τ2 < · · · < τL is a finite grid on (0, 1), and the instrument functions are defined as ϕτl(zi), G(τl, p(z′

i ); ρ) is the conditional copula

indexed by a parameter ρ, and: ˜ βτ(ρ) = argminβ

N

i=1

Si[Gτi(Yi − X ′

i β)+ + (1 − Gτ,i(Yi − X ′ i β)−]

where a+ = max{a, 0}, a− = max{−a, 0}, and Gτ,i = G(τ, p(z); ρ).

10 / 20

slide-11
SLIDE 11

Third Step

  • 3. Given the estimated ˆ

ρ, ˆ βτ can be estimated by minimizing a rotated check function of the form: ˆ βτ = argminβ

N

i=1

Si[ ˆ Gτ,i(Yi − X ′

i β)+ + (1 −

ˆ Gτ,i)(Yi − X ′

i β)−]

where ˆ βτ will be a consistent estimator of the τ-th quantile regression coefficient. Note that this step is unnecessary if the researcher is interested on the quantiles included in the finite grid of step 2.

11 / 20

slide-12
SLIDE 12

Implementing the method in Stata

12 / 20

slide-13
SLIDE 13

Syntax

qregsel depvar

  • indepvars

if in

  • , select(
  • depvarS =
  • varlistS)

quantile(#) grid min(grid minvalue) grid max(grid maxvalue) grid length(grid lengthvalue)

  • copula(copula) noconstant plot
  • 13 / 20
slide-14
SLIDE 14

Empirical Example

14 / 20

slide-15
SLIDE 15

Wages of women used in Heckman command

15 / 20

slide-16
SLIDE 16

Grid for minimization

16 / 20

slide-17
SLIDE 17

Counterfactual distribution: Corrected versus uncorrected quantiles

17 / 20

slide-18
SLIDE 18

Conclusions

18 / 20

slide-19
SLIDE 19

Conclusions

  • We have introduced a new Stata command that implements a copula-based method

to correct for sample selection in quantile regressions proposed in Arellano and Bonhomme (2017)

  • This command may be useful for Stata users doing empirical work, as we have

illustrated with the case of two recently published papers

  • The code is for now only available in our github repo
  • Questions, comments, and suggestions are welcome

19 / 20

slide-20
SLIDE 20

References

  • Arellano, M., and S. Bonhomme (2017), “Quantile Selection Models with an

Application to Understanding Changes in Wage Inequality.” Econometrica 85(1)

  • Bollinger, C., B. Hirsch, C. Hokayem, and J. Ziliak (2019), “Trouble in the Tails? What

We Know about Earnings Nonresponse Thirty Years after Lillard, Smith, and Welch.” Journal of Political Economy 127(5).

  • Maasoumi, E., and L. Wang (2019), “The Gender Gap between Earnings Distributions.”

Journal of Political Economy 127(5).

  • Munoz, E., and M. Siravegna (2020), “Implementing Quantile Selection Models in

Stata.”

20 / 20