implementing quantile selection models in stata
play

Implementing Quantile Selection Models in Stata Mariel Siravegna - PowerPoint PPT Presentation

Implementing Quantile Selection Models in Stata Mariel Siravegna Ercio Munoz Georgetown University The Graduate Center, CUNY July 30, 2020 1 / 20 Non-random sample selection is a major issue in empirical work - A simple sample selection


  1. Implementing Quantile Selection Models in Stata Mariel Siravegna Ercio Munoz Georgetown University The Graduate Center, CUNY July 30, 2020 1 / 20

  2. Non-random sample selection is a major issue in empirical work - A simple sample selection model can be written as the latent model Y ∗ = X ′ β + µ but Y ∗ is only observed if S=1 S = 1 ( Z ′ γ + ν ≥ 0 ) - Since the seminal work of Heckman (1979), much progress has been made in methods that extend the original model or relax some of its assumptions - And recently Arellano and Bonhomme (2017) proposed a copula-based method to correct for sample selection in quantile regression 2 / 20

  3. Two Recent Applications 3 / 20

  4. Maasoumi and Wang (JPE 2019) - In this paper the authors use the CPS between 1976-2013 to see how the gender wage gap vary across the wage distribution - They assess how selective participation of individuals in the labor market affects the gender gap 4 / 20

  5. Comparison of Female and Male Wage CDF (Without correction) (Correcting for Selection) 5 / 20

  6. Bollinger et al. (JPE 2019) - Survey earnings response is not random - In this paper the authors match the survey earnings responses to administrative records to see how response vary across the earnings distribution - They find that non-response rate follows an U shape across earnings and this produces an underestimation of inequality, which can be corrected using this copula-based approach 6 / 20

  7. Bollinger et al. (JPE 2019) 7 / 20

  8. Estimation 8 / 20

  9. Three-step Algorithm of Arellano and Bohnomme (2017) Given an i.i.d sample ( Y i , Z i , S i ) , i = 1 , ..., N where Z i = ( X i , W i ) and assuming that quantile functions are linear: q ( τ , x ) = x ′ β τ , for all τ ∈ ( 0 , 1 ) and x ∈ X (3) the algorithm is as follows: 1. Estimation of the propensity score p ( z ) 2. Estimation of the dependence parameter or degree of selection ρ using this moment restriction: E [ I ( Y ≤ X ′ ˆ β τ ) − G ( τ , p ( z ) ; ρ ) | S = 1 , Z = z ] = 0 9 / 20

  10. Second Step Taken to the sample by choosing a ρ that minimizes the following objective function: N L S i ϕ τ l ( z i )[ I { Y i ≤ X ′ β τ l ( ρ ) } − G ( τ l , p ( z ′ ∑ ∑ i ˜ ρ = argmin ρ � i ) ; ρ )] � ˆ i = 1 l = 1 where � . � is the Euclidean norm, τ 1 < τ 2 < · · · < τ L is a finite grid on ( 0 , 1 ) , and the instrument functions are defined as ϕ τ l ( z i ) , G ( τ l , p ( z ′ i ) ; ρ ) is the conditional copula indexed by a parameter ρ , and: N i β ) + + ( 1 − G τ , i ( Y i − X ′ S i [ G τ i ( Y i − X ′ i β ) − ] ˜ ∑ β τ ( ρ ) = argmin β i = 1 where a + = max { a , 0 } , a − = max {− a , 0 } , and G τ , i = G ( τ , p ( z ) ; ρ ) . 10 / 20

  11. Third Step ρ , ˆ 3. Given the estimated ˆ β τ can be estimated by minimizing a rotated check function of the form: N i β ) + + ( 1 − ˆ S i [ ˆ G τ , i ( Y i − X ′ G τ , i )( Y i − X ′ ˆ i β ) − ] ∑ β τ = argmin β i = 1 where ˆ β τ will be a consistent estimator of the τ -th quantile regression coefficient. Note that this step is unnecessary if the researcher is interested on the quantiles included in the finite grid of step 2. 11 / 20

  12. Implementing the method in Stata 12 / 20

  13. Syntax � � � � � � � � depvar S = varlist S ) qregsel depvar indepvars if in , select( quantile( # ) grid min(grid minvalue) grid max(grid maxvalue) � � grid length(grid lengthvalue) copula( copula ) noconstant plot 13 / 20

  14. Empirical Example 14 / 20

  15. Wages of women used in Heckman command 15 / 20

  16. Grid for minimization 16 / 20

  17. Counterfactual distribution: Corrected versus uncorrected quantiles 17 / 20

  18. Conclusions 18 / 20

  19. Conclusions - We have introduced a new Stata command that implements a copula-based method to correct for sample selection in quantile regressions proposed in Arellano and Bonhomme (2017) - This command may be useful for Stata users doing empirical work, as we have illustrated with the case of two recently published papers - The code is for now only available in our github repo - Questions, comments, and suggestions are welcome 19 / 20

  20. References - Arellano, M., and S. Bonhomme (2017), “Quantile Selection Models with an Application to Understanding Changes in Wage Inequality.” Econometrica 85(1) - Bollinger, C., B. Hirsch, C. Hokayem, and J. Ziliak (2019), “Trouble in the Tails? What We Know about Earnings Nonresponse Thirty Years after Lillard, Smith, and Welch.” Journal of Political Economy 127(5). - Maasoumi, E., and L. Wang (2019), “The Gender Gap between Earnings Distributions.” Journal of Political Economy 127(5). - Munoz, E., and M. Siravegna (2020), “Implementing Quantile Selection Models in Stata.” 20 / 20

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend