on models for small area compositions
play

On Models for Small Area Compositions Li-Chun Zhang Angela Luna - PowerPoint PPT Presentation

On Models for Small Area Compositions Li-Chun Zhang Angela Luna L.Zhang@soton.ac.uk A.Luna-Hernandez@soton.ac.uk Social Statistics & Demography University of Southampton Acknowledgements: This work has been funded by the Office for


  1. On Models for Small Area Compositions Li-Chun Zhang Angela Luna L.Zhang@soton.ac.uk A.Luna-Hernandez@soton.ac.uk Social Statistics & Demography University of Southampton Acknowledgements: This work has been funded by the Office for National Statistics - ONS and the Economic and Social Research Council - ESRC. 1 / 22

  2. Motivation Compositions Area 1 . . . Total J 1 Y 1 · 2 Y 2 · 3 Y 3 · Y aj . . . . . . A Y A · Total . . . Y · 1 Y · J Y ·· 2 / 22

  3. Motivation Compositions Area 1 . . . Total J 1 Y 1 · 2 Y 2 · 3 Y 3 · Y aj . . . . . . A Y A · Total . . . Y · 1 Y · J Y ·· Target: Estimate the within area cell counts Y aj , using proxy information and fixed row/column margins. 2 / 22

  4. Motivation Two different (fixed-effects) approaches to this problem are considered: ⊲ Structure Preserving Models: Long tradition in SAE. Assumptions about the relationship between the interactions of two compositions in the log-linear scale. (Proxy information is required). 3 / 22

  5. Motivation Two different (fixed-effects) approaches to this problem are considered: ⊲ Structure Preserving Models: Long tradition in SAE. Assumptions about the relationship between the interactions of two compositions in the log-linear scale. (Proxy information is required). ⊲ Regression (Generalized Linear) Models: Multinomial-Logistic: Assumptions about the relationship between the log-odds with respect to a reference category and a set of covariates. (Proxy information can be used as covariate). 3 / 22

  6. Motivation In this work, we: 1 Introduce a generalization of the Structure Preserving approach, covering the SPREE and GSPREE models and also the logit-multinomial (using proxy information) as particular cases. 4 / 22

  7. Motivation In this work, we: 1 Introduce a generalization of the Structure Preserving approach, covering the SPREE and GSPREE models and also the logit-multinomial (using proxy information) as particular cases. 2 Use data from 2001 and 2011 Population Censuses in England to compare the different models in terms of their Prediction Error. 4 / 22

  8. Motivation In this work, we: 1 Introduce a generalization of the Structure Preserving approach, covering the SPREE and GSPREE models and also the logit-multinomial (using proxy information) as particular cases. 2 Use data from 2001 and 2011 Population Censuses in England to compare the different models in terms of their Prediction Error. 3 Show some ongoing work on a model using a mapping matrix between the proxy and desired compositions, which allows to incorporate auxiliary information at the aggregate level. 4 / 22

  9. Outline 1 Structure Preserving Models 2 Model using a Mapping matrix 5 / 22

  10. Structure Preserving Models 6 / 22

  11. Structure Preserving Models Denote by θ X aj = X aj / X a · an auxiliary composition of exactly the same dimension as θ Y aj = Y aj / Y a · , its log-linear representation given by: γ X aj = α X 0 + α X a + α X j + α X aj where γ X aj = log θ X α X γ X α X γ X γ X α X γ X γ X ·· and aj , 0 = ¯ ·· , a = ¯ a · − ¯ ·· , j = ¯ · j − ¯ α X aj = γ X γ X γ X γ X aj − ¯ a · − ¯ · j − ¯ ·· . 6 / 22

  12. Structure Preserving Models Denote by θ X aj = X aj / X a · an auxiliary composition of exactly the same dimension as θ Y aj = Y aj / Y a · , its log-linear representation given by: γ X aj = α X 0 + α X a + α X j + α X aj where γ X aj = log θ X α X γ X α X γ X γ X α X γ X γ X ·· and aj , 0 = ¯ ·· , a = ¯ a · − ¯ ·· , j = ¯ · j − ¯ α X aj = γ X γ X γ X γ X aj − ¯ a · − ¯ · j − ¯ ·· . The log-linear representation satisfies the constraints: aj = 0 . Analogous for θ Y aj . a α X j α X a α X j α X � a = 0 , � j = 0 , � aj = � 6 / 22

  13. Structure Preserving Models Denote by θ X aj = X aj / X a · an auxiliary composition of exactly the same dimension as θ Y aj = Y aj / Y a · , its log-linear representation given by: γ X aj = α X 0 + α X a + α X j + α X aj where γ X aj = log θ X α X γ X α X γ X γ X α X γ X γ X ·· and aj , 0 = ¯ ·· , a = ¯ a · − ¯ ·· , j = ¯ · j − ¯ α X aj = γ X γ X γ X γ X aj − ¯ a · − ¯ · j − ¯ ·· . The log-linear representation satisfies the constraints: aj = 0 . Analogous for θ Y aj . a α X j α X a α X j α X � a = 0 , � j = 0 , � aj = � The modelling process is focused on the relationship between α Y aj and α X aj . a ˆ j ˆ Marginal constraints such as � Y aj = Y · j for j = 1 , . . . , J and � Y aj = Y a · for a = 1 , . . . , A can be considered using IPF without modifying the parameter estimates. Proxy information (not just covariates) is required. 6 / 22

  14. Structure Preserving Models In the context of SAE, the following Structure Preserving models have been used: 7 / 22

  15. Structure Preserving Models In the context of SAE, the following Structure Preserving models have been used: � � θ X 1. Given , { Y 1 · , . . . , Y A · } : aj 7 / 22

  16. Structure Preserving Models In the context of SAE, the following Structure Preserving models have been used: � � θ X 1. Given , { Y 1 · , . . . , Y A · } : aj Synthetic Estimator: Adapted from Gonzalez & Hoza (1978), ˆ Y aj = θ X aj Y a · The underlining model is α Y j = α X j , α Y aj = α X aj . The estimated composition is a rescaled version of the auxiliary composition. 7 / 22

  17. Structure Preserving Models � � θ X 2. Given , { Y 1 · , . . . , Y A · } , { Y · 1 , . . . , Y · J } : aj 8 / 22

  18. Structure Preserving Models � � θ X 2. Given , { Y 1 · , . . . , Y A · } , { Y · 1 , . . . , Y · J } : aj SPREE: Purcell & Kish (1980) use IPF to fit the two margins, Y (1) ˆ Y (1) Y (2) ˆ ˆ aj = θ X aj Y a · , = Y · j , ... aj aj Y (1) ˆ · j until convergency is achieved. This estimator minimizes the distance between the compositions X and ˆ Y satisfying the marginal constraints. The underlining model is α Y aj = α X aj . 8 / 22

  19. Structure Preserving Models � � � � 3. Given , { Y 1 · , . . . , Y A · } , { Y · 1 , . . . , Y · J } and an estimated : θ X θ Y aj aj 9 / 22

  20. Structure Preserving Models � � � � 3. Given , { Y 1 · , . . . , Y A · } , { Y · 1 , . . . , Y · J } and an estimated : θ X θ Y aj aj Generalized Linear Structural Model (GSPREE): Zhang & Chambers (2004) propose the model α Y aj = βα X aj . 9 / 22

  21. Structure Preserving Models � � � � 3. Given , { Y 1 · , . . . , Y A · } , { Y · 1 , . . . , Y · J } and an estimated : θ X θ Y aj aj Generalized Linear Structural Model (GSPREE): Zhang & Chambers (2004) propose the model α Y aj = βα X aj . β can be estimated using ML under the multinomial distribution, when expressing the model as: µ Y aj = λ j + βµ X aj for µ aj = log θ aj − 1 � log θ aj = α j + α aj . J l 9 / 22

  22. Structure Preserving Models � � � � 3. Given , { Y 1 · , . . . , Y A · } , { Y · 1 , . . . , Y · J } and an estimated : θ X θ Y aj aj Generalized Linear Structural Model (GSPREE): Zhang & Chambers (2004) propose the model α Y aj = βα X aj . β can be estimated using ML under the multinomial distribution, when expressing the model as: µ Y aj = λ j + βµ X aj for µ aj = log θ aj − 1 � log θ aj = α j + α aj . J l Given the sum-zero constraint of the α j , the λ j are nuisance parameters with no practical interest. 9 / 22

  23. Extension of the Structure Preserving approach All the previous models can be seen as particular cases of the more general model:     α Y α X a 1 a 1 . .  .   .   = B β B . .        α Y α X aJ aJ 10 / 22

  24. Extension of the Structure Preserving approach All the previous models can be seen as particular cases of the more general model:     α Y α X a 1 a 1 . .  .   .   = B β B . .        α Y α X aJ aJ Where B J × J = I − J − 1 11 ′ and β J × J = { β jk } contains all the parameters. 10 / 22

  25. Extension of the Structure Preserving approach All the previous models can be seen as particular cases of the more general model:     α Y α X a 1 a 1 . .  .   .   = B β B . .        α Y α X aJ aJ Where B J × J = I − J − 1 11 ′ and β J × J = { β jk } contains all the parameters. The multiplication on left and right by B ensure that the sum zero constraints are satisfied by the predicted α Y aj , as well as the uniqueness of G = B β B . Denoting by { g jk } the components of G we can write, � α Y g jk α X aj = ak . k As in the GSPREE, the estimation of β can be done using ML under the multinomial distribution writing the model as η Y a = λ + B β B η X a . 10 / 22

  26. Extension of the Structure Preserving approach Some particular cases: a) SPREE: β = I aj − 1 � α Y aj = α X α X ak J k 11 / 22

  27. Extension of the Structure Preserving approach Some particular cases: a) SPREE: β = I aj − 1 � α Y aj = α X α X ak J k b) GSPREE: With parameter φ , β = φ I aj − φ 1 α Y aj = φα X � α X ak J k 11 / 22

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend