On Models for Small Area Compositions Li-Chun Zhang Angela Luna - - PowerPoint PPT Presentation

on models for small area compositions
SMART_READER_LITE
LIVE PREVIEW

On Models for Small Area Compositions Li-Chun Zhang Angela Luna - - PowerPoint PPT Presentation

On Models for Small Area Compositions Li-Chun Zhang Angela Luna L.Zhang@soton.ac.uk A.Luna-Hernandez@soton.ac.uk Social Statistics & Demography University of Southampton Acknowledgements: This work has been funded by the Office for


slide-1
SLIDE 1

On Models for Small Area Compositions

Angela Luna

A.Luna-Hernandez@soton.ac.uk

Li-Chun Zhang

L.Zhang@soton.ac.uk

Social Statistics & Demography University of Southampton

Acknowledgements: This work has been funded by the Office for National Statistics - ONS and the Economic and Social Research Council - ESRC.

1 / 22

slide-2
SLIDE 2

Motivation

Compositions

Area 1 . . . J Total 1 Y1· 2 Y2· 3 Y3· Yaj . . . . . . A YA· Total Y·1 . . . Y·J Y··

2 / 22

slide-3
SLIDE 3

Motivation

Compositions

Area 1 . . . J Total 1 Y1· 2 Y2· 3 Y3· Yaj . . . . . . A YA· Total Y·1 . . . Y·J Y·· Target: Estimate the within area cell counts Yaj, using proxy information and fixed row/column margins.

2 / 22

slide-4
SLIDE 4

Motivation

Two different (fixed-effects) approaches to this problem are considered: ⊲ Structure Preserving Models: Long tradition in SAE. Assumptions about the relationship between the interactions

  • f two compositions in the log-linear scale. (Proxy information

is required).

3 / 22

slide-5
SLIDE 5

Motivation

Two different (fixed-effects) approaches to this problem are considered: ⊲ Structure Preserving Models: Long tradition in SAE. Assumptions about the relationship between the interactions

  • f two compositions in the log-linear scale. (Proxy information

is required). ⊲ Regression (Generalized Linear) Models: Multinomial-Logistic: Assumptions about the relationship between the log-odds with respect to a reference category and a set of covariates. (Proxy information can be used as covariate).

3 / 22

slide-6
SLIDE 6

Motivation

In this work, we:

1 Introduce a generalization of the Structure Preserving

approach, covering the SPREE and GSPREE models and also the logit-multinomial (using proxy information) as particular cases.

4 / 22

slide-7
SLIDE 7

Motivation

In this work, we:

1 Introduce a generalization of the Structure Preserving

approach, covering the SPREE and GSPREE models and also the logit-multinomial (using proxy information) as particular cases.

2 Use data from 2001 and 2011 Population Censuses in England

to compare the different models in terms of their Prediction Error.

4 / 22

slide-8
SLIDE 8

Motivation

In this work, we:

1 Introduce a generalization of the Structure Preserving

approach, covering the SPREE and GSPREE models and also the logit-multinomial (using proxy information) as particular cases.

2 Use data from 2001 and 2011 Population Censuses in England

to compare the different models in terms of their Prediction Error.

3 Show some ongoing work on a model using a mapping matrix

between the proxy and desired compositions, which allows to incorporate auxiliary information at the aggregate level.

4 / 22

slide-9
SLIDE 9

Outline

1 Structure Preserving Models 2 Model using a Mapping matrix

5 / 22

slide-10
SLIDE 10

Structure Preserving Models

6 / 22

slide-11
SLIDE 11

Structure Preserving Models

Denote by θX

aj = Xaj/Xa· an auxiliary composition of exactly the same

dimension as θY

aj = Yaj/Ya·, its log-linear representation given by:

γX

aj = αX 0 + αX a + αX j + αX aj

where γX

aj = log θX aj,

αX

0 = ¯

γX

·· ,

αX

a = ¯

γX

a· − ¯

γX

·· ,

αX

j = ¯

γX

·j − ¯

γX

·· and

αX

aj = γX aj − ¯

γX

a· − ¯

γX

·j − ¯

γX

·· .

6 / 22

slide-12
SLIDE 12

Structure Preserving Models

Denote by θX

aj = Xaj/Xa· an auxiliary composition of exactly the same

dimension as θY

aj = Yaj/Ya·, its log-linear representation given by:

γX

aj = αX 0 + αX a + αX j + αX aj

where γX

aj = log θX aj,

αX

0 = ¯

γX

·· ,

αX

a = ¯

γX

a· − ¯

γX

·· ,

αX

j = ¯

γX

·j − ¯

γX

·· and

αX

aj = γX aj − ¯

γX

a· − ¯

γX

·j − ¯

γX

·· .

The log-linear representation satisfies the constraints:

  • a αX

a = 0,

  • j αX

j = 0,

  • a αX

aj = j αX aj = 0. Analogous for θY

aj.

6 / 22

slide-13
SLIDE 13

Structure Preserving Models

Denote by θX

aj = Xaj/Xa· an auxiliary composition of exactly the same

dimension as θY

aj = Yaj/Ya·, its log-linear representation given by:

γX

aj = αX 0 + αX a + αX j + αX aj

where γX

aj = log θX aj,

αX

0 = ¯

γX

·· ,

αX

a = ¯

γX

a· − ¯

γX

·· ,

αX

j = ¯

γX

·j − ¯

γX

·· and

αX

aj = γX aj − ¯

γX

a· − ¯

γX

·j − ¯

γX

·· .

The log-linear representation satisfies the constraints:

  • a αX

a = 0,

  • j αX

j = 0,

  • a αX

aj = j αX aj = 0. Analogous for θY

aj.

The modelling process is focused on the relationship between αY

aj and αX aj.

Marginal constraints such as

a ˆ

Yaj = Y·j for j = 1, . . . , J and

j ˆ

Yaj = Ya· for a = 1, . . . , A can be considered using IPF without modifying the parameter

  • estimates. Proxy information (not just covariates) is required.

6 / 22

slide-14
SLIDE 14

Structure Preserving Models

In the context of SAE, the following Structure Preserving models have been used:

7 / 22

slide-15
SLIDE 15

Structure Preserving Models

In the context of SAE, the following Structure Preserving models have been used:

  • 1. Given
  • θX

aj

  • , {Y1·, . . . , YA·}:

7 / 22

slide-16
SLIDE 16

Structure Preserving Models

In the context of SAE, the following Structure Preserving models have been used:

  • 1. Given
  • θX

aj

  • , {Y1·, . . . , YA·}:

Synthetic Estimator: Adapted from Gonzalez & Hoza (1978), ˆ Yaj = θX

ajYa·

The underlining model is αY

j = αX j , αY aj = αX aj.

The estimated composition is a rescaled version of the auxiliary composition.

7 / 22

slide-17
SLIDE 17

Structure Preserving Models

  • 2. Given
  • θX

aj

  • , {Y1·, . . . , YA·} , {Y·1, . . . , Y·J}:

8 / 22

slide-18
SLIDE 18

Structure Preserving Models

  • 2. Given
  • θX

aj

  • , {Y1·, . . . , YA·} , {Y·1, . . . , Y·J}:

SPREE: Purcell & Kish (1980) use IPF to fit the two margins, ˆ Y (1)

aj

= θX

ajYa· ,

ˆ Y (2)

aj

= ˆ Y (1)

aj

ˆ Y (1)

·j

Y·j , ... until convergency is achieved. This estimator minimizes the distance between the compositions X and ˆ Y satisfying the marginal constraints. The underlining model is αY

aj = αX aj .

8 / 22

slide-19
SLIDE 19

Structure Preserving Models

  • 3. Given
  • θX

aj

  • , {Y1·, . . . , YA·} , {Y·1, . . . , Y·J} and an estimated
  • θY

aj

  • :

9 / 22

slide-20
SLIDE 20

Structure Preserving Models

  • 3. Given
  • θX

aj

  • , {Y1·, . . . , YA·} , {Y·1, . . . , Y·J} and an estimated
  • θY

aj

  • :

Generalized Linear Structural Model (GSPREE): Zhang & Chambers (2004) propose the model αY

aj = βαX aj .

9 / 22

slide-21
SLIDE 21

Structure Preserving Models

  • 3. Given
  • θX

aj

  • , {Y1·, . . . , YA·} , {Y·1, . . . , Y·J} and an estimated
  • θY

aj

  • :

Generalized Linear Structural Model (GSPREE): Zhang & Chambers (2004) propose the model αY

aj = βαX aj .

β can be estimated using ML under the multinomial distribution, when expressing the model as:

µY

aj = λj + βµX aj

for µaj = log θaj − 1

J

  • l

log θaj = αj + αaj .

9 / 22

slide-22
SLIDE 22

Structure Preserving Models

  • 3. Given
  • θX

aj

  • , {Y1·, . . . , YA·} , {Y·1, . . . , Y·J} and an estimated
  • θY

aj

  • :

Generalized Linear Structural Model (GSPREE): Zhang & Chambers (2004) propose the model αY

aj = βαX aj .

β can be estimated using ML under the multinomial distribution, when expressing the model as:

µY

aj = λj + βµX aj

for µaj = log θaj − 1

J

  • l

log θaj = αj + αaj .

Given the sum-zero constraint of the αj, the λj are nuisance parameters with no practical interest.

9 / 22

slide-23
SLIDE 23

Extension of the Structure Preserving approach

All the previous models can be seen as particular cases of the more general model:

    αY

a1

. . . αY

aJ

    = B β B     αX

a1

. . . αX

aJ

   

10 / 22

slide-24
SLIDE 24

Extension of the Structure Preserving approach

All the previous models can be seen as particular cases of the more general model:

    αY

a1

. . . αY

aJ

    = B β B     αX

a1

. . . αX

aJ

   

Where BJ×J = I − J−111′ and βJ×J = {βjk} contains all the parameters.

10 / 22

slide-25
SLIDE 25

Extension of the Structure Preserving approach

All the previous models can be seen as particular cases of the more general model:

    αY

a1

. . . αY

aJ

    = B β B     αX

a1

. . . αX

aJ

   

Where BJ×J = I − J−111′ and βJ×J = {βjk} contains all the parameters. The multiplication on left and right by B ensure that the sum zero constraints are satisfied by the predicted αY

aj, as well as the

uniqueness of G = BβB. Denoting by {gjk} the components of G we can write, αY

aj =

  • k

gjkαX

ak .

As in the GSPREE, the estimation of β can be done using ML under the multinomial distribution writing the model as ηY

a = λ + B β BηX a .

10 / 22

slide-26
SLIDE 26

Extension of the Structure Preserving approach

Some particular cases: a) SPREE: β = I αY

aj = αX aj − 1

J

  • k

αX

ak

11 / 22

slide-27
SLIDE 27

Extension of the Structure Preserving approach

Some particular cases: a) SPREE: β = I αY

aj = αX aj − 1

J

  • k

αX

ak

b) GSPREE: With parameter φ, β = φI αY

aj = φαX aj − φ1

J

  • k

αX

ak

11 / 22

slide-28
SLIDE 28

Extension of the Structure Preserving approach

c) Logit-Multinomial Model: The model with J-1 parameters ηY

aj = γj + φjηX aj

for ηaj = log [θaj/θaJ], can be written as a structural model in the form αY

a = B(J)βB αX a

for B(J) the J×(J-1) matrix resulting of dropping the column J from B and β a (J-1)×J matrix defined as β =

  • Diag
  • β(J)

β(J)

  • for

β(J) a vector of J-1 parameters (The category J doesn’t have a free parameter).

12 / 22

slide-29
SLIDE 29

Extension of the Structure Preserving approach

d) GSPREE with category-specific (J) parameters: β = Diag

  • β
  • αY

aj = βjαX aj − 1

J

  • k

βkαX

ak

The second term on the right hand, included to satisfy the sum-zero constrains without impose restrictions to the βj, make the predictions of this model not a line anymore.

13 / 22

slide-30
SLIDE 30

Extension of the Structure Preserving approach

d) GSPREE with category-specific (J) parameters: β = Diag

  • β
  • αY

aj = βjαX aj − 1

J

  • k

βkαX

ak

The second term on the right hand, included to satisfy the sum-zero constrains without impose restrictions to the βj, make the predictions of this model not a line anymore. e) GSPREE JxJ model: β = {βjk}, G = {gjk} = BβB αY

aj =

  • k

gjkαX

ak

13 / 22

slide-31
SLIDE 31

Some examples

Data from 2001 and 2011 Population Census in England for the Hackney Borough. Ethnicity.

  • 0.3

0.4 0.5 0.6 0.7 0.8 0.3 0.4 0.5 0.6 0.7 0.8

White

Y Proportions Y pred proportions

  • SPREE

GSPREE J model JxJ model

  • 0.03

0.04 0.05 0.06 0.07 0.08 0.09 0.03 0.05 0.07 0.09

Mixed

Y Proportions Y pred proportions

  • SPREE

GSPREE J model JxJ model

  • 0.05

0.10 0.15 0.20 0.25 0.05 0.10 0.15 0.20 0.25

Asian

Y Proportions Y pred proportions

  • SPREE

GSPREE J model JxJ model

  • 0.1

0.2 0.3 0.4 0.5 0.1 0.2 0.3 0.4 0.5

Black

Y Proportions Y pred proportions

  • SPREE

GSPREE J model JxJ model

  • 0.02

0.04 0.06 0.08 0.10 0.12 0.02 0.06 0.10

Other

Y Proportions Y pred proportions

  • SPREE

GSPREE J model JxJ model

14 / 22

slide-32
SLIDE 32

Some examples

Data from 2001 and 2011 Population Census in England for the Hackney Borough. Ethnicity.

  • SPREE

GSPREE J model JxJ model −0.10 −0.05 0.00 0.05

White

Prediction error (Proportion)

  • SPREE

GSPREE J model JxJ model −0.015 −0.005 0.005

Mixed

Prediction error (Proportion)

  • SPREE

GSPREE J model JxJ model −0.06 −0.02 0.02

Asian

Prediction error (Proportion)

  • SPREE

GSPREE J model JxJ model −0.04 0.00 0.04 0.08

Black

Prediction error (Proportion) SPREE GSPREE J model JxJ model −0.02 0.00 0.02 0.04

Other

Prediction error (Proportion)

15 / 22

slide-33
SLIDE 33

Future work

Extending the general model to include random effects

0.04 0.06 0.08 0.10 0.12 0.04 0.06 0.08 0.10 0.12

  • Other. n = 464

Y Proportions Y pred proportions

JxJ model JxJ model+RE

0.04 0.06 0.08 0.10 0.12 0.04 0.06 0.08 0.10 0.12

  • Other. n = 93

Y Proportions Y pred proportions

JxJ model JxJ model+RE

0.04 0.06 0.08 0.10 0.12 0.04 0.06 0.08 0.10 0.12

  • Other. n = 31

Y Proportions Y pred proportions

JxJ model JxJ model+RE

16 / 22

slide-34
SLIDE 34

In summary...

17 / 22

slide-35
SLIDE 35

In summary...

  • Having an auxiliary composition (register/census data), an

estimated (updated) composition and the current margins, we extend the GSPREE model from one to a maximum of J × J parameters.

17 / 22

slide-36
SLIDE 36

In summary...

  • Having an auxiliary composition (register/census data), an

estimated (updated) composition and the current margins, we extend the GSPREE model from one to a maximum of J × J parameters.

  • According to our preliminary exercises, the new models show

less bias than SPREE and GSPREE models (fixed effects approach).

17 / 22

slide-37
SLIDE 37

In summary...

  • Having an auxiliary composition (register/census data), an

estimated (updated) composition and the current margins, we extend the GSPREE model from one to a maximum of J × J parameters.

  • According to our preliminary exercises, the new models show

less bias than SPREE and GSPREE models (fixed effects approach).

  • We are still working on the extension to include cell-specific

random effects. As expected, for a big sample size the estimative obtained using a mixed model gets closer to the direct estimate, however, as it is borrowing strength from the auxiliary composition, it would be more stable.

17 / 22

slide-38
SLIDE 38

In summary...

  • Having an auxiliary composition (register/census data), an

estimated (updated) composition and the current margins, we extend the GSPREE model from one to a maximum of J × J parameters.

  • According to our preliminary exercises, the new models show

less bias than SPREE and GSPREE models (fixed effects approach).

  • We are still working on the extension to include cell-specific

random effects. As expected, for a big sample size the estimative obtained using a mixed model gets closer to the direct estimate, however, as it is borrowing strength from the auxiliary composition, it would be more stable.

  • MSE estimation is still need to be addressed.

17 / 22

slide-39
SLIDE 39

Outline

1 Structure Preserving Models 2 Model using a Mapping matrix

18 / 22

slide-40
SLIDE 40

Model using a Mapping matrix

Motivation

Denote by P = {Pij} the gross flow from the composition X to Y, i.e., assume that for each area:

     θY

a1

. . . θY

aJ

     =      P11 . . . P1J . . . ... . . . PJ1 . . . PJJ           θX

a1

. . . θX

aJ

    

The column sum of P is 1.

19 / 22

slide-41
SLIDE 41

Model using a Mapping matrix

Motivation

Denote by P = {Pij} the gross flow from the composition X to Y, i.e., assume that for each area:

     θY

a1

. . . θY

aJ

     =      P11 . . . P1J . . . ... . . . PJ1 . . . PJJ           θX

a1

. . . θX

aJ

    

The column sum of P is 1. What is the effect of the mapping matrix P in the log-linear representation of Y ? θY

a = PθX a ?

→ log θY

a ≈ M log θX a

19 / 22

slide-42
SLIDE 42

Model using a Mapping matrix

Using a First Order Taylor approximation over ln

  • θY

aj

  • = ln
  • l

θX

alPjl

  • as function of ln
  • θX

aj

  • , around the distribution of X at an

aggregate level, denoted by ˜ θX, we obtain ln θY

aj − ln ˜

θY

j ≈

  • l
  • qjl − τj ˜

θX

l

ln θX

al − ln ˜

θX

l

  • 20 / 22
slide-43
SLIDE 43

Model using a Mapping matrix

Using a First Order Taylor approximation over ln

  • θY

aj

  • = ln
  • l

θX

alPjl

  • as function of ln
  • θX

aj

  • , around the distribution of X at an

aggregate level, denoted by ˜ θX, we obtain ln θY

aj − ln ˜

θY

j ≈

  • l
  • qjl − τj ˜

θX

l

ln θX

al − ln ˜

θX

l

  • where ˜

θY = P ˜ θX, qjl = Pjl ˜ θX

l

˜ θY

j

is the reverse flow and τj = Pjrj ˜ θY

j

for Pjrj one cell specifically chosen for the j category.

20 / 22

slide-44
SLIDE 44

Model using a Mapping matrix

Applying the link function µaj = log θaj − 1 J

  • l

log θaj = αj + αaj

21 / 22

slide-45
SLIDE 45

Model using a Mapping matrix

Applying the link function µaj = log θaj − 1 J

  • l

log θaj = αj + αaj we can obtain the relationship αY

aj ≈

  • l

(qjl − ¯ q.l) αX

al −

  • l

(τj − ¯ τ) ˜ θX

l αX al.

According to our empirical studies, the leading term in the approximation is the term associated with the reverse flow. In this sense, a model involving also auxiliary information on the reverse flow at an aggregate level could be of interest.

21 / 22

slide-46
SLIDE 46

Thanks!

22 / 22