Small Area Estimation for Multivariate Repeated Measures Data - - PowerPoint PPT Presentation

small area estimation for multivariate repeated measures
SMART_READER_LITE
LIVE PREVIEW

Small Area Estimation for Multivariate Repeated Measures Data - - PowerPoint PPT Presentation

Small Area Estimation for Multivariate Repeated Measures Data Innocent Ngaruye Department of Mathematics, University of Rwanda Department of Mathematics, Link oping University 2nd Science day-Stockholm 13th December 2016 1 / 11 My


slide-1
SLIDE 1

Small Area Estimation for Multivariate Repeated Measures Data Innocent Ngaruye

Department of Mathematics, University of Rwanda Department of Mathematics, Link¨

  • ping University

2nd Science day-Stockholm 13th December 2016

1 / 11

slide-2
SLIDE 2

My Supervisors

Martin Singull Dietrich von Rosen

Main Supervisor Co-supervisor Link¨

  • ping University

Swedish University of Agricultural Sciences

2 / 11

slide-3
SLIDE 3

Introduction

Small Area Estimation (SAE) theory is concerned with solving the following problems

1 How to produce reliable estimates of characteristics of interest,

(total, means, quantiles, etc...) for small areas or domains, based on small samples or even no samples taken from these areas.

2 How to assess the estimation or prediction error

3 / 11

slide-4
SLIDE 4

Introduction (cont’d)

Censuses and surveys have limited scope and often provide very little information for subpopulations (domains) such that direct survey estimates on a target small area are not reliable due to a small sample size connected to this area We propose a multivariate linear regression model for repeated measurements in SAE settings to get a model which borrows strength across both small areas and over time. We consider repeated measurements on variable of interest y for p time points from the finite population of size N partitioned into m disjoint subpopulations or domains called small areas of sizes Ni, i = 1, ..., m.

4 / 11

slide-5
SLIDE 5

The working model

The corresponding model at small area level is given by Yi =ABCi + 1pγ′Xi + uiz′

i + Ei,

ui ∼ Np(0, Σu), Ei ∼ Np,Ni(0, Σe, INi). The working model combining all small areas is expressed by Y = ABHC + 1pγ′X + UZ + E, (1) E ∼ Np,N(0, Σe, I N), U ∼ Np,m(0, Σu, I m),

Ngaruye et al. (2016). Small Area Estimation under a Multivariate Linear Model for Repeated measures Data. Communications in Statistics - Theory and Methods

5 / 11

slide-6
SLIDE 6

Estimation of model parameters

Theorem (Ngaruye et al. (2016)) Consider the corresponding model for sample data for model defined in (1). Then, the likelihood based estimators of γ, B and Σu can be expressed as

  • γ

= 1 p (PX ′)−PY ′1p + (PX ′)o t2,

  • B

= (A′A)−1A′Y R2K ′

2(K 2K ′ 2)−

−1 p (A′A)−1A′1p1′

pY P′(XP′) −XR2K ′ 2(K 2K ′ 2)−

−(A′A)−1A′1p t

′ 2(PX ′)o′XR2K ′ 2(K 2K ′ 2)− +

T 1(K 2K ′

2)o′,

  • Σu

= 1 m(V 3 − A T 1 C 1 − 1p t

′ 2

C 2)(V 3 − A T 1 C 1 − 1p t

′ 2

C 2)′ − Σe,

6 / 11

slide-7
SLIDE 7

Theorem (cont.) where

K 1 = H(CC ′)1/2Γ1, K 2 = H(CC ′)1/2Γ2, R1 = C ′(CC ′)−1/2Γ1, R2 = C ′(CC ′)−1/2Γ2, P = XC ′o(C ′o)

′ + XR2R′ 2 − XR2K ′ 2(K 2K ′ 2)−K 2R′ 2,

  • T 1 = (A′S−1

2 A)−1A′S−1 2 (V 3 − 1p

t

′ 2

C 2) C

′ 1(

C 1 C

′ 1)− + A′T 11

C

  • 1,
  • t2 = (1′

pS−1 1 1p)−1(

C 2Q

C

′ 1

  • C

′ 2)−

C 2Q

C

′ 1V ′

3S−1 1 1p + (

C 2Q

C

′ 1)o′t′

211p,

S1 = V 3Q(

C

′ 1:

C

′ 2)V ′

3,

S2 = S1 + Q1p,S−1

1 V 3PQ C′ 1

  • C

′ 2V ′

3Q′ 1p,S−1

1 .

  • C 1 = (K 2K ′

2)o′K 1,

  • C 2 = (PX ′)o′X(I − R2K ′

2(K 2K ′ 2)−)R1.

7 / 11

slide-8
SLIDE 8

Prediction of random effects and small area means

Theorem (Ngaruye et al. (2016)) Consider the model defined by (1). Then, the prediction of random effects and target small area means at each time point for each group and across all time points are given by

  • U =
  • Σe

Σ

−1 u

+ I p −1 (Y − A BHC − 1p γ′X)Z ′

  • µig = 1

Nig

  • Y (s)

ig 1nig +

  • A

βg1′

Nig−nig + 1p

γ′X (r)

ig +

uiz(r)′

ig

  • 1Nig−nig
  • ,
  • µig =1

p1′

p

µig, g = 1, . . . , k, t = 1, . . . , p.

8 / 11

slide-9
SLIDE 9

Crop yield estimation at district level for SAS 2014 in Rwanda

The variable of interest is crop yield. We are interested in estimating average yield for beans (bush beans and climbing beans varieties) at district level during two agricultural seasons 2014, A and B. Note that in Rwanda, there are 30 districts.

Figure: Grain beans, Bush beans, Climbing beans Ngaruye, I., von Rosen, D., and Singull, M. (2016). Crop yield estimation at district level for agricultural seasons 2014 in Rwanda. African Journal of Applied Statistics, 3(1):69-90.

9 / 11

slide-10
SLIDE 10

Figure: Distribution of average beans yield estimates at district level during agricultural seasons A and B, 2104

10 / 11

slide-11
SLIDE 11

Thank you for your attention!

11 / 11