[PPT] - Probability Sampling Approach to Editing Maiki Ilves 1 , Prof. PowerPoint Presentation

SLIDE 1

Probability Sampling Approach to Editing

Maiki Ilves1, Prof. Thomas Laitila2

1 Department of Statistics, ¨

Orebro University, Sweden

2Department of Statistics, ¨

Orebro University and Statistics Sweden.

SLIDE 2

Introduction

The role of editing:

1. To assess the quality of data
2. To improve the survey by identifying error sources
3. Correct errors

Probability Sampling Approach to Editing – p.1/16

SLIDE 3

Different ways of editing

Traditional micro-editing Automated editing Selective editing Macro-editing

Probability Sampling Approach to Editing – p.2/16

SLIDE 4

Selective editing - 1

Purpose: prioritize suspicious responses according to their influence to the survey estimates and edit only the most influential responses.

▽Probability Sampling Approach to Editing – p.3/16

SLIDE 5

Selective editing - 1

Purpose: prioritize suspicious responses according to their influence to the survey estimates and edit only the most influential responses. Three stages:

▽Probability Sampling Approach to Editing – p.3/16

SLIDE 6

Selective editing - 1

Purpose: prioritize suspicious responses according to their influence to the survey estimates and edit only the most influential responses. Three stages:

1. Find out suspicious responses
editing rules

▽Probability Sampling Approach to Editing – p.3/16

SLIDE 7

Selective editing - 1

Purpose: prioritize suspicious responses according to their influence to the survey estimates and edit only the most influential responses. Three stages:

1. Find out suspicious responses
editing rules
2. Prioritize
score function i.e. function of measured value and

expected amended value. Local score, global score.

▽Probability Sampling Approach to Editing – p.3/16

SLIDE 8

Selective editing - 1

Purpose: prioritize suspicious responses according to their influence to the survey estimates and edit only the most influential responses. Three stages:

1. Find out suspicious responses
editing rules
2. Prioritize
score function i.e. function of measured value and

expected amended value. Local score, global score.

3. Determine cut-off point
in simulation study based on fully edited dataset

Probability Sampling Approach to Editing – p.3/16

SLIDE 9

Selective editing - 2

Evaluation: relative pseudo-bias

ˆ

θq − ˆ θ100 se(ˆ θ100)

q - percentage of suspicious responses pursued.

Probability Sampling Approach to Editing – p.4/16

SLIDE 10

Selective editing - 3

Advantages + Reduced costs + Reduced response burden + Gain in timeliness

▽Probability Sampling Approach to Editing – p.5/16

SLIDE 11

Selective editing - 3

Advantages + Reduced costs + Reduced response burden + Gain in timeliness Disadvantages

How to take into account the effect of editing in the

estimation stage?

Influence of edited data when used in different

statistical analysis is not known.

So far used only on quantitative variables.

Probability Sampling Approach to Editing – p.5/16

SLIDE 12

Estimating measurement bias

Literature: Madow (1965), Lessler and Kalsbeck (1992), Rao and Sitter (1997) Bias estimation through double sampling or two-phase

sampling. For all subsampled units the true values are

recorded and the difference between true values and

bserved values is used for bias estimation.

Probability Sampling Approach to Editing – p.6/16

SLIDE 13

Probability sampling approach

Our idea: Combine selective editing with bias estimation and derive unbiased estimator and its variance for this approach.

▽Probability Sampling Approach to Editing – p.7/16

SLIDE 14

Probability sampling approach

Our idea: Combine selective editing with bias estimation and derive unbiased estimator and its variance for this approach.

U

▽Probability Sampling Approach to Editing – p.7/16

SLIDE 15

Probability sampling approach

Our idea: Combine selective editing with bias estimation and derive unbiased estimator and its variance for this approach.

U

✬ ✫ ✩ ✪

sa

▽Probability Sampling Approach to Editing – p.7/16

SLIDE 16

Probability sampling approach

Our idea: Combine selective editing with bias estimation and derive unbiased estimator and its variance for this approach.

U

✬ ✫ ✩ ✪

sa U1 U2 sa1 sa2

▽Probability Sampling Approach to Editing – p.7/16

SLIDE 17

Probability sampling approach

Our idea: Combine selective editing with bias estimation and derive unbiased estimator and its variance for this approach.

U

✬ ✫ ✩ ✪

sa U1 U2 sa1 sa2

✫✪ ✬✩

s2

Probability Sampling Approach to Editing – p.7/16

SLIDE 18

Unbiased estimator for edited data

Notation:

zk, k ∈ U1 - true value xk, k ∈ U2 - observed value yk = Iedit

k

zk + (1 − Iedit

k

)xk, k ∈ U - observed value

after selective editing

▽Probability Sampling Approach to Editing – p.8/16

SLIDE 19

Unbiased estimator for edited data

Notation:

zk, k ∈ U1 - true value xk, k ∈ U2 - observed value yk = Iedit

k

zk + (1 − Iedit

k

)xk, k ∈ U - observed value

after selective editing We want to estimate tz =

U zk.

▽Probability Sampling Approach to Editing – p.8/16

SLIDE 20

Unbiased estimator for edited data

Notation:

zk, k ∈ U1 - true value xk, k ∈ U2 - observed value yk = Iedit

k

zk + (1 − Iedit

k

)xk, k ∈ U - observed value

after selective editing We want to estimate tz =

U zk.

HT-estimator ˆ

ty =

sa yk/πak is biased.

▽Probability Sampling Approach to Editing – p.8/16

SLIDE 21

Unbiased estimator for edited data

Notation:

zk, k ∈ U1 - true value xk, k ∈ U2 - observed value yk = Iedit

k

zk + (1 − Iedit

k

)xk, k ∈ U - observed value

after selective editing We want to estimate tz =

U zk.

HT-estimator ˆ

ty =

sa yk/πak is biased.

Estimator of bias is

ˆ B(ˆ ty) =

s2

ek πakπk|sa2 , ek = xk − zk.

▽Probability Sampling Approach to Editing – p.8/16

SLIDE 22

Unbiased estimator for edited data

Notation:

zk, k ∈ U1 - true value xk, k ∈ U2 - observed value yk = Iedit

k

zk + (1 − Iedit

k

)xk, k ∈ U - observed value

after selective editing We want to estimate tz =

U zk.

HT-estimator ˆ

ty =

sa yk/πak is biased.

Estimator of bias is

ˆ B(ˆ ty) =

s2

ek πakπk|sa2 , ek = xk − zk.

Bias corrected estimator is ˆ

tz = ˆ ty − ˆ B(ˆ ty).

Probability Sampling Approach to Editing – p.8/16

SLIDE 23

Precision of the estimators

MSE(ˆ ty) = V (ˆ ty) + B2(ˆ ty). MSE(ˆ tz) = V (ˆ ty) + V ( ˆ B(ˆ ty)) − 2C(ˆ ty, ˆ B(ˆ ty))

▽Probability Sampling Approach to Editing – p.9/16

SLIDE 24

Precision of the estimators

MSE(ˆ ty) = V (ˆ ty) + B2(ˆ ty). MSE(ˆ tz) = V (ˆ ty) + V ( ˆ B(ˆ ty)) − 2C(ˆ ty, ˆ B(ˆ ty))

where

V (ˆ ty) =

U

∆akl yk πak yl πal ,

(1)

▽Probability Sampling Approach to Editing – p.9/16

SLIDE 25

Precision of the estimators

MSE(ˆ ty) = V (ˆ ty) + B2(ˆ ty). MSE(ˆ tz) = V (ˆ ty) + V ( ˆ B(ˆ ty)) − 2C(ˆ ty, ˆ B(ˆ ty))

where

V ( ˆ B(ˆ ty)) =

U2

∆akl ek πak el πal +

(2)

+Ea

U2

∆kl|sa2IakIal ek πakπk|sa2 el πalπl|sa2

,

▽Probability Sampling Approach to Editing – p.9/16

SLIDE 26

Precision of the estimators

MSE(ˆ ty) = V (ˆ ty) + B2(ˆ ty). MSE(ˆ tz) = V (ˆ ty) + V ( ˆ B(ˆ ty)) − 2C(ˆ ty, ˆ B(ˆ ty))

where

C(ˆ ty, ˆ B(ˆ ty)) =

U
U2

∆akl yk πak el πal .

Probability Sampling Approach to Editing – p.9/16

SLIDE 27

One example

One specific two-phase design is considered. First phase sampling design: SI of size na, second phase sampling design: Poisson with inclusion probability πk|sa2.

▽Probability Sampling Approach to Editing – p.10/16

SLIDE 28

One example

Then,

ˆ V (ˆ ty) = CS2

ysa,

ˆ V ( ˆ B(ˆ ty)) = C

S2

ˇ es2 +

1 N − na

s2

(1 − πk|sa2)ˇ e2

k

,

ˆ C(ˆ ty, ˆ B(ˆ ty)) = C na

s2

xkˇ ek − 1 na − 1

sa

yk

s2

ˇ ek

,

where C = (1−fa)N 2

na

, ˇ

ek = ek/πk|sa2 and S2

ˇ es2 = 1/(na − 1)( s2 ˇ

e2

k − 1/na( s2 ˇ

ek)2).

Probability Sampling Approach to Editing – p.10/16

SLIDE 29

Simulation study: purpose

To compare survey estimates under two editing approaches:

▽Probability Sampling Approach to Editing – p.11/16

SLIDE 30

Simulation study: purpose

To compare survey estimates under two editing approaches: Approach 1 - editing procedure where selective editing procedure is applied;

▽Probability Sampling Approach to Editing – p.11/16

SLIDE 31

Simulation study: purpose

To compare survey estimates under two editing approaches: Approach 1 - editing procedure where selective editing procedure is applied; Approach 2 - editing procedure where in addition to selective editing bias correction is carried out.

Probability Sampling Approach to Editing – p.11/16

SLIDE 32