During and after fieldwork: Fieldwork monitoring, quality control and several ways to assess nonresponse bias
Joost Kappelhof
During and after fieldwork: Fieldwork monitoring, quality control - - PowerPoint PPT Presentation
During and after fieldwork: Fieldwork monitoring, quality control and several ways to assess nonresponse bias Joost Kappelhof Overview Paradata & monitoring fieldwork Nonresponse and nonresponse bias Ways to assess nonresponse
Joost Kappelhof
INGRID 2016
INGRID 2016
and/ or auxiliary information, such as sample frame data, interviewer
and late contacts, Compare easy and difficult respondents, final disposition codes.
INGRID 2016
Example of how and what type of paradata is collected in the ESS
frames
1+ ): Kish grid or first/ last birthday
countries)
INGRID 2016
INGRID 2016
ineligible)
someone else)
INGRID 2016
INGRID 2016
independent of observable variables, such as gender, and the unobservable parameter of interest, such as income
mechanism does not depend on the unobserved data)
missingness has nothing to do with income level after you adjust for gender
unknown way with the level of the income (and not on observed variables such as gender)
INGRID 2016
INGRID 2016
Fixed Response Model
N R k k k R k 1
1 y a R Y n
=
=
R R
E(y ) Y
=
NR R R R NR
N B(y ) Y Y (Y Y ) QK N
= − = − =
The Nonresponse Problem (Jelke Bethlehem, selection)
Random Response Model
where is the mean response probability.
The bias is large if
=
ρ ≈ ≡ ρ
N * k k r k 1
Y 1 E(y ) Y , N
ρ
Y Y R R
R S S B(y ) E(y ) Y
ρ ρ
= − = ρ
N R k k k R k 1
1 y a R Y n
=
=
The Nonresponse Problem (Jelke Bethlehem, selection)
INGRID 2016
INGRID 2016
INGRID 2016
target population (e.g., age, race, gender, urbanicity subgroups). Generally, the researcher asserts that there is no evidence of “nonresponse bias” if the response rates are similar across subgroups.
nonresponse biases in estimates based on other survey variables)
rates of demographic subgroups as evidence of nonresponse bias in survey estimates. Journal of Official Statistics, 25, 193 – 201.
INGRID 2016
sociodemographic variables among respondents with those from gold standard (high-quality) survey or population statistics
external source
survey are not completely known
INGRID 2016
contactability) or cooperative and reluctant respondent.
diverse populations, on diverse topics.
survey (assumption of continuum of resistance)
contactability and reluctance can be tricky
INGRID 2016
INGRID 2016
Number of calls Number of calls to first contact
INGRID 2016
Call (nr, time, mode) First contact
INGRID 2016
(from start or after first call)
INGRID 2016
INGRID 2016
Sturgis and Purdon, 1997; Loosveldt and Storms, 2001
?)
INGRID 2016
10 20 30 40 50 60 70 80 90 100
Poland Turkey Portugal France Ukraine Greece Czech republic Netherlands Bulgaria Norway Cyprus Belgium Latvia Estonia Russia Romania Spain Israel Ireland Hungary Switzerland Slovak Republic United Kingdom Finland Denmark Slovenia Croatia Sweden bad timing not interested subject unknown/ difficult waste time waste money privacy never do surveys co-operate too often no trust surveys bad experience dislike subject no approval not admit strangers
INGRID 2016
INGRID 2016
(individual, regional),doorstep questionnaire, nr follow-up survey
nonrespondents are availabe and accurate estimates of nonresponse bias on those frame or external data variables can be constructed.
variables can be measured among respondents, to address in a partial way the likely nonresponse biases of survey variables.
INGRID 2016
estimates utilizing some weighting class adjustments.
population parameter can be compared based on different assumptions and in case of similar results between estimates, one can have more confidence in the size and direction of nonresponse
yield different estimates?)
INGRID 2016
INGRID 2016
Adjustment weighting (Jelke Bethlehem) 29
Example Two auxiliary variables: Gender x Age. Weight of young female = 0.209 / 0.150 = 1.393. Now the weighted response is representative with respect to gender and age!
Response (n= 100) Population (N= 1000) Weights Male Female Male Female Male Female Young 23 15 Young 226 209 Young 0.983 1.393 Middle 16 17 Middle 152 144 Middle 0.950 0.847 Old 13 16 Old 133 136 Old 1.023 0.850
INGRID 2016
Uniform for all countries Adjust all nonresponse (refusal + noncontact) At least an idea of direction of nonresponse bias
Also sampling errors ViF Not effective when target vars do not correlate with PS vars Measurement errors of PS variables in ESS Different availability and quality of population info across countries Only minor differences in regression parameters found
INGRID 2016
Characteristics of Responsive survey designs (Groves & Heeringa, 2006) :
incidental deviations. Characteristics of Adaptive survey designs (Schouten et. al, 2013) :
In fact, any design phase of responsive survey design, i.e. any period in between interventions, could be adaptive (Schouten & Schlomo, 2013).
INGRID 2016
varied;
costs);
step 7;
step 6;
INGRID 2016
absolute bias, CV of response propensity, FMI, balance indicators and proxy pattern-mixture analysis.
respondents and nonrespondents on characteristics that are
relationship between these fully observed covariates and the survey outcome of interest.
sample frame data)
INGRID 2016
but only with respect to the (complete frame) variables in the model.
average response propensities (In essence one can view it as a measure that uses the variability between nonresponse adjustment weights).
(upper bound nonresponse bias for a hypothetical item), cv of response propensities and/ or (un/ conditional) partial R-indicators.
INGRID 2016
Nonresponse.” Journal of Official Statistics 27: 153-180.
. Cobben, and B. Schouten. 2011. Handbook of Nonresponse in Household Surveys. Hoboken, New Jersey: John Wiley & Sons, Inc.
Categorical Auxiliary Variables.” International Statistical Review 79: 233-254. DOI: http: / / dx.doi.org/ 10.1111/ j.1751-5823.2011.00142.x
Vectors to Reduce Nonresponse Bias.” Survey Methodology 36: 131-144.
Representativity of Survey Response Through R-Indicators and Partial R-Indicators.” International Statistical Review 80: 382-399.
. Cobben, and J. Bethlehem. 2009. “Indicators of Representativeness
Quality of Survey Data.” Public Opinion Quarterly 74: 223-243. DOI: http: / / dx.doi.org/ 10.1093/ poq/ nfq007