Multiple imputation methods for incomplete longitudinal ordinal - - PowerPoint PPT Presentation

multiple imputation methods for incomplete longitudinal
SMART_READER_LITE
LIVE PREVIEW

Multiple imputation methods for incomplete longitudinal ordinal - - PowerPoint PPT Presentation

Multiple imputation methods for incomplete longitudinal ordinal data: a simulation study Anne-Francoise DONNEAU Medical Informatics and Biostatistics School of Public Health University of Li` ege Promotor: Pr. A. Albert 14 September 2012


slide-1
SLIDE 1

Multiple imputation methods for incomplete longitudinal ordinal data: a simulation study

Anne-Francoise DONNEAU

Medical Informatics and Biostatistics School of Public Health University of Li` ege Promotor: Pr. A. Albert

14 September 2012

  • AFr. Donneau (ULg)

Multiple imputation methods for incomplete longitudinal ordinal data: a simulation study 1 / 21

slide-2
SLIDE 2

Outline of the presentation

◮ Introduction ◮ Methods for (incomplete) Non-Gaussian longitudinal data

Generalized Estimating Equations (GEE) Multiple imputation based GEE (MI-GEE)

◮ Simulation plan ◮ Results ◮ Conclusions

  • AFr. Donneau (ULg)

Multiple imputation methods for incomplete longitudinal ordinal data: a simulation study 2 / 21

slide-3
SLIDE 3

Introduction Ordinal longitudinal data

Analysis of ordinal longitudinal data

Units: Subjects, objects (i = 1, · · · , N)

  • AFr. Donneau (ULg)

Multiple imputation methods for incomplete longitudinal ordinal data: a simulation study 3 / 21

slide-4
SLIDE 4

Introduction Ordinal longitudinal data

Analysis of ordinal longitudinal data

Units: Subjects, objects (i = 1, · · · , N) Outcome: Ordinal variable Y with K levels

  • AFr. Donneau (ULg)

Multiple imputation methods for incomplete longitudinal ordinal data: a simulation study 3 / 21

slide-5
SLIDE 5

Introduction Ordinal longitudinal data

Analysis of ordinal longitudinal data

Units: Subjects, objects (i = 1, · · · , N) Outcome: Ordinal variable Y with K levels Measurement: Repeated at T time points, Yi = (Yi1, · · · , YiT)′

  • AFr. Donneau (ULg)

Multiple imputation methods for incomplete longitudinal ordinal data: a simulation study 3 / 21

slide-6
SLIDE 6

Introduction Ordinal longitudinal data

Analysis of ordinal longitudinal data

Units: Subjects, objects (i = 1, · · · , N) Outcome: Ordinal variable Y with K levels Measurement: Repeated at T time points, Yi = (Yi1, · · · , YiT)′ Covariates: T × p covariates matrix Xi = (xi1, · · · , xiT)′ Time, gender, age ...

  • AFr. Donneau (ULg)

Multiple imputation methods for incomplete longitudinal ordinal data: a simulation study 3 / 21

slide-7
SLIDE 7

Introduction Ordinal longitudinal data

Analysis of ordinal longitudinal data

Units: Subjects, objects (i = 1, · · · , N) Outcome: Ordinal variable Y with K levels Measurement: Repeated at T time points, Yi = (Yi1, · · · , YiT)′ Covariates: T × p covariates matrix Xi = (xi1, · · · , xiT)′ Time, gender, age ... Methods: Methods for Non-Gaussian Longitudinal Data

  • AFr. Donneau (ULg)

Multiple imputation methods for incomplete longitudinal ordinal data: a simulation study 3 / 21

slide-8
SLIDE 8

Introduction Ordinal longitudinal data

Analysis of ordinal longitudinal data

Units: Subjects, objects (i = 1, · · · , N) Outcome: Ordinal variable Y with K levels Measurement: Repeated at T time points, Yi = (Yi1, · · · , YiT)′ Covariates: T × p covariates matrix Xi = (xi1, · · · , xiT)′ Time, gender, age ... Methods: Methods for Non-Gaussian Longitudinal Data Generalized estimating equations (GEE)

  • AFr. Donneau (ULg)

Multiple imputation methods for incomplete longitudinal ordinal data: a simulation study 3 / 21

slide-9
SLIDE 9

Introduction Ordinal longitudinal data

Analysis of ordinal longitudinal data

Units: Subjects, objects (i = 1, · · · , N) Outcome: Ordinal variable Y with K levels Measurement: Repeated at T time points, Yi = (Yi1, · · · , YiT)′ Covariates: T × p covariates matrix Xi = (xi1, · · · , xiT)′ Time, gender, age ... Methods: Methods for Non-Gaussian Longitudinal Data Generalized estimating equations (GEE) Problem: Missing data

  • AFr. Donneau (ULg)

Multiple imputation methods for incomplete longitudinal ordinal data: a simulation study 3 / 21

slide-10
SLIDE 10

Missing data Missingness

Missingness

Missing data patterns:

◮ Drop out / attrition ◮ Non-monotone missingness

  • AFr. Donneau (ULg)

Multiple imputation methods for incomplete longitudinal ordinal data: a simulation study 4 / 21

slide-11
SLIDE 11

Missing data Missingness

Missingness

Missing data patterns:

◮ Drop out / attrition ◮ Non-monotone missingness

Missing data mechanism (Little and Rubin, 1987)

MCAR - Missing completely at random

◮ independent of (both observed and unobserved) measurements

MAR - Missing at random

◮ conditional on observed measurements, independent of

unobserved measurements MNAR - Missing not at random

◮ dependent on unobserved and (also possibly) observed

measurements

  • AFr. Donneau (ULg)

Multiple imputation methods for incomplete longitudinal ordinal data: a simulation study 4 / 21

slide-12
SLIDE 12

Analysis - GEE Methods for Non-Gaussian Longitudinal Data

GEE

◮ GEE - extension of Generalized Linear Models to longitudinal data

  • AFr. Donneau (ULg)

Multiple imputation methods for incomplete longitudinal ordinal data: a simulation study 5 / 21

slide-13
SLIDE 13

Analysis - GEE Methods for Non-Gaussian Longitudinal Data

GEE

◮ GEE - extension of Generalized Linear Models to longitudinal data ◮ Ordinal data (proportional odds model) - needs some transformations

  • AFr. Donneau (ULg)

Multiple imputation methods for incomplete longitudinal ordinal data: a simulation study 5 / 21

slide-14
SLIDE 14

Analysis - GEE Methods for Non-Gaussian Longitudinal Data

GEE

◮ GEE - extension of Generalized Linear Models to longitudinal data ◮ Ordinal data (proportional odds model) - needs some transformations ◮ Define of a (K − 1) expanded vector of binary responses

Y∗

ij = (Y ∗ ij1, ...,Y ∗ ij,(K−1))’ where Y ∗ ijk = 1 if Yij = k and 0 otherwise ◮ logit[Pr(Yij ≤ k)] = logit[Pr(Y ∗ ijk = 1)] = β0k + x′ ijβ

  • AFr. Donneau (ULg)

Multiple imputation methods for incomplete longitudinal ordinal data: a simulation study 5 / 21

slide-15
SLIDE 15

Analysis - GEE Methods for Non-Gaussian Longitudinal Data

GEE

◮ GEE - extension of Generalized Linear Models to longitudinal data ◮ Ordinal data (proportional odds model) - needs some transformations ◮ Define of a (K − 1) expanded vector of binary responses

Y∗

ij = (Y ∗ ij1, ...,Y ∗ ij,(K−1))’ where Y ∗ ijk = 1 if Yij = k and 0 otherwise ◮ logit[Pr(Yij ≤ k)] = logit[Pr(Y ∗ ijk = 1)] = β0k + x′ ijβ N

  • i=1

∂πi′ ∂β W−1

i

(Y∗

i − πi) = 0

where Y∗

i = (Y∗ i1, ..., Y∗ iT)′, πi = E(Y∗ i ) and Wi = V1/2 i

RiV1/2

i

with Vi the diagonal matrix of the variance of the element of Y∗

i . The matrix Ri is the

’working’ correlation matrix that expresses the dependence among repeated

  • bservations over the subjects.
  • AFr. Donneau (ULg)

Multiple imputation methods for incomplete longitudinal ordinal data: a simulation study 5 / 21

slide-16
SLIDE 16

Analysis - GEE Methods for Non-Gaussian Longitudinal Data

GEE - Large sample properties

√ N(ˆ β − β) N(0, I −1 I1I −1 )

◮ ˆ

β are consistent even if working correlation matrix is incorrect

◮ uncorrected specification of the correlation structure affects efficiency of ˆ

β

◮ valid only under MCAR

  • AFr. Donneau (ULg)

Multiple imputation methods for incomplete longitudinal ordinal data: a simulation study 6 / 21

slide-17
SLIDE 17

Analysis - GEE Methods for Non-Gaussian Longitudinal Data

GEE - Large sample properties

√ N(ˆ β − β) N(0, I −1 I1I −1 )

◮ ˆ

β are consistent even if working correlation matrix is incorrect

◮ uncorrected specification of the correlation structure affects efficiency of ˆ

β

◮ valid only under MCAR ◮ What if not MCAR?

  • AFr. Donneau (ULg)

Multiple imputation methods for incomplete longitudinal ordinal data: a simulation study 6 / 21

slide-18
SLIDE 18

Analysis - GEE Methods for Non-Gaussian Longitudinal Data

GEE - Large sample properties

√ N(ˆ β − β) N(0, I −1 I1I −1 )

◮ ˆ

β are consistent even if working correlation matrix is incorrect

◮ uncorrected specification of the correlation structure affects efficiency of ˆ

β

◮ valid only under MCAR ◮ What if not MCAR? ◮ Solution: Use Multiple Imputation (MI) as a preliminary step

  • AFr. Donneau (ULg)

Multiple imputation methods for incomplete longitudinal ordinal data: a simulation study 6 / 21

slide-19
SLIDE 19

Multiple imputation Multiple imputation

Multiple imputation

Idea Replace each missing value by a set of M > 1 plausible values drawn from conditional distribution of unobserved values given observed ones

  • AFr. Donneau (ULg)

Multiple imputation methods for incomplete longitudinal ordinal data: a simulation study 7 / 21

slide-20
SLIDE 20

Multiple imputation Multiple imputation

Multiple imputation

Idea Replace each missing value by a set of M > 1 plausible values drawn from conditional distribution of unobserved values given observed ones How

  • 1. Imputation stage - Y missing

ij

⇒ Y 1

ij , · · · , Y M ij

  • 2. Analysis stage - Analyze the M completed datasets using GEE

ˆ βm, ˆ var( ˆ βm)

  • , m = 1, · · · , M
  • 3. Pooling stage - Combination of the M results

ˆ β

∗ = 1

M

M

  • m=1

ˆ βm T = W +

  • 1 + 1

M

  • B

where W =

1 M

M

m=1 ˆ

var( ˆ βm) and B =

1 M−1

M

m=1( ˆ

βm − ˆ β

∗)( ˆ

βm − ˆ β

∗)′

  • AFr. Donneau (ULg)

Multiple imputation methods for incomplete longitudinal ordinal data: a simulation study 7 / 21

slide-21
SLIDE 21

Multiple imputation Multiple imputation

Multiple imputation

Idea Replace each missing value by a set of M > 1 plausible values drawn from conditional distribution of unobserved values given observed ones How

  • 1. Imputation stage - Y missing

ij

⇒ Y 1

ij , · · · , Y M ij

  • 2. Analysis stage - Analyze the M completed datasets using GEE

ˆ βm, ˆ var( ˆ βm)

  • , m = 1, · · · , M
  • 3. Pooling stage - Combination of the M results

ˆ β

∗ = 1

M

M

  • m=1

ˆ βm T = W +

  • 1 + 1

M

  • B

where W =

1 M

M

m=1 ˆ

var( ˆ βm) and B =

1 M−1

M

m=1( ˆ

βm − ˆ β

∗)( ˆ

βm − ˆ β

∗)′

  • AFr. Donneau (ULg)

Multiple imputation methods for incomplete longitudinal ordinal data: a simulation study 8 / 21

slide-22
SLIDE 22

Multiple imputation Imputation methods

Imputation mechanism

Any monotone response pattern can be written as Y = (Yo, Ymissing). Let θ represents the parameter vector of the distribution of the response Y. The idea is to impute missing data using f (Ymissing|Yo, θ).

  • AFr. Donneau (ULg)

Multiple imputation methods for incomplete longitudinal ordinal data: a simulation study 9 / 21

slide-23
SLIDE 23

Multiple imputation Imputation methods

Imputation mechanism

Any monotone response pattern can be written as Y = (Yo, Ymissing). Let θ represents the parameter vector of the distribution of the response Y. The idea is to impute missing data using f (Ymissing|Yo, ˆ θ).

  • AFr. Donneau (ULg)

Multiple imputation methods for incomplete longitudinal ordinal data: a simulation study 10 / 21

slide-24
SLIDE 24

Multiple imputation Imputation methods

Imputation mechanism

Any monotone response pattern can be written as Y = (Yo, Ymissing). Let θ represents the parameter vector of the distribution of the response Y. The idea is to impute missing data using f (Ymissing|Yo, ˆ θ). Imputation mechanisms based on :

  • AFr. Donneau (ULg)

Multiple imputation methods for incomplete longitudinal ordinal data: a simulation study 10 / 21

slide-25
SLIDE 25

Multiple imputation Imputation methods

Imputation mechanism

Any monotone response pattern can be written as Y = (Yo, Ymissing). Let θ represents the parameter vector of the distribution of the response Y. The idea is to impute missing data using f (Ymissing|Yo, ˆ θ). Imputation mechanisms based on :

  • Markov chain Monte Carlo (MCMC)
  • Stochastic regression (ordinal logistic regression (OIM))
  • AFr. Donneau (ULg)

Multiple imputation methods for incomplete longitudinal ordinal data: a simulation study 10 / 21

slide-26
SLIDE 26

Multiple imputation Imputation methods

Imputation methods - MCMC

Assuming data arise from a multivariate Normal distribution, use an iterative imputation method based on MCMC (Schafer, 1997).

  • AFr. Donneau (ULg)

Multiple imputation methods for incomplete longitudinal ordinal data: a simulation study 11 / 21

slide-27
SLIDE 27

Multiple imputation Imputation methods

Imputation methods - MCMC

Assuming data arise from a multivariate Normal distribution, use an iterative imputation method based on MCMC (Schafer, 1997).

  • 1. I-step Given starting values for θ, θ(0), values for Ymissing are simulated by

randomly drawing a value from f (Ymissing|Yo, θ(0)).

  • 2. P-step New value for θ, θ(j), is drawn from a transition distribution,

considering the previous value θ(j) ≈ hs(θ(j−1)). Both steps are iterated long enough to provide a stationary Markov chain (Ymissing

(1)

, θ(1)), (Ymissing

(2)

, θ(2)), · · · and last iteration is used to impute Ymissing in the dataset. Repeat to obtain M sets of imputed values.

  • AFr. Donneau (ULg)

Multiple imputation methods for incomplete longitudinal ordinal data: a simulation study 11 / 21

slide-28
SLIDE 28

Multiple imputation Imputation methods

Imputation methods - MCMC

Assuming data arise from a multivariate Normal distribution, use an iterative imputation method based on MCMC (Schafer, 1997).

  • 1. I-step Given starting values for θ, θ(0), values for Ymissing are simulated by

randomly drawing a value from f (Ymissing|Yo, θ(0)).

  • 2. P-step New value for θ, θ(j), is drawn from a transition distribution,

considering the previous value θ(j) ≈ hs(θ(j−1)). Both steps are iterated long enough to provide a stationary Markov chain (Ymissing

(1)

, θ(1)), (Ymissing

(2)

, θ(2)), · · · and last iteration is used to impute Ymissing in the dataset. Repeat to obtain M sets of imputed values.

Problem when applied to ordinal data

◮ Normality assumption fails ◮ Imputed values are no longer integers between 1 and K → rounding

  • AFr. Donneau (ULg)

Multiple imputation methods for incomplete longitudinal ordinal data: a simulation study 11 / 21

slide-29
SLIDE 29

Multiple imputation Imputation methods

Imputation methods - OIM

Ordinal imputation model: logit[Pr(Yij ≤ k)|x∗

ij]

= γ0k + x′∗

ijγ

(1) where the covariates typically include Xij, possible auxiliary covariates Aij, and the previous outcomes ˜ Yij = (Yi1, ..., Yi,j−1).

  • AFr. Donneau (ULg)

Multiple imputation methods for incomplete longitudinal ordinal data: a simulation study 12 / 21

slide-30
SLIDE 30

Multiple imputation Imputation methods

Imputation methods - OIM

Ordinal imputation model: logit[Pr(Yij ≤ k)|x∗

ij]

= γ0k + x′∗

ijγ

(1) where the covariates typically include Xij, possible auxiliary covariates Aij, and the previous outcomes ˜ Yij = (Yi1, ..., Yi,j−1).

  • 1. Draw new values for parameters ˆ

Γ = (γ′

0, γ′)′,

Γ∗ = ˆ Γ + V′

hiZ

where Vhi is the upper triangular matrix of the Cholesky decomposition of V (ˆ Γ) and Z is a [(K − 1) + q]−vector of independent random Normal variates.

  • AFr. Donneau (ULg)

Multiple imputation methods for incomplete longitudinal ordinal data: a simulation study 12 / 21

slide-31
SLIDE 31

Multiple imputation Imputation methods

Imputation methods - OIM

Ordinal imputation model: logit[Pr(Yij ≤ k)|x∗

ij]

= γ0k + x′∗

ijγ

(1) where the covariates typically include Xij, possible auxiliary covariates Aij, and the previous outcomes ˜ Yij = (Yi1, ..., Yi,j−1).

  • 1. Draw new values for parameters ˆ

Γ = (γ′

0, γ′)′,

Γ∗ = ˆ Γ + V′

hiZ

where Vhi is the upper triangular matrix of the Cholesky decomposition of V (ˆ Γ) and Z is a [(K − 1) + q]−vector of independent random Normal variates.

  • 2. For each missing observation, Y missing

ij

, compute the expected probabilities πk = P[Y missing

ij

= k|x∗

ij] (k = 1, ..., K), using (Eq. 1)

  • AFr. Donneau (ULg)

Multiple imputation methods for incomplete longitudinal ordinal data: a simulation study 12 / 21

slide-32
SLIDE 32

Multiple imputation Imputation methods

Imputation methods - OIM

Ordinal imputation model: logit[Pr(Yij ≤ k)|x∗

ij]

= γ0k + x′∗

ijγ

(1) where the covariates typically include Xij, possible auxiliary covariates Aij, and the previous outcomes ˜ Yij = (Yi1, ..., Yi,j−1).

  • 1. Draw new values for parameters ˆ

Γ = (γ′

0, γ′)′,

Γ∗ = ˆ Γ + V′

hiZ

where Vhi is the upper triangular matrix of the Cholesky decomposition of V (ˆ Γ) and Z is a [(K − 1) + q]−vector of independent random Normal variates.

  • 2. For each missing observation, Y missing

ij

, compute the expected probabilities πk = P[Y missing

ij

= k|x∗

ij] (k = 1, ..., K), using (Eq. 1)

  • 3. Draw a random variate from a multinomial distribution with probabilities

derived in step 2.

  • AFr. Donneau (ULg)

Multiple imputation methods for incomplete longitudinal ordinal data: a simulation study 12 / 21

slide-33
SLIDE 33

Multiple imputation Imputation methods

Imputation methods - OIM

Ordinal imputation model: logit[Pr(Yij ≤ k)|x∗

ij]

= γ0k + x′∗

ijγ

(1) where the covariates typically include Xij, possible auxiliary covariates Aij, and the previous outcomes ˜ Yij = (Yi1, ..., Yi,j−1).

  • 1. Draw new values for parameters ˆ

Γ = (γ′

0, γ′)′,

Γ∗ = ˆ Γ + V′

hiZ

where Vhi is the upper triangular matrix of the Cholesky decomposition of V (ˆ Γ) and Z is a [(K − 1) + q]−vector of independent random Normal variates.

  • 2. For each missing observation, Y missing

ij

, compute the expected probabilities πk = P[Y missing

ij

= k|x∗

ij] (k = 1, ..., K), using (Eq. 1)

  • 3. Draw a random variate from a multinomial distribution with probabilities

derived in step 2.

  • 4. Repeat steps 1 to 3 to obtain M sets of imputed values.
  • AFr. Donneau (ULg)

Multiple imputation methods for incomplete longitudinal ordinal data: a simulation study 12 / 21

slide-34
SLIDE 34

Simulation Data setup

Simulation plan

Longitudinal ordinal data model: logit[Pr(Yij ≤ k|xi, tj)] = β0k + βxxi + βttj + βtxxitj (k = 1, · · · K − 1) with a binary group effect (x = 0 or 1), an assessment time (t) and an interaction term between group and time.

  • AFr. Donneau (ULg)

Multiple imputation methods for incomplete longitudinal ordinal data: a simulation study 13 / 21

slide-35
SLIDE 35

Simulation Data setup

Simulation plan

Longitudinal ordinal data model: logit[Pr(Yij ≤ k|xi, tj)] = β0k + βxxi + βttj + βtxxitj (k = 1, · · · K − 1) with a binary group effect (x = 0 or 1), an assessment time (t) and an interaction term between group and time. MAR missingness generation: logit[Pr(Di = j|xi, Yi,(j−1))] = ψ0 + ψxxi + ψprevYi,(j−1)

  • AFr. Donneau (ULg)

Multiple imputation methods for incomplete longitudinal ordinal data: a simulation study 13 / 21

slide-36
SLIDE 36

Simulation Data setup

Simulation plan

Longitudinal ordinal data model: logit[Pr(Yij ≤ k|xi, tj)] = β0k + βxxi + βttj + βtxxitj (k = 1, · · · K − 1) with a binary group effect (x = 0 or 1), an assessment time (t) and an interaction term between group and time. MAR missingness generation: logit[Pr(Di = j|xi, Yi,(j−1))] = ψ0 + ψxxi + ψprevYi,(j−1) Model simulation parameters (Well-balanced data): K = 2, 3, 4, 5 and 7 T = 3, 5 N = 100, 300, 500 Missingness = 10%, 30%, 50% → 90 different combination patterns. For each pattern, 500 random samples were generated.

  • AFr. Donneau (ULg)

Multiple imputation methods for incomplete longitudinal ordinal data: a simulation study 13 / 21

slide-37
SLIDE 37

Simulation Results - Global

Simulation results

Relative bias (%) Relative bias (Mean ± SD) MCMC OIM Difference βx 89.4 ± 13.1 99.5 ± 15.5

  • 10.1 ± 8.91

βt 84.6 ± 10.4 100.9 ± 8.95

  • 16.4 ± 9.58

βtx 90.6 ± 5.73 99.7 ± 5.37

  • 9.10 ± 4.60
  • AFr. Donneau (ULg)

Multiple imputation methods for incomplete longitudinal ordinal data: a simulation study 14 / 21

slide-38
SLIDE 38

Simulation Results - Global

Simulation results

Relative bias (%) Relative bias (Mean ± SD) MCMC OIM Difference βx 89.4 ± 13.1 99.5 ± 15.5

  • 10.1 ± 8.91

βt 84.6 ± 10.4 100.9 ± 8.95

  • 16.4 ± 9.58

βtx 90.6 ± 5.73 99.7 ± 5.37

  • 9.10 ± 4.60

Mean square error (MSE): similar for MCMC and OIM.

  • AFr. Donneau (ULg)

Multiple imputation methods for incomplete longitudinal ordinal data: a simulation study 14 / 21

slide-39
SLIDE 39

Simulation Results - Global

Simulation results

Relative bias (%) Relative bias (Mean ± SD) MCMC OIM Difference βx 89.4 ± 13.1 99.5 ± 15.5

  • 10.1 ± 8.91

βt 84.6 ± 10.4 100.9 ± 8.95

  • 16.4 ± 9.58

βtx 90.6 ± 5.73 99.7 ± 5.37

  • 9.10 ± 4.60

Mean square error (MSE): similar for MCMC and OIM.

  • AFr. Donneau (ULg)

Multiple imputation methods for incomplete longitudinal ordinal data: a simulation study 15 / 21

slide-40
SLIDE 40

Simulation Results - βtx

Simulation results - Relative bias βtx

Number of levels K K MCMC OIM Difference 2 92.9 ± 5.18 101.2 ± 2.93

  • 8.35 ± 4.29

3 94.1 ± 2.98 103.4 ± 4.23

  • 9.35 ± 4.34

4 88.0 ± 6.71 99.1 ± 6.05

  • 11.1 ± 4.66

5 89.1 ± 5.36 99.5 ± 3.09

  • 10.4 ± 4.70

7 88.7 ± 5.56 95.0 ± 6.12

  • 6.34 ± 3.87

< 0.0001 < 0.0001 0.034

  • AFr. Donneau (ULg)

Multiple imputation methods for incomplete longitudinal ordinal data: a simulation study 16 / 21

slide-41
SLIDE 41

Simulation Results - βtx

Simulation results - Relative bias βtx

Number of levels K K MCMC OIM Difference 2 92.9 ± 5.18 101.2 ± 2.93

  • 8.35 ± 4.29

3 94.1 ± 2.98 103.4 ± 4.23

  • 9.35 ± 4.34

4 88.0 ± 6.71 99.1 ± 6.05

  • 11.1 ± 4.66

5 89.1 ± 5.36 99.5 ± 3.09

  • 10.4 ± 4.70

7 88.7 ± 5.56 95.0 ± 6.12

  • 6.34 ± 3.87

< 0.0001 < 0.0001 0.034 Number of time points T T MCMC OIM Difference 3 91.7 ± 5.82 100.9 ± 5.34

  • 9.26 ± 4.73

5 89.4 ± 5.47 98.4 ± 5.14

  • 8.94 ± 4.51

0.007 0.009 0.61

  • AFr. Donneau (ULg)

Multiple imputation methods for incomplete longitudinal ordinal data: a simulation study 16 / 21

slide-42
SLIDE 42

Simulation Results - βtx

Simulation results - Relative bias βtx

Sample size N MCMC OIM Difference 100 90.5 ± 6.60 97.7 ± 6.73

  • 7.22 ± 4.18

300 90.9 ± 5.37 100.8 ± 4.77

  • 9.88 ± 4.48

500 90.2 ± 5.29 100.4 ± 3.85

  • 10.2 ± 4.67

0.74 0.027 0.0002

  • AFr. Donneau (ULg)

Multiple imputation methods for incomplete longitudinal ordinal data: a simulation study 17 / 21

slide-43
SLIDE 43

Simulation Results - βtx

Simulation results - Relative bias βtx

Sample size N MCMC OIM Difference 100 90.5 ± 6.60 97.7 ± 6.73

  • 7.22 ± 4.18

300 90.9 ± 5.37 100.8 ± 4.77

  • 9.88 ± 4.48

500 90.2 ± 5.29 100.4 ± 3.85

  • 10.2 ± 4.67

0.74 0.027 0.0002 Rate of missingness Missingness MCMC OIM Difference 10% 95.4 ± 2.65 100.1 ± 2.47

  • 4.64 ± 0.94

30% 89.9 ± 3.23 99.9 ± 3.57

  • 9.94 ± 2.21

50% 86.3 ± 6.29 99.0 ± 8.31

  • 12.7 ± 4.92

< 0.0001 0.37 < 0.0001

  • AFr. Donneau (ULg)

Multiple imputation methods for incomplete longitudinal ordinal data: a simulation study 17 / 21

slide-44
SLIDE 44

Conclusions Conclusions

Conclusions

Relative bias

◮ MCMC yields highly underestimated model parameters ◮ The estimates derived under the OIM method are almost unbiased.

  • AFr. Donneau (ULg)

Multiple imputation methods for incomplete longitudinal ordinal data: a simulation study 18 / 21

slide-45
SLIDE 45

Conclusions Conclusions

Conclusions

Relative bias

◮ MCMC yields highly underestimated model parameters ◮ The estimates derived under the OIM method are almost unbiased. ◮

K N T Missingness βx MCMC ↑ OIM ↑ ↓ ↑ βt MCMC ↑ ↑ OIM ↑ ↓ ↑ βtx MCMC ↑ ↑ ↑ OIM ↑ ↓ ↑ ↑ Absolute bias increases ↓ Absolute bias decreases

  • AFr. Donneau (ULg)

Multiple imputation methods for incomplete longitudinal ordinal data: a simulation study 18 / 21

slide-46
SLIDE 46

Conclusions Conclusions

Conclusions

Relative bias

◮ MCMC yields highly underestimated model parameters ◮ The estimates derived under the OIM method are almost unbiased. ◮

K N T Missingness βx MCMC ↑ OIM ↑ ↓ ↑ βt MCMC ↑ ↑ OIM ↑ ↓ ↑ βtx MCMC ↑ ↑ ↑ OIM ↑ ↓ ↑ ↑ Absolute bias increases ↓ Absolute bias decreases

MSE

◮ MCMC and OIM were similar

  • AFr. Donneau (ULg)

Multiple imputation methods for incomplete longitudinal ordinal data: a simulation study 18 / 21

slide-47
SLIDE 47

Conclusions Conclusions

Conclusion - General

MCMC is not really recommended to impute longitudinal ordinal data.

  • AFr. Donneau (ULg)

Multiple imputation methods for incomplete longitudinal ordinal data: a simulation study 19 / 21

slide-48
SLIDE 48

Conclusions Conclusions

Conclusion - General

MCMC is not really recommended to impute longitudinal ordinal data. Advisable to impute missing ordinal data using appropriate method.

  • AFr. Donneau (ULg)

Multiple imputation methods for incomplete longitudinal ordinal data: a simulation study 19 / 21

slide-49
SLIDE 49

Conclusions Conclusions

Thank you.

  • AFr. Donneau (ULg)

Multiple imputation methods for incomplete longitudinal ordinal data: a simulation study 20 / 21

slide-50
SLIDE 50

Simulation Results - Skewed data

Simulation results - Relative bias βtx - Skewed

No relationship between the OIM relative bias and the modeling parameters MCMC relative bias increased with K (p = 0.0002) and the rate of missingness (p = 0.0005)

Numbers of categories Relative bias (%) 2 3 4 5 7 95 105 115 125 Rate of missingness Relative bias (%) 10 30 50 95 105 115 125

Figure: Relative bias (%) of βtx according to the number of categories and the rate of

missingness (MCMC= shaded boxplot - OIM=empty boxplot)

  • AFr. Donneau (ULg)

Multiple imputation methods for incomplete longitudinal ordinal data: a simulation study 21 / 21