Flexible Discriminant Analysis Using Multivariate Mixed Models
- D. Hughes
Motivation MGLMM Discriminant Analysis ISDR Example Conclusions
Flexible Discriminant Analysis Using Motivation MGLMM Multivariate - - PowerPoint PPT Presentation
Flexible Discriminant Analysis Using Multivariate Mixed Models D. Hughes Flexible Discriminant Analysis Using Motivation MGLMM Multivariate Mixed Models Discriminant Analysis ISDR Example Conclusions David Hughes 2015 Flexible
Flexible Discriminant Analysis Using Multivariate Mixed Models
Motivation MGLMM Discriminant Analysis ISDR Example Conclusions
Flexible Discriminant Analysis Using Multivariate Mixed Models
Motivation MGLMM Discriminant Analysis ISDR Example Conclusions
Flexible Discriminant Analysis Using Multivariate Mixed Models
Motivation MGLMM Discriminant Analysis ISDR Example Conclusions
◮ Complex data.
Flexible Discriminant Analysis Using Multivariate Mixed Models
Motivation MGLMM Discriminant Analysis ISDR Example Conclusions
◮ Complex data.
◮ Longitudinal
Flexible Discriminant Analysis Using Multivariate Mixed Models
Motivation MGLMM Discriminant Analysis ISDR Example Conclusions
◮ Complex data.
◮ Longitudinal ◮ Multivariate
Flexible Discriminant Analysis Using Multivariate Mixed Models
Motivation MGLMM Discriminant Analysis ISDR Example Conclusions
◮ Complex data.
◮ Longitudinal ◮ Multivariate ◮ Different types of data
Flexible Discriminant Analysis Using Multivariate Mixed Models
Motivation MGLMM Discriminant Analysis ISDR Example Conclusions
◮ Complex data.
◮ Longitudinal ◮ Multivariate ◮ Different types of data ◮ Complicated correlation structure
Flexible Discriminant Analysis Using Multivariate Mixed Models
Motivation MGLMM Discriminant Analysis ISDR Example Conclusions
◮ Complex data.
◮ Longitudinal ◮ Multivariate ◮ Different types of data ◮ Complicated correlation structure
Flexible Discriminant Analysis Using Multivariate Mixed Models
Motivation MGLMM Discriminant Analysis ISDR Example Conclusions
◮ Univariate models using a classical linear mixed model (e.g
◮ Fails to account properly for the dependence between markers
◮ Multivariate Models for continuous markers using multivariate
◮ Not applicable if some of the markers are not continuous.
◮ Pairwise models for continuous and binary markers (Fieuws et
◮ This method in principle is suitable for our purposes but in this
Flexible Discriminant Analysis Using Multivariate Mixed Models
Motivation MGLMM Discriminant Analysis ISDR Example Conclusions
◮ Typical assumption about the random effects distribution can
◮ This methodology only considers three continuous markers.
◮ Cluster Analysis with continuous, binary and count variables
◮ In Cluster Analysis the groups are unknown whereas in our case
◮ Software is available in the mixAK package in R created by
Flexible Discriminant Analysis Using Multivariate Mixed Models
Motivation MGLMM Discriminant Analysis ISDR Example Conclusions
Flexible Discriminant Analysis Using Multivariate Mixed Models
Motivation MGLMM Discriminant Analysis ISDR Example Conclusions
◮ Yi,r,j is the j‘th observation of the r‘th marker for patient i and
◮ We consider r = 1, . . . , R markers on i = 1, . . . , N patients. ◮ Yi,r is a vector containing all observations of marker r for
◮ Yi is a stacked vector containing all the observations of all
◮ Distribution of each marker may depend on additional
◮ It is possible for each marker to be measured at different time
Flexible Discriminant Analysis Using Multivariate Mixed Models
Motivation MGLMM Discriminant Analysis ISDR Example Conclusions
◮ To allow for different types of marker we model each marker
r
Flexible Discriminant Analysis Using Multivariate Mixed Models
Motivation MGLMM Discriminant Analysis ISDR Example Conclusions
◮ To allow for different types of marker we model each marker
r
◮ hr is a link function used depending on the type of longitudinal
◮ αr is a vector of fixed parameters for marker r. ◮ bi,r is a vector of random effects for patient i for marker r (i.e
◮ X and Z are matrices containing covariate information for each
Flexible Discriminant Analysis Using Multivariate Mixed Models
Motivation MGLMM Discriminant Analysis ISDR Example Conclusions
◮ The dependence between markers is captured by the joint
◮ The most common assumption is that the random effects
◮ This assumption can be difficult to verify and additional
K
Flexible Discriminant Analysis Using Multivariate Mixed Models
Motivation MGLMM Discriminant Analysis ISDR Example Conclusions
◮ We need to estimate the following parameters.
Flexible Discriminant Analysis Using Multivariate Mixed Models
Motivation MGLMM Discriminant Analysis ISDR Example Conclusions
◮ We need to estimate the following parameters.
◮ Fixed effects α = (α1, . . . , αR) ◮ Possible dispersion parameters φ = (φ1, . . . , φR) ◮ Mixture weights w = (w1, . . . , wK) ◮ Mean vector of random effects µ = (µ1, . . . , µK) ◮ Covariance matrix of random effects (vec(D1), . . . , vec(DK))
◮ In all, we need to estimate,
Flexible Discriminant Analysis Using Multivariate Mixed Models
Motivation MGLMM Discriminant Analysis ISDR Example Conclusions
◮ Full maximum likelihood estimates are difficult to obtain due
◮ We instead use a Bayesian approach based on MCMC. ◮ We utilise weakly informative priors and a block Gibbs sampler. ◮ A benefit of this method, not explored in this talk is that
Flexible Discriminant Analysis Using Multivariate Mixed Models
Motivation MGLMM Discriminant Analysis ISDR Example Conclusions
Flexible Discriminant Analysis Using Multivariate Mixed Models
Motivation MGLMM Discriminant Analysis ISDR Example Conclusions
◮ Fit MGLMM to data in each diagnostic group g, g = 1, . . . , G
◮ Use the fitted GLMM model to derive the discriminant rule
◮ Let ˆ
◮ The prior probability of being in group g is denoted πg. ◮ Using Bayes rule it can be seen that
h=0 πhˆ
◮ Assign new patients to disease group if ˆ
Flexible Discriminant Analysis Using Multivariate Mixed Models
Motivation MGLMM Discriminant Analysis ISDR Example Conclusions
◮ Marginal Prediction
g,new = p(ynew|θg)
◮ Conditional Prediction
g,new = p(ynew|bnew = ˜
◮ Random Effects Prediction
g,new = p(˜
◮ These values are calculated using numerical integration
Flexible Discriminant Analysis Using Multivariate Mixed Models
Motivation MGLMM Discriminant Analysis ISDR Example Conclusions
◮ Our motivation comes from the ISDR cohort study. ◮ We consider 12,628 patients with diabetes who were screened
◮ Various markers measured over time, HbA1c and Cholesterol
◮ 600 patients had positive screening event within the
Flexible Discriminant Analysis Using Multivariate Mixed Models
Motivation MGLMM Discriminant Analysis ISDR Example Conclusions
◮ We consider two groups, 600 patients with a positive screening
◮ 80% of the patients in each group to train MGLMMs (one for
◮ 20% of patients to test the classification accuracy. ◮ End goal is to identify patients who will have a positive
Flexible Discriminant Analysis Using Multivariate Mixed Models
Motivation MGLMM Discriminant Analysis ISDR Example Conclusions
Flexible Discriminant Analysis Using Multivariate Mixed Models
Motivation MGLMM Discriminant Analysis ISDR Example Conclusions
Posterior Mean Standard Error Posterior Median 95% Credible Interval No STDR Group α1
1.48e-06
(-3.03e-03,-2.46e-03) α2
3.73e-05
(-1.25e-02,2.07e-03) ) α3
1.84e-06
(-3.65e-03,-2.93e-03) ) α4
4.35e-05
(-9.61e-02,-7.9e-02) α5
2.24e-08
(-2.45e-05,-1.57e-05) α6 3.66e-03 3.97e-06 3.67e-03 (2.86e-03,4.44e-03) α7
1.01e-04
(-3.64e-02,3.03e-03) α8 2.99e-04 9.87e-08 2.99e-04 (2.8e-04,3.18e-04) α9 9.24e-03 2.6e-05 9.2e-03 (4.2e-03,1.45e-02) α10 1.2e-01 6.4e-04 1.2e-01 (-5.64e-03,2.45e-01) STDR Group α1
8.62e-06
(-8.4e-03,-5.05e-03) α2
2.45e-04
(-8.03e-02,1.49e-02) α3
7.47e-06
(-4.54e-03,-1.59e-03) α4
2.3e-04
(-1.27e-01,-3.79e-02) α5
1.71e-07
(-5.7e-05,1.05e-05) α6 9.07e-03 2.21e-05 9.1e-03 (4.77e-03,1.33e-02) α7
6.13e-04
(-1.46e-01,9.26e-02) α8 4.78e-04 5.97e-07 4.78e-04 (3.61e-04,5.95e-04) α9
1.09e-04
(-2.63e-02,1.58e-02) α10
3.43e-03
(-8.16e-01,5.33e-01)
Flexible Discriminant Analysis Using Multivariate Mixed Models
Motivation MGLMM Discriminant Analysis ISDR Example Conclusions
Posterior Mean Standard Error Posterior Median 95% Credible Interval No STDR Group E[b0] 4.15 1.02e-04 4.15 (4.13,4.17) E[b1] 6.07e-06 3.37e-08 6.11e-06 (-5.18e-07,1.26e-05) E[b2] 1.68 1.24e-04 1.68 (1.65,1.7) E[b3] 5.1e-01 2.77e-04 5.1e-01 (4.55e-01,5.65e-01) E[b4]
1.97e-03
(-3.35,-2.56) E[b5]
6.69e-07
(-4.65e-04,-2.03e-04) SD[b0] 2.71e-01 4.39e-05 2.71e-01 (2.63e-01,2.8e-01) SD[b1] 2.2e-04 5.85e-08 2.2e-04 (2.09e-04,2.32e-04) SD[b2] 1.83e-01 1.76e-05 1.83e-01 (1.8e-01,1.87e-01) SD[b3] 2.27e-01 7.16e-05 2.27e-01 (2.13e-01,2.41e-01) SD[b4] 2.54 1.27e-03 2.54 (2.3e,2.79) SD[b5] 9.46e-04 1.3e-06 9.5e-04 (6.91e-04,1.2e-03) STDR Group E[b0] 4.62 5.79e-04 4.62 (4.51,4.73) E[b1] 3.61e-05 1.88e-07 3.61e-05 (-8.23e-07,7.26e-05) E[b2] 1.65 4.96e-04 1.65 (1.55,1.74) E[b3] 3.05e-01 1.45e-03 3.01e-01 (2.17e-02,5.96e-01) E[b4] 3.81 7.94e-03 3.76 (2.38,5.44) E[b5] 8.66e-04 2.09e-06 8.88e-04 (2.82e-04,1.3e-03) SD[b0] 3.05e-01 2.11e-04 3.04e-01 (2.64e-01,3.47e-01) SD[b1] 1.75e-04 2.55e-07 1.75e-04 (1.24e-04,2.24e-04) SD[b2] 2.06e-01 1.05e-04 2.05e-01 (1.85e-01,2.26e-01) SD[b3] 3.15e-01 3.69e-04 3.16e-01 (2.4e-01,3.86e-01) SD[b4] 2.75 3.63e-03 2.71 (2.16,3.59) SD[b5] 8.94e-04 1.59e-06 8.59e-04 (6.43e-04,1.3e-03)
Flexible Discriminant Analysis Using Multivariate Mixed Models
Motivation MGLMM Discriminant Analysis ISDR Example Conclusions
−2000 −1500 −1000 −500 3.5 4.0 4.5 5.0 Time (days) Log(HbA1c)
No STDR Group
−2000 −1500 −1000 −500 3.5 4.0 4.5 5.0 Time (days)
STDR Group
−2000 −1500 −1000 −500 0.5 1.0 1.5 2.0 2.5 Time (days) Log(Cholesterol) −2000 −1500 −1000 −500 0.5 1.0 1.5 2.0 2.5 Time (days)
Flexible Discriminant Analysis Using Multivariate Mixed Models
Motivation MGLMM Discriminant Analysis ISDR Example Conclusions
−2000 −1500 −1000 −500 5 10 15 Time (days) Number of GP Visits
No STDR Group
−2000 −1500 −1000 −500 5 10 15 Time (days)
STDR Group
−2000 −1500 −1000 −500 0.0 0.2 0.4 0.6 0.8 1.0 Time (days) Grading −2000 −1500 −1000 −500 0.0 0.2 0.4 0.6 0.8 1.0 Time (days)
Flexible Discriminant Analysis Using Multivariate Mixed Models
Motivation MGLMM Discriminant Analysis ISDR Example Conclusions
0.0 0.2 0.4 0.6 0.8 1.0 0.0 0.2 0.4 0.6 0.8 1.0
ROC Plot for Methods of Group Prediction
1−Specificity Sensitivity Marginal Conditional Random Effects LDA QDA
Flexible Discriminant Analysis Using Multivariate Mixed Models
Motivation MGLMM Discriminant Analysis ISDR Example Conclusions
Flexible Discriminant Analysis Using Multivariate Mixed Models
Motivation MGLMM Discriminant Analysis ISDR Example Conclusions
◮ There is a definite advantage to using longitudinal information
Flexible Discriminant Analysis Using Multivariate Mixed Models
Motivation MGLMM Discriminant Analysis ISDR Example Conclusions
◮ There is a definite advantage to using longitudinal information
◮ The marginal prediction method gives the best classification for
Flexible Discriminant Analysis Using Multivariate Mixed Models
Motivation MGLMM Discriminant Analysis ISDR Example Conclusions
◮ There is a definite advantage to using longitudinal information
◮ The marginal prediction method gives the best classification for
◮ Our methodology is able to obtain promising classification
Flexible Discriminant Analysis Using Multivariate Mixed Models
Motivation MGLMM Discriminant Analysis ISDR Example Conclusions
◮ Can we make more use of the credible intervals that are readily
Flexible Discriminant Analysis Using Multivariate Mixed Models
Motivation MGLMM Discriminant Analysis ISDR Example Conclusions
◮ Can we make more use of the credible intervals that are readily
◮ Can we identify the ideal timing of the next screening interval?
Flexible Discriminant Analysis Using Multivariate Mixed Models
Motivation MGLMM Discriminant Analysis ISDR Example Conclusions
◮ Can we make more use of the credible intervals that are readily
◮ Can we identify the ideal timing of the next screening interval? ◮ Can we include categorical longitudinal outcomes within this
Flexible Discriminant Analysis Using Multivariate Mixed Models
Motivation MGLMM Discriminant Analysis ISDR Example Conclusions
◮ Joint work with Arnoˇ
◮ We are grateful for the support of the ISDR team. ◮ We acknowledge support from the Medical Research Council
◮ Garc´
Flexible Discriminant Analysis Using Multivariate Mixed Models
Motivation MGLMM Discriminant Analysis ISDR Example Conclusions
◮ Brant, L.J., Sheng S.L., Morrell, C.H., Verbeke, G. N., Lesaffre, E.
◮ Fieuws, S., Verbeke, G., Maes, B., and Vanrenterghem, Y. (2008)
◮ Kom´
◮ Kom´
Flexible Discriminant Analysis Using Multivariate Mixed Models
Motivation MGLMM Discriminant Analysis ISDR Example Conclusions
◮ Lix, L.M., and Sajobi, T.T. (2010)
◮ Marshall, G., De la Cruz-Mes´
◮ Morrell, C.H., Brant, L.J., Sheng, S.L., and Metter, E. J. (2012)
◮ Tomasko, L., Helms, R.W. and Snapinn, S.M. (1999)
◮ Wernecke, K-D., Kalb, G., Schink T., and Wegner, B. (2004)