computational methods for nonlinear mixed models
play

Computational Methods for Nonlinear Mixed Models Douglas Bates - PowerPoint PPT Presentation

Computational Methods for Nonlinear Mixed Models Douglas Bates University of Wisconsin - Madison <Bates@Wisc.edu> Slides for this presentation are available at lme4.R-forge.R-project.org/slides/ Joint Statistical Meetings Washington, DC


  1. Computational Methods for Nonlinear Mixed Models Douglas Bates University of Wisconsin - Madison <Bates@Wisc.edu> Slides for this presentation are available at lme4.R-forge.R-project.org/slides/ Joint Statistical Meetings Washington, DC August 6, 2009 Douglas Bates (U.Wisc) Nonlinear mixed models August 6, 2009 1 / 14

  2. Outline Introduction 1 Model definition and an example 2 The penalized least squares problem 3 Douglas Bates (U.Wisc) Nonlinear mixed models August 6, 2009 2 / 14

  3. Outline Introduction 1 Model definition and an example 2 The penalized least squares problem 3 Douglas Bates (U.Wisc) Nonlinear mixed models August 6, 2009 2 / 14

  4. Outline Introduction 1 Model definition and an example 2 The penalized least squares problem 3 Douglas Bates (U.Wisc) Nonlinear mixed models August 6, 2009 2 / 14

  5. Outline Introduction 1 Model definition and an example 2 The penalized least squares problem 3 Douglas Bates (U.Wisc) Nonlinear mixed models August 6, 2009 3 / 14

  6. Introduction Population pharmacokinetics data are often modeled using nonlinear mixed-effects models (NLMMs). These are nonlinear because pharmacokinetic parameters - rate constants, clearance rates, etc. - occur nonlinearly in the model function. In statistical terms these are mixed-effects models because they involve both fixed-effects parameters , applying to the entire population or well-defined subsets of the population, and random effects associated with particular experimental or observational units under study. Many algorithms for obtaining parameter estimates, usually the maximum likelihood estimates (MLEs), for such models have been proposed and implemented. Comparing different algorithms is not easy. Even understanding the definition of the model and the proposed algorithm is not easy. We begin by defining the model. Douglas Bates (U.Wisc) Nonlinear mixed models August 6, 2009 4 / 14

  7. Outline Introduction 1 Model definition and an example 2 The penalized least squares problem 3 Douglas Bates (U.Wisc) Nonlinear mixed models August 6, 2009 5 / 14

  8. Theophylline pharmacokinetics 0 5 10 15 20 25 0 5 10 15 20 25 0 5 10 15 20 25 4 9 12 10 1 5 ● ● ● 10 ●● ● ●● ● ● ● ● ●● ● 8 ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● Serum concentration (mg/l) ● 6 ● ● ● ●● ● ● ● ● ● ● ● ●● 4 ● ● ● ● ● ● ● 2 ● ● ● ● ● ● ● ● 0 ● ● ● ● 6 7 8 11 3 2 10 ● ● 8 ● ● ●● ● ● ● ●●● ● ● ● ● ● ● ● ● 6 ● ● ● ●● ● ● ● ● ● ● ● ● ● ● ● ● ● 4 ● ● ● ● ● ● ● ● ● ● ● ● ● 2 ● ● ● ● ● ● ● ● ● 0 ● ● ● ● ● ● 0 5 10 15 20 25 0 5 10 15 20 25 0 5 10 15 20 25 Time since drug administration (hr) These are serum concentration profiles for 12 volunteers after injestion of an oral dose of Theophylline, as described in Pinheiro and Bates (2000). Douglas Bates (U.Wisc) Nonlinear mixed models August 6, 2009 6 / 14

  9. Modeling pharmacokinetic data with a nonlinear model These are longitudinal repeated measures data. For such data the time pattern of an individual’s response is determined by pharmacokinetic parameters (e.g. rate constants) that occur nonlinearly in the expression for the expected response. The form of the nonlinear model is determined by the pharmacokinetic theory, not derived from the data. d · k e · k a · C e − k e t − e − k a t k a − k e These pharmacokinetic parameters vary over the population. We wish to characterize typical values in the population and the extent of the variation. Thus, we associate random effects with the parameters, k a , k e and C in the nonlinear model. Douglas Bates (U.Wisc) Nonlinear mixed models August 6, 2009 7 / 14

  10. Linear and nonlinear mixed-effects models For both linear and nonlinear mixed-effects models, we consider the n -dimensional response random variable, Y , whose value, y , is observed, and the q -dimensional, unobserved random effects variable, B . In the models we will consider B ∼ N ( 0 , Σ θ ) . The variance-covariance matrix Σ θ can be huge but it is completely determined by a small number of variance-component parameters , θ . The conditional distribution of the response, Y , is � � µ Y | B , σ 2 I n ( Y | B = b ) ∼ N The conditional mean, µ Y | B , depends on b and on the fixed-effects parameters, β , through a linear predictor expression, Zb + Xβ . For a linear mixed model (LMM), µ Y | B is exactly the linear predictor. For an NLMM the linear predictor determines the parameter values in the nonlinear model function which then determines the mean. Douglas Bates (U.Wisc) Nonlinear mixed models August 6, 2009 8 / 14

  11. Transforming to orthogonal random effects We never really form Σ θ ; we always work with the relative covariance factor , Λ θ , defined so that Σ θ = σ 2 Λ θ Λ ⊺ θ . Note that we must allow for Λ θ to be less that full rank. We define a q -dimensional “spherical” or “unit” random-effects vector, U , such that � � 0 , σ 2 I q , B = Λ θ U ⇒ Var ( B ) = σ 2 Λ θ Λ ⊺ U ∼ N θ = Σ θ . Setting U θ = Z Λ θ , the linear predictor expression becomes Zb + Xβ = Z Λ θ u + Xβ = U θ u + Xβ . Douglas Bates (U.Wisc) Nonlinear mixed models August 6, 2009 9 / 14

  12. The conditional mode, ˜ u θ,β Although the probability model is defined from ( Y | U = u ) , we observe y , not u (or b ) so we want to work with the other conditional distribution, ( U | Y = y ) . The joint distribution of Y and U is Gaussian with density f Y , U ( y , u ) = f Y | U ( y | u ) f U ( u ) = exp( − 1 2 σ 2 � y − µ Y | U � 2 ) exp( − 1 2 σ 2 � u � 2 ) (2 πσ 2 ) n/ 2 (2 πσ 2 ) q/ 2 � � y − µ Y | U � 2 + � u � 2 � / (2 σ 2 )) = exp( − (2 πσ 2 ) ( n + q ) / 2 u θ,β , of the conditional distribution ( U | Y = y ) (also the The mode, ˜ mean in this case of an LMM) is �� � 2 + � u � 2 � � � y − µ Y | U u θ,β = arg min ˜ u Douglas Bates (U.Wisc) Nonlinear mixed models August 6, 2009 10 / 14

  13. Outline Introduction 1 Model definition and an example 2 The penalized least squares problem 3 Douglas Bates (U.Wisc) Nonlinear mixed models August 6, 2009 11 / 14

  14. Minimizing a penalized sum of squared residuals � � � 2 + � u � 2 is called a penalized sum of � y − µ Y | U An expression like � � � 2 is a sum of squared residuals � y − µ Y | U squared residuals because and � u � 2 is a penalty on the size of the vector u . Determining ˜ u θ,β as the minimizer of this expression is a penalized least squares (PLS) problem. For an LMM it is a penalized linear least squares problem that can be solved directly (i.e. without iterating). For an NLMM it is a penalized nonlinear least squares problem . One way to determine the solution in an LMM is to rephrase it as a linear least squares problem for an extended residual vector � � y − Xβ � � U θ � � 2 � � � � u θ,β = arg min ˜ − u � � 0 I q u This is sometimes called a pseudo-data approach because we create the effect of the penalty term, � u � 2 , by adding “pseudo-observations” to y and to the predictor. Douglas Bates (U.Wisc) Nonlinear mixed models August 6, 2009 12 / 14

  15. The profiled deviance for LMMs � ˜ � We can see that ˜ u θ,β satisfies U ⊺ θ U θ + I q u θ,β = U ⊺ θ ( y − Xβ ) which we solve using the sparse Cholesky decomposition � � L θ L ⊺ θ = P U ⊺ θ U θ + I q P ⊺ P is a permutation matrix that has practical importance but does not affect the theory. The matrix L θ is the sparse, lower-triangular factor. Let r 2 ( θ , β ) be the minimum penalized residual sum of squares, then ℓ ( θ , β , σ | y ) = log L ( θ , β , σ | y ) can be written − 2 ℓ ( θ , β , σ | y ) = n log(2 πσ 2 ) + r 2 ( θ , β ) + log( | L θ | 2 ) σ 2 The conditional estimate of σ 2 is σ 2 ( θ , β ) = r 2 ( θ , β ) � n producing the profiled deviance � � 2 πr 2 ( θ , β ) �� − 2˜ ℓ ( θ , β | y ) = log( | L θ | 2 ) + n 1 + log n Douglas Bates (U.Wisc) Nonlinear mixed models August 6, 2009 13 / 14

  16. Profiling the deviance with respect to β for LMMs In a LMM the deviance depends on β only through r 2 ( θ , β ) we can obtain the conditional estimate, � β θ , by extending the PLS problem to � � y − Xβ − U θ u � 2 + � u � 2 � r 2 ( θ ) = min u , β with the solution satisfying the equations � U ⊺ � � ˜ � � U ⊺ � U ⊺ θ U θ + I q u θ θ X θ y = � X ⊺ U θ X ⊺ X X ⊺ y . β θ The profiled deviance, which is a function of θ only, is � � 2 πr 2 ( θ ) �� − 2˜ ℓ ( θ ) = log( | L θ | 2 ) + n 1 + log n Douglas Bates (U.Wisc) Nonlinear mixed models August 6, 2009 14 / 14

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend