Computational Methods for Nonlinear Mixed Models Douglas Bates - PowerPoint PPT Presentation

Computational Methods for Nonlinear Mixed Models Douglas Bates University of Wisconsin - Madison <Bates@Wisc.edu> Slides for this presentation are available at lme4.R-forge.R-project.org/slides/ Joint Statistical Meetings Washington, DC August 6, 2009 Douglas Bates (U.Wisc) Nonlinear mixed models August 6, 2009 1 / 14

Outline Introduction 1 Model definition and an example 2 The penalized least squares problem 3 Douglas Bates (U.Wisc) Nonlinear mixed models August 6, 2009 2 / 14

Introduction Population pharmacokinetics data are often modeled using nonlinear mixed-effects models (NLMMs). These are nonlinear because pharmacokinetic parameters - rate constants, clearance rates, etc. - occur nonlinearly in the model function. In statistical terms these are mixed-effects models because they involve both fixed-effects parameters , applying to the entire population or well-defined subsets of the population, and random effects associated with particular experimental or observational units under study. Many algorithms for obtaining parameter estimates, usually the maximum likelihood estimates (MLEs), for such models have been proposed and implemented. Comparing different algorithms is not easy. Even understanding the definition of the model and the proposed algorithm is not easy. We begin by defining the model. Douglas Bates (U.Wisc) Nonlinear mixed models August 6, 2009 4 / 14

Theophylline pharmacokinetics 0 5 10 15 20 25 0 5 10 15 20 25 0 5 10 15 20 25 4 9 12 10 1 5 ● ● ● 10 ●● ● ●● ● ● ● ● ●● ● 8 ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● Serum concentration (mg/l) ● 6 ● ● ● ●● ● ● ● ● ● ● ● ●● 4 ● ● ● ● ● ● ● 2 ● ● ● ● ● ● ● ● 0 ● ● ● ● 6 7 8 11 3 2 10 ● ● 8 ● ● ●● ● ● ● ●●● ● ● ● ● ● ● ● ● 6 ● ● ● ●● ● ● ● ● ● ● ● ● ● ● ● ● ● 4 ● ● ● ● ● ● ● ● ● ● ● ● ● 2 ● ● ● ● ● ● ● ● ● 0 ● ● ● ● ● ● 0 5 10 15 20 25 0 5 10 15 20 25 0 5 10 15 20 25 Time since drug administration (hr) These are serum concentration profiles for 12 volunteers after injestion of an oral dose of Theophylline, as described in Pinheiro and Bates (2000). Douglas Bates (U.Wisc) Nonlinear mixed models August 6, 2009 6 / 14

Modeling pharmacokinetic data with a nonlinear model These are longitudinal repeated measures data. For such data the time pattern of an individual’s response is determined by pharmacokinetic parameters (e.g. rate constants) that occur nonlinearly in the expression for the expected response. The form of the nonlinear model is determined by the pharmacokinetic theory, not derived from the data. d · k e · k a · C e − k e t − e − k a t k a − k e These pharmacokinetic parameters vary over the population. We wish to characterize typical values in the population and the extent of the variation. Thus, we associate random effects with the parameters, k a , k e and C in the nonlinear model. Douglas Bates (U.Wisc) Nonlinear mixed models August 6, 2009 7 / 14

Linear and nonlinear mixed-effects models For both linear and nonlinear mixed-effects models, we consider the n -dimensional response random variable, Y , whose value, y , is observed, and the q -dimensional, unobserved random effects variable, B . In the models we will consider B ∼ N ( 0 , Σ θ ) . The variance-covariance matrix Σ θ can be huge but it is completely determined by a small number of variance-component parameters , θ . The conditional distribution of the response, Y , is � � µ Y | B , σ 2 I n ( Y | B = b ) ∼ N The conditional mean, µ Y | B , depends on b and on the fixed-effects parameters, β , through a linear predictor expression, Zb + Xβ . For a linear mixed model (LMM), µ Y | B is exactly the linear predictor. For an NLMM the linear predictor determines the parameter values in the nonlinear model function which then determines the mean. Douglas Bates (U.Wisc) Nonlinear mixed models August 6, 2009 8 / 14

Transforming to orthogonal random effects We never really form Σ θ ; we always work with the relative covariance factor , Λ θ , defined so that Σ θ = σ 2 Λ θ Λ ⊺ θ . Note that we must allow for Λ θ to be less that full rank. We define a q -dimensional “spherical” or “unit” random-effects vector, U , such that � � 0 , σ 2 I q , B = Λ θ U ⇒ Var ( B ) = σ 2 Λ θ Λ ⊺ U ∼ N θ = Σ θ . Setting U θ = Z Λ θ , the linear predictor expression becomes Zb + Xβ = Z Λ θ u + Xβ = U θ u + Xβ . Douglas Bates (U.Wisc) Nonlinear mixed models August 6, 2009 9 / 14

The conditional mode, ˜ u θ,β Although the probability model is defined from ( Y | U = u ) , we observe y , not u (or b ) so we want to work with the other conditional distribution, ( U | Y = y ) . The joint distribution of Y and U is Gaussian with density f Y , U ( y , u ) = f Y | U ( y | u ) f U ( u ) = exp( − 1 2 σ 2 � y − µ Y | U � 2 ) exp( − 1 2 σ 2 � u � 2 ) (2 πσ 2 ) n/ 2 (2 πσ 2 ) q/ 2 � � y − µ Y | U � 2 + � u � 2 � / (2 σ 2 )) = exp( − (2 πσ 2 ) ( n + q ) / 2 u θ,β , of the conditional distribution ( U | Y = y ) (also the The mode, ˜ mean in this case of an LMM) is �� 2 + � u � 2 � � � y − µ Y | U u θ,β = arg min ˜ u Douglas Bates (U.Wisc) Nonlinear mixed models August 6, 2009 10 / 14

Minimizing a penalized sum of squared residuals � � � 2 + � u � 2 is called a penalized sum of � y − µ Y | U An expression like � � � 2 is a sum of squared residuals � y − µ Y | U squared residuals because and � u � 2 is a penalty on the size of the vector u . Determining ˜ u θ,β as the minimizer of this expression is a penalized least squares (PLS) problem. For an LMM it is a penalized linear least squares problem that can be solved directly (i.e. without iterating). For an NLMM it is a penalized nonlinear least squares problem . One way to determine the solution in an LMM is to rephrase it as a linear least squares problem for an extended residual vector � � y − Xβ � � U θ � � 2 � � � � u θ,β = arg min ˜ − u � � 0 I q u This is sometimes called a pseudo-data approach because we create the effect of the penalty term, � u � 2 , by adding “pseudo-observations” to y and to the predictor. Douglas Bates (U.Wisc) Nonlinear mixed models August 6, 2009 12 / 14

The profiled deviance for LMMs � ˜ � We can see that ˜ u θ,β satisfies U ⊺ θ U θ + I q u θ,β = U ⊺ θ ( y − Xβ ) which we solve using the sparse Cholesky decomposition � � L θ L ⊺ θ = P U ⊺ θ U θ + I q P ⊺ P is a permutation matrix that has practical importance but does not affect the theory. The matrix L θ is the sparse, lower-triangular factor. Let r 2 ( θ , β ) be the minimum penalized residual sum of squares, then ℓ ( θ , β , σ | y ) = log L ( θ , β , σ | y ) can be written − 2 ℓ ( θ , β , σ | y ) = n log(2 πσ 2 ) + r 2 ( θ , β ) + log( | L θ | 2 ) σ 2 The conditional estimate of σ 2 is σ 2 ( θ , β ) = r 2 ( θ , β ) � n producing the profiled deviance � � 2 πr 2 ( θ , β ) �� − 2˜ ℓ ( θ , β | y ) = log( | L θ | 2 ) + n 1 + log n Douglas Bates (U.Wisc) Nonlinear mixed models August 6, 2009 13 / 14

Profiling the deviance with respect to β for LMMs In a LMM the deviance depends on β only through r 2 ( θ , β ) we can obtain the conditional estimate, � β θ , by extending the PLS problem to � � y − Xβ − U θ u � 2 + � u � 2 � r 2 ( θ ) = min u , β with the solution satisfying the equations � U ⊺ � � ˜ � � U ⊺ � U ⊺ θ U θ + I q u θ θ X θ y = � X ⊺ U θ X ⊺ X X ⊺ y . β θ The profiled deviance, which is a function of θ only, is � � 2 πr 2 ( θ ) �� − 2˜ ℓ ( θ ) = log( | L θ | 2 ) + n 1 + log n Douglas Bates (U.Wisc) Nonlinear mixed models August 6, 2009 14 / 14

Computational Methods for Nonlinear Mixed Models Douglas Bates - PowerPoint PPT Presentation

Computational Methods for Nonlinear Mixed Models Douglas Bates University of Wisconsin - Madison <Bates@Wisc.edu> Slides for this presentation are available at lme4.R-forge.R-project.org/slides/ Joint Statistical Meetings Washington, DC

Mixed models in R using the lme4 package Part 6: Nonlinear mixed models Douglas Bates Madison

Nonlinear mixed-effects models using Stata Yulia Marchenko Executive Director of Statistics

Nonlinear mixed-effects models using Stata Yulia Marchenko Executive Director of Statistics

Regression 2: Mixed Models Marco Baroni Practical Statistics in R Outline Mixed models with

Mixing it up with random effects Joshua Loftus Mixed models Intro to mixed models What is a

Mixed Methodological Analysis David F. Feldon Utah State University May 8, 2018 Mixed Methods

Mixed models in R using the lme4 package Part 8: Nonlinear mixed models Douglas Bates University

Nonlinear Control Lecture # 31 Nonlinear Observers Nonlinear Control Lecture # 31 Nonlinear

Nonlinear Control Lecture # 22 Special nonlinear Forms Nonlinear Control Lecture # 22 Special

Nonlinear Control Lecture # 21 Special nonlinear Forms Nonlinear Control Lecture # 21 Special

Mixed models in R using the lme4 package Part 9: Nonlinear mixed models Douglas Bates University

Mixed models in R using the lme4 package Part 8: Nonlinear mixed models Douglas Bates University

Mixed Oxides in Selective Mixed Oxides in Selective Mixed Oxides in Selective Mixed Oxides in

Mixed Precision Training PAI Overview What is mixed-precision

Nonlinear Control Lecture # 8 Special nonlinear Forms Nonlinear Control Lecture # 8 Special

Nonlinear Control Lecture # 12 Nonlinear Observers and Output Feedback Stabilization Nonlinear

Linear regression Linear regression is a simple approach to supervised learning. It assumes

Regression Regression is a predictive method (like the nearest neighbour algorithm) The

CS 4495 Computer Vision Tracking 2: Particle Filters Aaron Bobick School of Interactive

Biased Resampling Strategies for Imbalanced Spatio-temporal Forecasting M A R I AN A O L IV EI R

Abstract/Methodology: ARMA International (a.k.a. the Association of Records Managers and

The Nonprofit Racial Leadership Gap in Massachusetts Presented by: Sean Thomas-Breitfeld and

Index of Aggregate Work Hours per Person Index, 2007=100 Monthly data available as of Feb-2016

The Fitted Response Surface Graph the fitted surface and its standard error: response.R 1 / 17