Nonlinear regression analysis Peter Dalgaard (orig. Lene Theil - PowerPoint PPT Presentation

Nonlinear regression analysis Peter Dalgaard (orig. Lene Theil Skovgaard) Department of Biostatistics University of Copenhagen Variance & Regression, May 2008

Nonlinear regression ◮ Simple kinetic model ◮ Compartment models ◮ Michaelis Menten reaction ◮ Dose-response relationships

How can we model non-linear effects? ◮ Polynomials tend to be very wiggly (use ’ i=rq ’ or ’ i=rc ’ in the symbol -statement) ◮ Splines piecewise interpolations, often linear or cubic

Polynomial regression i + · · · + β p x p Y i = β 0 + β 1 x i + β 2 x 2 i With the new covariates Z 1 = X , Z 2 = X 2 , · · · , Z p = X p this is just a linear multiple regression Y i = β 0 + β 1 z 1 i + β 2 z 2 i + · · · + β p z pi The model is linear in the parameters !

How can this work? The covariates Z 1 , · · · , Z p are of course correlated , but they are not linearly dependent. What do we use polynomial regression for? ◮ as model checking for the linear model ◮ as a smoothing method ◮ rarely as a ‘final model’ for publications

Linear splines ◮ Subdivide age into groups, using appropriate thresholds ◮ Fit linear effect of age in each age group ◮ Make the linear pieces ’meet’ at the thresholds The result is a bent line:

Example Oxygen consumption (from earlier exercise) days 1 105 97 104 106 2 136 161 151 153 3 173 179 174 174 5 195 182 201 172 7 207 194 206 213 10 218 193 235 229 We want to give a description of the oxygen consumption ( boc ) over time ( days )

The above plot shows that boc as a function of days is certainly not linear .

The biologists claim that the relation between boc and days can be described by a relation of the form boc = γ exp ( − β/ days ) This relation is obviously nonlinear, but may be transformed to linearity using the (natural) logarithm: log ( boc ) = log ( γ ) − β/ days

With y = logboc = log ( boc ) x = invdays = 1 / days and α = log ( γ ) we may write this equation as y = α − β x i.e., a linear relation , just with a minus sign on the slope.

A scatter plot of these new variables This plot seems reasonably linear. We get logboc = 5 . 431 − 0 . 808 × invdays

The linear regression model gives us the estimates: intercept: ˆ α = log (ˆ γ ) = 5 . 431 ( 0 . 019 ) slope: ˆ β = − 0 . 808 ( 0 . 039 ) Noting that boc ( ∞ ) = γ = exp ( α ) , we find the estimate of boc ( ∞ ) to be exp ( 5 . 431 ) = 228 . 38 with the 95% confidence interval ( exp ( 5 . 392 ) , exp ( 5 . 471 )) = ( 219 . 6 , 237 . 7 )

Residual plot and fitted curve

Why use non-linear regression? ◮ Transformation is necessary to obtain variance homogeneity, but transformation destroys linearity. ◮ Linearity does not fit, and the transformation seems to destroy other parts of the model assumptions, e.g. the assumption of variance homogeneity. ◮ Theoretical knowledge (e.g. from kinetics or physiology) indicates that the proper relation is intrinsically non-linear. ◮ Interest is in functions of the parameters that do not enter linearly in the model (e.g. kinetic rate constants or ED 50 in dose-response studies)

Untransformed: After a logarithmic trans- Linearity, but with increasing formation: Non-linearity, but variance with constant variance

Example Quantification of the Reticuloendothelial cell system ( RES ) of the liver: Concentration measurements y i over the liver, following a bolus injection of radioactive tracer First order kinetics implies c ( t ) = β ( 1 − e − γ t ) No transformation to linearity possible! y i = β ( 1 − e − γ t i ) + ε i , ε i ∼ N ( 0 , σ 2 )

Least squares method Minimize the sums of squares ( y i − β ( 1 − e − γ t i )) 2 = � � ε 2 SS ( β, γ ) = i This requires an iterative procedure

Example: RES in the liver Starting values: ◮ c ( ∞ ) = β ≈ 2000 ◮ dc dt ( 0 ) = βγ ≈ 100

Residual sum of squares, SS, as a function of only one parameter (hypothetical) We have to determine the minimum

In case of linear regression:

Residual sum of squares, SS, as a function of two parameters (hypothetical) There may be problems with convergence of the iteration procedure and the solution may be a local minimum

SAS program data reticulo; infile ’kw_res.txt’; input tid conc; run; proc nlin data=reticulo; parms beta=2000 gamma=0.05; model conc=beta*(1-exp(-gamma*tid)); run;

Output Dependent Variable conc Iterative Phase Sum of Iter beta gamma Squares 0 2000.0 0.0500 4370990 1 1958.6 0.0824 382995 2 2169.2 0.0769 26355.0 3 2174.2 0.0772 25286.7 4 2174.0 0.0773 25286.7 5 2174.0 0.0773 25286.7 NOTE: Convergence criterion met. Sum of Mean Approx Source DF Squares Square F Value Pr > F Regression 2 63048136 31524068 29920.0 <.0001 Residual 24 25286.7 1053.6 Uncorrected Total 26 63073423 Corrected Total 25 3455129 Approx Parameter Estimate Std Error Approximate 95% Confidence Limits beta 2174.0 28.3459 2115.5 2232.5 gamma 0.0773 0.00226 0.0726 0.0819

Example The fit becomes: Estimates: ◮ ˆ β = 2174 . 0 ( 28 . 3 ) , CI=(2115.5, 2232.5) ◮ ˆ γ = 0 . 0773 ( 0 . 0023 ) , CI=(0.0726, 0.0819)

Residuals Residual pattern shows systematic behaviour: so the model does not fit particularly well

Asymptotic normality of parameter estimates ◮ The distribution of the estimates is in general unknown, we only have approximate normality ◮ The approximations may be very poor for small samples Confidence regions: ◮ Asymptotically (i.e. for large sample sizes), the estimates are normally distributed ◮ The confidence areas will therefore be approximately elliptic ◮ For small sample sizes, this can become extremely misleading!

Alternative procedure Determine confidence regions directly from the sum of squares SS , i.e. as those values of the parameter, which makes SS sufficiently small.

Determining the cutoff What is “sufficently small” SS to obtain a 95% confidence region? Or: How do we determine the coverage probability for a given confidence region? This is rather technical. . . Good approximation to a (1- α )-sized region: p { θ | SS ( θ ) ≤ SS (ˆ θ )( 1 + n − pF α ( p , n − p )) }

For linear regression, the situation is: and confidence intervals become symmetric/elliptic

Example: RES in the liver Residual sum of Confidence region squares, SS : based upon SS :

c ( t ) = β ( 1 − e − γ t ) Same model as before: Simulated data, ( β = 1 , γ = 1 , σ = 0 . 05): The design is not adequate, we only see the linear part of the concentration curve!

Confidence regions based on SS ( β, γ ) :

If we spread out the x’s:

Enlarged picture, with superimposed normal approximation:

’ Optimal ’ design (in terms of a normal confidence region):

Effect of parametrization: y i = β ( 1 − e − γ t i ) + ε i Reparametrization: α = βγ y i = α γ ( 1 − e − γ t i ) + ε i

Parameters β and γ : Approx Parameter Estimate Std Error Approx. 95% Confidence Limits beta 1.0559 0.2407 0.3875 1.7244 gamma 1.0376 0.5202 -0.4067 2.4818 Approximate Correlation Matrix beta gamma beta 1.0000000 -0.9592479 gamma -0.9592479 1.0000000

Parameters α and γ : Approx Parameter Estimate Std Error Approx. 95% Confidence Limits alfa 1.0956 0.3176 0.2138 1.9774 gamma 1.0376 0.5202 -0.4067 2.4818 Approximate Correlation Matrix alfa gamma alfa 1.0000000 0.9749950 gamma 0.9749950 1.0000000

Compartment models Differential equations ⇒ multi-exponential curves Often only measurements from a single compartment! Identification problems

There may be problems with the identification of parameters, even with good quality data. This will give rise to very unprecise and extremely correlated estimates.

Compartment models yield solutions as a sum of exponential curves: f ( t ) = α 1 exp ( − λ 1 t ) + α 2 exp ( − λ 2 t ) α 1 λ 1 α 2 λ 2 1 1 1 7 11 2 7 1 1 2 11.78 6.06 3 . 1 9 . 4 Rule of thumb: There must be a ratio of at least 5 in λ 1 λ 2

Example: Dose-effect Effect of isoprenalin on heart rate ◮ E: Increase in heart rate ◮ D: Dose of isoprenalin Michaelis-Menten relation : E = E max D k d + D

Linearization (Lineweaver-Burk) E = k d + D 1 1 k d 1 E max D = + E max E max D Linear relation between the inverses: E = α + β 1 1 D with the reparametrisation: α = 1 / E max β = k d / E max E max = 1 /α k d = β/α Ex. Isoprenalin: α : 0.0165 (0.0004), β : 0.0202 (0.0004) E max : 1 / 0 . 0165 = 60 . 6 ( 57 . 8 , 63 . 7 ) k d : 0 . 0202 / 0 . 0165 = 1 . 22

Lineweaver-Burk plot The model fits nicely ???

Model fit using direct non- Model fit using Lineweaver- linear regression: Burk linearisation:

Estimates, isoprenalin E max k d 1/E linear 60.6 1.22 E, non-linear 62.67 (2.12) 1.33 (0.14) log(E), non-linear 61.59 (1.82) 1.26 (0.08)

Isoprenalin following metropolol: E max k d 1/E linear -228.2 -45.62 E, non-linear 56.97 (3.47) 5.46 (0.85) log(E), non-linear 80.73 (30.93) 11.71 (6.09)

Isoprenalin following Lineweaver-Burk plot: metropolol:

Nonlinear regression analysis Peter Dalgaard (orig. Lene Theil - PowerPoint PPT Presentation

Nonlinear regression analysis Peter Dalgaard (orig. Lene Theil Skovgaard) Department of Biostatistics University of Copenhagen Variance & Regression, May 2008 Nonlinear regression Simple kinetic model Compartment models

Nonlinear Control Lecture # 31 Nonlinear Observers Nonlinear Control Lecture # 31 Nonlinear

Nonlinear Control Lecture # 22 Special nonlinear Forms Nonlinear Control Lecture # 22 Special

Nonlinear Control Lecture # 21 Special nonlinear Forms Nonlinear Control Lecture # 21 Special

Nonlinear Control Lecture # 8 Special nonlinear Forms Nonlinear Control Lecture # 8 Special

Nonlinear Control Lecture # 12 Nonlinear Observers and Output Feedback Stabilization Nonlinear

Nonlinear Control Lecture # 20 Special nonlinear Forms Nonlinear Control Lecture # 20 Special

Business Statistics CONTENTS Multiple regression Dummy regressors Assumptions of regression

Regression 3: Logistic Regression Marco Baroni Practical Statistics in R Outline Logistic

Regression Methods 1. Linear Regression and Logistic Regression: definitions, and a common

Nonlinear models Posterior Gradient Ascent Adaptive Step Size Approach to Limit Example Will

Machine Learning 2: Nonlinear Regression Stefano Ermon April 13, 2016 Stefano Ermon April 13,

Nonlinear Control Lecture # 1 Introduction Nonlinear Control Lecture # 1 Introduction Nonlinear

Numerical Proofs in Nonlinear Control Sicun Gao, UCSD Nonlinear control working Nonlinear

Analysis of variance and regression Other types of regression models Other types of regression

Logistic Regression James H. Steiger Department of Psychology and Human Development Vanderbilt

Regression 1: Linear Regression Marco Baroni Practical Statistics in R Outline Classic linear

LUX PLR Overview Shaun Alsum 1 LUX Goals Discover WIMPs Necessary requirement:

How willing are you to be wrong? Type I and Type II Errors Type 1, Type II Errors and Power

Memory Errors Bits in memory can be flipped Hard error The chip is broken E.g.,

Machine Learning and Data Mining Ensembles of Learners Kalev Kask HW4 Download data from

Chapter 5 Initial-Value Problems for Ordinary Differential Equations Per-Olof Persson

The Galactic Center The Galactic Center Source as seen by Fermi Source as seen by Fermi Johann

Lecture Notes for Chapter 4 Slides by Tan, Steinbach, Kumar adapted by Michael Hahsler Look for

Multiple Comparisons & Type-I Error Paul Gribble Winter, 2019 . . . . . . . . . .

Nonlinear regression analysis Peter Dalgaard (orig. Lene Theil - PowerPoint PPT Presentation

Nonlinear regression analysis Peter Dalgaard (orig. Lene Theil Skovgaard) Department of Biostatistics University of Copenhagen Variance & Regression, May 2008 Nonlinear regression Simple kinetic model Compartment models

Nonlinear Control Lecture # 31 Nonlinear Observers Nonlinear Control Lecture # 31 Nonlinear

Nonlinear Control Lecture # 22 Special nonlinear Forms Nonlinear Control Lecture # 22 Special

Nonlinear Control Lecture # 21 Special nonlinear Forms Nonlinear Control Lecture # 21 Special

Nonlinear Control Lecture # 8 Special nonlinear Forms Nonlinear Control Lecture # 8 Special

Nonlinear Control Lecture # 12 Nonlinear Observers and Output Feedback Stabilization Nonlinear

Nonlinear Control Lecture # 20 Special nonlinear Forms Nonlinear Control Lecture # 20 Special

Business Statistics CONTENTS Multiple regression Dummy regressors Assumptions of regression

Regression 3: Logistic Regression Marco Baroni Practical Statistics in R Outline Logistic

Regression Methods 1. Linear Regression and Logistic Regression: definitions, and a common

Nonlinear models Posterior Gradient Ascent Adaptive Step Size Approach to Limit Example Will

Machine Learning 2: Nonlinear Regression Stefano Ermon April 13, 2016 Stefano Ermon April 13,

Nonlinear Control Lecture # 1 Introduction Nonlinear Control Lecture # 1 Introduction Nonlinear

Numerical Proofs in Nonlinear Control Sicun Gao, UCSD Nonlinear control working Nonlinear

Analysis of variance and regression Other types of regression models Other types of regression

Logistic Regression James H. Steiger Department of Psychology and Human Development Vanderbilt

Regression 1: Linear Regression Marco Baroni Practical Statistics in R Outline Classic linear

LUX PLR Overview Shaun Alsum 1 LUX Goals Discover WIMPs Necessary requirement:

How willing are you to be wrong? Type I and Type II Errors Type 1, Type II Errors and Power

Memory Errors Bits in memory can be flipped Hard error The chip is broken E.g.,

Machine Learning and Data Mining Ensembles of Learners Kalev Kask HW4 Download data from

Chapter 5 Initial-Value Problems for Ordinary Differential Equations Per-Olof Persson

The Galactic Center The Galactic Center Source as seen by Fermi Source as seen by Fermi Johann

Lecture Notes for Chapter 4 Slides by Tan, Steinbach, Kumar adapted by Michael Hahsler Look for

Multiple Comparisons &amp; Type-I Error Paul Gribble Winter, 2019 . . . . . . . . . .

Multiple Comparisons & Type-I Error Paul Gribble Winter, 2019 . . . . . . . . . .