rlrsim testing for random effects or nonparametric
play

RLRsim : Testing for Random Effects or Nonparametric Regression - PowerPoint PPT Presentation

Background & Problem Description Implementation & Application Examples Simulation Study RLRsim : Testing for Random Effects or Nonparametric Regression Functions in Additive Mixed Models Fabian Scheipl 1 joint work with Sonja Greven 1 ,


  1. Background & Problem Description Implementation & Application Examples Simulation Study RLRsim : Testing for Random Effects or Nonparametric Regression Functions in Additive Mixed Models Fabian Scheipl 1 joint work with Sonja Greven 1 , 2 and Helmut K¨ uchenhoff 1 1 Department of Statistics, LMU M¨ unchen, Germany 2 Department of Biostatistics, Johns Hopkins University, USA useR! 2008 August 13, 2008

  2. Background & Problem Description Implementation & Application Examples Simulation Study Outline Background & Problem Description Implementation & Application Examples Simulation Study

  3. Background & Problem Description Implementation & Application Examples Simulation Study Linear Mixed Models L � y = X β + Z l b l + ε l =1 b l ∼ N K l ( 0 , λ l σ 2 ε Σ l ) , b l ⊥ b s ∀ l � = s ε ∼ N n ( 0 , σ 2 ε I n ) ,

  4. Background & Problem Description Implementation & Application Examples Simulation Study Linear Mixed Models L � y = X β + Z l b l + ε l =1 b l ∼ N K l ( 0 , λ l σ 2 ε Σ l ) , b l ⊥ b s ∀ l � = s ε ∼ N n ( 0 , σ 2 ε I n ) , We want to test H 0 , l : λ l = 0 versus H A , l : λ l > 0 ⇔ H 0 , l : Var( b l ) = 0 versus H A , l : Var( b l ) > 0 Application examples: ◮ testing for equality of means between groups/subjects ◮ testing for linearity of a smooth function

  5. Background & Problem Description Implementation & Application Examples Simulation Study Additive Models as Linear Mixed Models Simple additive model: y = f ( x ) + ε J � f ( x i ) ≈ δ j B j ( x i ) j =1 � y − B δ � 2 + 1 ◮ fit via PLS: min � λ δ ′ P δ � δ ◮ reparametrize s.t. PLS-estimation is equivalent to (RE)ML-estimation

  6. Background & Problem Description Implementation & Application Examples Simulation Study Additive Models as Linear Mixed Models Simple additive model: y = f ( x ) + ε J � f ( x i ) ≈ δ j B j ( x i ) j =1 � y − B δ � 2 + 1 ◮ fit via PLS: min � λ δ ′ P δ � δ ◮ reparametrize s.t. PLS-estimation is equivalent to (RE)ML-estimation given λ in a LMM with ◮ fixed effects for the unpenalized part of f ( x ) i . i . d . ◮ random effects ( ∼ N (0 , λσ 2 ε )) for the deviations from the unpenalized part (Brumback, Ruppert, Wand, 1999; Fahrmeir, Kneib, Lang, 2004) ◮ In R: mgcv::gamm(), lmeSplines

  7. Background & Problem Description Implementation & Application Examples Simulation Study Problem: Likelihood Ratio Tests for Zero Variance Components General Case: i . i . d . ◮ y 1 , . . . , y n ∼ f ( y | θ ); θ = ( θ 1 , . . . , θ p ) ◮ Test: H 0 : θ i = θ 0 i versus H A : θ i � = θ 0 i θ 0 | y ) n →∞ ◮ LRT = 2 log L ( ˆ θ | y ) − 2 log L ( ˆ χ 2 ∼ 1

  8. Background & Problem Description Implementation & Application Examples Simulation Study Problem: Likelihood Ratio Tests for Zero Variance Components General Case: i . i . d . ◮ y 1 , . . . , y n ∼ f ( y | θ ); θ = ( θ 1 , . . . , θ p ) ◮ Test: H 0 : θ i = θ 0 i versus H A : θ i � = θ 0 i θ 0 | y ) n →∞ ◮ LRT = 2 log L ( ˆ θ | y ) − 2 log L ( ˆ χ 2 ∼ 1 Problem for testing H 0 : Var( b l ) = 0 Underlying assumptions for asymptotics violated: ◮ data in LMM not independent ◮ θ 0 not an interior point of the parameter space Θ

  9. Background & Problem Description Implementation & Application Examples Simulation Study Previous Results: ◮ Stram, Lee (1994); Self, Liang (1987): for i . i . d . observations/subvectors, testing on the boundary of Θ: LRT as ∼ 0 . 5 δ 0 : 0 . 5 χ 2 1

  10. Background & Problem Description Implementation & Application Examples Simulation Study Previous Results: ◮ Stram, Lee (1994); Self, Liang (1987): for i . i . d . observations/subvectors, testing on the boundary of Θ: LRT as ∼ 0 . 5 δ 0 : 0 . 5 χ 2 1 ◮ Crainiceanu, Ruppert (2004): ◮ Stram/Lee mixture very conservative for non-i . i . d . data, small samples ◮ LRT often with large point mass at zero, restricted LRT ( RLRT ) more useful ◮ derive exact finite sample distributions of LRT and RLRT in LMMs with one variance component

  11. Background & Problem Description Implementation & Application Examples Simulation Study Previous Results: ◮ Stram, Lee (1994); Self, Liang (1987): for i . i . d . observations/subvectors, testing on the boundary of Θ: LRT as ∼ 0 . 5 δ 0 : 0 . 5 χ 2 1 ◮ Crainiceanu, Ruppert (2004): ◮ Stram/Lee mixture very conservative for non-i . i . d . data, small samples ◮ LRT often with large point mass at zero, restricted LRT ( RLRT ) more useful ◮ derive exact finite sample distributions of LRT and RLRT in LMMs with one variance component ◮ Greven et al. (2007): pseudo-ML arguments to justify application of results in Crainiceanu, Ruppert (2004) to models with multiple variance components

  12. Background & Problem Description Implementation & Application Examples Simulation Study RLRsim : Algorithm � K � � 1 + N n ( λ ) � � RLRT n ∼ sup ( n − p ) log − log (1 + λµ k , n ) , D n ( λ ) λ ≥ 0 k =1 K K n − p w 2 λµ k , n � w 2 � � w 2 k N n ( λ ) = k ; D n ( λ ) = + k 1 + λµ k , n 1 + λµ k , n k =1 k =1 k = K +1 w k ∼ N (0 , 1); µ : eigenvalues of Σ 1 / 2 Z ′ ( I n − X(X ′ X) − 1 X ) ZΣ 1 / 2

  13. Background & Problem Description Implementation & Application Examples Simulation Study RLRsim : Algorithm � K � � 1 + N n ( λ ) � � RLRT n ∼ sup ( n − p ) log − log (1 + λµ k , n ) , D n ( λ ) λ ≥ 0 k =1 K K n − p w 2 λµ k , n � w 2 � � w 2 k N n ( λ ) = k ; D n ( λ ) = + k 1 + λµ k , n 1 + λµ k , n k =1 k =1 k = K +1 w k ∼ N (0 , 1); µ : eigenvalues of Σ 1 / 2 Z ′ ( I n − X(X ′ X) − 1 X ) ZΣ 1 / 2 Rapid simulation from this distribution: ◮ do eigenvalue decomposition to get µ ◮ repeat: ◮ draw ( K + 1) χ 2 variates ◮ one-dimensional maximization in λ (via grid search)

  14. Background & Problem Description Implementation & Application Examples Simulation Study RLRsim : Algorithm � K � � 1 + N n ( λ ) � � RLRT n ∼ sup ( n − p ) log − log (1 + λµ k , n ) , D n ( λ ) λ ≥ 0 k =1 K K n − p w 2 λµ k , n � w 2 � � w 2 k N n ( λ ) = k ; D n ( λ ) = + k 1 + λµ k , n 1 + λµ k , n k =1 k =1 k = K +1 w k ∼ N (0 , 1); µ : eigenvalues of Σ 1 / 2 Z ′ ( I n − X(X ′ X) − 1 X ) ZΣ 1 / 2 Rapid simulation from this distribution: ◮ do eigenvalue decomposition to get µ ◮ repeat: ◮ draw ( K + 1) χ 2 variates ◮ one-dimensional maximization in λ (via grid search) → computational cost depends on K , not n → implemented in C ⇒ quasi-instantaneous → easy extension to models with L > 1

  15. Background & Problem Description Implementation & Application Examples Simulation Study Example: One Variance Component Test for random intercept ( nlme::lme ): > m0 <- lme(distance ~ age + Sex, data = Orthodont, random = ~ 1) > system.time(print( exactRLRT(m0) ), gcFirst=T) simulated finite sample distribution of RLRT. (p-value based on 10000 simulated values) RLRT = 47.0114, p-value < 2.2e-16 user system elapsed 0.42 0.00 0.42 > system.time(simulate.lme(m0,nsim=10000,method= ✬ REML ✬ ), gcFirst=T) user system elapsed 55.00 0.03 55.48

  16. Background & Problem Description Implementation & Application Examples Simulation Study Example: Two Variance Components Test for random slope with nuisance random intercept ( lme4::lmer ): > m0 <- lmer(distance ~ age + Sex + (1|Subject), data = Orthodont) > mA <- update(m0, .~. + (0 + age|Subject)) > mSlope <- update(mA, .~. - (1|Subject)) > exactRLRT(mSlope, mA, m0) simulated finite sample distribution of RLRT. (p-value based on 10000 simulated values) RLRT = 0.8672, p-value = 0.1603

  17. Background & Problem Description Implementation & Application Examples Simulation Study Example: Testing for Linearity of a Smooth Function > library(mgcv); data(trees) > m1 <- gamm(I(log(Volume)) ~ Height + s(Girth, m = 2), + data = trees)$lme s(Girth, m=2) 1.0 0.5 s(Girth,2.72) 0.0 −0.5 8 10 12 14 16 18 20 Girth Significant deviations from linearity?

  18. Background & Problem Description Implementation & Application Examples Simulation Study Example: Testing for Linearity of a Smooth Function > library(mgcv); data(trees) > m1 <- gamm(I(log(Volume)) ~ Height + s(Girth, m = 2), + data = trees)$lme > exactRLRT(ml) simulated finite sample distribution of RLRT. (p-value based on 10000 simulated values) RLRT = 5.4561, p-value = 0.0052

  19. Background & Problem Description Implementation & Application Examples Simulation Study Simulation Study: Settings H 0 tested VC nuisance VCs equality of group means random intercept - random slope uni-/bivariate smooth equality of group trends random slope random intercept no effect / linearity univariate smooth - random intercept uni-/bivariate smooth additivity bivariate smooth 2 univariate smooths Goal: compare size & power of tests for zero variance components ◮ sample sizes n = 50 , 100 , 500 ◮ mildly unbalanced group sizes for K = 5 , 20 ◮ details: Scheipl, Greven, K¨ uchenhoff (2007)

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend