RLRsim : Testing for Random Effects or Nonparametric Regression - PowerPoint PPT Presentation

Background & Problem Description Implementation & Application Examples Simulation Study RLRsim : Testing for Random Effects or Nonparametric Regression Functions in Additive Mixed Models Fabian Scheipl 1 joint work with Sonja Greven 1 , 2 and Helmut K¨ uchenhoff 1 1 Department of Statistics, LMU M¨ unchen, Germany 2 Department of Biostatistics, Johns Hopkins University, USA useR! 2008 August 13, 2008

Background & Problem Description Implementation & Application Examples Simulation Study Outline Background & Problem Description Implementation & Application Examples Simulation Study

Background & Problem Description Implementation & Application Examples Simulation Study Linear Mixed Models L � y = X β + Z l b l + ε l =1 b l ∼ N K l ( 0 , λ l σ 2 ε Σ l ) , b l ⊥ b s ∀ l � = s ε ∼ N n ( 0 , σ 2 ε I n ) ,

Background & Problem Description Implementation & Application Examples Simulation Study Linear Mixed Models L � y = X β + Z l b l + ε l =1 b l ∼ N K l ( 0 , λ l σ 2 ε Σ l ) , b l ⊥ b s ∀ l � = s ε ∼ N n ( 0 , σ 2 ε I n ) , We want to test H 0 , l : λ l = 0 versus H A , l : λ l > 0 ⇔ H 0 , l : Var( b l ) = 0 versus H A , l : Var( b l ) > 0 Application examples: ◮ testing for equality of means between groups/subjects ◮ testing for linearity of a smooth function

Background & Problem Description Implementation & Application Examples Simulation Study Additive Models as Linear Mixed Models Simple additive model: y = f ( x ) + ε J � f ( x i ) ≈ δ j B j ( x i ) j =1 � y − B δ � 2 + 1 ◮ fit via PLS: min � λ δ ′ P δ � δ ◮ reparametrize s.t. PLS-estimation is equivalent to (RE)ML-estimation

Background & Problem Description Implementation & Application Examples Simulation Study Additive Models as Linear Mixed Models Simple additive model: y = f ( x ) + ε J � f ( x i ) ≈ δ j B j ( x i ) j =1 � y − B δ � 2 + 1 ◮ fit via PLS: min � λ δ ′ P δ � δ ◮ reparametrize s.t. PLS-estimation is equivalent to (RE)ML-estimation given λ in a LMM with ◮ fixed effects for the unpenalized part of f ( x ) i . i . d . ◮ random effects ( ∼ N (0 , λσ 2 ε )) for the deviations from the unpenalized part (Brumback, Ruppert, Wand, 1999; Fahrmeir, Kneib, Lang, 2004) ◮ In R: mgcv::gamm(), lmeSplines

Background & Problem Description Implementation & Application Examples Simulation Study Problem: Likelihood Ratio Tests for Zero Variance Components General Case: i . i . d . ◮ y 1 , . . . , y n ∼ f ( y | θ ); θ = ( θ 1 , . . . , θ p ) ◮ Test: H 0 : θ i = θ 0 i versus H A : θ i � = θ 0 i θ 0 | y ) n →∞ ◮ LRT = 2 log L ( ˆ θ | y ) − 2 log L ( ˆ χ 2 ∼ 1

Background & Problem Description Implementation & Application Examples Simulation Study Problem: Likelihood Ratio Tests for Zero Variance Components General Case: i . i . d . ◮ y 1 , . . . , y n ∼ f ( y | θ ); θ = ( θ 1 , . . . , θ p ) ◮ Test: H 0 : θ i = θ 0 i versus H A : θ i � = θ 0 i θ 0 | y ) n →∞ ◮ LRT = 2 log L ( ˆ θ | y ) − 2 log L ( ˆ χ 2 ∼ 1 Problem for testing H 0 : Var( b l ) = 0 Underlying assumptions for asymptotics violated: ◮ data in LMM not independent ◮ θ 0 not an interior point of the parameter space Θ

Background & Problem Description Implementation & Application Examples Simulation Study Previous Results: ◮ Stram, Lee (1994); Self, Liang (1987): for i . i . d . observations/subvectors, testing on the boundary of Θ: LRT as ∼ 0 . 5 δ 0 : 0 . 5 χ 2 1

Background & Problem Description Implementation & Application Examples Simulation Study Previous Results: ◮ Stram, Lee (1994); Self, Liang (1987): for i . i . d . observations/subvectors, testing on the boundary of Θ: LRT as ∼ 0 . 5 δ 0 : 0 . 5 χ 2 1 ◮ Crainiceanu, Ruppert (2004): ◮ Stram/Lee mixture very conservative for non-i . i . d . data, small samples ◮ LRT often with large point mass at zero, restricted LRT ( RLRT ) more useful ◮ derive exact finite sample distributions of LRT and RLRT in LMMs with one variance component

Background & Problem Description Implementation & Application Examples Simulation Study Previous Results: ◮ Stram, Lee (1994); Self, Liang (1987): for i . i . d . observations/subvectors, testing on the boundary of Θ: LRT as ∼ 0 . 5 δ 0 : 0 . 5 χ 2 1 ◮ Crainiceanu, Ruppert (2004): ◮ Stram/Lee mixture very conservative for non-i . i . d . data, small samples ◮ LRT often with large point mass at zero, restricted LRT ( RLRT ) more useful ◮ derive exact finite sample distributions of LRT and RLRT in LMMs with one variance component ◮ Greven et al. (2007): pseudo-ML arguments to justify application of results in Crainiceanu, Ruppert (2004) to models with multiple variance components

Background & Problem Description Implementation & Application Examples Simulation Study RLRsim : Algorithm � K � � 1 + N n ( λ ) � � RLRT n ∼ sup ( n − p ) log − log (1 + λµ k , n ) , D n ( λ ) λ ≥ 0 k =1 K K n − p w 2 λµ k , n � w 2 � � w 2 k N n ( λ ) = k ; D n ( λ ) = + k 1 + λµ k , n 1 + λµ k , n k =1 k =1 k = K +1 w k ∼ N (0 , 1); µ : eigenvalues of Σ 1 / 2 Z ′ ( I n − X(X ′ X) − 1 X ) ZΣ 1 / 2

Background & Problem Description Implementation & Application Examples Simulation Study RLRsim : Algorithm � K � � 1 + N n ( λ ) � � RLRT n ∼ sup ( n − p ) log − log (1 + λµ k , n ) , D n ( λ ) λ ≥ 0 k =1 K K n − p w 2 λµ k , n � w 2 � � w 2 k N n ( λ ) = k ; D n ( λ ) = + k 1 + λµ k , n 1 + λµ k , n k =1 k =1 k = K +1 w k ∼ N (0 , 1); µ : eigenvalues of Σ 1 / 2 Z ′ ( I n − X(X ′ X) − 1 X ) ZΣ 1 / 2 Rapid simulation from this distribution: ◮ do eigenvalue decomposition to get µ ◮ repeat: ◮ draw ( K + 1) χ 2 variates ◮ one-dimensional maximization in λ (via grid search)

Background & Problem Description Implementation & Application Examples Simulation Study RLRsim : Algorithm � K � � 1 + N n ( λ ) � � RLRT n ∼ sup ( n − p ) log − log (1 + λµ k , n ) , D n ( λ ) λ ≥ 0 k =1 K K n − p w 2 λµ k , n � w 2 � � w 2 k N n ( λ ) = k ; D n ( λ ) = + k 1 + λµ k , n 1 + λµ k , n k =1 k =1 k = K +1 w k ∼ N (0 , 1); µ : eigenvalues of Σ 1 / 2 Z ′ ( I n − X(X ′ X) − 1 X ) ZΣ 1 / 2 Rapid simulation from this distribution: ◮ do eigenvalue decomposition to get µ ◮ repeat: ◮ draw ( K + 1) χ 2 variates ◮ one-dimensional maximization in λ (via grid search) → computational cost depends on K , not n → implemented in C ⇒ quasi-instantaneous → easy extension to models with L > 1

Background & Problem Description Implementation & Application Examples Simulation Study Example: One Variance Component Test for random intercept ( nlme::lme ): > m0 <- lme(distance ~ age + Sex, data = Orthodont, random = ~ 1) > system.time(print( exactRLRT(m0) ), gcFirst=T) simulated finite sample distribution of RLRT. (p-value based on 10000 simulated values) RLRT = 47.0114, p-value < 2.2e-16 user system elapsed 0.42 0.00 0.42 > system.time(simulate.lme(m0,nsim=10000,method= ✬ REML ✬ ), gcFirst=T) user system elapsed 55.00 0.03 55.48

Background & Problem Description Implementation & Application Examples Simulation Study Example: Two Variance Components Test for random slope with nuisance random intercept ( lme4::lmer ): > m0 <- lmer(distance ~ age + Sex + (1|Subject), data = Orthodont) > mA <- update(m0, .~. + (0 + age|Subject)) > mSlope <- update(mA, .~. - (1|Subject)) > exactRLRT(mSlope, mA, m0) simulated finite sample distribution of RLRT. (p-value based on 10000 simulated values) RLRT = 0.8672, p-value = 0.1603

Background & Problem Description Implementation & Application Examples Simulation Study Example: Testing for Linearity of a Smooth Function > library(mgcv); data(trees) > m1 <- gamm(I(log(Volume)) ~ Height + s(Girth, m = 2), + data = trees)$lme s(Girth, m=2) 1.0 0.5 s(Girth,2.72) 0.0 −0.5 8 10 12 14 16 18 20 Girth Significant deviations from linearity?

Background & Problem Description Implementation & Application Examples Simulation Study Example: Testing for Linearity of a Smooth Function > library(mgcv); data(trees) > m1 <- gamm(I(log(Volume)) ~ Height + s(Girth, m = 2), + data = trees)$lme > exactRLRT(ml) simulated finite sample distribution of RLRT. (p-value based on 10000 simulated values) RLRT = 5.4561, p-value = 0.0052

Background & Problem Description Implementation & Application Examples Simulation Study Simulation Study: Settings H 0 tested VC nuisance VCs equality of group means random intercept - random slope uni-/bivariate smooth equality of group trends random slope random intercept no effect / linearity univariate smooth - random intercept uni-/bivariate smooth additivity bivariate smooth 2 univariate smooths Goal: compare size & power of tests for zero variance components ◮ sample sizes n = 50 , 100 , 500 ◮ mildly unbalanced group sizes for K = 5 , 20 ◮ details: Scheipl, Greven, K¨ uchenhoff (2007)

RLRsim : Testing for Random Effects or Nonparametric Regression - PowerPoint PPT Presentation

Background & Problem Description Implementation & Application Examples Simulation Study RLRsim : Testing for Random Effects or Nonparametric Regression Functions in Additive Mixed Models Fabian Scheipl 1 joint work with Sonja Greven 1 ,

Random Numbers RANDOM VS PSEUDO RANDOM Truly Random numbers From Wolfram: A random number

Effects and State Liam OConnor CSE, UNSW (and Data61) Term 2 2019 1 Effects State IO

Nonparametric Regression Splines for Nonparametric Regression Splines for Regional Atmospheric

Nonparametric Sequential Change Detection for High-Dimensional Problems Yasin Ylmaz Electrical

The np package np : A Package for Nonparametric Kernel The np package implements a variety of

Nonparametric analysis of CMB Nonparametric analysis of CMB power spectrum data and consistency

Levels of Testing Chapter 12 Beyond unit testing Developer Testing stages Unit testing

Testing Terminology System testing Types of errors Function testing Structure

PS 4 Panel Models 11 December 2014 PS 4 Panel Models Pooled OLS vs Fixed Effects Pooled OLS vs

Why Mixed Effects Models? Mixed Effects Models Recap/Intro Three issues with ANOVA

Property-Based Testing Matt Bachmann @mattbachmann Testing is Important Testing is Important

Software Testing Overview What is software testing? General testing criteria Testing

Software testing Software Testing Introduction Testing levels Automated testing Principles and

1. Test page This page is for testing. This page is for testing. This page is for testing.

Chapter 2: Random Variables In this chapter we will cover: 1. Discrete Random variables, ( 2.1

Random Numbers, Files, and Onwards Random Numbers Computers cannot produce truly random numbers.

Global Ferronickel Holdings, Inc. FY2018 Analysts Briefing March 22, 2019 Dis isclaimer The

The behavior of base m etals prices So Paulo Novem ber 5 , 2 0 0 9 1 Agenda Unique

CUMBERLAND COUNTY BOARD OF COMMISSIONERS MAY 31, 2012 6:45 PM 117 DICK STREET, 1 ST FLOOR,

Aluminium for the world Q3 2011 IR PRESENTATION albasmelter.com Aluminium for the world

MERIDIAN ENERGY . NZX RETAIL INVESTOR EVENING MANAGEMENT PRESENTATION NZ electricity sector.

FIFTH AVENUE PASSENGER LOADING ZONE IMPROVING THE GASLAMP VISITOR EXPERIENCE WEEKEND CONGESTION

Manhattan Community Board 6 Monday, May 6, 2019 New York City Transit Background M14A/D is

DINKYTOWN MIXED-USE On December 9, 2019, the Planning Commission took action on

Sambuz

Useful Links

Newsletter

Mail Us