analysing geoadditive regression data a mixed model
play

Analysing geoadditive regression data: a mixed model approach - PowerPoint PPT Presentation

Analysing geoadditive regression data: a mixed model approach Thomas Kneib Institut f ur Statistik, Ludwig-Maximilians-Universit at M unchen Joint work with Ludwig Fahrmeir & Stefan Lang 25.11.2005 Thomas Kneib Spatio-temporal


  1. Analysing geoadditive regression data: a mixed model approach Thomas Kneib Institut f¨ ur Statistik, Ludwig-Maximilians-Universit¨ at M¨ unchen Joint work with Ludwig Fahrmeir & Stefan Lang 25.11.2005

  2. Thomas Kneib Spatio-temporal regression data Spatio-temporal regression data • Regression in a general sense: – Generalised linear models, – Multivariate (categorical) generalised linear models, – Regression models for survival times (Cox-type models, AFT models). • Common structure: Model a quantity of interest in terms of categorical and continuous covariates, e.g. E ( y | u ) = h ( u ′ γ ) (GLM) or λ ( t | u ) = λ 0 ( t ) exp( u ′ γ ) (Cox model) • Spatio-temporal data: Temporal and spatial information as additional covariates. Analysing geoadditive regression data: a mixed model approach 1

  3. Thomas Kneib Spatio-temporal regression data • Spatio-temporal regression models should allow – to account for spatial and temporal correlations, – for time- and space-varying effects, – for non-linear effects of continuous covariates, – for flexible interactions, – to account for unobserved heterogeneity. Analysing geoadditive regression data: a mixed model approach 2

  4. Thomas Kneib Example I: Forest health data Example I: Forest health data • Yearly forest health inventories carried out from 1983 to 2004. • 83 beeches within a 15 km times 10 km area. • Response: defoliation degree of beech i in year t , measured in three ordered categories: y it = 1 no defoliation, y it = 2 defoliation 25% or less, y it = 3 defoliation above 25%. • Covariates: calendar time, t site of the beech, s i age of the tree in years, a it further (mostly categorical) covariates. u it Analysing geoadditive regression data: a mixed model approach 3

  5. Thomas Kneib Example I: Forest health data 1.0 no damage Empirical time trends. medium damage severe damage 0.8 0.6 0.4 0.2 0.0 1985 1990 1995 2000 calendar time Empirical spatial effect. 0 1 Analysing geoadditive regression data: a mixed model approach 4

  6. Thomas Kneib Example I: Forest health data • Cumulative probit model: � θ ( r ) − η it � P ( y it ≤ r ) = Φ with standard normal cdf Φ , thresholds −∞ = θ (0) < θ (1) < θ (2) < θ (3) = ∞ and η it = f 1 ( t ) + f 2 ( age it ) + f 3 ( t, age it ) + f spat ( s i ) + u ′ it γ θ ( 1 ) θ ( 2 ) θ ( 3 ) θ ( 1 ) θ ( 2 ) θ ( 3 ) η η Analysing geoadditive regression data: a mixed model approach 5

  7. Thomas Kneib Example I: Forest health data Analysing geoadditive regression data: a mixed model approach 6

  8. Thomas Kneib Example I: Forest health data 3 0 −3 1.5 −6 1.0 5 30 55 80 105 130 155 180 205 230 0.5 age in years 0.0 2 −0.5 −1.0 1 −1.5 200 0 2000 150 a g e 1995 i n 100 y calendar time e a r s 1990 50 −1 1985 −2 1983 1990 1997 2004 calendar time Analysing geoadditive regression data: a mixed model approach 7

  9. Thomas Kneib Example I: Forest health data • Category-specific trends: � θ ( r ) − f ( r ) � 1 ( t ) − f 2 ( age it ) − f spat ( s i ) − u ′ P ( y it ≤ r ) = Φ it γ • More complicated constraints: −∞ < θ (1) − f (1) 1 ( t ) < θ (2) − f (2) 1 ( t ) < ∞ for all t. time trend 1 time trend 2 2 2 1 1 0 0 −1 −1 −2 −2 1983 1990 1997 2004 1983 1990 1997 2004 calendar time calendar time Analysing geoadditive regression data: a mixed model approach 8

  10. Thomas Kneib Structured additive regression Structured additive regression • General Idea: Replace usual parametric predictor with a flexible semiparametric predictor containing – Nonparametric effects of time scales and continuous covariates, – Spatial effects, – Interaction surfaces, – Varying coefficient terms (continuous and spatial effect modifiers), – Random intercepts and random slopes. • All effects can be cast into one general framework. Analysing geoadditive regression data: a mixed model approach 9

  11. Thomas Kneib Structured additive regression • Penalised splines. – Approximate f ( x ) by a weighted sum of B-spline basis functions. – Employ a large number of basis functions to enable flexibility. – Penalise differences between parameters of adjacent basis functions to ensure smoothness. 2 1 2 0 1 −1 2 0 1 −2 −1 −3 −1.5 0 1.5 3 0 −2 −1 −3 −1.5 0 1.5 3 −2 −3 −1.5 0 1.5 3 Analysing geoadditive regression data: a mixed model approach 10

  12. Thomas Kneib Structured additive regression • Bivariate penalised splines. ❡ ❡ ❡ ❡ ❡ ❡ ❡ ✉ ❡ ❡ ❡ ❡ ✉ ❡ ❡ ✉ ❡ ❡ ❡ ❡ ✉ ❡ ❡ ❡ ❡ ❡ ❡ ❡ ❡ • Varying coefficient models. – Effect of covariate x varies smoothly over the domain of a second covariate z : f ( x, z ) = x · g ( z ) – Spatial effect modifier ⇒ Geographically weighted regression. Analysing geoadditive regression data: a mixed model approach 11

  13. Thomas Kneib Structured additive regression • Spatial effect for regional data: Markov random fields. – Bivariate extension of a first order random walk on the real line. – Define appropriate neighbourhoods for the regions. – Assume that the expected value of f spat ( s ) is the average of the function evaluations of adjacent sites. f(t+1) E[f(t)|f(t−1),f(t+1)] τ 2 2 f(t−1) t−1 t t+1 Analysing geoadditive regression data: a mixed model approach 12

  14. Thomas Kneib Structured additive regression • Spatial effect for point-referenced data: Stationary Gaussian random fields. – Well-known as Kriging in the geostatistics literature. – Spatial effect follows a zero mean stationary Gaussian stochastic process. – Correlation of two arbitrary sites is defined by an intrinsic correlation function. – Can be interpreted as a basis function approach with radial basis functions. Analysing geoadditive regression data: a mixed model approach 13

  15. Thomas Kneib Mixed model based inference Mixed model based inference • Each term in the predictor is associated with a vector of regression coefficients with multivariate Gaussian prior / random effects distribution: � � − 1 p ( ξ j | τ 2 ξ ′ j ) ∝ exp j K j ξ j 2 τ 2 j • K j is a penalty matrix, τ 2 j a smoothing parameter. • In most cases K j is rank-deficient. ⇒ Reparametrise the model to obtain a mixed model with proper distributions. Analysing geoadditive regression data: a mixed model approach 14

  16. Thomas Kneib Mixed model based inference • Decompose ξ j = X j β j + Z j b j , where b j ∼ N (0 , τ 2 p ( β j ) ∝ const and j I ) . ⇒ β j is a fixed effect and b j is an i.i.d. random effect. • This yields the variance components model η = x ′ β + z ′ b, where in turn p ( β ) ∝ const and b ∼ N (0 , Q ) . Analysing geoadditive regression data: a mixed model approach 15

  17. Thomas Kneib Mixed model based inference • Obtain empirical Bayes estimates / penalised likelihood estimates via iterating – Penalised maximum likelihood for the regression coefficients β and b . – Restricted Maximum / Marginal likelihood for the variance parameters in Q : � L ( Q ) = L ( β, b, Q ) p ( b ) dβdb → max Q . Analysing geoadditive regression data: a mixed model approach 16

  18. Thomas Kneib Software Software • Implemented in the software package BayesX. • Available from http://www.stat.uni-muenchen.de/~bayesx Analysing geoadditive regression data: a mixed model approach 17

  19. Thomas Kneib Childhood mortality in Nigeria Childhood mortality in Nigeria • Data from the 2003 Demographic and Health Survey (DHS) in Nigeria. • Retrospective questionnaire on the health status of women in reproductive age and their children. • Survival time of n = 5323 children. • Numerous covariates including spatial information. • Analysis based on the Cox model: λ ( t ; u ) = λ 0 ( t ) exp( u ′ γ ) . Analysing geoadditive regression data: a mixed model approach 18

  20. Thomas Kneib Childhood mortality in Nigeria • Limitations of the classical Cox model: – Restricted to right censored observations. – Post-estimation of the baseline hazard. – Proportional hazards assumption. – Parametric form of the predictor. – No spatial correlations. ⇒ Geoadditive hazard regression. Analysing geoadditive regression data: a mixed model approach 19

  21. Thomas Kneib Interval censored survival times Interval censored survival times • In theory, survival times should be available in days. • Retrospective questionnaire ⇒ most uncensored survival times are rounded (Heaping). 300 250 200 150 100 50 0 0 6 12 18 24 30 36 42 48 54 • In contrast: censoring times are given in days. ⇒ Treat survival times as interval censored. Analysing geoadditive regression data: a mixed model approach 20

  22. Thomas Kneib Interval censored survival times interval censored right censored uncensored T upper 0 C T lower T Analysing geoadditive regression data: a mixed model approach 21

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend