Analysing competing risks data using flexible parametric survival - PowerPoint PPT Presentation

Analysing competing risks data using flexible parametric survival models: what tools are available in Stata, which ones to use and when? Sarwar Islam Mozumder (sarwar.islam@le.ac.uk) Biostatistics Research Group, Dept. of Health Sciences, University of Leicester 2018 Nordic and Baltic SUGM | Oslo | 12 September 2018

1/25 h cs h cs Competing risks Death 1 ( t ) Alive from cause k = 1 2 ( t ) Death from cause k = 2 Cause-specific hazard (CSH) rate, h cs k ( t ) Instantaneous mortality (failure) rate from cause k , given that the individual is still alive up to time t

k t k u d u K h cs 0 t 1 k 2/25 S cs 1 k K S t CSH relationship with cause-specific CIF Cause-specific CIF, F k ( t ) Probability a patient will die from cause D = k by time t whilst also being at risk of dying from other competing causes of death

2/25 K h cs 0 0 K S cs CSH relationship with cause-specific CIF Cause-specific CIF, F k ( t ) ∫ t F k ( t ) = S ( u ) h cs k ( u ) d u ( ) ∫ t ∏ ∑ S ( t ) = k ( t ) = exp − k ( u ) d u k = 1 k = 1

2/25 Approaches for modelling (all) CSHs in Stata

• Can also easily include time-dependent effects (TDE) lk m lk x lk 3/25 E t s k Cause-specifjc log-cumulative PH FPM 1 l Flexible parametric survival models (FPMs) [Royston and Parmar, 2002] • Models and more accurately captures complex shapes of the (log-cumulative) baseline hazard function • A generalisation of the Weibull distribution is used with restricted cubic splines (RCS) that allows for more �exibility ln ( H cs k ( t | x k )) = s k (ln t ; γ γ k , m 0 k ) + β γ β β cs k x k γ k , m 0 k ) : baseline restricted cubic spline function on log-time s k (ln t ; γ γ

3/25 Cause-specifjc log-cumulative non-PH FPM E Flexible parametric survival models (FPMs) [Royston and Parmar, 2002] • Models and more accurately captures complex shapes of the (log-cumulative) baseline hazard function • A generalisation of the Weibull distribution is used with restricted cubic splines (RCS) that allows for more �exibility • Can also easily include time-dependent effects (TDE) ∑ ln ( H cs k ( t | x k )) = s k (ln t ; γ γ γ k , m 0 k ) + β β β cs k x k + s k (ln t ; α α α lk , m lk ) x lk l = 1 α lk , m lk ) x lk : interaction between spline variables and covariates for TDEs s k (ln t ; α α

4/25 60.28 100.00 506 Total 100.00 11.86 60 Other 88.14 27.87 141 CVD 30.63 155 Cancer 29.64 29.64 150 Censor Cum. Percent Freq. status . tab status . use "http://www.stata-journal.com/software/sj4-2/st0059/prostatecancer", clear Example dataset Load public-use prostate cancer dataset:

5/25 0.394 -12.40 .0272468 .229559 _cons 1.056594 .9785387 0.85 .1819129 .0199075 1.016818 _rcs4 1.146942 .9854806 0.114 0.000 .2896844 .0411503 treatment .9197894 .4740025 0.014 -2.45 .1116672 .6602897 [95% Conf. Interval] Note: Estimates are transformed only in the first equation. P>|z| z Std. Err. Haz. Ratio _t . stcox treatment, nolog noshow 1.58 1.06315 . stset time, failure(status == 1) id(id) scale(12) exit(time 60) Std. Err. .6594084 treatment xb [95% Conf. Interval] P>|z| z exp(b) -2.46 506 = Number of obs -440.316 Log likelihood = . stpm2 treatment, scale(hazard) df(4) eform nolog .111509 0.014 _rcs3 _rcs2 1.041871 .7567963 0.145 -1.46 .0724157 .8879662 4.336179 .4733827 2.649838 0.000 9.72 .4258797 3.389716 _rcs1 .9185368 stpm2 [Lambert and Royston, 2009]

5/25 0.185 -12.95 .0237024 .17767 _cons 1.070644 .986915 1.33 .1367912 .0213538 1.027927 _rcs4 1.017663 .8923696 0.151 0.000 .2307651 .0319403 treatment 1.679937 .8619538 0.277 1.09 .2048509 1.20334 [95% Conf. Interval] Note: Estimates are transformed only in the first equation. P>|z| z Std. Err. Haz. Ratio _t . stcox treatment, nolog noshow -1.44 .9529595 . stset time, failure(status == 2) id(id) scale(12) exit(time 60) z .2047249 1.202808 treatment xb [95% Conf. Interval] P>|z| Std. Err. 0.278 exp(b) 506 = Number of obs Log likelihood = -448.73758 . stpm2 treatment, scale(hazard) df(4) eform nolog 1.08 .8616223 _rcs3 _rcs2 .9820878 .7681357 0.025 -2.25 .0544436 .8685486 3.397384 1.679097 2.355841 0.000 11.13 .2642265 2.82908 _rcs1 stpm2 [Lambert and Royston, 2009]

5/25 0.341 -12.69 .0179093 .097687 _cons 1.09413 .9693337 0.95 .0681998 .031817 1.029843 _rcs4 1.033209 .8497164 0.192 0.000 .1399235 .0467358 treatment 1.096964 .3804893 0.106 -1.62 .1745103 .6460519 [95% Conf. Interval] Note: Estimates are transformed only in the first equation. P>|z| z Std. Err. Haz. Ratio _t . stcox treatment, nolog noshow -1.30 .9369818 . stset time, failure(status == 3) id(id) scale(12) exit(time 60) z .1737196 .6432149 treatment xb [95% Conf. Interval] P>|z| Std. Err. 0.102 exp(b) 506 = Number of obs Log likelihood = -231.45608 . stpm2 treatment, scale(hazard) df(4) eform nolog -1.63 .3788467 _rcs3 _rcs2 .9160589 .683647 0.002 -3.13 .0590788 .7913665 3.384628 1.092066 2.057219 0.000 7.64 .3351586 2.638735 _rcs1 stpm2 [Lambert and Royston, 2009]

Must be obtained by numerical approximation: • Trapezoid method - stpm2cif [Hinchliffe and Lambert, 2013] • Gauss-Legendre quadrature - stpm2cr [Mozumder et al., 2017] 6/25 0 0 h cs h cs K Estimating cause-specific CIFs after fitting FPMs Cause-specific CIF, F k ( t ) ( ) ∫ t ∫ t ∑ F k ( t ) = exp − k ( u ) d u k ( u ) d u k = 1

6/25 K h cs h cs 0 0 Estimating cause-specific CIFs after fitting FPMs Cause-specific CIF, F k ( t ) ( ) ∫ t ∫ t ∑ F k ( t ) = exp − k ( u ) d u k ( u ) d u k = 1 Must be obtained by numerical approximation: • Trapezoid method - stpm2cif [Hinchliffe and Lambert, 2013] • Gauss-Legendre quadrature - stpm2cr [Mozumder et al., 2017]

. expand 3 // augment data k = 3 times . bysort id: gen _cause=_n . //create dummy variables for each cause of death . gen _cvd=_cause==2 . gen _other=_cause==3 . gen _cancer=_cause==1 . //create cause of death event indicator variable . gen _event=(_cause==status) . label values _cause status . foreach cause in _cancer _cvd _other { 2. gen treatment ̀ cause ́ = treatment* ̀ cause ́ 3. } 7/25 stpm2cif: Data setup

7/25 1 CVD 3 7. 0 3 0 Cancer 1 2 6. 0 2 0 1 40 1 2 9. 0 3 1 40 CVD 3 1 0 2 1 40 CVD 3 8. Cancer 5. . list id status time treatment _cause _event in 1/9, sep(9) 1 2. 0 1 0 72 Censor 1. Censor _event _cause treatm~t time status id 1 72 1 0 1 0 1 Cancer 2 4. 3 0 0 72 Censor 1 3. 0 2 stpm2cif: Data setup

7/25 local bhknots ̀ cause ́ ̀ e(bhknots) ́ 10. } local k = ̀ k ́ + 1 9. local bknotstvc_opt ̀ bknotstvc_opt ́ ̀ cause ́ ̀ boundknots ̀ cause ́ ́ 8. local knotstvc_opt ̀ knotstvc_opt ́ ̀ cause ́ ̀ bhknots ̀ cause ́ ́ 7. local boundknots ̀ cause ́ ̀ e(boundary_knots) ́ 6. 5. . local knotstvc_opt estimates store stpm2 ̀ cause ́ 4. cap stpm2 treatment, df(4) scale(h) eform nolog 3. stset time, failure(status == ̀ k ́ ) exit(time 60) scale(12) 2. . foreach cause in _cancer _cvd _other { . local k = 1 . local bknotstvc_opt stpm2cif: Data setup

8/25 .1737196 0.278 .8616223 1.679097 _cvd .17767 .0237024 -12.95 0.000 .1367912 .2307651 treatment_other .6432149 -1.63 .2047249 0.102 .3788467 1.092066 _other .097687 .0179093 -12.69 0.000 .0681998 .1399235 ( output omitted ) Note: Estimates are transformed only in the first equation. 1.08 1.202808 . stset time, failure(_event == 1) exit(time 60) scale(12) xb . stpm2 treatment_cancer _cancer treatment_cvd _cvd treatment_other _other /// > , scale(h) knotstvc( ̀ knotstvc_opt ́ ) bknotstvc( ̀ bknotstvc_opt ́ ) /// > tvc(_cancer _cvd _other) rcsbaseoff nocons eform nolog Log likelihood = -1120.5192 Number of obs = 1,518 exp(b) Std. Err. z P>|z| [95% Conf. Interval] treatment_cancer treatment_cvd .6593781 .111504 -2.46 0.014 .4733607 .9184951 _cancer .2295677 .0272475 -12.40 0.000 .1819204 .2896945 stpm2cif: Fitting the model

9/25 . stpm2cif cancer cvd other, cause1(treatment_cancer 1 _cancer 1) /// > cause2(treatment_cvd 1 _cvd 1) cause3(treatment_other 1 _other 1) ci stpm2cif: Post-estimation stpm2cif Patients on treatment 1.00 Cancer Other 0.80 CVD Probability of death 0.60 0.40 0.20 0.00 0 1 2 3 4 5 Years since diagnosis

. stset time, failure(status == 1,2,3) exit(time 60) scale(12) . stpm2cr [cancer: treatment, scale(hazard) df(4)] /// > [cvd: treatment, scale(hazard) df(4)] /// > [other: treatment, scale(hazard) df(4)], /// > events(status) cause(1 2 3) cens(0) eform model(csh) 10/25 stpm2cr

11/25 . range newt 0 5 100 . predict cifgq_trt1, cif at(treatment 1) timevar(newt) ci stpm2cr: Post-estimation stpm2cif vs stpm2cr Patients on treatment 0.40 Cancer (stpm2cif) Cancer (stpm2cr) Other (stpm2cif) Other (stpm2cr) CVD (stpm2cif) CVD (stpm2cr) 0.30 Probability of death 0.20 0.10 0.00 0 1 2 3 4 5 Years since diagnosis

. expand 500 //now 253,000 observations . replace time = time + runiform()*0.0001 . replace id = _n variable id was int now long 12/25 Note on computational time Time (secs) stpm2cr model 52.60 stpm2 (stacked data) 76.59 stpm2cr predict (w/ CIs) 2.56 stpm2cif (w/ CIs) 11.10

Analysing competing risks data using flexible parametric survival - PowerPoint PPT Presentation

Analysing competing risks data using flexible parametric survival models: what tools are available in Stata, which ones to use and when? Sarwar Islam Mozumder (sarwar.islam@le.ac.uk) Biostatistics Research Group, Dept. of Health Sciences,

Analysing competing risks data using flexible parametric survival models: what tools are

MLSE Channel Estimation MLSE Channel Estimation MLSE Channel Estimation Parametric or Non-

13. hypothesis testing 1 competing hypotheses 2 competing hypotheses 3 competing hypotheses

Semi-parametric and response setup non-parametric approaches to Parametric models

GLOBAL RISKS GLOBAL RISKS GLOBAL RISKS - GLOBAL RISKS - - - GLOBAL RISKS GLOBAL RISKS

Week 5 Kullmann Analysing BFS Depth-first search Depth-first search Analysing DFS

Session Five Five Session Session Five Competing in a global world: a Competing in a global

The The Beverly Beverly Middle Middle School School Flexible Flexible Learning Learning

Using Groove for analysing RPGame models Model Driven Engineering Brent van Bladel University of

Flexible parametric joint modelling of longitudinal and survival data Workshop on Flexible Models

Personalized Learning Flexible Seating and Space Flexible Seating and Space Flexible Seating and

Parametric Bootstrapping 18.05 Spring 2017 Parametric bootstrapping Use the estimated parameter

Outline Introduction Background Progress on the implementation of the MTSF

Monitoring and analysing multilingual media reports Monitoring and analysing multilingual media

Analysing Object-Capability Security Toby Murray Oxford University Computing Laboratory

Analysing Kauffman Boolean Networks PAVEL EMELYANOV Institute of Informatics Systems and

Fa Farma maci e nuovi farma maci nel carcinoma ma della

Pharma-toxicological and phytochemical investigations on Harpagophytum procumbens DC. ex Meisn.

The New Common Rule Changes to the regulations for human subjects research Laurie Herraiz, RD,

High Hopes for CER Better information about the costs and benefits of different treatment

ImprovingPsychologicalOutcomes* forMenwithProstateCancer* ! Professor!Suzanne!Chambers! !

Towards Better than Human Capability in Diagnosing Prostate Cancer Using Infrared Spectroscopic

PROACT: Iterative Design of a Patient-Centered Visualization for Effective Prostate Cancer

Manifold Mixup Alex Lamb, Vikas Verma, Christopher Beckham, Amir Najafi, Ioannis Mitliagkas,

Sambuz

Useful Links

Newsletter

Mail Us

Analysing competing risks data using flexible parametric survival - PowerPoint PPT Presentation

Analysing competing risks data using flexible parametric survival models: what tools are available in Stata, which ones to use and when? Sarwar Islam Mozumder (sarwar.islam@le.ac.uk) Biostatistics Research Group, Dept. of Health Sciences,

Analysing competing risks data using flexible parametric survival models: what tools are

MLSE Channel Estimation MLSE Channel Estimation MLSE Channel Estimation Parametric or Non-

13. hypothesis testing 1 competing hypotheses 2 competing hypotheses 3 competing hypotheses

Semi-parametric and response setup non-parametric approaches to Parametric models

GLOBAL RISKS GLOBAL RISKS GLOBAL RISKS - GLOBAL RISKS - - - GLOBAL RISKS GLOBAL RISKS

Week 5 Kullmann Analysing BFS Depth-first search Depth-first search Analysing DFS

Session Five Five Session Session Five Competing in a global world: a Competing in a global

The The Beverly Beverly Middle Middle School School Flexible Flexible Learning Learning

Using Groove for analysing RPGame models Model Driven Engineering Brent van Bladel University of

Flexible parametric joint modelling of longitudinal and survival data Workshop on Flexible Models

Personalized Learning Flexible Seating and Space Flexible Seating and Space Flexible Seating and

Parametric Bootstrapping 18.05 Spring 2017 Parametric bootstrapping Use the estimated parameter

Outline Introduction Background Progress on the implementation of the MTSF

Monitoring and analysing multilingual media reports Monitoring and analysing multilingual media

Analysing Object-Capability Security Toby Murray Oxford University Computing Laboratory

Analysing Kauffman Boolean Networks PAVEL EMELYANOV Institute of Informatics Systems and

Fa Farma maci e nuovi farma maci nel carcinoma ma della

Pharma-toxicological and phytochemical investigations on Harpagophytum procumbens DC. ex Meisn.

The New Common Rule Changes to the regulations for human subjects research Laurie Herraiz, RD,

High Hopes for CER Better information about the costs and benefits of different treatment

Improving*Psychological*Outcomes* for*Men*with*Prostate*Cancer* ! Professor!Suzanne!Chambers! !

Towards Better than Human Capability in Diagnosing Prostate Cancer Using Infrared Spectroscopic

PROACT: Iterative Design of a Patient-Centered Visualization for Effective Prostate Cancer

Manifold Mixup Alex Lamb*, Vikas Verma*, Christopher Beckham, Amir Najafi, Ioannis Mitliagkas,

Sambuz

Useful Links

Newsletter

Mail Us

ImprovingPsychologicalOutcomes* forMenwithProstateCancer* ! Professor!Suzanne!Chambers! !

Manifold Mixup Alex Lamb, Vikas Verma, Christopher Beckham, Amir Najafi, Ioannis Mitliagkas,