Models for time-to-event data
From Cox’s proportional hazards model to deep learning Sebastian Pölsterl
Artificial Intelligence in Medical Imaging | Ludwig Maximilian Universität Munich
Models for time-to-event data From Coxs proportional hazards model - - PowerPoint PPT Presentation
Models for time-to-event data From Coxs proportional hazards model to deep learning Sebastian Plsterl Artificial Intelligence in Medical Imaging | Ludwig Maximilian Universitt Munich October 2 nd 2018 cole Centrale de Nantes Outline 1
Artificial Intelligence in Medical Imaging | Ludwig Maximilian Universität Munich
1 What is Survival Analysis? 2 Parametric Survival Models 3 Semiparametric Survival Models 4 Non-Linear Survival Models 5 Survival Analysis with Deep Learning 6 Conclusion
Sebastian Pölsterl (AI-Med) October 2nd 2018 École Centrale de Nantes 2 of 49
Alzheimer’s disease progression
Source: Jack et al. (2013)
Sebastian Pölsterl (AI-Med) October 2nd 2018 École Centrale de Nantes 3 of 49
Remaining useful life of equipment
Source: MathWorks
Sebastian Pölsterl (AI-Med) October 2nd 2018 École Centrale de Nantes 4 of 49
Customer relationship management
FIRST VALUE (successfull trial) GROW VALUE (deployment) START CHURN CHURN CHURN DECREASE VALUE DECREASE VALUE
INCREASE USERS INCREASE USAGE EXPAND FUNC- TIONALITY Source: For Entrepreneurs
Sebastian Pölsterl (AI-Med) October 2nd 2018 École Centrale de Nantes 5 of 49
1 What is Survival Analysis? 2 Parametric Survival Models 3 Semiparametric Survival Models 4 Non-Linear Survival Models 5 Survival Analysis with Deep Learning 6 Conclusion
Sebastian Pölsterl (AI-Med) October 2nd 2018 École Centrale de Nantes 6 of 49
2 4 6 8 10 12 Time in months End of study A Lost B † C Dropped out D † E 1 2 3 4 5 6 Time since enrollment in months A Lost B † C Dropped out D † E
Sebastian Pölsterl (AI-Med) October 2nd 2018 École Centrale de Nantes 7 of 49
2 4 6 8 10 12 Time in months End of study A Lost B † C Dropped out D † E 1 2 3 4 5 6 Time since enrollment in months A Lost B † C Dropped out D † E
Sebastian Pölsterl (AI-Med) October 2nd 2018 École Centrale de Nantes 7 of 49
2 4 6 8 10 12 Time in months End of study A Lost B † C Dropped out D † E 1 2 3 4 5 6 Time since enrollment in months A Lost B † C Dropped out D † E
Sebastian Pölsterl (AI-Med) October 2nd 2018 École Centrale de Nantes 7 of 49
i
Sebastian Pölsterl (AI-Med) October 2nd 2018 École Centrale de Nantes 8 of 49
i
i
Sebastian Pölsterl (AI-Med) October 2nd 2018 École Centrale de Nantes 8 of 49
i
i
i; τ r i ]
Sebastian Pölsterl (AI-Med) October 2nd 2018 École Centrale de Nantes 8 of 49
i
i
i; τ r i ]
Sebastian Pölsterl (AI-Med) October 2nd 2018 École Centrale de Nantes 8 of 49
∞
t
Sebastian Pölsterl (AI-Med) October 2nd 2018 École Centrale de Nantes 9 of 49
∞
t
∆t→0
Sebastian Pölsterl (AI-Med) October 2nd 2018 École Centrale de Nantes 9 of 49
∞
t
∆t→0
t
Sebastian Pölsterl (AI-Med) October 2nd 2018 École Centrale de Nantes 9 of 49
Time t 5 10 15 0.2 0.4 0.6 0.8 1.0 Survival Probability S(t) Time t 5 10 15 0.2 0.4 0.6 0.8 1.0 1.2 1.4 Hazard h(t)
Sebastian Pölsterl (AI-Med) October 2nd 2018 École Centrale de Nantes 10 of 49
Sebastian Pölsterl (AI-Med) October 2nd 2018 École Centrale de Nantes 11 of 49
Sebastian Pölsterl (AI-Med) October 2nd 2018 École Centrale de Nantes 11 of 49
Sebastian Pölsterl (AI-Med) October 2nd 2018 École Centrale de Nantes 11 of 49
1 What is Survival Analysis? 2 Parametric Survival Models 3 Semiparametric Survival Models 4 Non-Linear Survival Models 5 Survival Analysis with Deep Learning 6 Conclusion
Sebastian Pölsterl (AI-Med) October 2nd 2018 École Centrale de Nantes 12 of 49
i=1
Sebastian Pölsterl (AI-Med) October 2nd 2018 École Centrale de Nantes 13 of 49
Sebastian Pölsterl (AI-Med) October 2nd 2018 École Centrale de Nantes 14 of 49
Time t
Θ
Sebastian Pölsterl (AI-Med) October 2nd 2018 École Centrale de Nantes 15 of 49
Time t
Θ
Sebastian Pölsterl (AI-Med) October 2nd 2018 École Centrale de Nantes 16 of 49
Time t
Θ
Sebastian Pölsterl (AI-Med) October 2nd 2018 École Centrale de Nantes 17 of 49
Time t
i
i
Θ
i < T ≤ τ r i ; Θ | xi) =
τ r
i
τ l
i
i; Θ | xi) − S(τ r i ; Θ | xi)
Sebastian Pölsterl (AI-Med) October 2nd 2018 École Centrale de Nantes 18 of 49
Putting it all together
Θ
i; Θ | xi) − S(τ r i ; Θ | xi)
October 2nd 2018 École Centrale de Nantes 19 of 49
1 2 3 4 5 Time t 0.0 0.2 0.4 0.6 0.8 1.0 1.2 1.4 Hazard h(t) Exponential Weibull Log logistic Gamma Gompertz Sebastian Pölsterl (AI-Med) October 2nd 2018 École Centrale de Nantes 20 of 49
1 What is Survival Analysis? 2 Parametric Survival Models 3 Semiparametric Survival Models 4 Non-Linear Survival Models 5 Survival Analysis with Deep Learning 6 Conclusion
Sebastian Pölsterl (AI-Med) October 2nd 2018 École Centrale de Nantes 21 of 49
Sebastian Pölsterl (AI-Med) October 2nd 2018 École Centrale de Nantes 22 of 49
covariates.
Sebastian Pölsterl (AI-Med) October 2nd 2018 École Centrale de Nantes 22 of 49
October 2nd 2018 École Centrale de Nantes 23 of 49
Sebastian Pölsterl (AI-Med) October 2nd 2018 École Centrale de Nantes 23 of 49
October 2nd 2018 École Centrale de Nantes 23 of 49
Sebastian Pölsterl (AI-Med) October 2nd 2018 École Centrale de Nantes 23 of 49
Sebastian Pölsterl (AI-Med) October 2nd 2018 École Centrale de Nantes 24 of 49
.
β n
x⊤
i β − log
j∈Ri
j β)
,
Sebastian Pölsterl (AI-Med) October 2nd 2018 École Centrale de Nantes 25 of 49
1 2 3 4 5 6 Time since enrollment in months A ? (δA = 0) B † (δB = 1) C ? (δC = 0) D † (δD = 1) E ? (δE = 0)
Sebastian Pölsterl (AI-Med) October 2nd 2018 École Centrale de Nantes 26 of 49
1 2 3 4 5 6 Time since enrollment in months A ? (δA = 0) B † (δB = 1) C ? (δC = 0) D † (δD = 1) E ? (δE = 0)
Comparable (tB > tD)
Sebastian Pölsterl (AI-Med) October 2nd 2018 École Centrale de Nantes 26 of 49
1 2 3 4 5 6 Time since enrollment in months A ? (δA = 0) B † (δB = 1) C ? (δC = 0) D † (δD = 1) E ? (δE = 0)
Incomparable (tA > tC or tC > tA?)
Sebastian Pölsterl (AI-Med) October 2nd 2018 École Centrale de Nantes 26 of 49
1 2 3 4 5 6 Time since enrollment in months A ? (δA = 0) B † (δB = 1) C ? (δC = 0) D † (δD = 1) E ? (δE = 0)
Comparable (tC > tD)
Sebastian Pölsterl (AI-Med) October 2nd 2018 École Centrale de Nantes 26 of 49
1 2 3 4 5 6 Time since enrollment in months A ? (δA = 0) B † (δB = 1) C ? (δC = 0) D † (δD = 1) E ? (δE = 0)
Incomparable (tB > tC or tC > tB?)
Sebastian Pölsterl (AI-Med) October 2nd 2018 École Centrale de Nantes 26 of 49
1 2 3 4 5 6 Time since enrollment in months A ? (δA = 0) B † (δB = 1) C ? (δC = 0) D † (δD = 1) E ? (δE = 0)
Sebastian Pölsterl (AI-Med) October 2nd 2018 École Centrale de Nantes 26 of 49
Sebastian Pölsterl (AI-Med) October 2nd 2018 École Centrale de Nantes 27 of 49
Example
1 |P|
f(xi)< ˆ f(xj)) 1 2 3 4 5 6 Time since enrollment in months A ? (δA = 0) B † (δB = 1) C ? (δC = 0) D † (δD = 1) E ? (δE = 0) 0.25 0.5 0.75 1 ˆ f(x)
Sebastian Pölsterl (AI-Med) October 2nd 2018 École Centrale de Nantes 28 of 49
Example
1 |P|
f(xi)< ˆ f(xj)) 1 2 3 4 5 6 Time since enrollment in months A ? (δA = 0) B † (δB = 1) C ? (δC = 0) D † (δD = 1) E ? (δE = 0) 0.25 0.5 0.75 1 ˆ f(x)
f(xB) < ˆ f(xD)
Sebastian Pölsterl (AI-Med) October 2nd 2018 École Centrale de Nantes 28 of 49
Example
1 |P|
f(xi)< ˆ f(xj)) 1 2 3 4 5 6 Time since enrollment in months A ? (δA = 0) B † (δB = 1) C ? (δC = 0) D † (δD = 1) E ? (δE = 0) 0.25 0.5 0.75 1 ˆ f(x)
f(xC) ≮ ˆ f(xD)
Sebastian Pölsterl (AI-Med) October 2nd 2018 École Centrale de Nantes 28 of 49
Example
1 |P|
f(xi)< ˆ f(xj)) 1 2 3 4 5 6 Time since enrollment in months A ? (δA = 0) B † (δB = 1) C ? (δC = 0) D † (δD = 1) E ? (δE = 0) 0.25 0.5 0.75 1 ˆ f(x)
f(xA) < ˆ f(xD)
Sebastian Pölsterl (AI-Med) October 2nd 2018 École Centrale de Nantes 28 of 49
Example
1 |P|
f(xi)< ˆ f(xj)) 1 2 3 4 5 6 Time since enrollment in months A ? (δA = 0) B † (δB = 1) C ? (δC = 0) D † (δD = 1) E ? (δE = 0) 0.25 0.5 0.75 1 ˆ f(x)
f(xE) < ˆ f(xD)
Sebastian Pölsterl (AI-Med) October 2nd 2018 École Centrale de Nantes 28 of 49
Example
1 |P|
f(xi)< ˆ f(xj)) 1 2 3 4 5 6 Time since enrollment in months A ? (δA = 0) B † (δB = 1) C ? (δC = 0) D † (δD = 1) E ? (δE = 0) 0.25 0.5 0.75 1 ˆ f(x)
f(xE) ≮ ˆ f(xB)
Sebastian Pölsterl (AI-Med) October 2nd 2018 École Centrale de Nantes 28 of 49
Example
1 |P|
f(xi)< ˆ f(xj)) 1 2 3 4 5 6 Time since enrollment in months A ? (δA = 0) B † (δB = 1) C ? (δC = 0) D † (δD = 1) E ? (δE = 0) 0.25 0.5 0.75 1 ˆ f(x)
Sebastian Pölsterl (AI-Med) October 2nd 2018 École Centrale de Nantes 28 of 49
1 What is Survival Analysis? 2 Parametric Survival Models 3 Semiparametric Survival Models 4 Non-Linear Survival Models 5 Survival Analysis with Deep Learning 6 Conclusion
Sebastian Pölsterl (AI-Med) October 2nd 2018 École Centrale de Nantes 29 of 49
i β with an
2010)
Sebastian Pölsterl (AI-Med) October 2nd 2018 École Centrale de Nantes 30 of 49
w
2 + γ
Sebastian Pölsterl (AI-Med) October 2nd 2018 École Centrale de Nantes 31 of 49
Sebastian Pölsterl (AI-Med) October 2nd 2018 École Centrale de Nantes 32 of 49
Hidden layer Input layer Output layer Cox PH loss argmax
β n
δi
i β
j∈Ri
j β)
,
Sebastian Pölsterl (AI-Med) October 2nd 2018 École Centrale de Nantes 33 of 49
Hidden layer Input layer Output layer Cox PH loss argmin
Θ n
δi
j∈Ri
Sebastian Pölsterl (AI-Med) October 2nd 2018 École Centrale de Nantes 33 of 49
Problems
Sebastian Pölsterl (AI-Med) October 2nd 2018 École Centrale de Nantes 34 of 49
Biganzoli et al. (1998) and Liestøl et al. (1994)
1 2 3 4 5 6 Time since enrollment in months A ? B † C ? D † E ?
Sebastian Pölsterl (AI-Med) October 2nd 2018 École Centrale de Nantes 35 of 49
Biganzoli et al. (1998) and Liestøl et al. (1994)
1 2 3 4 5 6 Time since enrollment in months A ? Event in k-th interval? δA1 = 0, δA2 = 0, δA3 = 0 Time spent in k-th interval: ˜ yA1 = 2, ˜ yA2 = 1, ˜ yA3 = 0 B † C ? D † E ?
Sebastian Pölsterl (AI-Med) October 2nd 2018 École Centrale de Nantes 35 of 49
Biganzoli et al. (1998) and Liestøl et al. (1994)
1 2 3 4 5 6 Time since enrollment in months A ? B † Event in k-th interval? δB1 = 0, δB2 = 0, δB3 = 1 Time spent in k-th interval: ˜ yB1 = 2, ˜ yB2 = 2, ˜ yB3 = 0.5 C ? D † E ?
Sebastian Pölsterl (AI-Med) October 2nd 2018 École Centrale de Nantes 35 of 49
Biganzoli et al. (1998) and Liestøl et al. (1994)
1 2 3 4 5 6 Time since enrollment in months A ? B † C ? Event in k-th interval? δC1 = 0, δC2 = 0, δC3 = 0, Time spent in k-th interval: ˜ yC1 = 2, ˜ yC2 = 1.5, ˜ yC3 = 0 D † E ?
Sebastian Pölsterl (AI-Med) October 2nd 2018 École Centrale de Nantes 35 of 49
Biganzoli et al. (1998) and Liestøl et al. (1994)
1 2 3 4 5 6 Time since enrollment in months A ? B † C ? D † Event in k-th interval? δD1 = 1, δD2 = 0, δD3 = 0, Time spent in k-th interval: ˜ yD1 = 2, ˜ yD2 = 0, ˜ yD3 = 0 E ?
Sebastian Pölsterl (AI-Med) October 2nd 2018 École Centrale de Nantes 35 of 49
Biganzoli et al. (1998) and Liestøl et al. (1994)
l−1
Sebastian Pölsterl (AI-Med) October 2nd 2018 École Centrale de Nantes 36 of 49
Biganzoli et al. (1998) and Liestøl et al. (1994)
l−1
{λ1,...,λL} n
L
Sebastian Pölsterl (AI-Med) October 2nd 2018 École Centrale de Nantes 36 of 49
Biganzoli et al. (1998) and Liestøl et al. (1994)
l−1
{λ1,...,λL} n
L
baseline =bias term
Sebastian Pölsterl (AI-Med) October 2nd 2018 École Centrale de Nantes 36 of 49
1 What is Survival Analysis? 2 Parametric Survival Models 3 Semiparametric Survival Models 4 Non-Linear Survival Models 5 Survival Analysis with Deep Learning 6 Conclusion
Sebastian Pölsterl (AI-Med) October 2nd 2018 École Centrale de Nantes 37 of 49
1excluding work using Deep Gaussian Processes
Sebastian Pölsterl (AI-Med) October 2nd 2018 École Centrale de Nantes 38 of 49
Mobadersany et al. (2018)
Sebastian Pölsterl (AI-Med) October 2nd 2018 École Centrale de Nantes 39 of 49
Mobadersany et al. (2018)
Sebastian Pölsterl (AI-Med) October 2nd 2018 École Centrale de Nantes 40 of 49
Grob et al. (2018)
v(t)hj
past
Sebastian Pölsterl (AI-Med) October 2nd 2018 École Centrale de Nantes 41 of 49
Grob et al. (2018)
Sebastian Pölsterl (AI-Med) October 2nd 2018 École Centrale de Nantes 42 of 49
1 What is Survival Analysis? 2 Parametric Survival Models 3 Semiparametric Survival Models 4 Non-Linear Survival Models 5 Survival Analysis with Deep Learning 6 Conclusion
Sebastian Pölsterl (AI-Med) October 2nd 2018 École Centrale de Nantes 43 of 49
Sebastian Pölsterl (AI-Med) October 2nd 2018 École Centrale de Nantes 44 of 49
Benner, A. (2002). “Application of “Aggregated Classifiers” in Survival Time Studies”. In:
Biganzoli, E., P. Boracchi, L. Mariani, and E. Marubini (May 1998). “Feed forward neural networks for the analysis of censored survival data: a partial logistic regression approach”. In: Stat. Med. 17.10, pp. 1169–1186. issn: 0277-6715. Binder, H. and M. Schumacher (2008). “Allowing for mandatory covariates in boosting estimation of sparse high-dimensional survival models”. In: BMC Bioinformatics 9,
Breiman, L., J. H. Friedman, C. J. Stone, and R. A. Ohlsen (1984). Classification and Regression Trees. Wadsworth International Group. Bühlmann, P. and B. Yu (2003). “Boosting With the L2 Loss”. In: J Am Stat Assoc 98.462, pp. 324–339. Eleuteri, A. (2008). “Support vector survival regression”. In: 4th IET International Conference on Advances in Medical, Signal and Information Processing, pp. 1–4.
Sebastian Pölsterl (AI-Med) October 2nd 2018 École Centrale de Nantes 45 of 49
Eleuteri, A. and A. F. Taktak (2012). “Support Vector Machines for Survival Regression”. In: Computational Intelligence Methods for Bioinformatics and Biostatistics. Ed. by
Faraggi, D. and R. Simon (Jan. 1995). “A neural network model for survival data”. In:
Grob, G. L., A. Cardoso, C. H. B. Liu, D. A. Little, and B. P. Chamberlain (2018). “A Recurrent Neural Network Survival Model: Predicting Web User Return Time”. In: Eur.
Hothorn, T., P. Bühlmann, S. Dudoit, A. Molinaro, and M. J. van der Laan (2006). “Survival ensembles”. In: Biostatistics 7.3, pp. 355–373. Jack, C. R., D. S. Knopman, W. J. Jagust, R. C. Petersen, M. W. Weiner, et al. (Feb. 2013). “Tracking pathophysiological processes in Alzheimer’s disease: an updated hypothetical model of dynamic biomarkers”. In: The Lancet Neurology 12.2,
Sebastian Pölsterl (AI-Med) October 2nd 2018 École Centrale de Nantes 46 of 49
Khan, F. M. and V. B. Zubek (2008). “Support Vector Regression for Censored Data (SVRc): A Novel Tool for Survival Analysis”. In: 8th IEEE International Conference on Data Mining, pp. 863–868. Li, H. and Y. Luan (2005). “Boosting proportional hazards models using smoothing splines, with applications to high-dimensional microarray data”. In: Bioinformatics 21.10, pp. 2403–2409. Liestøl, K., P. K. Andersen, and U. Andersen (June 1994). “Survival analysis and neural nets”. In: Stat. Med. 13.12, pp. 1189–1200. issn: 02776715. Mayr, A. and M. Schmid (2014). “Boosting the concordance index for survival data – a unified framework to derive and evaluate biomarker combinations”. In: PLoS One 9.1, e84483. Mobadersany, P., S. Yousefi, M. Amgad, D. A. Gutman, J. S. Barnholtz-Sloan,
Sebastian Pölsterl (AI-Med) October 2nd 2018 École Centrale de Nantes 47 of 49
Pölsterl, S., N. Navab, and A. Katouzian (2015). “Fast Training of Support Vector Machines for Survival Analysis”. In: Machine Learning and Knowledge Discovery in
and C. Soares. Lecture Notes in Computer Science, pp. 243–259. – (Sept. 2016). “An Efficient Training Algorithm for Kernel Survival Support Vector Machines”. In: 3rd Workshop on Machine Learning in Life Sciences. Ridgeway, G. (1999). “The state of boosting”. In: Computing Science and Statistics,
Schmid, M. and T. Hothorn (2008). “Flexible boosting of accelerated failure time models”. In: BMC Bioinformatics 9, p. 269. Shivaswamy, P. K., W. Chu, and M. Jansche (2007). “A Support Vector Approach to Censored Targets”. In: 7th IEEE International Conference on Data Mining, pp. 655–660. Van Belle, V., K. Pelckmans, J. A. K. Suykens, and S. Van Huffel (2008). “Survival SVM: a practical scalable algorithm”. In: ESANN, pp. 89–94.
Sebastian Pölsterl (AI-Med) October 2nd 2018 École Centrale de Nantes 48 of 49
Wang, Z. and C. Wang (2010). “Buckley-James Boosting for Survival Analysis with High-Dimensional Biomarker Data”. In: Statistical Applications in Genetics and Molecular Biology 9.1.
Sebastian Pölsterl (AI-Med) October 2nd 2018 École Centrale de Nantes 49 of 49