Boosted Tree Ensembles for Predicting Postsurgical ICU Mortality
Garrick Aden-Buie, Yun Chen, Rashad Kayal, Gina Romero, Hui Yang
- Dept. of Industrial and Management Sciences Engineering
Boosted Tree Ensembles for Predicting Postsurgical ICU Mortality - - PowerPoint PPT Presentation
Boosted Tree Ensembles for Predicting Postsurgical ICU Mortality Garrick Aden-Buie, Yun Chen, Rashad Kayal, Gina Romero, Hui Yang Dept. of Industrial and Management Sciences Engineering College of Engineering University of South Florida, T
Motivation MIMIC II Clinical Data Methods Results
Boosted Tree Ensembles for Predicting Postsurgical ICU Mortality 1
Motivation MIMIC II Clinical Data Methods Results
Boosted Tree Ensembles for Predicting Postsurgical ICU Mortality 1
Motivation MIMIC II Clinical Data Methods Results
◮ Despite 12.2% decrease in hospitals with critical
Boosted Tree Ensembles for Predicting Postsurgical ICU Mortality 2 Halpern, Neil A, and Stephen M Pastores, 2010 “Critical Care Medicine in the United States 2000-2005”
Motivation MIMIC II Clinical Data Methods Results
◮ APACHE ◮ SAPS ◮ MPM ◮ SOFA
Boosted Tree Ensembles for Predicting Postsurgical ICU Mortality 3
Motivation MIMIC II Clinical Data Methods Results
Boosted Tree Ensembles for Predicting Postsurgical ICU Mortality 4
Motivation MIMIC II Clinical Data Methods Results
◮ Uses heterogeneous, routinely-collected data ◮ Requires minimal preprocessing ◮ Effectively addresses sampling and missing
◮ Accurately predicts in-hospital mortality
Boosted Tree Ensembles for Predicting Postsurgical ICU Mortality 5
Motivation MIMIC II Clinical Data Methods Results
Boosted Tree Ensembles for Predicting Postsurgical ICU Mortality 5
Motivation MIMIC II Clinical Data Methods Results
◮ Set A: Training ◮ Set B: Validation ◮ Set C: T
◮ Age ≥ 16 years ◮ Initial ICU stay ≥ 48hrs
Boosted Tree Ensembles for Predicting Postsurgical ICU Mortality 6 http://www.physionet.org/challenge/2012/
Motivation MIMIC II Clinical Data Methods Results
◮ Set A: Training ◮ Set B: T
◮ Age ≥ 16 years ◮ Initial ICU stay ≥ 48hrs
Boosted Tree Ensembles for Predicting Postsurgical ICU Mortality 6 http://www.physionet.org/challenge/2012/
Motivation MIMIC II Clinical Data Methods Results
◮ 5 general descriptors ◮ 36 time series variables
Boosted Tree Ensembles for Predicting Postsurgical ICU Mortality 7
Motivation MIMIC II Clinical Data Methods Results
Boosted Tree Ensembles for Predicting Postsurgical ICU Mortality 8
Motivation MIMIC II Clinical Data Methods Results
Boosted Tree Ensembles for Predicting Postsurgical ICU Mortality 9
ALP ALT AST Bilirubin Cholesterol SaO2 TroponinI TroponinT MechVent BUN Creatinine Glucose HCO3 HCT K Lactate Mg Na PaCO2 PaO2 pH Platelets WBC DiasABP NIDiasABP MAP NIMAP SysABP NISysABP GCS HR Temp Urine.Sum FiO2 Weight 3.50 3.75 4.00 4.25 4.50 57.50 57.75 58.00 58.25 58.50 58.50 58.75 59.00 59.25 59.50 119.50 119.75 120.00 120.25 120.50 0.00 0.25 0.50 0.75 1.00 −1.50 −1.25 −1.00 −0.75 −0.50 97 98 99 100 −1.50 −1.25 −1.00 −0.75 −0.50 2.6 2.8 3.0 3.2 −1.50 −1.25 −1.00 −0.75 −0.50 8 10 12 14 0.700 0.725 0.750 0.775 0.800 90 93 96 99 21 22 23 24 25 26 36 37 38 39 3.6 3.7 3.8 3.9 4.0 4.1 −1.50 −1.25 −1.00 −0.75 −0.50 1.6 1.7 1.8 1.9 2.0 2.1 135 136 137 138 139 140 −1.50 −1.25 −1.00 −0.75 −0.50 −1.50 −1.25 −1.00 −0.75 −0.50 −1.50 −1.25 −1.00 −0.75 −0.50 160 180 200 220 8 9 10 11 55 60 65 70 75 80 20 40 60 80 65 70 75 80 85 90 25 50 75 75 80 85 90 95 100 50 100 14.50 14.75 15.00 15.25 15.50 70 80 90 100 110 36.00 36.25 36.50 36.75 20 40 60 80 −1.50 −1.25 −1.00 −0.75 −0.50 99.50 99.75 100.00 100.25 100.50 1000 2000 1000 2000 1000 2000 1000 2000 1000 2000 1000 2000
Time
Patient 133659 −− Outcome: 0 Female Age: 46 Weight: 220lbs Height: 5' 10" BMI: 31.63 kg/m2 ICUType: 1:Coronary Care
ALP ALT AST Bilirubin Cholesterol SaO2 TroponinI TroponinT MechVent BUN Creatinine Glucose HCO3 HCT K Lactate Mg Na PaCO2 PaO2 pH Platelets WBC DiasABP NIDiasABP MAP NIMAP SysABP NISysABP GCS HR Temp Urine.Sum FiO2 Weight 3.00 3.25 3.50 3.75 71.50 71.75 72.00 72.25 72.50 498.50 498.75 499.00 499.25 499.50 67.50 67.75 68.00 68.25 68.50 0.25 0.50 0.75 1.00 −1.50 −1.25 −1.00 −0.75 −0.50 85 90 95 0.4 0.6 0.8 1.0 1.2 1.4 −1.50 −1.25 −1.00 −0.75 −0.50 0.50 0.75 1.00 1.25 1.50 24 28 32 36 0.5 0.6 0.7 0.8 100 105 110 115 120 125 28 30 32 34 32 36 40 3.2 3.4 3.6 3.8 4.0 −1.50 −1.25 −1.00 −0.75 −0.50 2.100 2.125 2.150 2.175 2.200 140 141 142 143 144 50 60 70 100 200 300 7.25 7.30 7.35 7.40 205 210 9.5 10.0 10.5 40 50 60 70 80 −1.50 −1.25 −1.00 −0.75 −0.50 60 80 100 −1.50 −1.25 −1.00 −0.75 −0.50 100 150 −1.50 −1.25 −1.00 −0.75 −0.50 8 10 12 14 60 70 80 90 100 35 36 37 38 10 20 30 0.40 0.45 0.50 0.55 0.60 51.50 51.75 52.00 52.25 52.50 1000 2000 1000 2000 1000 2000 1000 2000 1000 2000 1000 2000
Time
Patient 142106 −− Outcome: 1 Male Age: 70 Weight: 115lbs Height: 5' 2" BMI: 20.96 kg/m2 ICUType: 1:Coronary Care
Motivation MIMIC II Clinical Data Methods Results
Boosted Tree Ensembles for Predicting Postsurgical ICU Mortality 11
Motivation MIMIC II Clinical Data Methods Results
Boosted Tree Ensembles for Predicting Postsurgical ICU Mortality 12
DiasABP HR MAP SysABP BUN Creatinine GCS HCT Temp Platelets WBC Na HCO3 K Mg Glucose Urine.Sum pH PaCO2 PaO2 FiO2 MechVent Lactate SaO2 AST ALT Bilirubin ALP Albumin RespRate TroponinT Cholesterol TroponinI 25 50 75 100 Type Infrequent Time Series
Consistency of variable inclusion
Motivation MIMIC II Clinical Data Methods Results
◮ Are included in ≤ 45% training set patients
◮ 0 = Not recorded ◮ 1 = Recorded & within normal range ◮ 2 = Recorded & abnormal
Boosted Tree Ensembles for Predicting Postsurgical ICU Mortality 14
SysABP DiasABP MAP HR Weight Urine.Sum Temp GCS FiO2 pH PaO2 PaCO2 HCT WBC Platelets Na Mg K HCO3 Glucose Creatinine BUN Lactate 20 40 60 80 Type Low−Freq Time Series Full Time Series
Number of observations per variable per patient
Motivation MIMIC II Clinical Data Methods Results
◮ < 10 observations for ≥ 75% training set patients
◮ Variables not meeting the above criteria
◮ Impute from normal distribution representing
Boosted Tree Ensembles for Predicting Postsurgical ICU Mortality 16
Motivation MIMIC II Clinical Data Methods Results
◮ Requires 5, 5, 10 observations
Boosted Tree Ensembles for Predicting Postsurgical ICU Mortality 17
Motivation MIMIC II Clinical Data Methods Results
Boosted Tree Ensembles for Predicting Postsurgical ICU Mortality 18 Peng, H, Fulmi Long, and C Ding, 2005. “Feature Selection Based on Mutual Information Criteria of Max-Dependency, Max-Relevance, and Min-Redundancy.”
Motivation MIMIC II Clinical Data Methods Results
◮ Natural handling of heterogeneous data ◮ Non-linear ◮ Minimal preprocessing
Boosted Tree Ensembles for Predicting Postsurgical ICU Mortality 19 Schapire, Robert E. "The strength of weak learnability." Machine learning 5, no. 2 (1990)
Motivation MIMIC II Clinical Data Methods Results
Boosted Tree Ensembles for Predicting Postsurgical ICU Mortality 20
Motivation MIMIC II Clinical Data Methods Results
◮ Select a random subsample of training data, ˜
◮ Search for a decision stump h(x) that
◮ Best fit is determined by maximized
◮ gm+1(x) ← gm(x) + λh(x)
Boosted Tree Ensembles for Predicting Postsurgical ICU Mortality 21 Generalized Boosted Regression Models, Greg Ridgeway R package version 2.0-8 – http://cran.R-project.org/package=gbm
Motivation MIMIC II Clinical Data Methods Results
Boosted Tree Ensembles for Predicting Postsurgical ICU Mortality 21
Motivation MIMIC II Clinical Data Methods Results
Boosted Tree Ensembles for Predicting Postsurgical ICU Mortality 22
Motivation MIMIC II Clinical Data Methods Results
Boosted Tree Ensembles for Predicting Postsurgical ICU Mortality 23
Motivation MIMIC II Clinical Data Methods Results
Boosted Tree Ensembles for Predicting Postsurgical ICU Mortality 24
Summary
◮ Extend our model to provide and update
Boosted Tree Ensembles for Predicting Postsurgical ICU Mortality 25
Summary
Boosted Tree Ensembles for Predicting Postsurgical ICU Mortality 26