Evaluating forecasts of infectious disease spread Sebastian Meyer - PowerPoint PPT Presentation

Evaluating forecasts of infectious disease spread Sebastian Meyer Institute of Medical Informatics, Biometry, and Epidemiology Friedrich-Alexander-Universität Erlangen-Nürnberg, Erlangen, Germany 21 March 2019 Based on joint work with Leonhard Held (University of Zurich): Held and Meyer (2019). Forecasting Based on Surveillance Data. In: Handbook of Infectious Disease Data Analysis . Chapman & Hall/CRC. arXiv:1809.03735

Epidemics are hard to predict World Health Organization (2014) Forecasting disease outbreaks is still in its infancy, however, unlike weather forecasting, where substantial progress has been made in recent years. Sebastian Meyer | IMBE | Evaluating forecasts of infectious disease spread 21 March 2019 1

Epidemics are hard to predict World Health Organization (2014) Forecasting disease outbreaks is still in its infancy, however, unlike weather forecasting, where substantial progress has been made in recent years. Meanwhile . . . • Epidemic Prediction Initiative (Centers for Disease Control and Prevention, 2016): online platform collecting real-time forecasts by various research groups • Adoption of forecast assessment techniques from weather forecasting (Held, Meyer, & Bracher, 2017) • Integration of social contact patterns (Meyer & Held, 2017), human mobility data (Pei, Kandula, Yang, & Shaman, 2018), and internet data (Osthus, Daughton, & Priedhorsky, 2019) Sebastian Meyer | IMBE | Evaluating forecasts of infectious disease spread 21 March 2019 1

CDC FluSight challenge (https://predict.cdc.gov/) Multiple forecasting targets for influenza-like illness (ILI): • short-term doctor visits: 1 to 4 weeks ahead • seasonal targets: onset week, peak week, peak incidence Sebastian Meyer | IMBE | Evaluating forecasts of infectious disease spread 21 March 2019 2

“Forecasts should be probabilistic” (Gneiting & Katzfuss, 2014) Example: one−week−ahead forecasts of infectious disease counts 12000 10000 8000 No. infected 6000 4000 point forecast − (mean of predictive distribution) 2000 0 42 44 46 48 50 52 02 04 06 08 10 12 14 16 18 Week Sebastian Meyer | IMBE | Evaluating forecasts of infectious disease spread 21 March 2019 3

Case study: Weekly ILI counts in Switzerland, 2000–2016 100 000 10 000 ILI counts 1 000 100 10 2001 2003 2005 2007 2009 2011 2013 2015 2017 Time (weekly) Sebastian Meyer | IMBE | Evaluating forecasts of infectious disease spread 21 March 2019 4

Case study: Weekly ILI counts in Switzerland, 2000–2016 100 000 10 000 ILI counts 1 000 100 10 2001 2003 2005 2007 2009 2011 2013 2015 2017 Time (weekly) 1. Rolling one-week-ahead forecasts in the test period (from December 2012) Sebastian Meyer | IMBE | Evaluating forecasts of infectious disease spread 21 March 2019 4

Case study: Weekly ILI counts in Switzerland, 2000–2016 100 000 10 000 ILI counts 1 000 100 10 2001 2003 2005 2007 2009 2011 2013 2015 2017 Time (weekly) 1. Rolling one-week-ahead forecasts in the test period (from December 2012) 2. Seasonal forecasts of the epidemic curve (30-weeks-ahead from December) Sebastian Meyer | IMBE | Evaluating forecasts of infectious disease spread 21 March 2019 4

Evaluating forecasts • Goal : compare predictive performance of different models 1. We evaluate point forecasts by RMSE or MAE, not correlation between point predictions and observations 2. We assess the whole distribution of probabilistic forecasts Sebastian Meyer | IMBE | Evaluating forecasts of infectious disease spread 21 March 2019 5

Evaluating forecasts • Goal : compare predictive performance of different models 1. We evaluate point forecasts by RMSE or MAE, not correlation between point predictions and observations 2. We assess the whole distribution of probabilistic forecasts • Paradigm : maximize sharpness subject to calibration • Calibration: statistical consistency of forecast F and observation y • Sharpness: width of prediction intervals Sebastian Meyer | IMBE | Evaluating forecasts of infectious disease spread 21 March 2019 5

Evaluating forecasts • Goal : compare predictive performance of different models 1. We evaluate point forecasts by RMSE or MAE, not correlation between point predictions and observations 2. We assess the whole distribution of probabilistic forecasts • Paradigm : maximize sharpness subject to calibration • Calibration: statistical consistency of forecast F and observation y • Sharpness: width of prediction intervals • Assessment techniques : • Histogram of PIT = F ( y ) values to informally check calibration • Proper scoring rules S ( F , y ) as summary measures of predictive performance addressing both calibration and sharpness Sebastian Meyer | IMBE | Evaluating forecasts of infectious disease spread 21 March 2019 5

Proper scoring rules • Scoring rule S ( F , y ) quantifies discrepancy between forecast and observation → something we would like to minimize • Propriety: forecasting with the true distribution is optimal • Simple example: squared error score SES ( F , y ) = ( y − µ F ) 2 • Compute average score over a test set of forecasts, e.g., (R)MSE • We will use the following scoring rules: • Logarithmic score: LS ( F , y ) = − log f ( y ) F ) + ( y − µ F ) 2 • Dawid-Sebastiani score: DSS ( F , y ) = log( σ 2 σ 2 F Sebastian Meyer | IMBE | Evaluating forecasts of infectious disease spread 21 March 2019 6

“Naive” forecast stratified by calendar week Log−normal distribution estimated from calendar week in past years 100000 10000 ILI counts 1000 99% 100 75% 50% 25% 10 1% DSS (mean: 14.90) LS (mean: 8.06) 30 Score 15 0 2013 2014 2015 2016 Sebastian Meyer | IMBE | Evaluating forecasts of infectious disease spread 21 March 2019 7

“Naive” forecast stratified by calendar week Log−normal distribution estimated from calendar week in past years 100000 10000 ILI counts 1000 99% 100 75% 50% 25% 10 1% DSS (mean: 14.90) LS (mean: 8.06) 30 Score 15 0 2013 2014 2015 2016 • Wide prediction intervals, RMSE = 5010 cases • Well calibrated? PIT histogram summarizes location of observations in the fan Sebastian Meyer | IMBE | Evaluating forecasts of infectious disease spread 21 March 2019 7

PIT histogram of the 213 one-week-ahead forecasts 2.0 1.5 Density 1.0 0.5 0.0 0.0 0.2 0.4 0.6 0.8 1.0 PIT Sebastian Meyer | IMBE | Evaluating forecasts of infectious disease spread 21 March 2019 8

PIT histogram of the 213 one-week-ahead forecasts 2.0 1.5 overestimation Density 1.0 0.5 0.0 0.0 0.2 0.4 0.6 0.8 1.0 PIT • Counts tend to be lower than predicted Sebastian Meyer | IMBE | Evaluating forecasts of infectious disease spread 21 March 2019 8

PIT histogram of the 213 one-week-ahead forecasts 2.0 underdispersed forecasts 1.5 overestimation Density 1.0 0.5 overdispersed forecasts 0.0 0.0 0.2 0.4 0.6 0.8 1.0 PIT • Counts tend to be lower than predicted • No clear-cut evidence of miscalibration Sebastian Meyer | IMBE | Evaluating forecasts of infectious disease spread 21 March 2019 8

Useful statistical models to forecast epidemic spread • Can we do better with more sophisticated time series models? • Scope: well-documented open-source R implementations • We compare four different models: • forecast::auto.arima() for log-counts → ARMA(2,2) • glarma::glarma() → NegBin-ARMA(4,4) • surveillance::hhh4() : “endemic-epidemic” NegBin model • prophet::prophet() for log-counts: linear regression model • All models account for yearly seasonality and a Christmas effect Sebastian Meyer | IMBE | Evaluating forecasts of infectious disease spread 21 March 2019 9

Performance of rolling one-week-ahead forecasts Average scores and runtime based on 213 one-week-ahead forecasts: Method RMSE DSS LS runtime [s] arima 2287 13.78 7.73 0.51 glarma 2450 13.59 7.71 1.49 hhh4 1769 13.58 7.71 0.02 prophet 5614 15.00 8.03 3.01 naive 5010 14.90 8.06 0.00 • All methods are reasonably fast • The two NegBin models score best • prophet does not outperform the naive approach Sebastian Meyer | IMBE | Evaluating forecasts of infectious disease spread 21 March 2019 10

hhh4 -based one-week-ahead forecasts surveillance::hhh4() 100000 10000 ILI counts 1000 99% 100 75% 50% 25% 10 1% DSS (mean: 13.58) LS (mean: 7.71) 30 Score 15 0 2013 2014 2015 2016 • Sharper than naive forecast (drawback in wiggly off-season 2016) • Seaonsal autoregressive effect adapts to yearly peaks Sebastian Meyer | IMBE | Evaluating forecasts of infectious disease spread 21 March 2019 11

PIT histogram for hhh4 -based one-week-ahead forecasts 2.0 1.5 Density 1.0 0.5 0.0 0.0 0.2 0.4 0.6 0.8 1.0 PIT • Calibration similar to naive forecasts • Off-season counts in lower tail of forecast distribution Sebastian Meyer | IMBE | Evaluating forecasts of infectious disease spread 21 March 2019 12

Evaluating forecasts of infectious disease spread Sebastian Meyer - PowerPoint PPT Presentation

Evaluating forecasts of infectious disease spread Sebastian Meyer Institute of Medical Informatics, Biometry, and Epidemiology Friedrich-Alexander-Universitt Erlangen-Nrnberg, Erlangen, Germany 21 March 2019 Based on joint work with

Spread Spectrum Concept Frequency Hopping Spread Spectrum Direct Sequence Spread

Making and Evaluating Point Forecasts Tilmann Gneiting Universit at Heidelberg Eltville, June

Globalization of Infectious Diseases Globalization of Infectious Diseases Origin of Some

Evaluating the Calibration of Multi-Step-Ahead Density Forecasts Using Raw Moments Malte Knppel

Boulder County Infectious Disease Surveillance Update Community Infectious Disease Emergency

DONT TICK ME OFF Lyme Disease Also known as Lyme borreliosis , is an infectious disease

OTHER FOODBORNE INFECTIOUS BACTERIA Infectious bacteria only INFECTIOUS BACTERIA May be

FY2012 2 Financial Financial Forecasts Forecasts FY2012 Financial Forecasts FY201 Toyota

Wake Up to Lyme What is Lyme Disease? Risk of Lyme Disease Preventing Lyme Disease

Working Group 3: Environmental Suitability and Infectious Disease Risk Report from the Climate

Tracking the spread of Tracking the spread of insecticide resistance in insecticide resistance

Some curves are flattening whilst others are steepening 60 90 40 10s 30s spread 2s 5s spread

Canada-Kyrgyz Cooperation to Combat BW Proliferation and the Spread of Infectious Diseases Global

Risk Based Regulation of Infectious Diseases BAA Conference - 1 November 2019 Dr Anuj Srivastava

Procalcitonin and other biomarkers for infectious disease Infectious Diseases in Clinical Practice

ICAO and prevention of spread of disease Dr Anthony Evans International Civil Aviation

Scalable Machine Learning 4. Optimization Alex Smola Yahoo! Research and ANU

CDI Burden and Pathophysiology Ciarn P. Kelly, MD Professor of Medicine Harvard Medical School

Biosurveillance in Maryland October 2014 Public Health Goals of Biosurveillance Goal:

Surveillance Strategies in African Surveillance Strategies in African Refugees in their Country

CS-5630 / CS-6630 Visualization for Data Science Data Alexander Lex alex@sci.utah.edu

Environmental Health Matters Initiative July 11, 2018 Thomas Burke: Steering Committee Chair

Public works as means to push for poverty reduction? Short-term welfare effects of Rwandas

Setting up a TMS Treatment Program Alvaro Pascual-Leone, M.D., Ph.D. Daniel Cohen, M.D., M.M.Sc.

Sambuz

Useful Links

Newsletter

Mail Us