Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2015 Aug 28;11(8):e1004382.
doi: 10.1371/journal.pcbi.1004382. eCollection 2015 Aug.

Flexible Modeling of Epidemics with an Empirical Bayes Framework

Affiliations

Flexible Modeling of Epidemics with an Empirical Bayes Framework

Logan C Brooks et al. PLoS Comput Biol. .

Abstract

Seasonal influenza epidemics cause consistent, considerable, widespread loss annually in terms of economic burden, morbidity, and mortality. With access to accurate and reliable forecasts of a current or upcoming influenza epidemic's behavior, policy makers can design and implement more effective countermeasures. This past year, the Centers for Disease Control and Prevention hosted the "Predict the Influenza Season Challenge", with the task of predicting key epidemiological measures for the 2013-2014 U.S. influenza season with the help of digital surveillance data. We developed a framework for in-season forecasts of epidemics using a semiparametric Empirical Bayes framework, and applied it to predict the weekly percentage of outpatient doctors visits for influenza-like illness, and the season onset, duration, peak time, and peak height, with and without using Google Flu Trends data. Previous work on epidemic modeling has focused on developing mechanistic models of disease behavior and applying time series tools to explain historical data. However, tailoring these models to certain types of surveillance data can be challenging, and overly complex models with many parameters can compromise forecasting ability. Our approach instead produces possibilities for the epidemic curve of the season of interest using modified versions of data from previous seasons, allowing for reasonable variations in the timing, pace, and intensity of the seasonal epidemics, as well as noise in observations. Since the framework does not make strict domain-specific assumptions, it can easily be applied to some other diseases with seasonal epidemics. This method produces a complete posterior distribution over epidemic curves, rather than, for example, solely point predictions of forecasting targets. We report prospective influenza-like-illness forecasts made for the 2013-2014 U.S. influenza season, and compare the framework's cross-validated prediction error on historical data to that of a variety of simpler baseline predictors.

PubMed Disclaimer

Conflict of interest statement

The authors have declared that no competing interests exist.

Figures

Fig 1
Fig 1. Examples of possible peak week, peak height, and pacing transformations, and different noise levels.
Thick black, original curve; red, possible peak week transformations; thick red, a random peak week transformation; purple, possible peak height transformations; thick purple, a random peak height transformation; blue, possible pacing transformations; thick blue, a random pacing transformation; dotted green, 5th and 95th (pointwise) percentiles of noise distribution for possible noise levels; dashed green, percentiles for a random noise level; thick green, one possible trajectory for the selected transformations and noise level. (A) Peak week transformations. Peak weeks of historical smoothed curves occurred between weeks 51 and week 10 of the next year, so we limit transformations to give peak weeks roughly within this range. (B) Peak height transformations. Peak heights of historical smoothed curves were between 2% and 8%, so we limit transformations to give peak heights roughly within this range. (C) Pacing transformations. We stretch the curve by a factor between 75% and 125% about the peak week. (D) Noise levels. We randomly select one of 15 noise levels from the fitting procedure and add this level of Gaussian noise to the transformed curve.
Fig 2
Fig 2. 2013–2014 national forecast, retrospectively, using the final revisions of wILI values, using revised wILI data through epidemiological weeks (A) 47, (B) 51, (C) 1, and (D) 7.
Fig 3
Fig 3. Point predictions and observed values of the forecasting targets for the 2013–2014 season.
Black, observed target value; blue, our current framework’s predictions using revised ILINet wILI data; red, our submitted point predictions using ILINet data only; green, our submitted point predictions that used both ILINet and GFT data. Historical target value ranges exclude the 2009–2010 and 2013–2014 seasons.
Fig 4
Fig 4. Cross-validated mean absolute error estimates and standard error bars for point predictions for (A) onset, (B) peak week, (C) peak height, and (D) duration.
(The onset and duration were defined based on the 2% national threshold set by CDC for the 2013–2014 season.)

References

    1. Molinari NAM, Ortega-Sanchez IR, Messonnier ML, Thompson WW, Wortley PM, Weintraub E, et al. The annual impact of seasonal influenza in the US: measuring disease burden and costs. Vaccine. 2007;25(27):5086–5096. - PubMed
    1. Laporte RE. How to improve monitoring and forecasting of disease patterns. BMJ: British Medical Journal. 1993;307(6919):1573–1574. Available from: http://www.ncbi.nlm.nih.gov/pmc/articles/PMC1697783/pdf/bmj00052-0002.pdf. 10.1136/bmj.307.6919.1573 - DOI - PMC - PubMed
    1. Centers for Disease Control and Prevention. Announcement of Requirements and Registration for the Predict the Influenza Season Challenge. Federal Register. 2013. November;78(227):70303–70305. Available from: https://www.federalregister.gov/articles/2013/11/25/2013-28198/announcem....
    1. Hethcote HW. The mathematics of infectious diseases. SIAM review. 2000;42(4):599–653. 10.1137/S0036144500371907 - DOI
    1. Laneri K, Bhadra A, Ionides EL, Bouma M, Dhiman RC, Yadav RS, et al. Forcing versus feedback: epidemic malaria and monsoon rains in northwest India. PLoS computational biology. 2010;6(9):e1000898 10.1371/journal.pcbi.1000898 - DOI - PMC - PubMed

Publication types