Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2017 Jan 10;2(1):56-70.
doi: 10.1016/j.idm.2016.12.004. eCollection 2017 Feb.

Model selection for seasonal influenza forecasting

Affiliations

Model selection for seasonal influenza forecasting

Alexander E Zarebski et al. Infect Dis Model. .

Abstract

Epidemics of seasonal influenza inflict a huge burden in temperate climes such as Melbourne (Australia) where there is also significant variability in their timing and magnitude. Particle filters combined with mechanistic transmission models for the spread of influenza have emerged as a popular method for forecasting the progression of these epidemics. Despite extensive research it is still unclear what the optimal models are for forecasting influenza, and how one even measures forecast performance. In this paper, we present a likelihood-based method, akin to Bayes factors, for model selection when the aim is to select for predictive skill. Here, "predictive skill" is measured by the probability of the data after the forecasting date, conditional on the data from before the forecasting date. Using this method we choose an optimal model of influenza transmission to forecast the number of laboratory-confirmed cases of influenza in Melbourne in each of the 2010-15 epidemics. The basic transmission model considered has the susceptible-exposed-infectious-recovered structure with extensions allowing for the effects of absolute humidity and inhomogeneous mixing in the population. While neither of the extensions provides a significant improvement in fit to the data they do differ in terms of their predictive skill. Both measurements of absolute humidity and a sinusoidal approximation of those measurements are observed to increase the predictive skill of the forecasts, while allowing for inhomogeneous mixing reduces the skill. We discuss how our work could be integrated into a forecasting system and how the model selection method could be used to evaluate forecasts when comparing to multiple surveillance systems providing disparate views of influenza activity.

PubMed Disclaimer

Figures

Fig. 1
Fig. 1
(Top) Time series of the number of laboratory confirmed cases of influenza in Melbourne for 2010–15 aggregated by week. (Bottom) Scaled time series of the measurements of absolute humidity in Melbourne for 2010–15 in grey, with cubic spline smoothing in green and a sinusoidal approximation in blue. The minimum and maximum values over the whole six years were set to 1 and 1 respectively.
Fig. 2
Fig. 2
Graphical representation of the hidden Markov model in which the hidden state, Xt, represents the state of the SEIR transmission model at time t and the observations, Yt, the number of notifications over the week prior. The absolute humidity signal, AHt, is assumed to be a deterministic function of time. The arrows indicate that: the evolution of the hidden state is dependent on its current state and the AH signal, and the observations are dependent on the current state of the hidden state and its state at the previous measurement.
Fig. 3
Fig. 3
Simulation periods for 2015. The first portion of the data (circles) is used to estimate the background notification rate via the exponentially weighted moving average (solid line). The second portion of the data is the target of the filtering and forecasting. The solid circles indicate the dates at which a forecast was generated.
Fig. 4
Fig. 4
The 50% and 95% credible interval for the observations under the filtering distribution for the null and sinusoidally forced models for the 2015 notification data. These running summaries of the observation distribution demonstrate both the null and sinusoidally forced models have near identical ability to assimilate new data, i.e., they have equal now-casting capabilities.
Fig. 5
Fig. 5
The logarithm of the aggregate Bayes factor (across all the epidemics 2010–15) for each of the alternative transmission models. The solid horizontal line indicates parity with the null model, anything above this line is an improvement in model fit over the null, and below the fit is weaker. The dashed horizontal lines indicate the significance threshold.
Fig. 6
Fig. 6
Comparison of the forecasts from the null and sinusoidally forced transmission models using increasing amounts of data from the 2015 epidemic. The solid points represent “observed” data used to fit the model and the hollow points represent the “future” data, the target of the forecast. The logarithms of the Bayes factors reported describe the improvement in forecast performance by the sinusoidally forced model over the null for each of the forecasts generated.
Fig. 7
Fig. 7
The logarithm of the aggregate forecast Bayes factor (across all the epidemics 2010–15) for each of the alternative transmission models. The solid horizontal line indicates parity with the null model, anything above this line is an improvement in predictive skill. The dashed horizontal lines indicate the significance threshold.
Fig. 8
Fig. 8
Forecast error plotted against the size of the observation being forecast. Each point represents the error in attempts to forecast a single observation (averaged over the forecasts made at different points in the season). A point at (x,y) indicates that when forecasting an observation of x cases the average error in the prediction was y, so negative and positive values of y indicate underestimation and overestimation respectively. The colour of each point indicates which model was used to generate the forecast. The solid lines represents a LOESS smoothing of the data.

Similar articles

Cited by

References

    1. Allen Edward J., Allen Linda J.S., Arciniega Armando, Greenwood Priscilla E. Construction of equivalent stochastic differential equation models. Stochastic Analysis and Applications. 2008;26(2):274–297.
    1. Allen Linda J.S., Brauer Fred, van den Driessche Pauline, Wu Jianhong. Springer; 2008. Mathematical epidemiology.
    1. Anderson Roy M., Robert May M. Vol. 28. Wiley Online Library; 1992. (Infectious diseases of Humans: Dynamics and control).
    1. Beauchemin Catherine A.A., Handel Andreas. A review of mathematical models of influenza A infections within a host or cell culture: Lessons learned and challenges ahead. BMC Public Health. 2011;11(1):1. - PMC - PubMed
    1. Bock Axelsen Jacob, Yaari Rami, Grenfell Bryan T., Stone Lewi. Multiannual forecasting of seasonal influenza dynamics reveals climatic and evolutionary drivers. Proceedings of the National Academy of Sciences. 2014;111(26):9538–9542. - PMC - PubMed

LinkOut - more resources