. 2017 Jan 10;2(1):56-70.

doi: 10.1016/j.idm.2016.12.004. eCollection 2017 Feb.

Model selection for seasonal influenza forecasting

Alexander E Zarebski¹, Peter Dawson², James M McCaw^{1

3

4}, Robert Moss³

Affiliations

¹ School of Mathematics and Statistics, The University of Melbourne, Melbourne, Australia.
² Land Personnel Protection Branch, Land Division, Defence Science and Technology Organisation, Melbourne, Australia.
³ Centre for Epidemiology and Biostatistics, Melbourne School of Population and Global Health, The University of Melbourne, Melbourne, Australia.
⁴ Modelling & Simulation, Murdoch Childrens Research Institute, Royal Childrens Hospital, Melbourne, Australia.

PMID: 29928729
PMCID: PMC5963331
DOI: 10.1016/j.idm.2016.12.004

Model selection for seasonal influenza forecasting

Alexander E Zarebski et al. Infect Dis Model. 2017.

. 2017 Jan 10;2(1):56-70.

doi: 10.1016/j.idm.2016.12.004. eCollection 2017 Feb.

Authors

Alexander E Zarebski¹, Peter Dawson², James M McCaw^{1

3

4}, Robert Moss³

Affiliations

¹ School of Mathematics and Statistics, The University of Melbourne, Melbourne, Australia.
² Land Personnel Protection Branch, Land Division, Defence Science and Technology Organisation, Melbourne, Australia.
³ Centre for Epidemiology and Biostatistics, Melbourne School of Population and Global Health, The University of Melbourne, Melbourne, Australia.
⁴ Modelling & Simulation, Murdoch Childrens Research Institute, Royal Childrens Hospital, Melbourne, Australia.

PMID: 29928729
PMCID: PMC5963331
DOI: 10.1016/j.idm.2016.12.004

Abstract

Epidemics of seasonal influenza inflict a huge burden in temperate climes such as Melbourne (Australia) where there is also significant variability in their timing and magnitude. Particle filters combined with mechanistic transmission models for the spread of influenza have emerged as a popular method for forecasting the progression of these epidemics. Despite extensive research it is still unclear what the optimal models are for forecasting influenza, and how one even measures forecast performance. In this paper, we present a likelihood-based method, akin to Bayes factors, for model selection when the aim is to select for predictive skill. Here, "predictive skill" is measured by the probability of the data after the forecasting date, conditional on the data from before the forecasting date. Using this method we choose an optimal model of influenza transmission to forecast the number of laboratory-confirmed cases of influenza in Melbourne in each of the 2010-15 epidemics. The basic transmission model considered has the susceptible-exposed-infectious-recovered structure with extensions allowing for the effects of absolute humidity and inhomogeneous mixing in the population. While neither of the extensions provides a significant improvement in fit to the data they do differ in terms of their predictive skill. Both measurements of absolute humidity and a sinusoidal approximation of those measurements are observed to increase the predictive skill of the forecasts, while allowing for inhomogeneous mixing reduces the skill. We discuss how our work could be integrated into a forecasting system and how the model selection method could be used to evaluate forecasts when comparing to multiple surveillance systems providing disparate views of influenza activity.

PubMed Disclaimer

Figures

**Fig. 1**
(Top) Time series of the number of laboratory confirmed cases of influenza in Melbourne for 2010–15 aggregated by week. (Bottom) Scaled time series of the measurements of absolute humidity in Melbourne for 2010–15 in grey, with cubic spline smoothing in green and a sinusoidal approximation in blue. The minimum and maximum values over the whole six years were set to $- 1$ and $1$ respectively.

**Fig. 2**
Graphical representation of the hidden Markov model in which the hidden state, $X_{t}$ , represents the state of the SEIR transmission model at time t and the observations, $Y_{t}$ , the number of notifications over the week prior. The absolute humidity signal, ${AH}_{t}$ , is assumed to be a deterministic function of time. The arrows indicate that: the evolution of the hidden state is dependent on its current state and the AH signal, and the observations are dependent on the current state of the hidden state and its state at the previous measurement.

**Fig. 3**
Simulation periods for 2015. The first portion of the data (circles) is used to estimate the background notification rate via the exponentially weighted moving average (solid line). The second portion of the data is the target of the filtering and forecasting. The solid circles indicate the dates at which a forecast was generated.

**Fig. 4**
The $50 %$ and $95 %$ credible interval for the observations under the filtering distribution for the null and sinusoidally forced models for the 2015 notification data. These running summaries of the observation distribution demonstrate both the null and sinusoidally forced models have near identical ability to assimilate new data, i.e., they have equal now-casting capabilities.

**Fig. 5**
The logarithm of the aggregate Bayes factor (across all the epidemics 2010–15) for each of the alternative transmission models. The solid horizontal line indicates parity with the null model, anything above this line is an improvement in model fit over the null, and below the fit is weaker. The dashed horizontal lines indicate the significance threshold.

**Fig. 6**
Comparison of the forecasts from the null and sinusoidally forced transmission models using increasing amounts of data from the 2015 epidemic. The solid points represent “observed” data used to fit the model and the hollow points represent the “future” data, the target of the forecast. The logarithms of the Bayes factors reported describe the improvement in forecast performance by the sinusoidally forced model over the null for each of the forecasts generated.

**Fig. 7**
The logarithm of the aggregate forecast Bayes factor (across all the epidemics 2010–15) for each of the alternative transmission models. The solid horizontal line indicates parity with the null model, anything above this line is an improvement in predictive skill. The dashed horizontal lines indicate the significance threshold.

**Fig. 8**
Forecast error plotted against the size of the observation being forecast. Each point represents the error in attempts to forecast a single observation (averaged over the forecasts made at different points in the season). A point at $(x, y)$ indicates that when forecasting an observation of x cases the average error in the prediction was y, so negative and positive values of y indicate underestimation and overestimation respectively. The colour of each point indicates which model was used to generate the forecast. The solid lines represents a LOESS smoothing of the data.

See this image and copyright information in PMC

Cited by

A dynamic pandemic model evaluating reopening strategies amid COVID-19.
Zhong L. Zhong L. PLoS One. 2021 Mar 26;16(3):e0248302. doi: 10.1371/journal.pone.0248302. eCollection 2021. PLoS One. 2021. PMID: 33770097 Free PMC article.
Machine learning forecasts for seasonal epidemic peaks: Lessons learnt from an atypical respiratory syncytial virus season.
Morbey RA, Todkill D, Watson C, Elliot AJ. Morbey RA, et al. PLoS One. 2023 Sep 22;18(9):e0291932. doi: 10.1371/journal.pone.0291932. eCollection 2023. PLoS One. 2023. PMID: 37738241 Free PMC article.
Forecasting national and regional influenza-like illness for the USA.
Ben-Nun M, Riley P, Turtle J, Bacon DP, Riley S. Ben-Nun M, et al. PLoS Comput Biol. 2019 May 23;15(5):e1007013. doi: 10.1371/journal.pcbi.1007013. eCollection 2019 May. PLoS Comput Biol. 2019. PMID: 31120881 Free PMC article.
Forecasting hospital demand in metropolitan areas during the current COVID-19 pandemic and estimates of lockdown-induced 2nd waves.
Capistran MA, Capella A, Christen JA. Capistran MA, et al. PLoS One. 2021 Jan 22;16(1):e0245669. doi: 10.1371/journal.pone.0245669. eCollection 2021. PLoS One. 2021. PMID: 33481925 Free PMC article.
Accounting for Healthcare-Seeking Behaviours and Testing Practices in Real-Time Influenza Forecasts.
Moss R, Zarebski AE, Carlson SJ, McCaw JM. Moss R, et al. Trop Med Infect Dis. 2019 Jan 11;4(1):12. doi: 10.3390/tropicalmed4010012. Trop Med Infect Dis. 2019. PMID: 30641917 Free PMC article.

See all "Cited by" articles

References

1. Allen Edward J., Allen Linda J.S., Arciniega Armando, Greenwood Priscilla E. Construction of equivalent stochastic differential equation models. Stochastic Analysis and Applications. 2008;26(2):274–297.
1. Allen Linda J.S., Brauer Fred, van den Driessche Pauline, Wu Jianhong. Springer; 2008. Mathematical epidemiology.
1. Anderson Roy M., Robert May M. Vol. 28. Wiley Online Library; 1992. (Infectious diseases of Humans: Dynamics and control).
1. Beauchemin Catherine A.A., Handel Andreas. A review of mathematical models of influenza A infections within a host or cell culture: Lessons learned and challenges ahead. BMC Public Health. 2011;11(1):1. - PMC - PubMed
1. Bock Axelsen Jacob, Yaari Rami, Grenfell Bryan T., Stone Lewi. Multiannual forecasting of seasonal influenza dynamics reveals climatic and evolutionary drivers. Proceedings of the National Academy of Sciences. 2014;111(26):9538–9542. - PMC - PubMed

LinkOut - more resources

Full Text Sources
Other Literature Sources
- scite Smart Citations

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Model selection for seasonal influenza forecasting

Affiliations

Model selection for seasonal influenza forecasting

Authors

Affiliations

Abstract

Figures

Similar articles

Cited by

References

LinkOut - more resources

Full Text Sources

Other Literature Sources

Abstract

Figures

Similar articles

Cited by

References

Related information

LinkOut - more resources

Full Text Sources

Other Literature Sources