. 2023 Aug 28;19(8):e1011392.

doi: 10.1371/journal.pcbi.1011392. eCollection 2023 Aug.

Neural network models for influenza forecasting with associated uncertainty using Web search activity trends

Michael Morris¹, Peter Hayes¹, Ingemar J Cox^{1

2}, Vasileios Lampos¹

Affiliations

¹ University College London, Centre for Artificial Intelligence, Department of Computer Science, London, United Kingdom.
² University of Copenhagen, Department of Computer Science, Copenhagen, Denmark.

PMID: 37639427
PMCID: PMC10491400
DOI: 10.1371/journal.pcbi.1011392

Neural network models for influenza forecasting with associated uncertainty using Web search activity trends

Michael Morris et al. PLoS Comput Biol. 2023.

. 2023 Aug 28;19(8):e1011392.

doi: 10.1371/journal.pcbi.1011392. eCollection 2023 Aug.

Authors

Michael Morris¹, Peter Hayes¹, Ingemar J Cox^{1

2}, Vasileios Lampos¹

Affiliations

¹ University College London, Centre for Artificial Intelligence, Department of Computer Science, London, United Kingdom.
² University of Copenhagen, Department of Computer Science, Copenhagen, Denmark.

PMID: 37639427
PMCID: PMC10491400
DOI: 10.1371/journal.pcbi.1011392

Abstract

Influenza affects millions of people every year. It causes a considerable amount of medical visits and hospitalisations as well as hundreds of thousands of deaths. Forecasting influenza prevalence with good accuracy can significantly help public health agencies to timely react to seasonal or novel strain epidemics. Although significant progress has been made, influenza forecasting remains a challenging modelling task. In this paper, we propose a methodological framework that improves over the state-of-the-art forecasting accuracy of influenza-like illness (ILI) rates in the United States. We achieve this by using Web search activity time series in conjunction with historical ILI rates as observations for training neural network (NN) architectures. The proposed models incorporate Bayesian layers to produce associated uncertainty intervals to their forecast estimates, positioning themselves as legitimate complementary solutions to more conventional approaches. The best performing NN, referred to as the iterative recurrent neural network (IRNN) architecture, reduces mean absolute error by 10.3% and improves skill by 17.1% on average in nowcasting and forecasting tasks across 4 consecutive flu seasons.

Copyright: © 2023 Morris et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.

PubMed Disclaimer

Conflict of interest statement

The authors have declared that no competing interests exist.

Figures

**Fig 1. Negative log-likelihood (NLL) and mean absolute error (MAE) for each NN model averaged over all four test flu seasons (2015/16 to 2018/19).**
Scores for different forecast horizons (γ) are shown. Lower values are better. We also provide a comparison with IRNN trained without using any Web search activity data (IRNN₀), and a simple persistence model (PER). Note that NLL cannot be determined for PER as it does not provide an associated uncertainty. S1 Fig shows the results for all metrics.

**Fig 2. IRNN forecasts for all 4 test seasons (2015/16 to 2018/19) and forecasting horizons (γ = 7, 14, 21, and 28).**
Confidence intervals (uncertainty estimates) are shown at 50% and 90% levels, and are visually distinguished by darker and lighter colour overlays respectively. The influenza-like illness (ILI) rate (ground truth) is shown by the black line.

**Fig 3. Calibration plots for the forecasts made by the three NN models (FF, SRNN, and IRNN) averaged over the four test periods (2015/16 to 2018/19) and shown for the 4 forecasting horizons (γ).**
The lines show how frequently the ground truth falls within a confidence interval (CI) of the same level. To be more precise, a point (x, y) denotes that the proportion y ∈ [0, 1] of the forecasts when combined with a CI at the x × 100% level include the ground truth (successful forecasts). The optimal calibration is shown by the diagonal black line. Points above or below the diagonal indicate an over- or under-estimation of uncertainty, and hence an under- or over-confident model, respectively. The shadows show the upper and lower quartile of the calibration curves when the models are trained multiple times with different initialisation seeds. The plot broken out into separate test periods is shown in the Supporting Information (S11 Fig).

**Fig 4. Diagram of the IRNN architecture where for the recurrent layers (RNN) we have used a Gated Recurrent Unit.**
An ILI rate, F ∈ [0, 1], and m search query frequencies, $Q \in R_{\geq 0}^{m}$ , beginning from time point (day) t₀ − τ are fed into the network a day at a time. τ denotes the window size of past observations that we consider (τ + 1 = 56 days). The reporting delay of the ILI rates means that when an ILI rates are available up to day t₀, search query frequencies are available up to day t₀ + δ, where δ = 14 days in our experiments. Dashed arrow lines denote that the model is called for multiple time-steps (where a time step is a day). For days t₀ − τ to t₀, IRNN enters a warm-up phase where it sets the hidden states in the RNN layer without making any predictions. For days t₀ to t₀ + δ, we can observe search query frequencies, but we cannot observe ILI rates. At this stage, IRNN performs nowcasting with respect to input Q. During nowcasting the estimated ILI rate ${\hat{F}}_{t}$ is combined with the true search frequencies Q_t use as the input for the next time step. The query search frequency estimates which are not used (as they are known to us) are shown by a faded box. For days t₀ + δ + 1 to t₀ + γ, where γ denotes the forecasting horizon, IRNN conducts pure forecasting as neither search query frequencies nor ILI rates are known for that period. Forecasted values for both of them are used as inputs for subsequent time steps. The full sequence of both predicted ILI rates and search query frequencies is used in the training loss.

See this image and copyright information in PMC

References

1. Ferguson NM, Laydon D, Nedjati-Gilani G, Imai N, Ainslie K, Baguelin M, et al.. Impact of non-pharmaceutical interventions (NPIs) to reduce COVID-19 mortality and healthcare demand. Imperial College COVID-19 Response Team London. 2020;.
1. Birrell P, Blake J, Van Leeuwen E, Gent N, De Angelis D. Real-time nowcasting and forecasting of COVID-19 dynamics in England: the first wave. Philos Trans R Soc B. 2021;376(1829):20200279. doi: 10.1098/rstb.2020.0279 - DOI - PMC - PubMed
1. Ioannidis JPA, Cripps S, Tanner MA. Forecasting for COVID-19 has failed. Int J Forecast. 2022;38(2):423–438. doi: 10.1016/j.ijforecast.2020.08.004 - DOI - PMC - PubMed
1. Shaman J, Karspeck A. Forecasting seasonal outbreaks of influenza. PNAS. 2012;109(50):20425–20430. doi: 10.1073/pnas.1208772109 - DOI - PMC - PubMed
1. Nsoesie E, Mararthe M, Brownstein J. Forecasting peaks of seasonal influenza epidemics. PLoS Curr. 2013;5. doi: 10.1371/currents.outbreaks.bb1e879a23137022ea79a8c508b030bc - DOI - PMC - PubMed

Publication types

Actions

MeSH terms

Actions
Actions
Actions
Actions
Actions
Actions

LinkOut - more resources

Full Text Sources
Medical
- MedlinePlus Health Information

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Neural network models for influenza forecasting with associated uncertainty using Web search activity trends

Affiliations

Neural network models for influenza forecasting with associated uncertainty using Web search activity trends

Authors

Affiliations

Abstract

Conflict of interest statement

Figures

References

Publication types

MeSH terms

LinkOut - more resources

Full Text Sources

Medical