Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2023 Aug 28;19(8):e1011392.
doi: 10.1371/journal.pcbi.1011392. eCollection 2023 Aug.

Neural network models for influenza forecasting with associated uncertainty using Web search activity trends

Affiliations

Neural network models for influenza forecasting with associated uncertainty using Web search activity trends

Michael Morris et al. PLoS Comput Biol. .

Abstract

Influenza affects millions of people every year. It causes a considerable amount of medical visits and hospitalisations as well as hundreds of thousands of deaths. Forecasting influenza prevalence with good accuracy can significantly help public health agencies to timely react to seasonal or novel strain epidemics. Although significant progress has been made, influenza forecasting remains a challenging modelling task. In this paper, we propose a methodological framework that improves over the state-of-the-art forecasting accuracy of influenza-like illness (ILI) rates in the United States. We achieve this by using Web search activity time series in conjunction with historical ILI rates as observations for training neural network (NN) architectures. The proposed models incorporate Bayesian layers to produce associated uncertainty intervals to their forecast estimates, positioning themselves as legitimate complementary solutions to more conventional approaches. The best performing NN, referred to as the iterative recurrent neural network (IRNN) architecture, reduces mean absolute error by 10.3% and improves skill by 17.1% on average in nowcasting and forecasting tasks across 4 consecutive flu seasons.

PubMed Disclaimer

Conflict of interest statement

The authors have declared that no competing interests exist.

Figures

Fig 1
Fig 1. Negative log-likelihood (NLL) and mean absolute error (MAE) for each NN model averaged over all four test flu seasons (2015/16 to 2018/19).
Scores for different forecast horizons (γ) are shown. Lower values are better. We also provide a comparison with IRNN trained without using any Web search activity data (IRNN0), and a simple persistence model (PER). Note that NLL cannot be determined for PER as it does not provide an associated uncertainty. S1 Fig shows the results for all metrics.
Fig 2
Fig 2. IRNN forecasts for all 4 test seasons (2015/16 to 2018/19) and forecasting horizons (γ = 7, 14, 21, and 28).
Confidence intervals (uncertainty estimates) are shown at 50% and 90% levels, and are visually distinguished by darker and lighter colour overlays respectively. The influenza-like illness (ILI) rate (ground truth) is shown by the black line.
Fig 3
Fig 3. Calibration plots for the forecasts made by the three NN models (FF, SRNN, and IRNN) averaged over the four test periods (2015/16 to 2018/19) and shown for the 4 forecasting horizons (γ).
The lines show how frequently the ground truth falls within a confidence interval (CI) of the same level. To be more precise, a point (x, y) denotes that the proportion y ∈ [0, 1] of the forecasts when combined with a CI at the x × 100% level include the ground truth (successful forecasts). The optimal calibration is shown by the diagonal black line. Points above or below the diagonal indicate an over- or under-estimation of uncertainty, and hence an under- or over-confident model, respectively. The shadows show the upper and lower quartile of the calibration curves when the models are trained multiple times with different initialisation seeds. The plot broken out into separate test periods is shown in the Supporting Information (S11 Fig).
Fig 4
Fig 4. Diagram of the IRNN architecture where for the recurrent layers (RNN) we have used a Gated Recurrent Unit.
An ILI rate, F ∈ [0, 1], and m search query frequencies, QR0m, beginning from time point (day) t0τ are fed into the network a day at a time. τ denotes the window size of past observations that we consider (τ + 1 = 56 days). The reporting delay of the ILI rates means that when an ILI rates are available up to day t0, search query frequencies are available up to day t0 + δ, where δ = 14 days in our experiments. Dashed arrow lines denote that the model is called for multiple time-steps (where a time step is a day). For days t0τ to t0, IRNN enters a warm-up phase where it sets the hidden states in the RNN layer without making any predictions. For days t0 to t0 + δ, we can observe search query frequencies, but we cannot observe ILI rates. At this stage, IRNN performs nowcasting with respect to input Q. During nowcasting the estimated ILI rate F^t is combined with the true search frequencies Qt use as the input for the next time step. The query search frequency estimates which are not used (as they are known to us) are shown by a faded box. For days t0 + δ + 1 to t0 + γ, where γ denotes the forecasting horizon, IRNN conducts pure forecasting as neither search query frequencies nor ILI rates are known for that period. Forecasted values for both of them are used as inputs for subsequent time steps. The full sequence of both predicted ILI rates and search query frequencies is used in the training loss.

References

    1. Ferguson NM, Laydon D, Nedjati-Gilani G, Imai N, Ainslie K, Baguelin M, et al.. Impact of non-pharmaceutical interventions (NPIs) to reduce COVID-19 mortality and healthcare demand. Imperial College COVID-19 Response Team London. 2020;.
    1. Birrell P, Blake J, Van Leeuwen E, Gent N, De Angelis D. Real-time nowcasting and forecasting of COVID-19 dynamics in England: the first wave. Philos Trans R Soc B. 2021;376(1829):20200279. doi: 10.1098/rstb.2020.0279 - DOI - PMC - PubMed
    1. Ioannidis JPA, Cripps S, Tanner MA. Forecasting for COVID-19 has failed. Int J Forecast. 2022;38(2):423–438. doi: 10.1016/j.ijforecast.2020.08.004 - DOI - PMC - PubMed
    1. Shaman J, Karspeck A. Forecasting seasonal outbreaks of influenza. PNAS. 2012;109(50):20425–20430. doi: 10.1073/pnas.1208772109 - DOI - PMC - PubMed
    1. Nsoesie E, Mararthe M, Brownstein J. Forecasting peaks of seasonal influenza epidemics. PLoS Curr. 2013;5. doi: 10.1371/currents.outbreaks.bb1e879a23137022ea79a8c508b030bc - DOI - PMC - PubMed

Publication types