Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2023 Apr 21:12:e81916.
doi: 10.7554/eLife.81916.

Predictive performance of multi-model ensemble forecasts of COVID-19 across European nations

Katharine Sherratt  1 Hugo Gruson  1 Rok Grah  2 Helen Johnson  2 Rene Niehus  2 Bastian Prasse  2 Frank Sandmann  2 Jannik Deuschel  3 Daniel Wolffram  3 Sam Abbott  1 Alexander Ullrich  4 Graham Gibson  5 Evan L Ray  5 Nicholas G Reich  5 Daniel Sheldon  5 Yijin Wang  5 Nutcha Wattanachit  5 Lijing Wang  6 Jan Trnka  7 Guillaume Obozinski  8 Tao Sun  8 Dorina Thanou  8 Loic Pottier  9 Ekaterina Krymova  10 Jan H Meinke  11 Maria Vittoria Barbarossa  12 Neele Leithauser  13 Jan Mohring  13 Johanna Schneider  13 Jaroslaw Wlazlo  13 Jan Fuhrmann  14 Berit Lange  15 Isti Rodiah  15 Prasith Baccam  16 Heidi Gurung  16 Steven Stage  17 Bradley Suchoski  16 Jozef Budzinski  18 Robert Walraven  19 Inmaculada Villanueva  20 Vit Tucek  21 Martin Smid  22 Milan Zajicek  22 Cesar Perez Alvarez  23 Borja Reina  23 Nikos I Bosse  1 Sophie R Meakin  1 Lauren Castro  24 Geoffrey Fairchild  24 Isaac Michaud  24 Dave Osthus  24 Pierfrancesco Alaimo Di Loro  25 Antonello Maruotti  25 Veronika Eclerova  26 Andrea Kraus  26 David Kraus  26 Lenka Pribylova  26 Bertsimas Dimitris  27 Michael Lingzhi Li  27 Soni Saksham  27 Jonas Dehning  28 Sebastian Mohr  28 Viola Priesemann  28 Grzegorz Redlarski  29 Benjamin Bejar  30 Giovanni Ardenghi  31 Nicola Parolini  31 Giovanni Ziarelli  31 Wolfgang Bock  32 Stefan Heyder  33 Thomas Hotz  33 David E Singh  34 Miguel Guzman-Merino  34 Jose L Aznarte  35 David Morina  36 Sergio Alonso  37 Enric Alvarez  37 Daniel Lopez  37 Clara Prats  37 Jan Pablo Burgard  38 Arne Rodloff  39 Tom Zimmermann  39 Alexander Kuhlmann  40 Janez Zibert  41 Fulvia Pennoni  42 Fabio Divino  43 Marti Catala  44 Gianfranco Lovison  45 Paolo Giudici  46 Barbara Tarantino  46 Francesco Bartolucci  47 Giovanna Jona Lasinio  48 Marco Mingione  48 Alessio Farcomeni  49 Ajitesh Srivastava  50 Pablo Montero-Manso  51 Aniruddha Adiga  52 Benjamin Hurt  52 Bryan Lewis  52 Madhav Marathe  52 Przemyslaw Porebski  52 Srinivasan Venkatramanan  52 Rafal P Bartczuk  53 Filip Dreger  53 Anna Gambin  53 Krzysztof Gogolewski  53 Magdalena Gruziel-Slomka  53 Bartosz Krupa  53 Antoni Moszyński  53 Karol Niedzielewski  53 Jedrzej Nowosielski  53 Maciej Radwan  53 Franciszek Rakowski  53 Marcin Semeniuk  53 Ewa Szczurek  53 Jakub Zielinski  53 Jan Kisielewski  53   54 Barbara Pabjan  55 Kirsten Holger  56 Yuri Kheifetz  56 Markus Scholz  56 Biecek Przemyslaw  57 Marcin Bodych  58 Maciej Filinski  58 Radoslaw Idzikowski  58 Tyll Krueger  58 Tomasz Ozanski  58 Johannes Bracher  3 Sebastian Funk  1
Affiliations

Predictive performance of multi-model ensemble forecasts of COVID-19 across European nations

Katharine Sherratt et al. Elife. .

Abstract

Background: Short-term forecasts of infectious disease burden can contribute to situational awareness and aid capacity planning. Based on best practice in other fields and recent insights in infectious disease epidemiology, one can maximise the predictive performance of such forecasts if multiple models are combined into an ensemble. Here, we report on the performance of ensembles in predicting COVID-19 cases and deaths across Europe between 08 March 2021 and 07 March 2022.

Methods: We used open-source tools to develop a public European COVID-19 Forecast Hub. We invited groups globally to contribute weekly forecasts for COVID-19 cases and deaths reported by a standardised source for 32 countries over the next 1-4 weeks. Teams submitted forecasts from March 2021 using standardised quantiles of the predictive distribution. Each week we created an ensemble forecast, where each predictive quantile was calculated as the equally-weighted average (initially the mean and then from 26th July the median) of all individual models' predictive quantiles. We measured the performance of each model using the relative Weighted Interval Score (WIS), comparing models' forecast accuracy relative to all other models. We retrospectively explored alternative methods for ensemble forecasts, including weighted averages based on models' past predictive performance.

Results: Over 52 weeks, we collected forecasts from 48 unique models. We evaluated 29 models' forecast scores in comparison to the ensemble model. We found a weekly ensemble had a consistently strong performance across countries over time. Across all horizons and locations, the ensemble performed better on relative WIS than 83% of participating models' forecasts of incident cases (with a total N=886 predictions from 23 unique models), and 91% of participating models' forecasts of deaths (N=763 predictions from 20 models). Across a 1-4 week time horizon, ensemble performance declined with longer forecast periods when forecasting cases, but remained stable over 4 weeks for incident death forecasts. In every forecast across 32 countries, the ensemble outperformed most contributing models when forecasting either cases or deaths, frequently outperforming all of its individual component models. Among several choices of ensemble methods we found that the most influential and best choice was to use a median average of models instead of using the mean, regardless of methods of weighting component forecast models.

Conclusions: Our results support the use of combining forecasts from individual models into an ensemble in order to improve predictive performance across epidemiological targets and populations during infectious disease epidemics. Our findings further suggest that median ensemble methods yield better predictive performance more than ones based on means. Our findings also highlight that forecast consumers should place more weight on incident death forecasts than incident case forecasts at forecast horizons greater than 2 weeks.

Funding: AA, BH, BL, LWa, MMa, PP, SV funded by National Institutes of Health (NIH) Grant 1R01GM109718, NSF BIG DATA Grant IIS-1633028, NSF Grant No.: OAC-1916805, NSF Expeditions in Computing Grant CCF-1918656, CCF-1917819, NSF RAPID CNS-2028004, NSF RAPID OAC-2027541, US Centers for Disease Control and Prevention 75D30119C05935, a grant from Google, University of Virginia Strategic Investment Fund award number SIF160, Defense Threat Reduction Agency (DTRA) under Contract No. HDTRA1-19-D-0007, and respectively Virginia Dept of Health Grant VDH-21-501-0141, VDH-21-501-0143, VDH-21-501-0147, VDH-21-501-0145, VDH-21-501-0146, VDH-21-501-0142, VDH-21-501-0148. AF, AMa, GL funded by SMIGE - Modelli statistici inferenziali per governare l'epidemia, FISR 2020-Covid-19 I Fase, FISR2020IP-00156, Codice Progetto: PRJ-0695. AM, BK, FD, FR, JK, JN, JZ, KN, MG, MR, MS, RB funded by Ministry of Science and Higher Education of Poland with grant 28/WFSN/2021 to the University of Warsaw. BRe, CPe, JLAz funded by Ministerio de Sanidad/ISCIII. BT, PG funded by PERISCOPE European H2020 project, contract number 101016233. CP, DL, EA, MC, SA funded by European Commission - Directorate-General for Communications Networks, Content and Technology through the contract LC-01485746, and Ministerio de Ciencia, Innovacion y Universidades and FEDER, with the project PGC2018-095456-B-I00. DE., MGu funded by Spanish Ministry of Health / REACT-UE (FEDER). DO, GF, IMi, LC funded by Laboratory Directed Research and Development program of Los Alamos National Laboratory (LANL) under project number 20200700ER. DS, ELR, GG, NGR, NW, YW funded by National Institutes of General Medical Sciences (R35GM119582; the content is solely the responsibility of the authors and does not necessarily represent the official views of NIGMS or the National Institutes of Health). FB, FP funded by InPresa, Lombardy Region, Italy. HG, KS funded by European Centre for Disease Prevention and Control. IV funded by Agencia de Qualitat i Avaluacio Sanitaries de Catalunya (AQuAS) through contract 2021-021OE. JDe, SMo, VP funded by Netzwerk Universitatsmedizin (NUM) project egePan (01KX2021). JPB, SH, TH funded by Federal Ministry of Education and Research (BMBF; grant 05M18SIA). KH, MSc, YKh funded by Project SaxoCOV, funded by the German Free State of Saxony. Presentation of data, model results and simulations also funded by the NFDI4Health Task Force COVID-19 (https://www.nfdi4health.de/task-force-covid-19-2) within the framework of a DFG-project (LO-342/17-1). LP, VE funded by Mathematical and Statistical modelling project (MUNI/A/1615/2020), Online platform for real-time monitoring, analysis and management of epidemic situations (MUNI/11/02202001/2020); VE also supported by RECETOX research infrastructure (Ministry of Education, Youth and Sports of the Czech Republic: LM2018121), the CETOCOEN EXCELLENCE (CZ.02.1.01/0.0/0.0/17-043/0009632), RECETOX RI project (CZ.02.1.01/0.0/0.0/16-013/0001761). NIB funded by Health Protection Research Unit (grant code NIHR200908). SAb, SF funded by Wellcome Trust (210758/Z/18/Z).

Keywords: COVID-19; Europe; ensemble; epidemiology; forecast; global health; modelling; none; prediction.

PubMed Disclaimer

Conflict of interest statement

KS, HG, RG, HJ, RN, BP, FS, JD, DW, SA, AU, GG, ER, NR, DS, YW, NW, LW, JT, GO, TS, DT, LP, EK, JM, MB, NL, JM, JS, JW, JF, BL, IR, JB, RW, IV, VT, MS, MZ, CP, BR, NB, SM, LC, GF, IM, DO, PA, AM, VE, AK, DK, LP, BD, ML, SS, JD, SM, VP, GR, BB, GA, NP, GZ, WB, SH, TH, DS, MG, JA, DM, SA, EA, DL, CP, JB, AR, TZ, AK, JZ, FP, FD, MC, GL, PG, BT, FB, GJ, MM, AF, AS, PM, AA, BH, BL, MM, PP, SV, RB, FD, AG, KG, MG, BK, AM, KN, JN, MR, FR, MS, ES, JZ, JK, BP, KH, YK, MS, BP, MB, MF, RI, TK, TO, JB, SF No competing interests declared, PB, HG, SS, BS Affiliated with IEM, Inc. The author has no financial interests to declare

Figures

Figure 1.
Figure 1.. Total number of forecasts included in evaluation, by target location, week ahead horizon, and variable.
Figure 2.
Figure 2.. Ensemble forecasts of weekly incident cases in Germany over periods of increasing SARS-CoV-2 variants Delta (B.1.617.2, left) and Omicron (B.1.1.529, right).
Black indicates observed data. Coloured ribbons represent each weekly forecast of 1–4 weeks ahead (showing median, 50%, and 90% probability). For each variant, forecasts are shown over an x-axis bounded by the earliest dates at which 5% and 99% of sequenced cases were identified as the respective variant of concern, while vertical dotted lines indicate the approximate date that the variant reached dominance (>50% sequenced cases).
Figure 3.
Figure 3.. Performance of short-term forecasts aggregated across all individually submitted models and the Hub ensemble, by horizon, forecasting cases (left) and deaths (right).
Performance measured by relative weighted interval score scaled against a baseline (dotted line, 1), and coverage of uncertainty at the 50% and 95% levels. Boxplot, with width proportional to number of observations, show interquartile ranges with outlying scores as faded points. The target range for each set of scores is shaded in yellow.
Figure 4.
Figure 4.. Performance of short-term forecasts across models and median ensemble (asterisk), by country, forecasting cases (top) and deaths (bottom) for 2-week ahead forecasts, according to the relative weighted interval score.
Boxplots show interquartile ranges, with outliers as faded points, and the ensemble model performance is marked by an asterisk. y-axis is cut-off to an upper bound of 4 for readability.

Update of

  • doi: 10.1101/2022.06.16.22276024

Similar articles

Cited by

References

    1. Adib K, Hancock PA, Rahimli A, Mugisa B, Abdulrazeq F, Aguas R, White LJ, Hajjeh R, Al Ariqi L, Nabeth P. A participatory modelling approach for investigating the spread of covid-19 in countries of the eastern Mediterranean region to support public health decision-making. BMJ Global Health. 2021;6:e005207. doi: 10.1136/bmjgh-2021-005207. - DOI - PMC - PubMed
    1. Agosto A, Giudici P. A poisson autoregressive model to understand COVID-19 contagion dynamics. Risks. 2020;8:77. doi: 10.3390/risks8030077. - DOI
    1. Agosto A, Campmas A, Giudici P, Renda A. Monitoring COVID-19 contagion growth. Statistics in Medicine. 2021;40:4150–4160. doi: 10.1002/sim.9020. - DOI - PMC - PubMed
    1. Aguas R, White L, Hupert N, Shretta R, Pan-Ngum W, Celhay O, Moldokmatova A, Arifi F, Mirzazadeh A, Sharifi H, Adib K, Sahak MN, Franco C, Coutinho R, CoMo Consortium Modelling the COVID-19 pandemic in context: an international participatory approach. BMJ Global Health. 2020;5:e003126. doi: 10.1136/bmjgh-2020-003126. - DOI - PMC - PubMed
    1. Alene M, Yismaw L, Assemie MA, Ketema DB, Gietaneh W, Birhan TY. Serial interval and incubation period of COVID-19: a systematic review and meta-analysis. BMC Infectious Diseases. 2021;21:257. doi: 10.1186/s12879-021-05950-x. - DOI - PMC - PubMed

Publication types