Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2021 Mar 29;17(3):e1008837.
doi: 10.1371/journal.pcbi.1008837. eCollection 2021 Mar.

Pandemic velocity: Forecasting COVID-19 in the US with a machine learning & Bayesian time series compartmental model

Affiliations

Pandemic velocity: Forecasting COVID-19 in the US with a machine learning & Bayesian time series compartmental model

Gregory L Watson et al. PLoS Comput Biol. .

Abstract

Predictions of COVID-19 case growth and mortality are critical to the decisions of political leaders, businesses, and individuals grappling with the pandemic. This predictive task is challenging due to the novelty of the virus, limited data, and dynamic political and societal responses. We embed a Bayesian time series model and a random forest algorithm within an epidemiological compartmental model for empirically grounded COVID-19 predictions. The Bayesian case model fits a location-specific curve to the velocity (first derivative) of the log transformed cumulative case count, borrowing strength across geographic locations and incorporating prior information to obtain a posterior distribution for case trajectories. The compartmental model uses this distribution and predicts deaths using a random forest algorithm trained on COVID-19 data and population-level characteristics, yielding daily projections and interval estimates for cases and deaths in U.S. states. We evaluated the model by training it on progressively longer periods of the pandemic and computing its predictive accuracy over 21-day forecasts. The substantial variation in predicted trajectories and associated uncertainty between states is illustrated by comparing three unique locations: New York, Colorado, and West Virginia. The sophistication and accuracy of this COVID-19 model offer reliable predictions and uncertainty estimates for the current trajectory of the pandemic in the U.S. and provide a platform for future predictions as shifting political and societal responses alter its course.

PubMed Disclaimer

Conflict of interest statement

I have read the journal’s policy and the authors of this manuscript have the following competing interests: GLW, JAZ, PS, TB, and report personal fees from Private Health Management during the conduct of the study. CMR reports grants and personal fees from Private Health Management. MAS reports grants from US National Institutes of Health, grants from IQVIA, personal fees from Janssen Research and Development, and personal fees from Private Health Management during the conduct of the study. DX, LZ, JS, and AWR declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

Figures

Fig 1
Fig 1. Log cumulative cases and its velocity.
The cumulative case count (a), the log cumulative confirmed case count (b) and its velocity (c), i.e., first derivative with respect to time, for three example states, New York (NY), Colorado (CO), and West Virginia (WV) since 100 or more confirmed cases were reported.
Fig 2
Fig 2. Death model covariate importance.
Covariate importance scores on the log scale for the random forest death model as the mean decrease in MSE associated with permutation of the variable’s values.
Fig 3
Fig 3. The SIRD model.
Each of the four compartments quantifies the number of population members with that disease status: S for susceptible, I for infectious, R for recovered and D for dead. The arrows indicate possible transitions between disease states.
Fig 4
Fig 4. Predictive accuracy.
The median and interquartile range (IQR) of MASE across all 50 states on each day of the 21-day prediction periods for new confirmed cases (a) and deaths (b). A MASE of 1 indicates equivalent accuracy to a one-day random walk forecast in the training data.
Fig 5
Fig 5. Predicted cumulative cases, active infections, deaths, and effective Rt.
Projected cumulative case count, active confirmed infections, and daily deaths through April 1, 2021, for New York, Colorado, and West Virginia. The grey dots indicate observed data, which are not available for active infections and Rt.

Similar articles

  • Challenges of COVID-19 Case Forecasting in the US, 2020-2021.
    Lopez VK, Cramer EY, Pagano R, Drake JM, O'Dea EB, Adee M, Ayer T, Chhatwal J, Dalgic OO, Ladd MA, Linas BP, Mueller PP, Xiao J, Bracher J, Castro Rivadeneira AJ, Gerding A, Gneiting T, Huang Y, Jayawardena D, Kanji AH, Le K, Mühlemann A, Niemi J, Ray EL, Stark A, Wang Y, Wattanachit N, Zorn MW, Pei S, Shaman J, Yamana TK, Tarasewicz SR, Wilson DJ, Baccam S, Gurung H, Stage S, Suchoski B, Gao L, Gu Z, Kim M, Li X, Wang G, Wang L, Wang Y, Yu S, Gardner L, Jindal S, Marshall M, Nixon K, Dent J, Hill AL, Kaminsky J, Lee EC, Lemaitre JC, Lessler J, Smith CP, Truelove S, Kinsey M, Mullany LC, Rainwater-Lovett K, Shin L, Tallaksen K, Wilson S, Karlen D, Castro L, Fairchild G, Michaud I, Osthus D, Bian J, Cao W, Gao Z, Lavista Ferres J, Li C, Liu TY, Xie X, Zhang S, Zheng S, Chinazzi M, Davis JT, Mu K, Pastore Y Piontti A, Vespignani A, Xiong X, Walraven R, Chen J, Gu Q, Wang L, Xu P, Zhang W, Zou D, Gibson GC, Sheldon D, Srivastava A, Adiga A, Hurt B, Kaur G, Lewis B, Marathe M, Peddireddy AS, Porebski P, Venkatramanan S, Wang L, Prasad PV, Walker JW, Webber AE, Slayton RB, Biggerstaff M, Reich NG, Johansson MA. Lopez VK, et al. PLoS Comput Biol. 2024 May 6;20(5):e1011200. doi: 10.1371/journal.pcbi.1011200. eCollection 2024 May. PLoS Comput Biol. 2024. PMID: 38709852 Free PMC article.
  • A comparison of five epidemiological models for transmission of SARS-CoV-2 in India.
    Purkayastha S, Bhattacharyya R, Bhaduri R, Kundu R, Gu X, Salvatore M, Ray D, Mishra S, Mukherjee B. Purkayastha S, et al. BMC Infect Dis. 2021 Jun 7;21(1):533. doi: 10.1186/s12879-021-06077-9. BMC Infect Dis. 2021. PMID: 34098885 Free PMC article.
  • Dynamic causal models in infectious disease epidemiology-an assessment of their predictive validity based on the COVID-19 epidemic in the UK 2020 to 2024.
    Bowie C, Friston K. Bowie C, et al. Front Public Health. 2025 May 2;13:1573783. doi: 10.3389/fpubh.2025.1573783. eCollection 2025. Front Public Health. 2025. PMID: 40385629 Free PMC article.
  • A particle swarm optimization approach for predicting the number of COVID-19 deaths.
    Haouari M, Mhiri M. Haouari M, et al. Sci Rep. 2021 Aug 16;11(1):16587. doi: 10.1038/s41598-021-96057-5. Sci Rep. 2021. PMID: 34400735 Free PMC article. Review.
  • Investigating and forecasting infectious disease dynamics using epidemiological and molecular surveillance data.
    Chowell G, Skums P. Chowell G, et al. Phys Life Rev. 2024 Dec;51:294-327. doi: 10.1016/j.plrev.2024.10.011. Epub 2024 Oct 24. Phys Life Rev. 2024. PMID: 39488136 Review.

Cited by

References

    1. Li R, Pei S, Chen B, Song Y, Zhang T, Yang W, et al.. Substantial undocumented infection facilitates the rapid dissemination of novel coronavirus (SARS-CoV2). Science. 2020. - PMC - PubMed
    1. Prem K, Liu Y, Russell TW, Kucharski AJ, Eggo RM, Davies N, et al.. The effect of control strategies to reduce social mixing on outcomes of the COVID-19 epidemic in Wuhan, China: a modelling study. Lancet Public Health. 2020;. 10.1016/S2468-2667(20)30073-6 - DOI - PMC - PubMed
    1. Walker PG, Whittaker C, Watson O, Baguelin M, Ainslie K, Bhatia S, et al.. The global impact of COVID-19 and strategies for mitigation and suppression. Imperial College London. 2020; 10.1126/science.abc0035 - DOI
    1. Lin Q, Zhao S, Gao D, Lou Y, Yang S, Musa SS, et al.. A conceptual model for the coronavirus disease 2019 (COVID-19) outbreak in Wuhan, China with individual reaction and governmental action. Int J Infect Dis. 2020;93:211–216. 10.1016/j.ijid.2020.02.058 - DOI - PMC - PubMed
    1. Mandal S, Bhatnagar T, Arinaminpathy N, Agarwal A, Chowdhury A, Murhekar M, et al.. Prudent public health intervention strategies to control the coronavirus disease 2019 transmission in India: A mathematical model-based approach. Indian J Med Res. 2020;151. 10.4103/ijmr.IJMR_504_20 - DOI - PMC - PubMed

Publication types