Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Review
. 2022 Nov 10;17(11):e0277154.
doi: 10.1371/journal.pone.0277154. eCollection 2022.

Application of machine learning for multi-community COVID-19 outbreak predictions with wastewater surveillance

Affiliations
Review

Application of machine learning for multi-community COVID-19 outbreak predictions with wastewater surveillance

Yuehan Ai et al. PLoS One. .

Abstract

The potential of wastewater-based epidemiology (WBE) as a surveillance and early warning tool for the COVID-19 outbreak has been demonstrated. For areas with limited testing capacity, wastewater surveillance can provide information on the disease dynamic at a community level. A predictive model is a key to generating quantitative estimates of the infected population. Modeling longitudinal wastewater data can be challenging as biomarkers in wastewater are susceptible to variations caused by multiple factors associated with the wastewater matrix and the sewersheds characteristics. As WBE is an emerging trend, the model should be able to address the uncertainties of wastewater from different sewersheds. We proposed exploiting machine learning and deep learning techniques, which are supported by the growing WBE data. In this article, we reviewed the existing predictive models, among which the emerging machine learning/deep learning models showed great potential. However, most models are built for individual sewersheds with few features extracted from the wastewater. To fulfill the research gap, we compared different time-series and non-time-series models for their short-term predictive performance of COVID-19 cases in 9 diverse sewersheds. The time-series models, long short-term memory (LSTM) and Prophet, outcompeted the non-time-series models. Besides viral (SARS-CoV-2) loads and location identity, domain-specific features like biochemical parameters of wastewater, geographical parameters of the sewersheds, and some socioeconomic parameters of the communities can contribute to the models. With proper feature engineering and hyperparameter tuning, we believe machine learning models like LSTM can be a feasible solution for the COVID-19 trend prediction via WBE. Overall, this is a proof-of-concept study on the application of machine learning in COVID-19 WBE. Future studies are needed to deploy and maintain the model in more real-world applications.

PubMed Disclaimer

Conflict of interest statement

The authors have declared that no competing interests exist.

Figures

Fig 1
Fig 1. Concept of applying machine learning for multi-community COVID-19 outbreak predictions with wastewater surveillance.
Fig 2
Fig 2. LSTM model flow.
a) Input type and sliding window; b) LSTM model architecture.
Fig 3
Fig 3. Comparison of model performance.
a) RMSEs of five machine/deep learning models; b) Person’s correlation coefficient of the predicted (LSTM and Prophet model) vs. observed COVID-19 case numbers (15 days rolling average) for Athens (‘bad case’); c) LSTM model performance on the data from Athens sewershed in Ohio (Overlaid area plot of the predicted vs. observed COVID-19 case numbers. Potential undertesting was observed); and d) Prophet model on all sewersheds. The shaded area indicated the 95% credible interval of the model parameters. True observations are shown in solid dots.

Similar articles

Cited by

References

    1. Bivins A., North D., Ahmad A., Ahmed W., Alm E., Been F., et al.. (2020). Wastewater-Based Epidemiology: Global Collaborative to Maximize Contributions in the Fight Against COVID-19. Environmental Science & Technology, 54(13), 7754–7757. doi: 10.1021/acs.est.0c02388 - DOI - PubMed
    1. Polo D., Quintela-Baluja M., Corbishley A., Jones D. L., Singer A. C., Graham D. W., et al.. (2020). Making waves: Wastewater-based epidemiology for COVID-19 –approaches and challenges for surveillance and prediction. Water Research, 186, 116404. doi: 10.1016/j.watres.2020.116404 - DOI - PMC - PubMed
    1. Ahmed W., Angel N., Edson J., Bibby K., Bivins A., O’Brien J. W., et al.. (2020). First confirmed detection of SARS-CoV-2 in untreated wastewater in Australia: A proof of concept for the wastewater surveillance of COVID-19 in the community. Science of The Total Environment, 728, 138764. doi: 10.1016/j.scitotenv.2020.138764 - DOI - PMC - PubMed
    1. Randazzo W., Cuevas-Ferrando E., Sanjuán R., Domingo-Calap P., & Sánchez G. (2020). Metropolitan wastewater analysis for COVID-19 epidemiological surveillance. International Journal of Hygiene and Environmental Health, 230, 113621. doi: 10.1016/j.ijheh.2020.113621 - DOI - PMC - PubMed
    1. Wu F., Zhang J., Xiao A., Gu X., Lee W. L., Armas F., et al.. (2020). SARS-CoV-2 Titers in Wastewater Are Higher than Expected from Clinically Confirmed Cases. MSystems, 5(4). doi: 10.1128/mSystems.00614-20 - DOI - PMC - PubMed

Publication types