Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Meta-Analysis
. 2021 Oct 1:789:147947.
doi: 10.1016/j.scitotenv.2021.147947. Epub 2021 May 23.

Data-driven estimation of COVID-19 community prevalence through wastewater-based epidemiology

Affiliations
Meta-Analysis

Data-driven estimation of COVID-19 community prevalence through wastewater-based epidemiology

Xuan Li et al. Sci Total Environ. .

Abstract

Wastewater-based epidemiology (WBE) has been regarded as a potential tool for the prevalence estimation of coronavirus disease 2019 (COVID-19) in the community. However, the application of the conventional back-estimation approach is currently limited due to the methodological challenges and various uncertainties. This study systematically performed meta-analysis for WBE datasets and investigated the use of data-driven models for the COVID-19 community prevalence in lieu of the conventional WBE back-estimation approach. Three different data-driven models, i.e. multiple linear regression (MLR), artificial neural network (ANN), and adaptive neuro fuzzy inference system (ANFIS) were applied to the multi-national WBE dataset. To evaluate the robustness of these models, predictions for sixteen scenarios with partial inputs were compared against the actual prevalence reports from clinical testing. The performance of models was further validated using unseen data (data sets not included for establishing the model) from different stages of the COVID-19 outbreak. Generally, ANN and ANFIS models showed better accuracy and robustness over MLR models. Air and wastewater temperature played a critical role in the prevalence estimation by data-driven models, especially MLR models. With unseen datasets, ANN model reasonably estimated the prevalence of COVID-19 (cumulative cases) at the initial phase and forecasted the upcoming new cases in 2-4 days at the post-peak phase of the COVID-19 outbreak. This study provided essential information about the feasibility and accuracy of data-driven estimation of COVID-19 prevalence through the WBE approach.

Keywords: Artificial neural network; COVID-19; Data-driven models; SARS-CoV-2; Wastewater-based epidemiology.

PubMed Disclaimer

Conflict of interest statement

Declaration of competing interest The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

Figures

Unlabelled Image
Graphical abstract
Fig. 1
Fig. 1
Pairwise Pearson's correlation plot between prevalence data (PWBE) and the nine explanatory factors. The color and size of the circles indicate the strength of Pearson's correlation coefficient (bigger circle = stronger link; blue = positive correlation and red = negative correlation) (A). The db-RDA diagram showing the relationship between the prevalence data and explanatory factors. Prevalence data from 7 publications were identified with different colors (P1-P7), and the countries of the 7 publications were differentiated with shapes. The % value in the RDA axes indicates the % of the total variation explained by each RDA axes (B). (For interpretation of the references to color in this figure legend, the reader is referred to the web version of this article.)
Fig. 2
Fig. 2
The outputs of the ANN model (A) and ANFIS model (B), and their correlations with the actual prevalence reported from clinical tests using all of the datasets. Target is the prevalence of active COVID-19 cases reported from the clinical testing. The output is the value obtained from the model predicting the SARS-CoV-2 prevalence using input parameters. The Y = T line is where the y-axis value equals the target value.
Fig. 3
Fig. 3
Comparison of the output from ANN model and prevalence determined by cumulative cases (Pcum), daily new cases (Pday), weekly new cases (Pweek) and upcoming new cases in the following 2 or 4 days (P2d, P4d) for the initial (pre-peak) stage of an outbreak (A) and post-peak stage of an outbreak (B). Y = X line is where the y-axis value equals the x-axis value.

Similar articles

Cited by

References

    1. Ahmed W., Angel N., Edson J., Bibby K., Bivins A., O’Brien J.W., Choi P.M., Kitajima M., Simpson S.L., Li J., Tscharke B., Verhagen R., Smith W.J.M., Zaugg J., Dierens L., Hugenholtz P., Thomas K.V., Mueller J.F. First confirmed detection of SARS-CoV-2 in untreated wastewater in Australia: a proof of concept for the wastewater surveillance of COVID-19 in the community. Sci. Total Environ. 2020;728 - PMC - PubMed
    1. Ahmed W., Bertsch P.M., Bibby K., Haramoto E., Hewitt J., Huygens F., Gyawali P., Korajkic A., Riddell S., Sherchan S.P., Simpson S.L., Sirikanchana K., Symonds E.M., Verhagen R., Vasan S.S., Kitajima M., Bivins A. Decay of SARS-CoV-2 and surrogate murine hepatitis virus RNA in untreated wastewater to inform application in wastewater-based epidemiology. Environ. Res. 2020;191 - PMC - PubMed
    1. Ahmed W., Bertsch P.M., Bivins A., Bibby K., Farkas K., Gathercole A., Haramoto E., Gyawali P., Korajkic A., McMinn B.R., Mueller J.F., Simpson S.L., Smith W.J.M., Symonds E.M., Thomas K.V., Verhagen R., Kitajima M. Comparison of virus concentration methods for the RT-qPCR-based recovery of murine hepatitis virus, a surrogate for SARS-CoV-2 from untreated wastewater. Sci. Total Environ. 2020;739 - PMC - PubMed
    1. Alygizakis N., Markou A.N., Rousis N.I., Galani A., Avgeris M., Adamopoulos P.G., Scorilas A., Lianidou E.S., Paraskevis D., Tsiodras S., Tsakris A., Dimopoulos M.A., Thomaidis N.S. Analytical methodologies for the detection of SARS-CoV-2 in wastewater: protocols and future perspectives. Trends Anal. Chem. 2021;134 - PMC - PubMed
    1. Ausati S., Amanollahi J. Assessing the accuracy of ANFIS, EEMD-GRNN, PCR, and MLR models in predicting PM2. 5. Atmos. Environ. 2016;142:465–474.

Publication types