Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
[Preprint]. 2021 Jan 25:2021.01.22.21250320.
doi: 10.1101/2021.01.22.21250320.

High-throughput sequencing of SARS-CoV-2 in wastewater provides insights into circulating variants

Affiliations

High-throughput sequencing of SARS-CoV-2 in wastewater provides insights into circulating variants

Rafaela S Fontenele et al. medRxiv. .

Update in

  • High-throughput sequencing of SARS-CoV-2 in wastewater provides insights into circulating variants.
    Fontenele RS, Kraberger S, Hadfield J, Driver EM, Bowes D, Holland LA, Faleye TOC, Adhikari S, Kumar R, Inchausti R, Holmes WK, Deitrick S, Brown P, Duty D, Smith T, Bhatnagar A, Yeager RA 2nd, Holm RH, von Reitzenstein NH, Wheeler E, Dixon K, Constantine T, Wilson MA, Lim ES, Jiang X, Halden RU, Scotch M, Varsani A. Fontenele RS, et al. Water Res. 2021 Oct 15;205:117710. doi: 10.1016/j.watres.2021.117710. Epub 2021 Sep 25. Water Res. 2021. PMID: 34607084 Free PMC article.

Abstract

Severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) emerged from a zoonotic spill-over event and has led to a global pandemic. The public health response has been predominantly informed by surveillance of symptomatic individuals and contact tracing, with quarantine, and other preventive measures have then been applied to mitigate further spread. Non-traditional methods of surveillance such as genomic epidemiology and wastewater-based epidemiology (WBE) have also been leveraged during this pandemic. Genomic epidemiology uses high-throughput sequencing of SARS-CoV-2 genomes to inform local and international transmission events, as well as the diversity of circulating variants. WBE uses wastewater to analyse community spread, as it is known that SARS-CoV-2 is shed through bodily excretions. Since both symptomatic and asymptomatic individuals contribute to wastewater inputs, we hypothesized that the resultant pooled sample of population-wide excreta can provide a more comprehensive picture of SARS-CoV-2 genomic diversity circulating in a community than clinical testing and sequencing alone. In this study, we analysed 91 wastewater samples from 11 states in the USA, where the majority of samples represent Maricopa County, Arizona (USA). With the objective of assessing the viral diversity at a population scale, we undertook a single-nucleotide variant (SNV) analysis on data from 52 samples with >90% SARS-CoV-2 genome coverage of sequence reads, and compared these SNVs with those detected in genomes sequenced from clinical patients. We identified 7973 SNVs, of which 5680 were novel SNVs that had not yet been identified in the global clinical-derived data as of 17th June 2020 (the day after our last wastewater sampling date). However, between 17th of June 2020 and 20th November 2020, almost half of the SNVs have since been detected in clinical-derived data. Using the combination of SNVs present in each sample, we identified the more probable lineages present in that sample and compared them to lineages observed in North America prior to our sampling dates. The wastewater-derived SARS-CoV-2 sequence data indicates there were more lineages circulating across the sampled communities than represented in the clinical-derived data. Principal coordinate analyses identified patterns in population structure based on genetic variation within the sequenced samples, with clear trends associated with increased diversity likely due to a higher number of infected individuals relative to the sampling dates. We demonstrate that genetic correlation analysis combined with SNVs analysis using wastewater sampling can provide a comprehensive snapshot of the SARS-CoV-2 genetic population structure circulating within a community, which might not be observed if relying solely on clinical cases.

PubMed Disclaimer

Conflict of interest statement

Conflicts of Interest: E.M.D and R.U.H. are cofounders of AquaVitas, LLC, 9260 E. Raintree, Ste 130, Scottsdale, AZ 85260, USA, an ASU start-up company providing commercial services in wastewater-based epidemiology. R.U.H. is the founder of OneWaterOneHealth, a non-profit project of the Arizona State University Foundation.

Figures

Figure 1:
Figure 1:
A. Map of the United States of America with states where wastewater samples were collected for this study highlighted in grey. B. SARS-CoV-2 RT-qPCR Ct detection value for each sample and the corresponding SARS-CoV-2 genome coverage uniformity from the tiling amplicon-based HTS. C. SARS-CoV-2 genome coverage of the high-throughput sequencing of all the wastewater samples (cyan) and those with >90% coverage (red). * indicates that these sites have a coverage depth of 1.
Figure 2:
Figure 2:
A. Number of single nucleotide variants (SNV) per sample across 10 states (each state is represented by a different colour). B. Regression analysis, with 95% confidence interval, of the number of wastewater-derived SARS-CoV-2 SNVs detected versus the mean depth for each of the 52 samples with >90% coverage that were analysed. The colour code indicates the states in which the samples were collected.
Figure 3:
Figure 3:
Novel SARS-CoV-2 SNVs (i.e. not yet detected in clinical-derived samples as of 17th June 2020) identified in the 52 wastewater samples analysed. On the y-axis are the number of samples containing the SNV and on the x-axis is the relative position of SNV in the SARS-CoV-2 genome. Positions with multiple variants are marked in red and those marked with grey circles represent the SNVs that have been detected up until 20th November 2020 in clinical samples.
Figure 4:
Figure 4:
Publicly available genomes from clinically derived data deposited in GISAID, grouped by PANGOLIN, whose mutations were consistent with those observed in wastewater samples. A. Heatmap showing the number of days between sample collection and when supported lineages were first observed in clinical data. Each wastewater sample (52 samples across 10 states) contained support for different clinical samples which are grouped here by PANGOLIN, some of which have only been observed outside North America (indicated as “global only”). B. Clinical genomes reported in USA states and territories which were assigned to PANGOLIN supported by at least one environmental sample. Black borders indicate lineages supported in environmental samples from the respective location.
Figure 5:
Figure 5:
Principal coordinate analysis (PCoA) of SARS-CoV-2 sequence data derived from wastewater samples. A. Distribution of sequences from samples collected in ten states (each represented by a different colour) in the USA showing pairwise distance based on genomic composition between viral populations present in each sample. B. Timeline representation (shown by the colour gradient) of samples taken from the sample locations across ten USA states between April-June 2020 with pairwise distance based on genomic composition between viral populations present in each sample. C. Spatial representation of SARS-CoV-2 sequences from samples collected from various regions within Arizona (represented by different symbols) comparative to those from other states. D. Sampling catchments in Tempe, Guadalupe and Gilbert, Arizona.

References

    1. Adams E.R., Ainsworth M., Anand R., Andersson M.I., Auckland K., Baillie J.K., Barnes E., Beer S., Bell J.I. and Berry T. 2020. Antibody testing for COVID-19: A report from the National COVID Scientific Advisory Panel. Wellcome Open Research 5(139), 139.10.12688/wellcomeopenres.15927.1. - DOI - PMC - PubMed
    1. Ahmed W., Angel N., Edson J., Bibby K., Bivins A., O’Brien J.W., Choi P.M., Kitajima M., Simpson S.L., Li J., Tscharke B., Verhagen R., Smith W.J.M., Zaugg J., Dierens L., Hugenholtz P., Thomas K.V. and Mueller J.F. 2020. First confirmed detection of SARS-CoV-2 in untreated wastewater in Australia: A proof of concept for the wastewater surveillance of COVID-19 in the community. Sci Total Environ 728, 138764.10.1016/j.scitotenv.2020.138764. - DOI - PMC - PubMed
    1. Andersen K.G., Rambaut A., Lipkin W.I., Holmes E.C. and Garry R.F. 2020. The proximal origin of SARS-CoV-2. Nat Med 26(4), 450–452.10.1038/s41591-020-0820-9. - DOI - PMC - PubMed
    1. Balboa S., Mauricio-Iglesias M., Rodriguez S., Martínez-Lamas L., Vasallo F.J., Regueiro B. and Lema J.M. 2020. The fate of SARS-CoV-2 in wastewater treatment plants points out the sludge line as a suitable spot for incidence monitoring. medRxiv 10.1101/2020.05.25.2011270610.1101/2020.05.25.20112706. - DOI - DOI - PMC - PubMed
    1. Becker M., Strengert M., Junker D., Kerrinnes T., Kaiser P.D., Traenkle B., Dinter H., Haering J., Zeck A., Weise F., Peter A., Hoerber S., Fink S., Ruoff F., Bakchoul T., Baillot A., Lohse S., Cornberg M., Illig T., Gottlieb J., Smola S., Karch A., Berger K., Rammensee H.-G., Schenke-Layland K., Nelde A., Maerklin M., Heitmann J.S., Walz J.S., Templin M.F., Joos T.O., Rothbauer U., Krause G. and Schneiderhan-Marra N. 2020. Going beyond clinical routine in SARS-CoV-2 antibody testing - A multiplex corona virus antibody test for the evaluation of cross-reactivity to endemic coronavirus antigens. medRxiv 2020.07.17.20156000.https://doi.org/2020.07.17.20156000.

Publication types

LinkOut - more resources