Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2022 Mar;204(Pt A):111909.
doi: 10.1016/j.envres.2021.111909. Epub 2021 Aug 20.

CovidPhy: A tool for phylogeographic analysis of SARS-CoV-2 variation

Affiliations

CovidPhy: A tool for phylogeographic analysis of SARS-CoV-2 variation

Xabier Bello et al. Environ Res. 2022 Mar.

Abstract

The severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) is the pathogen responsible for the coronavirus disease 2019 (COVID-19) pandemic. SARS-CoV-2 genomes have been sequenced massively and worldwide and are now available in different public genome repositories. There is much interest in generating bioinformatic tools capable to analyze and interpret SARS-CoV-2 variation. We have designed CovidPhy (http://covidphy.eu), a web interface that can process SARS-CoV-2 genome sequences in plain fasta text format or provided through identity codes from the Global Initiative on Sharing Avian Influenza Data (GISAID) or GenBank. CovidPhy aggregates information available on the large GISAID database (>1.49 M genomes). Sequences are first aligned against the reference sequence and the interface provides different sources of information, including automatic classification of genomes into a pre-computed phylogeny and phylogeographic information, haplogroup/lineage frequencies, and sequencing variation, indicating also if the genome contains known variants of concern (VOC). Additionally, CovidPhy allows searching for variants and haplotypes introduced by the user and includes a list of genomes that are good candidates for being responsible for large outbreaks worldwide, most likely mediated by important superspreading events, indicating their possible geographic epicenters and their relative impact as recorded in the GISAID database.

Keywords: COVID-19; Phylogeny; RNA; SARS-CoV-2; Superspreading events; Variants of concern.

PubMed Disclaimer

Conflict of interest statement

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

Figures

Fig. 1
Fig. 1
Pipeline of CovidPhy. CovidPhy offers three interfaces: a web, a CLI and a GUI. All three can be fed with a fasta file (top left) that is aligned using libdistfast.so against the Reference (402,125) and scanned looking for differences that allow the classification in a precomputed phylogeny (top, red square marked “core”). The output varies for each program: the CLI and the GUI only output the haplogroup and the variants found (bottom black square), while the web offers additional information: haplogroup frequencies in regions (e.g. countries), candidates for important outbreaks as inferred from database searchers, and VOCs. (For interpretation of the references to colour in this figure legend, the reader is referred to the Web version of this article.)

References

    1. Adam D.C., Wu P., Wong J.Y., Lau E.H.Y., Tsang T.K., Cauchemez S., Leung G.M., Cowling B.J. Clustering and superspreading potential of SARS-CoV-2 infections in Hong Kong. Nat. Med. 2020;26:1714–1719. - PubMed
    1. Althouse B.M., Wenger E.A., Miller J.C., Scarpino S.V., Allard A., Hebert-Dufresne L., Hu H. Superspreading events in the transmission dynamics of SARS-CoV-2: opportunities for interventions and control. PLoS Biol. 2020;18 - PMC - PubMed
    1. Callaway E. 'A bloody mess': confusion reigns over naming of new COVID variants. Nature. 2021;589 - PubMed
    1. Davies N.G., Abbott S., Barnard R.C., Jarvis C.I., Kucharski A.J., Munday J.D., Pearson C.A.B., Russell T.W., Tully D.C., Washburne A.D., et al. Estimated transmissibility and impact of SARS-CoV-2 lineage B.1.1.7 in England. Science. 2021;372 - PMC - PubMed
    1. Gómez-Carballa A., Bello X., Pardo-Seco J., Martinón-Torres F., Salas A. Mapping genome variation of SARS-CoV-2 worldwide highlights the impact of COVID-19 super-spreaders. Genome Res. 2020;30:1434–1448. - PMC - PubMed

Publication types