Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2020 Dec 30;18(1):494.
doi: 10.1186/s12967-020-02675-4.

Design of a companion bioinformatic tool to detect the emergence and geographical distribution of SARS-CoV-2 Spike protein genetic variants

Affiliations

Design of a companion bioinformatic tool to detect the emergence and geographical distribution of SARS-CoV-2 Spike protein genetic variants

Alice Massacci et al. J Transl Med. .

Abstract

Background: Tracking the genetic variability of Severe Acute Respiratory Syndrome CoronaVirus 2 (SARS-CoV-2) is a crucial challenge. Mainly to identify target sequences in order to generate robust vaccines and neutralizing monoclonal antibodies, but also to track viral genetic temporal and geographic evolution and to mine for variants associated with reduced or increased disease severity. Several online tools and bioinformatic phylogenetic analyses have been released, but the main interest lies in the Spike protein, which is the pivotal element of current vaccine design, and in the Receptor Binding Domain, that accounts for most of the neutralizing the antibody activity.

Methods: Here, we present an open-source bioinformatic protocol, and a web portal focused on SARS-CoV-2 single mutations and minimal consensus sequence building as a companion vaccine design tool. Furthermore, we provide immunogenomic analyses to understand the impact of the most frequent RBD variations.

Results: Results on the whole GISAID sequence dataset at the time of the writing (October 2020) reveals an emerging mutation, S477N, located on the central part of the Spike protein Receptor Binding Domain, the Receptor Binding Motif. Immunogenomic analyses revealed some variation in mutated epitope MHC compatibility, T-cell recognition, and B-cell epitope probability for most frequent human HLAs.

Conclusions: This work provides a framework able to track down SARS-CoV-2 genomic variability.

Keywords: Bioinformatic workflow; COVID mutations; Docker; SARS-CoV-2 genome; SARS-CoV-2 mutation; SARS-CoV-2 vaccine.

PubMed Disclaimer

Conflict of interest statement

The authors declare that they have no competing interests.

Figures

Fig. 1
Fig. 1
a Workflow of the Covid-Miner package. The execution is fully automatized and encapsulated in a Docker container. b LolliPlot representation of SARS-CoV-2 variants at 1% frequency for the whole SARS-CoV-2 genome and from 0.5 to 10% for the Spike protein locus, showing the most frequent RBD variants at codon 447 (7%) and 439 (0.7%). c Barplot from occurrences of most frequent RBD variations
Fig. 2
Fig. 2
a Heatmap of Receptor Binding Domain variants geographical distribution by normalized frequency. Countries with low occurrences of RBD variants (less than 5) were removed. b Multiple Correspondence Analysis (MCA) of Country-RBD variant association. c Evolution of S447N and N439K in space and time. Y-axis represents the relative frequency of that mutation in every country isolate set for each month. X-axis: months in 2020, as annotated in GISAID sequences. Every bar is annotated with absolute counts
Fig. 3
Fig. 3
a Heatmap representing T-cell immunogenicity scores for wild-type and mutated RBD epitopes. b Screenshot of covid-miner web portal homepage. c Donut plot of RBD mutational impact of overall sequences and September 2020 sequences. d Location of the most frequent RBD mutated amino acids N439 and S477

References

    1. Du L, He Y, Zhou Y, Liu S, Zheng BJ, Jiang S. The spike protein of SARS-CoV—a target for vaccine and therapeutic development. Nat Rev Microbiol. 2009;7:226–236. doi: 10.1038/nrmicro2090. - DOI - PMC - PubMed
    1. Yu J, Tostanoski LH, Peter L, Mercado NB, McMahan K, Mahrokhian SH, et al. DNA vaccine protection against SARS-CoV-2 in rhesus macaques. Science. 2020;69:806. doi: 10.1126/science.abc6284. - DOI - PMC - PubMed
    1. Shang J, Ye G, Shi K, Wan Y, Luo C, Aihara H, et al. Structural basis of receptor recognition by SARS-CoV-2. Nature. 2020;581:221. doi: 10.1038/s41586-020-2179-y. - DOI - PMC - PubMed
    1. Koyama T, Platt DE, Parida L. Variant analysis of COVID-19 genomes. J Bull World Heal Organ. 2020;2:1–21. https://www.researchgate.net/publication/339461351_Variant_analysis_of_C... - PMC - PubMed
    1. Chiara M, Horner DS, Pesole G. Comparative genomics suggests limited variability and similar evolutionary patterns between major clades of SARS-Cov-2. bioRxiv. 2020 doi: 10.1101/2020.03.30.016790. - DOI

Publication types

MeSH terms