Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2020 Dec 28;60(12):5853-5865.
doi: 10.1021/acs.jcim.0c00501. Epub 2020 Jun 25.

Decoding SARS-CoV-2 Transmission and Evolution and Ramifications for COVID-19 Diagnosis, Vaccine, and Medicine

Affiliations

Decoding SARS-CoV-2 Transmission and Evolution and Ramifications for COVID-19 Diagnosis, Vaccine, and Medicine

Rui Wang et al. J Chem Inf Model. .

Abstract

Tremendous effort has been given to the development of diagnostic tests, preventive vaccines, and therapeutic medicines for coronavirus disease 2019 (COVID-19) caused by severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2). Much of this development has been based on the reference genome collected on January 5, 2020. Based on the genotyping of 15 140 genome samples collected up to June 1, 2020, we report that SARS-CoV-2 has undergone 8309 single mutations which can be clustered into six subtypes. We introduce mutation ratio and mutation h-index to characterize the protein conservativeness and unveil that SARS-CoV-2 envelope protein, main protease, and endoribonuclease protein are relatively conservative, while SARS-CoV-2 nucleocapsid protein, spike protein, and papain-like protease are relatively nonconservative. In particular, we have identified mutations on 40% of nucleotides in the nucleocapsid gene in the population level, signaling potential impacts on the ongoing development of COVID-19 diagnosis, vaccines, and antibody and small-molecular drugs.

PubMed Disclaimer

Conflict of interest statement

The authors declare no competing financial interest.

Figures

Figure 1
Figure 1
Pie chart plot of six distinct clusters in the world. The light blue, dark blue, green, red, purple, and yellow represent clusters I, II, III, IV, V, and VI, respectively. The base color of each country is decided by the color of the dominant cluster.
Figure 2
Figure 2
Pie chart plot of four distinct clusters in the US. The blue, red, yellow, and green colors represent clusters A, B, C, and D. The base color of each state corresponds to its dominant cluster.
Figure 3
Figure 3
Distribution of SNP mutations of SARS-CoV-2 isolates from 15 140 genome samples in the world with respect to the reference genome of January 5, 2020 (GenBank access number: NC_045512.2).
Figure 4
Figure 4
Frequencies of the single SNP mutations of SARS-CoV-2 on the genome samples in the world with respect to the reference genome of January 5, 2020 (GenBank access number: NC_045512.2).
Figure 5
Figure 5
Illustration of SARS-CoV-2 spike protein mutations using 6VXX as a template.
Figure 6
Figure 6
Illustration of SARS-CoV-2 spike-protein receptor binding domain (RBD) mutation using 6M0J as a template. It is noted that nearly half of the residues in RBD have undergone mutations in the few months.
Figure 7
Figure 7
Illustration of SARS-CoV-2 main protease mutations using 6LU7 as a template.
Figure 8
Figure 8
Illustration of SARS-CoV-2 main protease binding domain (BD) mutations of 6LU7.
Figure 9
Figure 9
Illustration of SARS-CoV-2 papain-like protease mutations using 6W9C as a template.
Figure 10
Figure 10
Illustration of SARS-CoV-2 RNA-polymerase mutations using 6M71 as a template.
Figure 11
Figure 11
Illustration of SARS-CoV-2 Endoribo-nuclease protein mutations using 6VWW as a template.
Figure 12
Figure 12
Illustration of SARS-CoV-2 envelope protein mutations using 5X29 as a template.
Figure 13
Figure 13
Illustration of SARS-CoV-2 nucleocapsid phosphoprotein mutations using 6VYO as a template.

References

    1. Cao Y.; Su B.; Guo X.; Sun W.; Deng Y.; Bao L.; Zhu Q.; Zhang X.; Zheng Y.; Geng C.; Chai X.; He R.; Li X.; Lv Q.; Zhu H.; Deng W.; Xu Y.; Wang Y.; Xie X. Potent neutralizing antibodies against SARS-CoV-2 identified by high-throughput single-cell sequencing of convalescent patients b cells. Cell 2020, 10.1016/j.cell.2020.05.025. - DOI - PMC - PubMed
    1. Chang C.; Michalska K.; Jedrzejczak R.; Maltseva N.; Endres M.; Godzik A.; Kim Y.; Joachimiak A.. Crystal structure of RNA binding domain of nucleocapsid phosphoprotein from SARS coronavirus 2. Wordwide PDB; 2020;10.2210/pdb6VYO/pdb. - DOI
    1. Corman V. M.; Landt O.; Kaiser M.; Molenkamp R.; Meijer A.; Chu D. K.; Bleicker T.; Brnink S.; Schneider J.; Schmidt M. L.; Mulders D. G.; Haagmans B. L.; van der Veer B.; van den Brink S.; Wijsman L.; Goderski G.; Romette J.-L.; Ellis J.; Zambon M.; Peiris M.; Goossens H.; Reusken C.; Koopmans M. P.; Drosten C. Detection of 2019 novel coronavirus (2019-ncov) by real-time RT-PCR. Eurosurveillance 2020, 10.2807/1560-7917.ES.2020.25.3.2000045. - DOI - PMC - PubMed
    1. DeDiego M. L.; Nieto-Torres J. L.; Jiménez-Guardeño J. M.; Regla-Nava J. A.; Alvarez E.; Oliveros J. C.; Zhao J.; Fett C.; Perlman S.; Enjuanes L. Severe acute respiratory syndrome coronavirus envelope protein regulates cell stress response and apoptosis. PLoS Pathog. 2011, 7 (10), e1002315.10.1371/journal.ppat.1002315. - DOI - PMC - PubMed
    1. Ding C.; He X.. K-means clustering via principal component analysis. In Proceedings of the twenty-first international conference on Machine learning, 2004, p 29.

Publication types

MeSH terms