Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2021 Jun:23:101064.
doi: 10.1016/j.genrep.2021.101064. Epub 2021 Mar 1.

Analysis and comparison of genetic variants and mutations of the novel coronavirus SARS-CoV-2

Affiliations

Analysis and comparison of genetic variants and mutations of the novel coronavirus SARS-CoV-2

Zaid Almubaid et al. Gene Rep. 2021 Jun.

Abstract

We present an analysis and comparison study of genetic variants and mutations of about 1200 genomes of SARS-CoV-2 virus sampled across the first seven months of 2020. The study includes 12 sets of about 100 genomes each collected between January and September. We analyzed the mutations, mutation frequency and count trends over time, and genomes trends over time from January through September. We show that certain mutations in the SARS-CoV-2 genome are not occurring randomly as it has been commonly believed. This finding is in agreement with other recently published research in this domain. Therefore, this validates other findings in this direction. This study includes approximately 1000 genomes and was able to identify over 35 different mutations most of which are common to almost all genomes groups. Some mutations' ratios (frequency percentage) fluctuate over time to adapt the virus to various environmental factors, climate, and populations. One of the interesting findings in this paper is that the coding region, at the nucleotide level for NSP13 protein is relatively conserved compared with other protein regions in the ORF1ab gene which makes this protein a good candidate for developing drug targets and treatment for the COVID-19 disease. Although this outcome was already reported by other researchers, we corroborated their result with our work in a different approach and another experimental setting with almost one thousand complete genome sequences. We presented and discussed all these results and findings with tables of results and illustrating figures.

Keywords: 2019 H-CoV 2; Genetic variants; Novel coronavirus; SARS-CoV-2; SARS-CoV-2 genetic variants; SARS-CoV-2 genome.

PubMed Disclaimer

Conflict of interest statement

No conflict of interest.

Figures

Fig. 1
Fig. 1
Structure of the SARS-CoV-2 genome in three different views: (a) basic illustration of the complete genome structure (b) illustration of coronavirus 2 isolate Wuhan-Hu-1, NC_045512 (complete genome 29,903 bp) as presented in the GenBank/NCBI https://www.ncbi.nlm.nih.gov/nuccore/NC_045512.2?report=graph (c) another view of the NC_045512.2 reference genome from NCBI showing all location of all genes. {Note: (b) and (c) are both courtesy of The US National Center for Biotechnology Information NCBI www.ncbi.nlm.nih.org.}
Fig. 2
Fig. 2
This figure shows the divisions of the genome into genes/proteins and highlights the ORF1ab and Spike protein.
Fig. 3
Fig. 3
Visualization of the genomic sequences showing the alignment and genetic variants. (a) This shows 23406A>G using NCBI msa viewer (this is the dataset S5 (this is only partial view showing position 23354 to position 23463 with a sample of 30 genomes). (b) Visualization of the multiple sequence alignment (using Jalview tool). (c) S1c-100-genome (aligned with clustal msa)>>partial view with Jalview. Note: Y is C or T; R is A or G; W is A or T. Note: (a) is courtesy of the US National Center for Biotechnology Information NCBI. (b) and (c) are courtesy of Jalview tool (Waterhouse et al., n.d.): https://www.jalview.org/.
Fig. 4
Fig. 4
Illustration of the percentage of five mutations in eight genomic sets (sets S1S8) collected between 1 January and 31 May.

References

    1. Anastassopoulou C., Russo L., Tsakris A., Siettos C. Data-based analysis, modelling and forecasting of the COVID-19 outbreak. PLoS one. 2020;15(3):1–10. - PMC - PubMed
    1. Andersen Kristian G., Rambaut Andrew, Lipkin W. Ian, Holmes Edward C., Garry Robert F. The proximal origin of SARS-CoV-2. Nat. Med. 2020;26:450–452. doi: 10.1038/s41591-020-0820-9. - DOI - PMC - PubMed
    1. Biological Sciences, National Institutes of Health (US) National Institutes of Health; Bethesda (MD): 2007. Curriculum Study.
    1. Center for Disease Control and Prevention (CDC) H5N1 genetic changes. https://www.cdc.gov/flu/avianflu/h5n1/inventory-qa.htm
    1. Dorp Lucyvan, Acmana Mislav, Richard Damien, Shaw Liam P., Ford Charlotte E. Emergence of genomic diversity and recurrent mutations in SARS-CoV-2. Infection, Genetics and Evolution. 2020;83 doi: 10.1016/j.meegid.2020.104351. (Sept. 2020) - DOI - PMC - PubMed

LinkOut - more resources