The establishment of reference sequence for SARS-CoV-2 and variation analysis
- PMID: 32167180
- PMCID: PMC7228400
- DOI: 10.1002/jmv.25762
The establishment of reference sequence for SARS-CoV-2 and variation analysis
Abstract
Starting around December 2019, an epidemic of pneumonia, which was named COVID-19 by the World Health Organization, broke out in Wuhan, China, and is spreading throughout the world. A new coronavirus, named severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) by the Coronavirus Study Group of the International Committee on Taxonomy of Viruses was soon found to be the cause. At present, the sensitivity of clinical nucleic acid detection is limited, and it is still unclear whether it is related to genetic variation. In this study, we retrieved 95 full-length genomic sequences of SARAS-CoV-2 strains from the National Center for Biotechnology Information and GISAID databases, established the reference sequence by conducting multiple sequence alignment and phylogenetic analyses, and analyzed sequence variations along the SARS-CoV-2 genome. The homology among all viral strains was generally high, among them, 99.99% (99.91%-100%) at the nucleotide level and 99.99% (99.79%-100%) at the amino acid level. Although overall variation in open-reading frame (ORF) regions is low, 13 variation sites in 1a, 1b, S, 3a, M, 8, and N regions were identified, among which positions nt28144 in ORF 8 and nt8782 in ORF 1a showed mutation rate of 30.53% (29/95) and 29.47% (28/95), respectively. These findings suggested that there may be selective mutations in SARS-COV-2, and it is necessary to avoid certain regions when designing primers and probes. Establishment of the reference sequence for SARS-CoV-2 could benefit not only biological study of this virus but also diagnosis, clinical monitoring and intervention of SARS-CoV-2 infection in the future.
Keywords: SARS-CoV-2; homology; nucleotide; reference sequence; variation.
© 2020 Wiley Periodicals, Inc.
Conflict of interest statement
The authors declare that there are no conflict of interests.
Figures



Similar articles
-
Early phylogenetic estimate of the effective reproduction number of SARS-CoV-2.J Med Virol. 2020 Jun;92(6):675-679. doi: 10.1002/jmv.25723. Epub 2020 Mar 3. J Med Virol. 2020. PMID: 32096566 Free PMC article.
-
Identification of novel mutations in SARS-COV-2 isolates from Turkey.Arch Virol. 2020 Dec;165(12):2937-2944. doi: 10.1007/s00705-020-04830-0. Epub 2020 Oct 6. Arch Virol. 2020. PMID: 33025199 Free PMC article.
-
Molecular analysis of several in-house rRT-PCR protocols for SARS-CoV-2 detection in the context of genetic variability of the virus in Colombia.Infect Genet Evol. 2020 Oct;84:104390. doi: 10.1016/j.meegid.2020.104390. Epub 2020 Jun 4. Infect Genet Evol. 2020. PMID: 32505692 Free PMC article.
-
The species Severe acute respiratory syndrome-related coronavirus: classifying 2019-nCoV and naming it SARS-CoV-2.Nat Microbiol. 2020 Apr;5(4):536-544. doi: 10.1038/s41564-020-0695-z. Epub 2020 Mar 2. Nat Microbiol. 2020. PMID: 32123347 Free PMC article.
-
Coronavirus disease 2019: What we know?J Med Virol. 2020 Jul;92(7):719-725. doi: 10.1002/jmv.25766. Epub 2020 Mar 28. J Med Virol. 2020. PMID: 32170865 Free PMC article. Review.
Cited by
-
BioAider: An efficient tool for viral genome analysis and its application in tracing SARS-CoV-2 transmission.Sustain Cities Soc. 2020 Dec;63:102466. doi: 10.1016/j.scs.2020.102466. Epub 2020 Aug 28. Sustain Cities Soc. 2020. PMID: 32904401 Free PMC article.
-
Autoprocessing and oxyanion loop reorganization upon GC373 and nirmatrelvir binding of monomeric SARS-CoV-2 main protease catalytic domain.Commun Biol. 2022 Sep 16;5(1):976. doi: 10.1038/s42003-022-03910-y. Commun Biol. 2022. PMID: 36114420 Free PMC article.
-
Bioinformatics Analysis Unveils Certain Mutations Implicated in Spike Structure Damage and Ligand-Binding Site of Severe Acute Respiratory Syndrome Coronavirus 2.Bioinform Biol Insights. 2021 Jun 2;15:11779322211018200. doi: 10.1177/11779322211018200. eCollection 2021. Bioinform Biol Insights. 2021. PMID: 34121839 Free PMC article.
-
Phylogenetic, Sequencing, and Mutation Analysis of SARS-CoV-2 Omicron (BA.1) and Its Subvariants (BA.1.1, BA.2) During the Fifth Wave of the COVID-19 Pandemic in the Iraqi Kurdistan Region.Cureus. 2023 Nov 10;15(11):e48637. doi: 10.7759/cureus.48637. eCollection 2023 Nov. Cureus. 2023. PMID: 38090439 Free PMC article.
-
An aberrant STAT pathway is central to COVID-19.Cell Death Differ. 2020 Dec;27(12):3209-3225. doi: 10.1038/s41418-020-00633-7. Epub 2020 Oct 9. Cell Death Differ. 2020. PMID: 33037393 Free PMC article. Review.
References
-
- National Health Commission of the People's Republic of China . http://www.nhc.gov.cn (Assessed on March 8th, 2020). - PMC - PubMed
-
- WHO main website . https://www.who.int (accessed March 8th, 2020).
Publication types
MeSH terms
Substances
Grants and funding
LinkOut - more resources
Full Text Sources
Other Literature Sources
Miscellaneous