Variable number tandem repeats of a 9-base insertion in the N-terminal domain of severe acute respiratory syndrome coronavirus 2 spike gene
- PMID: 36687631
- PMCID: PMC9846035
- DOI: 10.3389/fmicb.2022.1089399
Variable number tandem repeats of a 9-base insertion in the N-terminal domain of severe acute respiratory syndrome coronavirus 2 spike gene
Abstract
Introduction: The world is still struggling against the pandemic of coronavirus disease 2019 (COVID-19), caused by severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2), in 2022. The pandemic has been facilitated by the intermittent emergence of variant strains, which has been explained and classified mainly by the patterns of point mutations of the spike (S) gene. However, the profiles of insertions/deletions (indels) in SARS-CoV-2 genomes during the pandemic remain largely unevaluated yet.
Methods: In this study, we first screened for the genome regions of polymorphic indel sites by performing multiple sequence alignment; then, NCBI BLAST search and GISAID database search were performed to comprehensively investigate the indel profiles at the polymorphic indel hotspot and elucidate the emergence and spread of the indels in time and geographical distribution.
Results: A polymorphic indel hotspot was identified in the N-terminal domain of the S gene at approximately 22,200 nucleotide position, corresponding to 210-215 amino acid positions of SARS-CoV-2 S protein. This polymorphic hotspot was comprised of adjacent 3-base deletion (5'-ATT-3'; Spike_N211del) and 9-base insertion (5'-AGCCAGAAG-3'; Spike_ins214EPE). By performing NCBI BLAST search and GISAID database search, we identified several types of tandem repeats of the 9-base insertion, creating an 18-base insertion (Spike_ins214EPEEPE, Spike_ins214EPDEPE). The results of the searches suggested that the two-cycle tandem repeats of the 9-base insertion were created in November 2021 in Central Europe, whereas the emergence of the original one-cycle 9-base insertion (Spike_ins214EPE) would date back to the middle of 2020 and was away from the Central Europe. The identified 18-base insertions based on 2-cycle tandem repeat of the 9-base insertion were collected between November 2021 and April 2022, suggesting that these mutations could not survive and have been already eliminated.
Discussion: The GISAID database search implied that this polymorphic indel hotspot to be with one of the highest tolerability for incorporating indels in SARS-CoV-2 S gene. In summary, the present study identified a variable number of tandem repeat of 9-base insertion in the N-terminal domain of SARS-CoV-2 S gene, and the repeat could have occurred at different time from the insertion of the original 9-base insertion.
Keywords: BLAST search; GISAID; N-terminal domain; insertions/deletions; severe acute respiratory syndrome coronavirus 2; spike gene; variable number tandem repeats.
Copyright © 2023 Akaishi, Fujiwara and Ishii.
Conflict of interest statement
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Figures



Similar articles
-
Insertion/deletion hotspots in the Nsp2, Nsp3, S1, and ORF8 genes of SARS-related coronaviruses.BMC Ecol Evol. 2022 Oct 28;22(1):123. doi: 10.1186/s12862-022-02078-7. BMC Ecol Evol. 2022. PMID: 36307763 Free PMC article.
-
Insertion-and-Deletion Mutations between the Genomes of SARS-CoV, SARS-CoV-2, and Bat Coronavirus RaTG13.Microbiol Spectr. 2022 Jun 29;10(3):e0071622. doi: 10.1128/spectrum.00716-22. Epub 2022 Jun 6. Microbiol Spectr. 2022. PMID: 35658573 Free PMC article.
-
Emergence, evolution, and vaccine production approaches of SARS-CoV-2 virus: Benefits of getting vaccinated and common questions.Saudi J Biol Sci. 2022 Apr;29(4):1981-1997. doi: 10.1016/j.sjbs.2021.12.020. Epub 2021 Dec 13. Saudi J Biol Sci. 2022. PMID: 34924802 Free PMC article. Review.
-
Insertion and deletion mutations preserved in SARS-CoV-2 variants.Arch Microbiol. 2023 Mar 31;205(4):154. doi: 10.1007/s00203-023-03493-0. Arch Microbiol. 2023. PMID: 37000302 Free PMC article.
-
Genetic Recombination Sites Away from the Insertion/Deletion Hotspots in SARS-Related Coronaviruses.Tohoku J Exp Med. 2022 Dec 13;259(1):17-26. doi: 10.1620/tjem.2022.J093. Epub 2022 Nov 10. Tohoku J Exp Med. 2022. PMID: 36351613
References
LinkOut - more resources
Full Text Sources
Research Materials
Miscellaneous