Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2021 Aug:92:104872.
doi: 10.1016/j.meegid.2021.104872. Epub 2021 Apr 24.

SARS-CoV-2 genomic surveillance in Costa Rica: Evidence of a divergent population and an increased detection of a spike T1117I mutation

Affiliations

SARS-CoV-2 genomic surveillance in Costa Rica: Evidence of a divergent population and an increased detection of a spike T1117I mutation

Jose Arturo Molina-Mora et al. Infect Genet Evol. 2021 Aug.

Abstract

Genome sequencing is a key strategy in the surveillance of SARS-CoV-2, the virus responsible for the COVID-19 pandemic. Latin America is the hardest-hit region of the world, accumulating almost 20% of COVID-19 cases worldwide. In Costa Rica, from the first detected case on March 6th to December 31st almost 170,000 cases have been reported. We analyzed the genomic variability during the SARS-CoV-2 pandemic in Costa Rica using 185 sequences, 52 from the first months of the pandemic, and 133 from the current wave. Three GISAID clades (G, GH, and GR) and three PANGOLIN lineages (B.1, B.1.1, and B.1.291) were predominant, suggesting multiple re-introductions from other regions. The whole-genome variant calling analysis identified a total of 283 distinct nucleotide variants, following a power-law distribution with 190 single nucleotide mutations in a single sequence, and only 16 mutations were found in >5% sequences. These mutations were distributed through the whole genome. The prevalence of worldwide-found variant D614G in the Spike (98.9% in Costa Rica), ORF8 L84S (1.1%) is similar to what is found elsewhere. Interestingly, the frequency of mutation T1117I in the Spike has increased during the current pandemic wave beginning in May 2020 in Costa Rica, reaching 29.2% detection in the full genome analyses in November 2020. This variant has been observed in less than 1% of the GISAID reported sequences worldwide in 2020. Structural modeling of the Spike protein with the T1117I mutation suggests a potential effect on the viral oligomerization needed for cell infection, but no differences with other genomes on transmissibility, severity nor vaccine effectiveness are predicted. In conclusion, genome analyses of the SARS-CoV-2 sequences over the course of the COVID-19 pandemic in Costa Rica suggest the introduction of lineages from other countries and the detection of mutations in line with other studies, but pointing out the local increase in the detection of Spike-T1117I variant. The genomic features of this virus need to be monitored and studied in further analyses as part of the surveillance program during the pandemic.

Keywords: COVID-19; Costa Rica; Genomic surveillance; Pandemic; SARS-CoV-2.

PubMed Disclaimer

Conflict of interest statement

The authors declare that there is no conflict of interest.

Figures

Unlabelled Image
Graphical abstract
Fig. 1
Fig. 1
Dynamic, geographic and temporal distribution of SARS-CoV-2 genomes from Costa Rican cases. (A) An exponential increment of COVID-19 cases has been reported in Costa Rica since March 2020, with a similar profile for reported deaths (B). Samples for sequencing were obtained from the whole country, mainly from the Central Valley, which harbors the most populated region of the country (C). Image in (C) was obtained from the Microreact tool (https://microreact.org/project/r7tcnUYgWMRJ5Fdssvv7VZ).
Fig. 2
Fig. 2
Variant calling analysis of SARS-CoV-2 genomes from Costa Rican cases of COVID-19. (A) Presence/absence of 283 different variants among 185 genomes. A few variants are widely distributed among genomes (*F = Frequency), and many variants are uniquely present in a single genome. (B) Distribution and accumulative percentage of variant frequency among genomes. Most variants are low-frequency mutations and only 16 variants are present in at least 5% (9) genomes. The most frequent variants (Spike D614G and ORF1a P4715L) are present in 183 (98.9%) genomes (arrow). The 16 variants are distributed along the SARS-CoV-2 genome, as shown in (C).
Fig. 3
Fig. 3
Frequency of T1117I in the spike along time in Costa Rica and the world. A notorious increment of the mutation has been reported in Costa Rica since May 2020, contrasting with the prevalence around the world which keeps relatively constant and low (A-B). Map in (B) was obtained from GISAID database.
Fig. 4
Fig. 4
Structural modeling of the spike protein of SARS-CoV-2. Variant D614G is present in 98.6% of the genomes in this study, which is also predominant worldwide (>90%, GISAID). D614G could affect the interaction with the host, as well as the immune response (vaccines), but real effects remain unclear. The variant T1117I is a variant very scarcely reported in the world (0.08%, GISAID), but the frequency in Costa Rica is 29.2%. The possible effect of this variant on the function of the spike is unknown.
Fig. 5
Fig. 5
Phylogenetic tree of SARS-CoV-2 genomes circulating in Costa Rica. Three GISAID clades (G, GH and GR) and three Pangolin lineages (B.1, B.1.1, and B.1.291) are dominant in Costa Rican cases. Deaths are similarly distributed along with clades. Variant T1117I in the spike is present in 54 genomes, which belong to a separated monophyletic cluster (dark-blue). Other variants are presented with different colors.

Similar articles

Cited by

References

    1. Alikhan N.-F., Petty N.K., Ben Zakour N.L., Beatson S.A. BLAST Ring Image Generator (BRIG): simple prokaryote genome comparisons. BMC Genomics. 2011;12(1):402. doi: 10.1186/1471-2164-12-402. - DOI - PMC - PubMed
    1. Andrews S. FastQC A Quality Control tool for High Throughput Sequence Data. 2010. https://www.bioinformatics.babraham.ac.uk/projects/fastqc/ Retrieved April 10, 2018, from.
    1. Argimón S., Abudahab K., Goater R.J.E., Fedosejev A., Bhai J., Glasner C.…Aanensen D.M. Microreact: visualizing and sharing data for genomic epidemiology and phylogeography. Microb. Genomics. 2016;2(11) doi: 10.1099/mgen.0.000093. - DOI - PMC - PubMed
    1. Behl T., Kaur I., Bungau S., Kumar A., Uddin M.S., Kumar C.…Arora S. The dual impact of ACE2 in COVID-19 and ironical actions in geriatrics and pediatrics with possible therapeutic solutions. Life Sci. 2020, September 15;257:118075. doi: 10.1016/j.lfs.2020.118075. - DOI - PMC - PubMed
    1. Bolger A.M., Lohse M., Usadel B. Trimmomatic: a flexible trimmer for Illumina sequence data. Bioinformatics. 2014;30(15):2114–2120. doi: 10.1093/bioinformatics/btu170. - DOI - PMC - PubMed

Publication types

Substances