Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2021 Jan 8;49(D1):D706-D714.
doi: 10.1093/nar/gkaa808.

GESS: a database of global evaluation of SARS-CoV-2/hCoV-19 sequences

Affiliations

GESS: a database of global evaluation of SARS-CoV-2/hCoV-19 sequences

Shuyi Fang et al. Nucleic Acids Res. .

Abstract

The COVID-19 outbreak has become a global emergency since December 2019. Analysis of SARS-CoV-2 sequences can uncover single nucleotide variants (SNVs) and corresponding evolution patterns. The Global Evaluation of SARS-CoV-2/hCoV-19 Sequences (GESS, https://wan-bioinfo.shinyapps.io/GESS/) is a resource to provide comprehensive analysis results based on tens of thousands of high-coverage and high-quality SARS-CoV-2 complete genomes. The database allows user to browse, search and download SNVs at any individual or multiple SARS-CoV-2 genomic positions, or within a chosen genomic region or protein, or in certain country/area of interest. GESS reveals geographical distributions of SNVs around the world and across the states of USA, while exhibiting time-dependent patterns for SNV occurrences which reflect development of SARS-CoV-2 genomes. For each month, the top 100 SNVs that were firstly identified world-widely can be retrieved. GESS also explores SNVs occurring simultaneously with specific SNVs of user's interests. Furthermore, the database can be of great help to calibrate mutation rates and identify conserved genome regions. Taken together, GESS is a powerful resource and tool to monitor SARS-CoV-2 migration and evolution according to featured genomic variations. It provides potential directive information for prevalence prediction, related public health policy making, and vaccine designs.

PubMed Disclaimer

Figures

Figure 1.
Figure 1.
Overview of GESS database. (A) The infrastructure of GESS database. The central part is individual SNV analysis and concurrence of SNVs. (B) Home page of GESS database. Besides a brief description of the website, the ‘Home’ page includes distributions of viral genome numbers around the world, or in the USA, or within each month. Several subpages contain different functions under the ‘Home’ page, in addition to the ‘Tutorial’ with the introduction to the use of GESS.
Figure 2.
Figure 2.
An example to show the process of data mining via GESS. (A) World map of distribution of the mutation A23403G (S:D614G). (B) Table including event counts of A23403G in different counties/areas. (C) Temporal occurrence ratios of A23403G along the time. (D) SNVs in concurrence with A23403G (identified in at least 1,000 samples with the ratio larger than 0.9).
Figure 3.
Figure 3.
Function of ‘SNV birth query’. (A) Word cloud of new SNVs in March 2020. (B) Table listing corresponding information for new SNVs in March 2020. (C) Word cloud of new SNVs and (D) information of new SNVs in June 2020.
Figure 4.
Figure 4.
Examples to use GESS to calibrate sequence conservation for vaccine design. (A) Helper T lymphocyte-based epitope LLLQYGSFCTQLNRA on the genomic region: 23 816 – 23 860 (Spike). The y-axis in the dot plot represents the numbers (base-10 log scale) of viral genomes with point mutations at each position. Nucleotides and amino acids are marked, respectively, under the dot plot at corresponding genomic locations. (B) B lymphocyte epitope, QGEIKDATPSDF, within the viral genome: 25 441 – 25 476 (ORF3a). The screenshots were taken from GESS.

References

    1. Poh C.M., Carissimo G., Wang B., Amrun S.N., Lee C.Y., Chee R.S., Fong S.W., Yeo N.K., Lee W.H., Torres-Ruesta A. et al.. Two linear epitopes on the SARS-CoV-2 spike protein that elicit neutralising antibodies in COVID-19 patients. Nat. Commun. 2020; 11:2806. - PMC - PubMed
    1. Yang X., Dong N., Chan E.W., Chen S.. Genetic cluster analysis of SARS-CoV-2 and the identification of those responsible for the major outbreaks in various countries. Emerg. Microbes Infect. 2020; 9:1287–1299. - PMC - PubMed
    1. Hu Y., Riley L.W.. Dissemination and co-circulation of SARS-CoV2 subclades exhibiting enhanced transmission associated with increased mortality in Western Europe and the United States. 2020; medRxiv doi:15 July 2020, preprint: not peer reviewed 10.1101/2020.07.13.20152959. - DOI
    1. Kim S.J., Nguyen V.G., Park Y.H., Park B.K., Chung H.C.. A novel synonymous mutation of SARS-CoV-2: is this possible to affect their antigenicity and immunogenicity. Vaccines (Basel). 2020; 8:220. - PMC - PubMed
    1. Daniloski Z., Guo X., Sanjana N.E.. The D614G mutation in SARS-CoV-2 Spike increases transduction of multiple human cell types. 2020; bioRxiv doi:15 June 2020, preprint: not peer reviewed 10.1101/2020.06.14.151357. - DOI - PMC - PubMed

Publication types

MeSH terms