Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2021 Aug 7;31(7):719-723.
doi: 10.1093/glycob/cwab003.

O-GlcNAcAtlas: A database of experimentally identified O-GlcNAc sites and proteins

Affiliations

O-GlcNAcAtlas: A database of experimentally identified O-GlcNAc sites and proteins

Junfeng Ma et al. Glycobiology. .

Abstract

O-linked β-N-acetylglucosamine (O-GlcNAc) is a post-translational modification (i.e., O-GlcNAcylation) on the serine/threonine residues of proteins. As a unique intracellular monosaccharide modification, protein O-GlcNAcylation plays important roles in almost all biochemical processes examined. Aberrant O-GlcNAcylation underlies the etiologies of a number of chronic diseases. With the tremendous improvement of techniques, thousands of proteins along with their O-GlcNAc sites have been reported. However, until now, there are few databases dedicated to accommodate the rapid accumulation of such information. Thus, O-GlcNAcAtlas is created to integrate all experimentally identified O-GlcNAc sites and proteins. O-GlcNAcAtlas consists of two datasets (Dataset-I and Dataset-II, for unambiguously identified sites and ambiguously identified sites, respectively), representing a total number of 4571 O-GlcNAc modified proteins from all species studied from 1984 to 31 Dec 2019. For each protein, comprehensive information (including species, sample type, gene symbol, modified peptides and/or modification sites, site mapping methods and literature references) is provided. To solve the heterogeneity among the data collected from different sources, the sequence identity of these reported O-GlcNAc peptides are mapped to the UniProtKB protein entries. To our knowledge, O-GlcNAcAtlas is a highly comprehensive and rigorously curated database encapsulating all O-GlcNAc sites and proteins identified in the past 35 years. We expect that O-GlcNAcAtlas will be a useful resource to facilitate O-GlcNAc studies and computational analyses of protein O-GlcNAcylation. The public version of the web interface to the O-GlcNAcAtlas can be found at http://oglcnac.org/.

Keywords: O-GlcNAc; database; proteomics.

PubMed Disclaimer

Figures

Fig 1
Fig 1
(A) Assembly of experimentally identified O-GlcNAc sites and proteins for a comprehensive database O-GlcNAcAtlas. (B) A snapshot for searching O-GlcNAcAtlas, with “microtubule-associated protein tau” as an example. Shown here are tabular results for all the matched entries with links to UniProtKB (right panel) and the main display page with detailed annotation and links to PubMed (left panel).

References

    1. Abrahams JL, Taherzadeh G, Jarvas G, Guttman A, Zhou Y, Campbell MP. 2020. Recent advances in glycoinformatic platforms for glycomics and glycoproteomics. Curr Opin Struct Biol. 62:56–69. - PubMed
    1. Alfaro JF, Gong CX, Monroe ME, Aldrich JT, Clauss TR, Purvine SO, Wang Z, Camp DG 2nd, Shabanowitz J, Stanley P, et al. 2012. Tandem mass spectrometry identifies many mouse brain O-GlcNAcylated proteins including EGF domain-specific O-GlcNAc transferase targets. Proc Natl Acad Sci U S A. 109(19):7280–7285. - PMC - PubMed
    1. Alocci D, Mariethoz J, Gastaldello A, Gasteiger E, Karlsson NG, Kolarich D, Packer NH, Lisacek F. 2019. GlyConnect: Glycoproteomics goes visual, interactive, and analytical. J Proteome Res. 18(2):664–677. - PubMed
    1. Baker PR, Chalkley RJ. 2014. MS-viewer: A web-based spectral viewer for proteomics results. Mol Cell Proteomics. 13(5):1392–1396. - PMC - PubMed
    1. Böhm M, Bohne-Lang A, Frank M, Loss A, Rojas-Macias MA, Lütteke T. 2019. Glycosciences. DB: An annotated data collection linking glycomics and proteomics data (2018 update). Nucleic Acids Res. 47(D1):D1195–D1201. - PMC - PubMed

Publication types

MeSH terms