Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2022 Jul 13;32(8):679-690.
doi: 10.1093/glycob/cwac022.

CarboGrove: a resource of glycan-binding specificities through analyzed glycan-array datasets from all platforms

Affiliations

CarboGrove: a resource of glycan-binding specificities through analyzed glycan-array datasets from all platforms

Zachary L Klamer et al. Glycobiology. .

Abstract

Glycan arrays continue to be the primary resource for determining the glycan-binding specificity of proteins. The volume and diversity of glycan-array data are increasing, but no common method and resource exist to analyze, integrate, and use the available data. To meet this need, we developed a resource of analyzed glycan-array data called CarboGrove. Using the ability to process and interpret data from any type of glycan array, we populated the database with the results from 35 types of glycan arrays, 13 glycan families, 5 experimental methods, and 19 laboratories or companies. In meta-analyses of glycan-binding proteins, we observed glycan-binding specificities that were not uncovered from single sources. In addition, we confirmed the ability to efficiently optimize selections of glycan-binding proteins to be used in experiments for discriminating between closely related motifs. Through descriptive reports and a programmatically accessible Application Programming Interface, CarboGrove yields unprecedented access to the wealth of glycan-array data being produced and powerful capabilities for both experimentalists and bioinformaticians.

Keywords: binding specificity; database; glycan-binding protein; lectin; microarray.

PubMed Disclaimer

Figures

Fig. 1
Fig. 1
Diversity of glycan-array technology and content. A) Several technologies in addition to the planar array are now used to probe glycan arrays. The arrays differ in their modes of glycan or protein presentation and in methods of quantification. B) The sets of glycans contained in the arrays represent diverse types of structures and organisms. C) The rate of development of new arrays has increased since 2016, punctuated by significant advances in glycan-synthesis technology (red text). Monosaccharide symbols follow the SNFG (Symbol Nomenclature for Glycans) system (PMID 26543186, Glycobiology 25: 1323–1324, 2015) details at NCBI.
Fig. 2
Fig. 2
Standardized analysis and output. A) MotifFinder analyzes glycan-array data from multiple incubation concentrations, where available, of a given glycan-binding protein. B) The program identifies the family of motifs that represents the specificity of the protein, organized into primary motifs and the subtrees of fine-specificity motifs. C) Among the several visualizations in the output are tabular descriptions and binding curves for each motif. Monosaccharide symbols follow the SNFG (Symbol Nomenclature for Glycans) system (PMID 26543186, Glycobiology 25: 1323–1324, 2015) details at NCBI.
Fig. 3
Fig. 3
Breadth of representation of array types of glycan-binding specificities. A) The collection covers a wide variety of glycan families, array providers, and technologies. B) Motif coverage across the arrays. Nearly all motifs are represented on at least 1 array. C) Motif coverage of the glycan-binding proteins.
Fig. 4
Fig. 4
Comparison across multiple arrays of results for SNA. A) Reliability score for each of the arrays, where higher scores indicate lower dataset noise. B) CarboGrove reported motifs for each array. Within each array, the motifs are ordered from top to bottom by the relative-binding score given next to the ID and graphical representation of the motif. The monosaccharide symbols follow the SNFG (Symbol Nomenclature for Glycans) system (PMID 26543186, Glycobiology 25: 1323–1324, 2015) details at NCBI.
Fig. 5
Fig. 5
Searches for glycan-binding proteins that are specific for selected motifs. A) The motif scores for 118 motifs (listed in the columns) and 785 glycan-binding proteins were hierarchically clustered. The search motifs and top hits are labeled. B) For the top 4 glycan-binding proteins from each search, the proteins’ top motifs and their relative binding scores are indicated. Monosaccharide symbols follow the SNFG (Symbol Nomenclature for Glycans) system (PMID 26543186, Glycobiology 25: 1323–1324, 2015) details at NCBI.
Fig. 6
Fig. 6
Searches for glycan-binding proteins that are specific for selected motifs. A) Using models downloaded from CarboGrove that bind any of the comparison motifs, we predicted binding to a series of glycans generated in MotifFinder that contain the motifs, as well as negative-control glycans. B) In the reduced set of 4 lectins (GSL-II, HAA, VVL, and PHA-E), the average binding to the glycans in each group shows a different pattern for each group. The differences in top motifs for the lectins correspond to the differences in binding patterns. Monosaccharide symbols follow the SNFG (Symbol Nomenclature for Glycans) system (PMID 26543186, Glycobiology 25: 1323–1324, 2015) details at NCBI.

Similar articles

Cited by

References

    1. Blixt O, Head S, Mondala T, Scanlan C, Huflejt ME, Alvarez R, Bryan MC, Fazio F, Calarese D, Stevens J, et al. Printed covalent glycan array for ligand profiling of diverse glycan binding proteins. Proc Natl Acad Sci U S A. 2004:101(49):17033–17038. - PMC - PubMed
    1. Bojar D, Meche L, Meng G, Eng W, Smith DF, Cummings RD, Mahal LK. A useful guide to lectin binding: machine-learning directed annotation of 57 unique lectin specificities. ACS Chem Biol. 2022. doi: 10.1021/acschembio.1c00689. - DOI - PMC - PubMed
    1. Bonnardel F, Mariethoz J, Salentin S, Robin X, Schroeder M, Perez S, Lisacek F, Imberty A. UniLectin3D, a database of carbohydrate binding proteins with curated information on 3D structures and interacting ligands. Nucleic Acids Res. 2019:47(D1):D1236–D1244. - PMC - PubMed
    1. Bryan MC, Fazio F, Lee H-K, Huang C-Y, Chang A, Best MD, Calarese DA, Blixt O, Paulson JC, Burton D, et al. Covalent display of oligosaccharide arrays in microtiter plates. J Am Chem Soc. 2004:126(28):8640–8641. - PubMed
    1. Büll C, Nason R, Sun L, Coillie JV, Sørensen DM, Moons SJ, Yang Z, Arbitman S, Fernandes SM, Furukawa S, et al. Probing the binding specificities of human Siglecs by cell-based glycan arrays. Proc National Acad Sci. 2021:118(17):e2026102118. doi: 10.1073/pnas.2026102118. - DOI - PMC - PubMed

Publication types