Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2022 Jan 7;50(D1):D571-D577.
doi: 10.1093/nar/gkab1045.

The carbohydrate-active enzyme database: functions and literature

Affiliations

The carbohydrate-active enzyme database: functions and literature

Elodie Drula et al. Nucleic Acids Res. .

Abstract

Thirty years have elapsed since the emergence of the classification of carbohydrate-active enzymes in sequence-based families that became the CAZy database over 20 years ago, freely available for browsing and download at www.cazy.org. In the era of large scale sequencing and high-throughput Biology, it is important to examine the position of this specialist database that is deeply rooted in human curation. The three primary tasks of the CAZy curators are (i) to maintain and update the family classification of this class of enzymes, (ii) to classify sequences newly released by GenBank and the Protein Data Bank and (iii) to capture and present functional information for each family. The CAZy website is updated once a month. Here we briefly summarize the increase in novel families and the annotations conducted during the last 8 years. We present several important changes that facilitate taxonomic navigation, and allow to download the entirety of the annotations. Most importantly we highlight the considerable amount of work that accompanies the analysis and report of biochemical data from the literature.

PubMed Disclaimer

Figures

Figure 1.
Figure 1.
The new headers of CAZy (sub)family webpages. Boxes highlight the novel features: (1) direct access to the form to report functional characterization(s); (2) download link to the complete list of protein accessions and their CAZy modules; (3) for subfamilies with characterized members, only the subfamily-specific functions are now listed; (4) taxonomic tabs have been removed and a ‘Download’ tab gives access to a text file corresponding to previous ‘ALL’ tab with the complete list of protein accessions belonging to the (sub)family; (5) a ‘Taxonomic display’ tab links to a Krona visualization of the family members as illustrated in Figure 2; (6) links to the publications that describe the functions of the enzymes are now given (preferentially PubMed, otherwise DOI or occasionally URL).
Figure 2.
Figure 2.
Krona chart browsing into family GH5. After navigation through taxonomic levels, this figure shows the display of the Basidiomycota, in bold in the center. The central part also recalls the results of the text search performed from the top form with the string ‘mi’. Results are highlighted both within the Basidiomycota, two matching genomes at right have their sector highlighted and the ‘mi’ string in yellow background, as well as in the levels above as the center shows for example 11 results in Dikarya. All sectors are color-coded according to the number of GH5 modules in the complete genome, and one sector was selected, the genome of Ceratobasidium sp. AG-Ba JN. Once a sector is selected, various links appear at the top right, to CAZy or NCBI, and multiple pie charts illustrate the representativeness of this genome in its taxonomic lineage.

Similar articles

Cited by

References

    1. Sayers E.W., Cavanaugh M., Clark K., Pruitt K.D., Schoch C.L., Sherry S.T., Karsch-Mizrachi I.. GenBank. Nucleic Acids Res. 2021; 49:D92–D96. - PMC - PubMed
    1. Camacho C., Coulouris G., Avagyan V., Ma N., Papadopoulos J., Bealer K., Madden T.L.. BLAST+: architecture and applications. BMC Bioinformatics. 2009; 10:421. - PMC - PubMed
    1. Eddy S.R. Accelerated profile HMM searches. PLoS Comput. Biol. 2011; 7:e1002195. - PMC - PubMed
    1. Lombard V., Golaconda Ramulu H., Drula E., Coutinho P.M., Henrissat B.. The carbohydrate-active enzymes database (CAZy) in 2013. Nucleic Acids Res. 2014; 42:D490–D495. - PMC - PubMed
    1. The UniProt Consortium UniProt: the universal protein knowledgebase. Nucleic Acids Res. 2017; 45:D158–D169. - PMC - PubMed

LinkOut - more resources