Functional classification of CATH superfamilies: a domain-based approach for protein function annotation
- PMID: 26139634
- PMCID: PMC4612221
- DOI: 10.1093/bioinformatics/btv398
Functional classification of CATH superfamilies: a domain-based approach for protein function annotation
Erratum in
-
Functional classification of CATH superfamilies: a domain-based approach for protein function annotation.Bioinformatics. 2016 Sep 15;32(18):2889. doi: 10.1093/bioinformatics/btw473. Epub 2016 Jul 31. Bioinformatics. 2016. PMID: 27477482 Free PMC article. No abstract available.
Abstract
Motivation: Computational approaches that can predict protein functions are essential to bridge the widening function annotation gap especially since <1.0% of all proteins in UniProtKB have been experimentally characterized. We present a domain-based method for protein function classification and prediction of functional sites that exploits functional sub-classification of CATH superfamilies. The superfamilies are sub-classified into functional families (FunFams) using a hierarchical clustering algorithm supervised by a new classification method, FunFHMMer.
Results: FunFHMMer generates more functionally coherent groupings of protein sequences than other domain-based protein classifications. This has been validated using known functional information. The conserved positions predicted by the FunFams are also found to be enriched in known functional residues. Moreover, the functional annotations provided by the FunFams are found to be more precise than other domain-based resources. FunFHMMer currently identifies 110,439 FunFams in 2735 superfamilies which can be used to functionally annotate>16 million domain sequences.
Availability and implementation: All FunFam annotation data are made available through the CATH webpages (http://www.cathdb.info). The FunFHMMer webserver (http://www.cathdb.info/search/by_funfhmmer) allows users to submit query sequences for assignment to a CATH FunFam.
Contact: sayoni.das.12@ucl.ac.uk
Supplementary information: Supplementary data are available at Bioinformatics online.
© The Author 2015. Published by Oxford University Press.
Figures





Similar articles
-
New functional families (FunFams) in CATH to improve the mapping of conserved functional sites to 3D structures.Nucleic Acids Res. 2013 Jan;41(Database issue):D490-8. doi: 10.1093/nar/gks1211. Epub 2012 Nov 29. Nucleic Acids Res. 2013. PMID: 23203873 Free PMC article.
-
CATH FunFHMMer web server: protein functional annotations using functional family assignments.Nucleic Acids Res. 2015 Jul 1;43(W1):W148-53. doi: 10.1093/nar/gkv488. Epub 2015 May 11. Nucleic Acids Res. 2015. PMID: 25964299 Free PMC article.
-
CATH: increased structural coverage of functional space.Nucleic Acids Res. 2021 Jan 8;49(D1):D266-D273. doi: 10.1093/nar/gkaa1079. Nucleic Acids Res. 2021. PMID: 33237325 Free PMC article.
-
Protein function annotation using protein domain family resources.Methods. 2016 Jan 15;93:24-34. doi: 10.1016/j.ymeth.2015.09.029. Epub 2015 Oct 3. Methods. 2016. PMID: 26434392 Review.
-
The history of the CATH structural classification of protein domains.Biochimie. 2015 Dec;119:209-17. doi: 10.1016/j.biochi.2015.08.004. Epub 2015 Aug 4. Biochimie. 2015. PMID: 26253692 Free PMC article. Review.
Cited by
-
Intelligent Protein Design and Molecular Characterization Techniques: A Comprehensive Review.Molecules. 2023 Nov 30;28(23):7865. doi: 10.3390/molecules28237865. Molecules. 2023. PMID: 38067593 Free PMC article. Review.
-
DeepGO: predicting protein functions from sequence and interactions using a deep ontology-aware classifier.Bioinformatics. 2018 Feb 15;34(4):660-668. doi: 10.1093/bioinformatics/btx624. Bioinformatics. 2018. PMID: 29028931 Free PMC article.
-
Eusocial Transition in Blattodea: Transposable Elements and Shifts of Gene Expression.Genes (Basel). 2022 Oct 26;13(11):1948. doi: 10.3390/genes13111948. Genes (Basel). 2022. PMID: 36360186 Free PMC article.
-
Assigning protein function from domain-function associations using DomFun.BMC Bioinformatics. 2022 Jan 15;23(1):43. doi: 10.1186/s12859-022-04565-6. BMC Bioinformatics. 2022. PMID: 35033002 Free PMC article.
-
Predictive metabolomic profiling of microbial communities using amplicon or metagenomic sequences.Nat Commun. 2019 Jul 17;10(1):3136. doi: 10.1038/s41467-019-10927-1. Nat Commun. 2019. PMID: 31316056 Free PMC article.
References
Publication types
MeSH terms
Substances
Grants and funding
LinkOut - more resources
Full Text Sources
Other Literature Sources
Miscellaneous