CATH--a hierarchic classification of protein domain structures
- PMID: 9309224
- DOI: 10.1016/s0969-2126(97)00260-8
CATH--a hierarchic classification of protein domain structures
Abstract
Background: Protein evolution gives rise to families of structurally related proteins, within which sequence identities can be extremely low. As a result, structure-based classifications can be effective at identifying unanticipated relationships in known structures and in optimal cases function can also be assigned. The ever increasing number of known protein structures is too large to classify all proteins manually, therefore, automatic methods are needed for fast evaluation of protein structures.
Results: We present a semi-automatic procedure for deriving a novel hierarchical classification of protein domain structures (CATH). The four main levels of our classification are protein class (C), architecture (A), topology (T) and homologous superfamily (H). Class is the simplest level, and it essentially describes the secondary structure composition of each domain. In contrast, architecture summarises the shape revealed by the orientations of the secondary structure units, such as barrels and sandwiches. At the topology level, sequential connectivity is considered, such that members of the same architecture might have quite different topologies. When structures belonging to the same T-level have suitably high similarities combined with similar functions, the proteins are assumed to be evolutionarily related and put into the same homologous superfamily.
Conclusions: Analysis of the structural families generated by CATH reveals the prominent features of protein structure space. We find that nearly a third of the homologous superfamilies (H-levels) belong to ten major T-levels, which we call superfolds, and furthermore that nearly two-thirds of these H-levels cluster into nine simple architectures. A database of well-characterised protein structure families, such as CATH, will facilitate the assignment of structure-function/evolution relationships to both known and newly determined protein structures.
Similar articles
-
Structural diversity of domain superfamilies in the CATH database.J Mol Biol. 2006 Jul 14;360(3):725-41. doi: 10.1016/j.jmb.2006.05.035. Epub 2006 Jun 2. J Mol Biol. 2006. PMID: 16780872
-
The CATH classification revisited--architectures reviewed and new ways to characterize structural divergence in superfamilies.Nucleic Acids Res. 2009 Jan;37(Database issue):D310-4. doi: 10.1093/nar/gkn877. Epub 2008 Nov 7. Nucleic Acids Res. 2009. PMID: 18996897 Free PMC article.
-
The CATH Database provides insights into protein structure/function relationships.Nucleic Acids Res. 1999 Jan 1;27(1):275-9. doi: 10.1093/nar/27.1.275. Nucleic Acids Res. 1999. PMID: 9847200 Free PMC article.
-
Protein folds, functions and evolution.J Mol Biol. 1999 Oct 22;293(2):333-42. doi: 10.1006/jmbi.1999.3054. J Mol Biol. 1999. PMID: 10529349 Review.
-
The history of the CATH structural classification of protein domains.Biochimie. 2015 Dec;119:209-17. doi: 10.1016/j.biochi.2015.08.004. Epub 2015 Aug 4. Biochimie. 2015. PMID: 26253692 Free PMC article. Review.
Cited by
-
A pharmacological organization of G protein-coupled receptors.Nat Methods. 2013 Feb;10(2):140-6. doi: 10.1038/nmeth.2324. Epub 2013 Jan 6. Nat Methods. 2013. PMID: 23291723 Free PMC article.
-
Manual classification strategies in the ECOD database.Proteins. 2015 Jul;83(7):1238-51. doi: 10.1002/prot.24818. Epub 2015 May 8. Proteins. 2015. PMID: 25917548 Free PMC article.
-
In silico Functional Annotation and Characterization of Hypothetical Proteins from Serratia marcescens FGI94.Biol Bull Russ Acad Sci. 2020;47(4):319-331. doi: 10.1134/S1062359020300019. Epub 2020 Jul 31. Biol Bull Russ Acad Sci. 2020. PMID: 32834707 Free PMC article.
-
A community proposal to integrate structural bioinformatics activities in ELIXIR (3D-Bioinfo Community).F1000Res. 2020 Apr 22;9:ELIXIR-278. doi: 10.12688/f1000research.20559.1. eCollection 2020. F1000Res. 2020. PMID: 32566135 Free PMC article.
-
Molecular modeling, molecular dynamics simulation, and essential dynamics analysis of grancalcin: An upregulated biomarker in experimental autoimmune encephalomyelitis mice.Heliyon. 2022 Oct 23;8(10):e11232. doi: 10.1016/j.heliyon.2022.e11232. eCollection 2022 Oct. Heliyon. 2022. PMID: 36340004 Free PMC article.
Publication types
MeSH terms
Substances
LinkOut - more resources
Full Text Sources
Other Literature Sources
Research Materials
Miscellaneous