The CATH hierarchy revisited-structural divergence in domain superfamilies and the continuity of fold space
- PMID: 19679085
- PMCID: PMC2741583
- DOI: 10.1016/j.str.2009.06.015
The CATH hierarchy revisited-structural divergence in domain superfamilies and the continuity of fold space
Abstract
This paper explores the structural continuum in CATH and the extent to which superfamilies adopt distinct folds. Although most superfamilies are structurally conserved, in some of the most highly populated superfamilies (4% of all superfamilies) there is considerable structural divergence. While relatives share a similar fold in the evolutionary conserved core, diverse elaborations to this core can result in significant differences in the global structures. Applying similar protocols to examine the extent to which structural overlaps occur between different fold groups, it appears this effect is confined to just a few architectures and is largely due to small, recurring super-secondary motifs (e.g., alphabeta-motifs, alpha-hairpins). Although 24% of superfamilies overlap with superfamilies having different folds, only 14% of nonredundant structures in CATH are involved in overlaps. Nevertheless, the existence of these overlaps suggests that, in some regions of structure space, the fold universe should be seen as more continuous.
Figures










Similar articles
-
The CATH classification revisited--architectures reviewed and new ways to characterize structural divergence in superfamilies.Nucleic Acids Res. 2009 Jan;37(Database issue):D310-4. doi: 10.1093/nar/gkn877. Epub 2008 Nov 7. Nucleic Acids Res. 2009. PMID: 18996897 Free PMC article.
-
Structural diversity of domain superfamilies in the CATH database.J Mol Biol. 2006 Jul 14;360(3):725-41. doi: 10.1016/j.jmb.2006.05.035. Epub 2006 Jun 2. J Mol Biol. 2006. PMID: 16780872
-
Extending CATH: increasing coverage of the protein structure universe and linking structure with function.Nucleic Acids Res. 2011 Jan;39(Database issue):D420-6. doi: 10.1093/nar/gkq1001. Epub 2010 Nov 19. Nucleic Acids Res. 2011. PMID: 21097779 Free PMC article.
-
Diversity in protein domain superfamilies.Curr Opin Genet Dev. 2015 Dec;35:40-9. doi: 10.1016/j.gde.2015.09.005. Epub 2015 Nov 3. Curr Opin Genet Dev. 2015. PMID: 26451979 Free PMC article. Review.
-
The structure of the protein universe and genome evolution.Nature. 2002 Nov 14;420(6912):218-23. doi: 10.1038/nature01256. Nature. 2002. PMID: 12432406 Review.
Cited by
-
From protein sequences to 3D-structures and beyond: the example of the UniProt knowledgebase.Cell Mol Life Sci. 2010 Apr;67(7):1049-64. doi: 10.1007/s00018-009-0229-6. Epub 2009 Dec 31. Cell Mol Life Sci. 2010. PMID: 20043185 Free PMC article. Review.
-
DALI and the persistence of protein shape.Protein Sci. 2020 Jan;29(1):128-140. doi: 10.1002/pro.3749. Epub 2019 Nov 5. Protein Sci. 2020. PMID: 31606894 Free PMC article.
-
Protein folds and protein folding.Protein Eng Des Sel. 2011 Jan;24(1-2):11-9. doi: 10.1093/protein/gzq096. Epub 2010 Nov 3. Protein Eng Des Sel. 2011. PMID: 21051320 Free PMC article. Review.
-
An Algebro-topological description of protein domain structure.PLoS One. 2011;6(5):e19670. doi: 10.1371/journal.pone.0019670. Epub 2011 May 24. PLoS One. 2011. PMID: 21629687 Free PMC article.
-
Longitudinal genomic surveillance of Plasmodium falciparum malaria parasites reveals complex genomic architecture of emerging artemisinin resistance.Genome Biol. 2017 Apr 28;18(1):78. doi: 10.1186/s13059-017-1204-4. Genome Biol. 2017. PMID: 28454557 Free PMC article.
References
-
- Chandonia J.M., Brenner S.E. Implications of structural genomics target selection strategies: Pfam5000, whole genome, and random approaches. Proteins. 2005;58:166–179. - PubMed
-
- Dengler U., Siddiqui A.S., Barton G.J. Protein structural domains: analysis of the 3Dee domains database. Proteins. 2001;42:332–344. - PubMed
MeSH terms
Substances
Grants and funding
LinkOut - more resources
Full Text Sources