Automatic definition of recurrent local structure motifs in proteins
- PMID: 2342110
- DOI: 10.1016/S0022-2836(05)80194-9
Automatic definition of recurrent local structure motifs in proteins
Abstract
An automatic procedure for defining recurrent folding motifs in proteins of known structure is described. These motifs are formed by short polypeptide fragments of equal size containing between four and seven residues. The method applies a classical clustering algorithm that operates on distances between selected backbone atoms. In one application, we use it to cluster all protein fragments into only four structural classes. This classification is rough considering the observed diversity of local structures, but comparable in homogeneity to the four classes of secondary structure (alpha-helix, beta-strand, turn and coil). Yet, it discriminates between extended and curved coil and distinguishes beta-bulges from beta-strands. In a second application, the clustering procedure is combined with assignment of backbone dihedral angles to allowed regions in the Ramachandran map. This produces an exhaustive repertoire of highly homogeneous families of structural motifs that contains all the beta-hairpins, beta alpha- and alpha beta-loops previously defined by manual procedures, and new structural families of which two examples, a beta alpha-loop and an alpha-helix beginning, are analyzed in detail. The described automatic procedures should be useful in categorizing structure information in proteins, thereby increasing our ability to analyze relations between structure and sequence.
Similar articles
-
Recurrent alpha beta loop structures in TIM barrel motifs show a distinct pattern of conserved structural features.Proteins. 1992 Apr;12(4):299-313. doi: 10.1002/prot.340120402. Proteins. 1992. PMID: 1374562
-
Automatic classification and analysis of alpha alpha-turn motifs in proteins.J Mol Biol. 1996 Jan 12;255(1):235-53. doi: 10.1006/jmbi.1996.0020. J Mol Biol. 1996. PMID: 8568871
-
Structural classification of alphabetabeta and betabetaalpha supersecondary structure units in proteins.Proteins. 1998 Feb 1;30(2):193-212. Proteins. 1998. PMID: 9489927
-
[A turning point in the knowledge of the structure-function-activity relations of elastin].J Soc Biol. 2001;195(2):181-93. J Soc Biol. 2001. PMID: 11727705 Review. French.
-
Sparsely populated residue conformations in protein structures: revisiting "experimental" Ramachandran maps.Proteins. 2014 Jul;82(7):1101-12. doi: 10.1002/prot.24384. Epub 2013 Dec 18. Proteins. 2014. PMID: 23934782 Review.
Cited by
-
Universal Architectural Concepts Underlying Protein Folding Patterns.Front Mol Biosci. 2021 Apr 30;7:612920. doi: 10.3389/fmolb.2020.612920. eCollection 2020. Front Mol Biosci. 2021. PMID: 33996891 Free PMC article.
-
Investigation of a physical basis for conformational similarity in proteins.J Protein Chem. 1991 Jun;10(3):273-85. doi: 10.1007/BF01025626. J Protein Chem. 1991. PMID: 1910459
-
Fragment-HMM: a new approach to protein structure prediction.Protein Sci. 2008 Nov;17(11):1925-34. doi: 10.1110/ps.036442.108. Epub 2008 Aug 22. Protein Sci. 2008. PMID: 18723665 Free PMC article.
-
Linkers of secondary structures in proteins.Protein Sci. 1997 Dec;6(12):2538-47. doi: 10.1002/pro.5560061206. Protein Sci. 1997. PMID: 9416603 Free PMC article.
-
Conformational analysis and clustering of short and medium size loops connecting regular secondary structures: a database for modeling and prediction.Protein Sci. 1996 Dec;5(12):2600-16. doi: 10.1002/pro.5560051223. Protein Sci. 1996. PMID: 8976569 Free PMC article.
Publication types
MeSH terms
Substances
LinkOut - more resources
Full Text Sources