Identifying the tertiary fold of small proteins with different topologies from sequence and secondary structure using the genetic algorithm and extended criteria specific for strand regions
- PMID: 8604145
- DOI: 10.1006/jmbi.1996.0115
Identifying the tertiary fold of small proteins with different topologies from sequence and secondary structure using the genetic algorithm and extended criteria specific for strand regions
Abstract
Grid-free protein folding simulations based on sequence and secondary structure knowledge (using mostly experimentally determined secondary structure information but also analysing results from secondary structure predictions) were investigated using the genetic algorithm, a backbone representation, and standard dihedral angular conformations. Optimal structures are selected according to basic protein building principles. Having previously applied this approach to proteins with helical topology, we have now developed additional criteria and weights for beta-strand-containing proteins, validated them on four small beta-strand-rich proteins with different topologies, and tested the general performance of the method on many further examples from known protein structures with mixed secondary structural type and less than 100 amino acid residues. Topology predictions close to the observed experimental structures were obtained in four test cases together with fitness values that correlated with the similarity of the predicted topology to the observed structures. Root-mean-square deviation values of C alpha atoms in the superposed predicted and observed structures, the latter of which had different topologies, were between 4.5 and 5.5 A(2.9 to 5.1 A without loops). Including 15 further protein examples with unique folds, root-mean-square deviation values ranged between 1.8 and 6.9 A with loop regions and averaged 5.3 A and 4.3 A, including and excluding loop regions, respectively.
Similar articles
-
Folding the main chain of small proteins with the genetic algorithm.J Mol Biol. 1994 Feb 25;236(3):844-61. doi: 10.1006/jmbi.1994.1193. J Mol Biol. 1994. PMID: 8114098
-
[A turning point in the knowledge of the structure-function-activity relations of elastin].J Soc Biol. 2001;195(2):181-93. J Soc Biol. 2001. PMID: 11727705 Review. French.
-
Fold assembly of small proteins using monte carlo simulations driven by restraints derived from multiple sequence alignments.J Mol Biol. 1998 Mar 27;277(2):419-48. doi: 10.1006/jmbi.1997.1595. J Mol Biol. 1998. PMID: 9514747
-
New efficient statistical sequence-dependent structure prediction of short to medium-sized protein loops based on an exhaustive loop classification.J Mol Biol. 1999 Jun 25;289(5):1469-90. doi: 10.1006/jmbi.1999.2826. J Mol Biol. 1999. PMID: 10373380
-
Cataloging topologies of protein folding patterns.J Mol Recognit. 2010 Mar-Apr;23(2):253-7. doi: 10.1002/jmr.1006. J Mol Recognit. 2010. PMID: 20151416 Review.
Cited by
-
Evolutionary algorithms in computer-aided molecular design.J Comput Aided Mol Des. 1996 Aug;10(4):337-58. doi: 10.1007/BF00124503. J Comput Aided Mol Des. 1996. PMID: 8877705 Review.
-
Toward predicting protein topology: an approach to identifying beta hairpins.Proc Natl Acad Sci U S A. 2002 Aug 20;99(17):11157-62. doi: 10.1073/pnas.162376199. Epub 2002 Aug 12. Proc Natl Acad Sci U S A. 2002. PMID: 12177429 Free PMC article.
-
MOLS sampling and its applications in structural biophysics.Biophys Rev. 2010 Dec;2(4):169-179. doi: 10.1007/s12551-010-0039-y. Epub 2010 Nov 16. Biophys Rev. 2010. PMID: 28510038 Free PMC article. Review.
-
Exploring conformational space using a mean field technique with MOLS sampling.J Biosci. 2007 Aug;32(5):909-20. doi: 10.1007/s12038-007-0091-3. J Biosci. 2007. PMID: 17914233 Review.
-
Low-resolution structures of proteins in solution retrieved from X-ray scattering with a genetic algorithm.Biophys J. 1998 Jun;74(6):2760-75. doi: 10.1016/S0006-3495(98)77984-6. Biophys J. 1998. PMID: 9635731 Free PMC article.
MeSH terms
Substances
LinkOut - more resources
Full Text Sources