Development of an ab initio protein structure prediction system ABLE
- PMID: 15706537
Development of an ab initio protein structure prediction system ABLE
Abstract
An ab initio protein structure prediction system called ABLE is described. It is based on the fragment assembly method, which consists of two steps: dividing a target sequence into overlapping subsequences (fragments) of short length and assigning a local structure to each fragment; and generating models by assembling the local structures and selecting the models with low potential energy. One of the most important problems in conventional fragment assembly methods is the difficulty of selecting native-like structures by energy minimization only. ABLE thus employs a structural clustering method to select the native-like models from among the generated models. By applying the unit-vector root mean square distance (URMS) as a measure of structure similarity, we achieve more robust, effective structural clustering. When no enough clusters of good quality are obtained, ABLE runs the energy minimization procedure again by incorporating structural restraint conditions obtained from the consensus substructures in the previously generated models. This approach is based on our observation that there is a high probability that the consensus substructures of the generated models have native-like structures. Another feature of ABLE is that in assigning local structures to fragments, it assigns mainchain dihedral angles (phi, psi) to the central residue of each fragment according to a probability distribution map built from candidate sequences similar to each fragment. This enables the system to generate appropriate local structures that may not already exist in a protein structure database. We applied our system to 25 small proteins and obtain near-native folds for more than half of them. We also demonstrate the performance of our structural clustering method, which can be applied to other protein structure prediction systems.
Similar articles
-
Protein structure prediction based on fragment assembly and parameter optimization.Biophys Chem. 2005 Apr 1;115(2-3):209-14. doi: 10.1016/j.bpc.2004.12.046. Epub 2005 Jan 6. Biophys Chem. 2005. PMID: 15752606
-
Optimizing physical energy functions for protein folding.Proteins. 2004 Jan 1;54(1):88-103. doi: 10.1002/prot.10429. Proteins. 2004. PMID: 14705026
-
How well can we predict native contacts in proteins based on decoy structures and their energies?Proteins. 2003 Sep 1;52(4):598-608. doi: 10.1002/prot.10444. Proteins. 2003. PMID: 12910459
-
Inter-residue interactions in protein folding and stability.Prog Biophys Mol Biol. 2004 Oct;86(2):235-77. doi: 10.1016/j.pbiomolbio.2003.09.003. Prog Biophys Mol Biol. 2004. PMID: 15288760 Review.
-
Ab initio protein structure prediction.Curr Opin Struct Biol. 2002 Apr;12(2):176-81. doi: 10.1016/s0959-440x(02)00306-8. Curr Opin Struct Biol. 2002. PMID: 11959494 Review.
Cited by
-
Local error estimates dramatically improve the utility of homology models for solving crystal structures by molecular replacement.Structure. 2015 Feb 3;23(2):397-406. doi: 10.1016/j.str.2014.11.020. Epub 2015 Jan 22. Structure. 2015. PMID: 25619999 Free PMC article.
-
General overview on structure prediction of twilight-zone proteins.Theor Biol Med Model. 2015 Sep 4;12:15. doi: 10.1186/s12976-015-0014-1. Theor Biol Med Model. 2015. PMID: 26338054 Free PMC article. Review.
-
High-performance computing for SARS-CoV-2 RNAs clustering: a data science‒based genomics approach.Genomics Inform. 2021 Dec;19(4):e49. doi: 10.5808/gi.21056. Epub 2021 Dec 31. Genomics Inform. 2021. PMID: 35012291 Free PMC article.
-
Acceleration of sequence clustering using longest common subsequence filtering.BMC Bioinformatics. 2013;14 Suppl 8(Suppl 8):S7. doi: 10.1186/1471-2105-14-S8-S7. Epub 2013 May 9. BMC Bioinformatics. 2013. PMID: 23815271 Free PMC article.