Learning biophysically-motivated parameters for alpha helix prediction
- PMID: 17570862
- PMCID: PMC1892091
- DOI: 10.1186/1471-2105-8-S5-S3
Learning biophysically-motivated parameters for alpha helix prediction
Abstract
Background: Our goal is to develop a state-of-the-art protein secondary structure predictor, with an intuitive and biophysically-motivated energy model. We treat structure prediction as an optimization problem, using parameterizable cost functions representing biological "pseudo-energies". Machine learning methods are applied to estimate the values of the parameters to correctly predict known protein structures.
Results: Focusing on the prediction of alpha helices in proteins, we show that a model with 302 parameters can achieve a Qalpha value of 77.6% and an SOValpha value of 73.4%. Such performance numbers are among the best for techniques that do not rely on external databases (such as multiple sequence alignments). Further, it is easier to extract biological significance from a model with so few parameters.
Conclusion: The method presented shows promise for the prediction of protein secondary structure. Biophysically-motivated elementary free-energies can be learned using SVM techniques to construct an energy cost function whose predictive performance rivals state-of-the-art. This method is general and can be extended beyond the all-alpha case described here.
Figures





References
-
- Eyrich V, et al. EVA: Continuous automatic evaluation of protein structure prediction servers. Bioinformatics. 2001;17:1242–1243. - PubMed
-
- Rost B. Review: Protein Secondary Structure Prediction Continues to Rise. Journal of Structural Biology. 2001;134:204–218. - PubMed
-
- Zemla A, Ceslovas Venclovas, Fidelis K, Rost B. A Modified Definition of Sov, a Segment-Based Measure for Protein Secondary Structure Prediction Assessment. Proteins. 1999;34:220–223. - PubMed
-
- Jones DT. Protein Secondary Structure Prediction Based on Position-specific Scoring Matrices. Journal of Molecular Biology. 1999;292:195–202. - PubMed
-
- Nguyen MN, Rajapakse JC. Prediction of protein secondary structure using bayesian method and support vector machines. ICONIP. 2002.
MeSH terms
LinkOut - more resources
Full Text Sources