Predicting protein secondary structure by a support vector machine based on a new coding scheme
- PMID: 15706504
Predicting protein secondary structure by a support vector machine based on a new coding scheme
Abstract
Protein structure prediction is one of the most important problems in modern computational biology. Protein secondary structure prediction is a key step in prediction of protein tertiary structure. There have emerged many methods based on machine learning techniques, such as neural networks (NN) and support vector machine (SVM) etc., to focus on the prediction of the secondary structures. In this paper, a new method was proposed based on SVM. Different from the existing methods, this method takes into account of the physical-chemical properties and structure properties of amino acids. When tested on the most popular dataset CB513, it achieved a Q(3) accuracy of 0.7844, which illustrates that it is one of the top range methods for protein of secondary structure prediction.
Similar articles
-
Protein backbone angle prediction with machine learning approaches.Bioinformatics. 2004 Jul 10;20(10):1612-21. doi: 10.1093/bioinformatics/bth136. Epub 2004 Feb 26. Bioinformatics. 2004. PMID: 14988121
-
HYPROSP II--a knowledge-based hybrid method for protein secondary structure prediction based on local prediction confidence.Bioinformatics. 2005 Aug 1;21(15):3227-33. doi: 10.1093/bioinformatics/bti524. Epub 2005 Jun 2. Bioinformatics. 2005. PMID: 15932901
-
Predicting disulfide connectivity from protein sequence using multiple sequence feature vectors and secondary structure.Bioinformatics. 2007 Dec 1;23(23):3147-54. doi: 10.1093/bioinformatics/btm505. Epub 2007 Oct 17. Bioinformatics. 2007. PMID: 17942444
-
Protein secondary structure prediction.Methods Mol Biol. 2010;609:327-48. doi: 10.1007/978-1-60327-241-4_19. Methods Mol Biol. 2010. PMID: 20221928 Review.
-
Recent progresses in the application of machine learning approach for predicting protein functional class independent of sequence similarity.Proteomics. 2006 Jul;6(14):4023-37. doi: 10.1002/pmic.200500938. Proteomics. 2006. PMID: 16791826 Review.
Cited by
-
Learning sparse models for a dynamic Bayesian network classifier of protein secondary structure.BMC Bioinformatics. 2011 May 13;12:154. doi: 10.1186/1471-2105-12-154. BMC Bioinformatics. 2011. PMID: 21569525 Free PMC article.
-
Prediction of carbohydrate-binding proteins from sequences using support vector machines.Adv Bioinformatics. 2010;2010:289301. doi: 10.1155/2010/289301. Epub 2010 Sep 27. Adv Bioinformatics. 2010. PMID: 20936154 Free PMC article.
-
How many 3D structures do we need to train a predictor?Genomics Proteomics Bioinformatics. 2009 Sep;7(3):128-37. doi: 10.1016/S1672-0229(08)60041-8. Genomics Proteomics Bioinformatics. 2009. PMID: 19944385 Free PMC article.
Publication types
MeSH terms
Substances
LinkOut - more resources
Other Literature Sources