Two-stage support vector regression approach for predicting accessible surface areas of amino acids
- PMID: 16456847
- DOI: 10.1002/prot.20883
Two-stage support vector regression approach for predicting accessible surface areas of amino acids
Abstract
We address the problem of predicting solvent accessible surface area (ASA) of amino acid residues in protein sequences, without classifying them into buried and exposed types. A two-stage support vector regression (SVR) approach is proposed to predict real values of ASA from the position-specific scoring matrices generated from PSI-BLAST profiles. By adding SVR as the second stage to capture the influences on the ASA value of a residue by those of its neighbors, the two-stage SVR approach achieves improvements of mean absolute errors up to 3.3%, and correlation coefficients of 0.66, 0.68, and 0.67 on the Manesh dataset of 215 proteins, the Barton dataset of 502 nonhomologous proteins, and the Carugo dataset of 338 proteins, respectively, which are better than the scores published earlier on these datasets. A Web server for protein ASA prediction by using a two-stage SVR method has been developed and is available (http://birc.ntu.edu.sg/~ pas0186457/asa.html).
(c) 2006 Wiley-Liss, Inc.
Similar articles
-
Prediction of protein relative solvent accessibility with a two-stage SVM approach.Proteins. 2005 Apr 1;59(1):30-7. doi: 10.1002/prot.20404. Proteins. 2005. PMID: 15696542
-
Prediction of protein accessible surface areas by support vector regression.Proteins. 2004 Nov 15;57(3):558-64. doi: 10.1002/prot.20234. Proteins. 2004. PMID: 15382233
-
SVM-Cabins: prediction of solvent accessibility using accumulation cutoff set and support vector machine.Proteins. 2007 Jul 1;68(1):82-91. doi: 10.1002/prot.21422. Proteins. 2007. PMID: 17436325
-
Real value prediction of solvent accessibility from amino acid sequence.Proteins. 2003 Mar 1;50(4):629-35. doi: 10.1002/prot.10328. Proteins. 2003. PMID: 12577269
-
Real value prediction of solvent accessibility in proteins using multiple sequence alignment and secondary structure.Proteins. 2005 Nov 1;61(2):318-24. doi: 10.1002/prot.20630. Proteins. 2005. PMID: 16106377
Cited by
-
Prediction of protein solvent accessibility using PSO-SVR with multiple sequence-derived features and weighted sliding window scheme.BioData Min. 2015 Jan 31;8:3. doi: 10.1186/s13040-014-0031-3. eCollection 2015. BioData Min. 2015. PMID: 26478747 Free PMC article.
-
Prediction of the burial status of transmembrane residues of helical membrane proteins.BMC Bioinformatics. 2007 Aug 20;8:302. doi: 10.1186/1471-2105-8-302. BMC Bioinformatics. 2007. PMID: 17708758 Free PMC article.
-
Predicting the protein-protein interactions using primary structures with predicted protein surface.BMC Bioinformatics. 2010 Jan 18;11 Suppl 1(Suppl 1):S3. doi: 10.1186/1471-2105-11-S1-S3. BMC Bioinformatics. 2010. PMID: 20122202 Free PMC article.
-
Sequence based residue depth prediction using evolutionary information and predicted secondary structure.BMC Bioinformatics. 2008 Sep 20;9:388. doi: 10.1186/1471-2105-9-388. BMC Bioinformatics. 2008. PMID: 18803867 Free PMC article.
-
A novel computational and structural analysis of nsSNPs in CFTR gene.Genomic Med. 2008 Jan;2(1-2):23-32. doi: 10.1007/s11568-008-9019-8. Epub 2008 May 14. Genomic Med. 2008. PMID: 18716917 Free PMC article.
Publication types
MeSH terms
Substances
LinkOut - more resources
Full Text Sources
Research Materials