COMSAT: Residue contact prediction of transmembrane proteins based on support vector machines and mixed integer linear programming
- PMID: 26756402
- DOI: 10.1002/prot.24979
COMSAT: Residue contact prediction of transmembrane proteins based on support vector machines and mixed integer linear programming
Abstract
In this article, we present COMSAT, a hybrid framework for residue contact prediction of transmembrane (TM) proteins, integrating a support vector machine (SVM) method and a mixed integer linear programming (MILP) method. COMSAT consists of two modules: COMSAT_SVM which is trained mainly on position-specific scoring matrix features, and COMSAT_MILP which is an ab initio method based on optimization models. Contacts predicted by the SVM model are ranked by SVM confidence scores, and a threshold is trained to improve the reliability of the predicted contacts. For TM proteins with no contacts above the threshold, COMSAT_MILP is used. The proposed hybrid contact prediction scheme was tested on two independent TM protein sets based on the contact definition of 14 Å between Cα-Cα atoms. First, using a rigorous leave-one-protein-out cross validation on the training set of 90 TM proteins, an accuracy of 66.8%, a coverage of 12.3%, a specificity of 99.3% and a Matthews' correlation coefficient (MCC) of 0.184 were obtained for residue pairs that are at least six amino acids apart. Second, when tested on a test set of 87 TM proteins, the proposed method showed a prediction accuracy of 64.5%, a coverage of 5.3%, a specificity of 99.4% and a MCC of 0.106. COMSAT shows satisfactory results when compared with 12 other state-of-the-art predictors, and is more robust in terms of prediction accuracy as the length and complexity of TM protein increase. COMSAT is freely accessible at http://hpcc.siat.ac.cn/COMSAT/.
Keywords: MILP; ab initio prediction; hybrid framework; machine learning; protein structure prediction.
© 2016 Wiley Periodicals, Inc.
Similar articles
-
COMTOP: Protein Residue-Residue Contact Prediction through Mixed Integer Linear Optimization.Membranes (Basel). 2021 Jun 30;11(7):503. doi: 10.3390/membranes11070503. Membranes (Basel). 2021. PMID: 34209399 Free PMC article.
-
Prediction of transmembrane regions of beta-barrel proteins using ANN- and SVM-based methods.Proteins. 2004 Jul 1;56(1):11-8. doi: 10.1002/prot.20092. Proteins. 2004. PMID: 15162482
-
Improved method for predicting beta-turn using support vector machine.Bioinformatics. 2005 May 15;21(10):2370-4. doi: 10.1093/bioinformatics/bti358. Epub 2005 Mar 29. Bioinformatics. 2005. PMID: 15797917
-
Structural protein descriptors in 1-dimension and their sequence-based predictions.Curr Protein Pept Sci. 2011 Sep;12(6):470-89. doi: 10.2174/138920311796957711. Curr Protein Pept Sci. 2011. PMID: 21787299 Review.
-
A Treatise to Computational Approaches Towards Prediction of Membrane Protein and Its Subtypes.J Membr Biol. 2017 Feb;250(1):55-76. doi: 10.1007/s00232-016-9937-7. Epub 2016 Nov 19. J Membr Biol. 2017. PMID: 27866233 Review.
Cited by
-
COMTOP: Protein Residue-Residue Contact Prediction through Mixed Integer Linear Optimization.Membranes (Basel). 2021 Jun 30;11(7):503. doi: 10.3390/membranes11070503. Membranes (Basel). 2021. PMID: 34209399 Free PMC article.
-
Co-evolution techniques are reshaping the way we do structural bioinformatics.F1000Res. 2017 Jul 25;6:1224. doi: 10.12688/f1000research.11543.1. eCollection 2017. F1000Res. 2017. PMID: 28781768 Free PMC article. Review.
-
Inter-Residue Distance Prediction From Duet Deep Learning Models.Front Genet. 2022 May 16;13:887491. doi: 10.3389/fgene.2022.887491. eCollection 2022. Front Genet. 2022. PMID: 35651930 Free PMC article.
-
Improved protein structure reconstruction using secondary structures, contacts at higher distance thresholds, and non-contacts.BMC Bioinformatics. 2017 Aug 29;18(1):380. doi: 10.1186/s12859-017-1807-5. BMC Bioinformatics. 2017. PMID: 28851269 Free PMC article.
-
Membrane protein contact and structure prediction using co-evolution in conjunction with machine learning.PLoS One. 2017 May 24;12(5):e0177866. doi: 10.1371/journal.pone.0177866. eCollection 2017. PLoS One. 2017. PMID: 28542325 Free PMC article.
Publication types
MeSH terms
Substances
LinkOut - more resources
Full Text Sources
Other Literature Sources