Sorting protein decoys by machine-learning-to-rank
- PMID: 27530967
- PMCID: PMC4987638
- DOI: 10.1038/srep31571
Sorting protein decoys by machine-learning-to-rank
Abstract
Much progress has been made in Protein structure prediction during the last few decades. As the predicted models can span a broad range of accuracy spectrum, the accuracy of quality estimation becomes one of the key elements of successful protein structure prediction. Over the past years, a number of methods have been developed to address this issue, and these methods could be roughly divided into three categories: the single-model methods, clustering-based methods and quasi single-model methods. In this study, we develop a single-model method MQAPRank based on the learning-to-rank algorithm firstly, and then implement a quasi single-model method Quasi-MQAPRank. The proposed methods are benchmarked on the 3DRobot and CASP11 dataset. The five-fold cross-validation on the 3DRobot dataset shows the proposed single model method outperforms other methods whose outputs are taken as features of the proposed method, and the quasi single-model method can further enhance the performance. On the CASP11 dataset, the proposed methods also perform well compared with other leading methods in corresponding categories. In particular, the Quasi-MQAPRank method achieves a considerable performance on the CASP11 Best150 dataset.
Figures



Similar articles
-
RRCRank: a fusion method using rank strategy for residue-residue contact prediction.BMC Bioinformatics. 2017 Sep 2;18(1):390. doi: 10.1186/s12859-017-1811-9. BMC Bioinformatics. 2017. PMID: 28865433 Free PMC article.
-
MQAPRank: improved global protein model quality assessment by learning-to-rank.BMC Bioinformatics. 2017 May 25;18(1):275. doi: 10.1186/s12859-017-1691-z. BMC Bioinformatics. 2017. PMID: 28545390 Free PMC article.
-
DeepQA: improving the estimation of single protein model quality with deep belief networks.BMC Bioinformatics. 2016 Dec 5;17(1):495. doi: 10.1186/s12859-016-1405-y. BMC Bioinformatics. 2016. PMID: 27919220 Free PMC article.
-
Accurate De Novo Prediction of Protein Contact Map by Ultra-Deep Learning Model.PLoS Comput Biol. 2017 Jan 5;13(1):e1005324. doi: 10.1371/journal.pcbi.1005324. eCollection 2017 Jan. PLoS Comput Biol. 2017. PMID: 28056090 Free PMC article.
-
Methods for estimation of model accuracy in CASP12.Proteins. 2018 Mar;86 Suppl 1:361-373. doi: 10.1002/prot.25395. Epub 2017 Oct 17. Proteins. 2018. PMID: 28975666
Cited by
-
Unsupervised and Supervised Learning over theEnergy Landscape for Protein Decoy Selection.Biomolecules. 2019 Oct 14;9(10):607. doi: 10.3390/biom9100607. Biomolecules. 2019. PMID: 31615116 Free PMC article.
-
Graph-Based Community Detection for Decoy Selection in Template-Free Protein Structure Prediction.Molecules. 2019 Feb 28;24(5):854. doi: 10.3390/molecules24050854. Molecules. 2019. PMID: 30823390 Free PMC article.
-
Two New Heuristic Methods for Protein Model Quality Assessment.IEEE/ACM Trans Comput Biol Bioinform. 2020 Jul-Aug;17(4):1430-1439. doi: 10.1109/TCBB.2018.2880202. Epub 2018 Nov 9. IEEE/ACM Trans Comput Biol Bioinform. 2020. PMID: 30418914 Free PMC article.
-
RRCRank: a fusion method using rank strategy for residue-residue contact prediction.BMC Bioinformatics. 2017 Sep 2;18(1):390. doi: 10.1186/s12859-017-1811-9. BMC Bioinformatics. 2017. PMID: 28865433 Free PMC article.
-
Decoy selection for protein structure prediction via extreme gradient boosting and ranking.BMC Bioinformatics. 2020 Dec 9;21(Suppl 1):189. doi: 10.1186/s12859-020-3523-9. BMC Bioinformatics. 2020. PMID: 33297949 Free PMC article.
References
MeSH terms
Substances
LinkOut - more resources
Full Text Sources
Other Literature Sources
Research Materials
Miscellaneous