. 2016 Aug 17:6:31571.

doi: 10.1038/srep31571.

Sorting protein decoys by machine-learning-to-rank

Xiaoyang Jing¹, Kai Wang², Ruqian Lu¹, Qiwen Dong³

Affiliations

¹ School of Computer Science, Fudan University, Shanghai 200433, People's Republic of China.
² College of Animal Science and Technology, Jilin Agricultural University, Changchun 130118, People's Republic of China.
³ Institute for Data Science and Engineering, East China Normal University, Shanghai 200062, People's Republic of China.

PMID: 27530967
PMCID: PMC4987638
DOI: 10.1038/srep31571

Sorting protein decoys by machine-learning-to-rank

Xiaoyang Jing et al. Sci Rep. 2016.

. 2016 Aug 17:6:31571.

doi: 10.1038/srep31571.

Authors

Xiaoyang Jing¹, Kai Wang², Ruqian Lu¹, Qiwen Dong³

Affiliations

¹ School of Computer Science, Fudan University, Shanghai 200433, People's Republic of China.
² College of Animal Science and Technology, Jilin Agricultural University, Changchun 130118, People's Republic of China.
³ Institute for Data Science and Engineering, East China Normal University, Shanghai 200062, People's Republic of China.

PMID: 27530967
PMCID: PMC4987638
DOI: 10.1038/srep31571

Abstract

Much progress has been made in Protein structure prediction during the last few decades. As the predicted models can span a broad range of accuracy spectrum, the accuracy of quality estimation becomes one of the key elements of successful protein structure prediction. Over the past years, a number of methods have been developed to address this issue, and these methods could be roughly divided into three categories: the single-model methods, clustering-based methods and quasi single-model methods. In this study, we develop a single-model method MQAPRank based on the learning-to-rank algorithm firstly, and then implement a quasi single-model method Quasi-MQAPRank. The proposed methods are benchmarked on the 3DRobot and CASP11 dataset. The five-fold cross-validation on the 3DRobot dataset shows the proposed single model method outperforms other methods whose outputs are taken as features of the proposed method, and the quasi single-model method can further enhance the performance. On the CASP11 dataset, the proposed methods also perform well compared with other leading methods in corresponding categories. In particular, the Quasi-MQAPRank method achieves a considerable performance on the CASP11 Best150 dataset.

PubMed Disclaimer

Figures

**Figure 1. The overall flowchart of the proposed methods.**

**Figure 2. The ROC curves of compared methods on the 3DRobot dataset based on GDT_TS score.**
The ModFOLDclust2 is a clustering method, other compared methods are listed in “feature extraction” section.

**Figure 3. The ROC curves of compared methods on the CASP11 dataset based on GDT_TS score.**
(a) The ROC curves for Best150 dataset and (b) the corresponding AUCs for Select20 dataset.

See this image and copyright information in PMC

Cited by

Unsupervised and Supervised Learning over theEnergy Landscape for Protein Decoy Selection.
Akhter N, Chennupati G, Kabir KL, Djidjev H, Shehu A. Akhter N, et al. Biomolecules. 2019 Oct 14;9(10):607. doi: 10.3390/biom9100607. Biomolecules. 2019. PMID: 31615116 Free PMC article.
Graph-Based Community Detection for Decoy Selection in Template-Free Protein Structure Prediction.
Kabir KL, Hassan L, Rajabi Z, Akhter N, Shehu A. Kabir KL, et al. Molecules. 2019 Feb 28;24(5):854. doi: 10.3390/molecules24050854. Molecules. 2019. PMID: 30823390 Free PMC article.
Two New Heuristic Methods for Protein Model Quality Assessment.
Wang W, Wang J, Xu D, Shang Y. Wang W, et al. IEEE/ACM Trans Comput Biol Bioinform. 2020 Jul-Aug;17(4):1430-1439. doi: 10.1109/TCBB.2018.2880202. Epub 2018 Nov 9. IEEE/ACM Trans Comput Biol Bioinform. 2020. PMID: 30418914 Free PMC article.
RRCRank: a fusion method using rank strategy for residue-residue contact prediction.
Jing X, Dong Q, Lu R. Jing X, et al. BMC Bioinformatics. 2017 Sep 2;18(1):390. doi: 10.1186/s12859-017-1811-9. BMC Bioinformatics. 2017. PMID: 28865433 Free PMC article.
Decoy selection for protein structure prediction via extreme gradient boosting and ranking.
Akhter N, Chennupati G, Djidjev H, Shehu A. Akhter N, et al. BMC Bioinformatics. 2020 Dec 9;21(Suppl 1):189. doi: 10.1186/s12859-020-3523-9. BMC Bioinformatics. 2020. PMID: 33297949 Free PMC article.

See all "Cited by" articles

References

1. Moult J., Fidelis K., Kryshtafovych A., Schwede T. & Tramontano A. Critical assessment of methods of protein structure prediction (CASP)—round x. Proteins: Structure, Function, and Bioinformatics 82, 1–6 (2014). - PMC - PubMed
1. Zhang Y. Protein structure prediction: when is it useful? Curr Opin Struct Biol 19, 145–155, 10.1016/j.sbi.2009.02.005 (2009). - DOI - PMC - PubMed
1. Cozzetto D., Kryshtafovych A., Ceriani M. & Tramontano A. Assessment of predictions in the model quality assessment category. Proteins 69 Suppl 8, 175–183, 10.1002/prot.21669 (2007). - DOI - PubMed
1. Kryshtafovych A. et al. Assessment of the assessment: Evaluation of the model quality estimates in CASP10. Proteins: Structure, Function, and Bioinformatics 82, 112–126, 10.1002/prot.24347 (2014). - DOI - PMC - PubMed
1. Kryshtafovych A. et al. Methods of model accuracy estimation can help selecting the best models from decoy sets: assessment of model accuracy estimations in CASP11. Proteins: Structure, Function, and Bioinformatics (2015). - PMC - PubMed

MeSH terms

Actions
Actions
Actions
Actions
Actions

Substances

Actions

LinkOut - more resources

Full Text Sources
Other Literature Sources
- scite Smart Citations
Research Materials
- NCI CPTC Antibody Characterization Program
Miscellaneous
- NCI CPTAC Assay Portal

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Sorting protein decoys by machine-learning-to-rank

Affiliations

Sorting protein decoys by machine-learning-to-rank

Authors

Affiliations

Abstract

Figures

Similar articles

Cited by

References

MeSH terms

Substances

LinkOut - more resources

Full Text Sources

Other Literature Sources

Research Materials

Miscellaneous