Prediction of MHC-binding peptides of flexible lengths from sequence-derived structural and physicochemical properties
- PMID: 16806474
- DOI: 10.1016/j.molimm.2006.04.001
Prediction of MHC-binding peptides of flexible lengths from sequence-derived structural and physicochemical properties
Abstract
Peptide binding to MHC is critical for antigen recognition by T-cells. To facilitate vaccine design, computational methods have been developed for predicting MHC-binding peptides, which achieve impressive prediction accuracies of 70-90% for binders and 40-80% for non-binders. These methods have been developed for peptides of fixed lengths, for a limited number of alleles, trained from small number of non-binders, and in some cases based straightforwardly on sequence. These limit prediction coverage and accuracy particularly for non-binders. It is desirable to explore methods that predict binders of flexible lengths from sequence-derived physicochemical properties and trained from diverse sets of non-binders. This work explores support vector machines (SVM) as such a method for developing prediction systems of 18 MHC class I and 12 class II alleles by using 4208-3252 binders and 234,333-168,793 non-binders, and evaluated by an independent set of 545-476 binders and 110,564-84,430 non-binders. Binder accuracies are 86-99% for 25 and 70-80% for 5 alleles, non-binder accuracies are 96-99% for 30 alleles. Binder accuracies are comparable and non-binder accuracies substantially improved against other results. Our method correctly predicts 73.3% of the 15 newly-published epitopes in the last 4 months of 2005. Of the 251 recently-published HLA-A*0201 non-epitopes predicted as binders by other methods, 63 are predicted as binders by our method. Screening of HIV-1 genome shows that, compared to other methods, a comparable percentage (75-100%) of its known epitopes is correctly predicted, while a lower percentage (0.01-5% for 24 and 5-8% for 6 alleles) of its constituent peptides are predicted as binders. Our software can be accessed at .
Similar articles
-
MHC-BPS: MHC-binder prediction server for identifying peptides of flexible lengths from sequence-derived physicochemical properties.Immunogenetics. 2006 Aug;58(8):607-13. doi: 10.1007/s00251-006-0117-2. Epub 2006 Jul 11. Immunogenetics. 2006. PMID: 16832638
-
Structure-based identification of MHC binding peptides: Benchmarking of prediction accuracy.Mol Biosyst. 2010 Dec;6(12):2508-20. doi: 10.1039/c0mb00013b. Epub 2010 Oct 18. Mol Biosyst. 2010. PMID: 20953500
-
Application of machine learning techniques in predicting MHC binders.Methods Mol Biol. 2007;409:201-15. doi: 10.1007/978-1-60327-118-9_14. Methods Mol Biol. 2007. PMID: 18450002
-
Class I MHC-peptide interactions: structural requirements and functional implications.Cancer Surv. 1995;22:37-49. Cancer Surv. 1995. PMID: 7536628 Review.
-
Methods and protocols for prediction of immunogenic epitopes.Brief Bioinform. 2007 Mar;8(2):96-108. doi: 10.1093/bib/bbl038. Epub 2006 Oct 31. Brief Bioinform. 2007. PMID: 17077136 Review.
Cited by
-
Predicting host tropism of influenza A virus proteins using random forest.BMC Med Genomics. 2014;7 Suppl 3(Suppl 3):S1. doi: 10.1186/1755-8794-7-S3-S1. Epub 2014 Dec 8. BMC Med Genomics. 2014. PMID: 25521718 Free PMC article.
-
Development and experimental test of support vector machines virtual screening method for searching Src inhibitors from large compound libraries.Chem Cent J. 2012 Nov 23;6(1):139. doi: 10.1186/1752-153X-6-139. Chem Cent J. 2012. PMID: 23173901 Free PMC article.
-
On evaluating MHC-II binding peptide prediction methods.PLoS One. 2008 Sep 24;3(9):e3268. doi: 10.1371/journal.pone.0003268. PLoS One. 2008. PMID: 18813344 Free PMC article.
-
Efficacy of different protein descriptors in predicting protein functional families.BMC Bioinformatics. 2007 Aug 17;8:300. doi: 10.1186/1471-2105-8-300. BMC Bioinformatics. 2007. PMID: 17705863 Free PMC article.
-
Prediction of High-Risk Types of Human Papillomaviruses Using Reduced Amino Acid Modes.Comput Math Methods Med. 2020 Jun 18;2020:5325304. doi: 10.1155/2020/5325304. eCollection 2020. Comput Math Methods Med. 2020. PMID: 32655680 Free PMC article.
MeSH terms
Substances
LinkOut - more resources
Full Text Sources
Research Materials