iBCE-EL: A New Ensemble Learning Framework for Improved Linear B-Cell Epitope Prediction
- PMID: 30100904
- PMCID: PMC6072840
- DOI: 10.3389/fimmu.2018.01695
iBCE-EL: A New Ensemble Learning Framework for Improved Linear B-Cell Epitope Prediction
Abstract
Identification of B-cell epitopes (BCEs) is a fundamental step for epitope-based vaccine development, antibody production, and disease prevention and diagnosis. Due to the avalanche of protein sequence data discovered in postgenomic age, it is essential to develop an automated computational method to enable fast and accurate identification of novel BCEs within vast number of candidate proteins and peptides. Although several computational methods have been developed, their accuracy is unreliable. Thus, developing a reliable model with significant prediction improvements is highly desirable. In this study, we first constructed a non-redundant data set of 5,550 experimentally validated BCEs and 6,893 non-BCEs from the Immune Epitope Database. We then developed a novel ensemble learning framework for improved linear BCE predictor called iBCE-EL, a fusion of two independent predictors, namely, extremely randomized tree (ERT) and gradient boosting (GB) classifiers, which, respectively, uses a combination of physicochemical properties (PCP) and amino acid composition and a combination of dipeptide and PCP as input features. Cross-validation analysis on a benchmarking data set showed that iBCE-EL performed better than individual classifiers (ERT and GB), with a Matthews correlation coefficient (MCC) of 0.454. Furthermore, we evaluated the performance of iBCE-EL on the independent data set. Results show that iBCE-EL significantly outperformed the state-of-the-art method with an MCC of 0.463. To the best of our knowledge, iBCE-EL is the first ensemble method for linear BCEs prediction. iBCE-EL was implemented in a web-based platform, which is available at http://thegleelab.org/iBCE-EL. iBCE-EL contains two prediction modes. The first one identifying peptide sequences as BCEs or non-BCEs, while later one is aimed at providing users with the option of mining potential BCEs from protein sequences.
Keywords: B-cell epitope; ensemble learning; extremely randomized tree; gradient boosting; immunotherapy.
Figures





Similar articles
-
PIP-EL: A New Ensemble Learning Method for Improved Proinflammatory Peptide Predictions.Front Immunol. 2018 Jul 31;9:1783. doi: 10.3389/fimmu.2018.01783. eCollection 2018. Front Immunol. 2018. PMID: 30108593 Free PMC article.
-
Multi-perspectives and challenges in identifying B-cell epitopes.Protein Sci. 2023 Nov;32(11):e4785. doi: 10.1002/pro.4785. Protein Sci. 2023. PMID: 37733481 Free PMC article. Review.
-
EPMLR: sequence-based linear B-cell epitope prediction method using multiple linear regression.BMC Bioinformatics. 2014 Dec 19;15(1):414. doi: 10.1186/s12859-014-0414-y. BMC Bioinformatics. 2014. PMID: 25523327 Free PMC article.
-
Shotgun Immunoproteomic Approach for the Discovery of Linear B-Cell Epitopes in Biothreat Agents Francisella tularensis and Burkholderia pseudomallei.Front Immunol. 2021 Sep 29;12:716676. doi: 10.3389/fimmu.2021.716676. eCollection 2021. Front Immunol. 2021. PMID: 34659206 Free PMC article.
-
Linear B-Cell Epitope Prediction for In Silico Vaccine Design: A Performance Review of Methods Available via Command-Line Interface.Int J Mol Sci. 2021 Mar 22;22(6):3210. doi: 10.3390/ijms22063210. Int J Mol Sci. 2021. PMID: 33809918 Free PMC article. Review.
Cited by
-
Accelerating therapeutic protein design with computational approaches toward the clinical stage.Comput Struct Biotechnol J. 2023 Apr 29;21:2909-2926. doi: 10.1016/j.csbj.2023.04.027. eCollection 2023. Comput Struct Biotechnol J. 2023. PMID: 38213894 Free PMC article. Review.
-
PIP-EL: A New Ensemble Learning Method for Improved Proinflammatory Peptide Predictions.Front Immunol. 2018 Jul 31;9:1783. doi: 10.3389/fimmu.2018.01783. eCollection 2018. Front Immunol. 2018. PMID: 30108593 Free PMC article.
-
A Hybrid Deep Learning Model for Predicting Protein Hydroxylation Sites.Int J Mol Sci. 2018 Sep 18;19(9):2817. doi: 10.3390/ijms19092817. Int J Mol Sci. 2018. PMID: 30231550 Free PMC article.
-
Multi-perspectives and challenges in identifying B-cell epitopes.Protein Sci. 2023 Nov;32(11):e4785. doi: 10.1002/pro.4785. Protein Sci. 2023. PMID: 37733481 Free PMC article. Review.
-
PVPred-SCM: Improved Prediction and Analysis of Phage Virion Proteins Using a Scoring Card Method.Cells. 2020 Feb 3;9(2):353. doi: 10.3390/cells9020353. Cells. 2020. PMID: 32028709 Free PMC article.
References
-
- Getzoff ED, Tainer JA, Lerner RA, Geysen HM. The Chemistry and Mechanism of Antibody Binding to Protein Antigens. Advances in immunology. 43. Elsevier; (1988). p. 1–98. - PubMed
Publication types
MeSH terms
Substances
LinkOut - more resources
Full Text Sources
Other Literature Sources