An integrated approach to epitope analysis I: Dimensional reduction, visualization and prediction of MHC binding using amino acid principal components and regression approaches
- PMID: 21044289
- PMCID: PMC2990731
- DOI: 10.1186/1745-7580-6-7
An integrated approach to epitope analysis I: Dimensional reduction, visualization and prediction of MHC binding using amino acid principal components and regression approaches
Abstract
Background: Operation of the immune system is multivariate. Reduction of the dimensionality is essential to facilitate understanding of this complex biological system. One multi-dimensional facet of the immune system is the binding of epitopes to the MHC-I and MHC-II molecules by diverse populations of individuals. Prediction of such epitope binding is critical and several immunoinformatic strategies utilizing amino acid substitution matrices have been designed to develop predictive algorithms. Contemporaneously, computational and statistical tools have evolved to handle multivariate and megavariate analysis, but these have not been systematically deployed in prediction of MHC binding. Partial least squares analysis, principal component analysis, and associated regression techniques have become the norm in handling complex datasets in many fields. Over two decades ago Wold and colleagues showed that principal components of amino acids could be used to predict peptide binding to cellular receptors. We have applied this observation to the analysis of MHC binding, and to derivation of predictive methods applicable on a whole proteome scale.
Results: We show that amino acid principal components and partial least squares approaches can be utilized to visualize the underlying physicochemical properties of the MHC binding domain by using commercially available software. We further show the application of amino acid principal components to develop both linear partial least squares and non-linear neural network regression prediction algorithms for MHC-I and MHC-II molecules. Several visualization options for the output aid in understanding the underlying physicochemical properties, enable confirmation of earlier work on the relative importance of certain peptide residues to MHC binding, and also provide new insights into differences among MHC molecules. We compared both the linear and non-linear MHC binding prediction tools to several predictive tools currently available on the Internet.
Conclusions: As opposed to the highly constrained user-interaction paradigms of web-server approaches, local computational approaches enable interactive analysis and visualization of complex multidimensional data using robust mathematical tools. Our work shows that prediction tools such as these can be constructed on the widely available JMP® platform, can operate in a spreadsheet environment on a desktop computer, and are capable of handling proteome-scale analysis with high throughput.
Figures









Similar articles
-
Toward prediction of class II mouse major histocompatibility complex peptide binding affinity: in silico bioinformatic evaluation using partial least squares, a robust multivariate statistical technique.J Chem Inf Model. 2006 May-Jun;46(3):1491-502. doi: 10.1021/ci050380d. J Chem Inf Model. 2006. PMID: 16711768
-
Deep convolutional neural networks for pan-specific peptide-MHC class I binding prediction.BMC Bioinformatics. 2017 Dec 28;18(1):585. doi: 10.1186/s12859-017-1997-x. BMC Bioinformatics. 2017. PMID: 29281985 Free PMC article.
-
An integrated approach to epitope analysis II: A system for proteomic-scale prediction of immunological characteristics.Immunome Res. 2010 Nov 2;6:8. doi: 10.1186/1745-7580-6-8. Immunome Res. 2010. PMID: 21044290 Free PMC article.
-
A comprehensive review and performance evaluation of bioinformatics tools for HLA class I peptide-binding prediction.Brief Bioinform. 2020 Jul 15;21(4):1119-1135. doi: 10.1093/bib/bbz051. Brief Bioinform. 2020. PMID: 31204427 Free PMC article. Review.
-
Prediction of MHC-peptide binding: a systematic and comprehensive overview.Curr Pharm Des. 2009;15(28):3209-20. doi: 10.2174/138161209789105162. Curr Pharm Des. 2009. PMID: 19860671 Review.
Cited by
-
Human Cysteine Cathepsins Degrade Immunoglobulin G In Vitro in a Predictable Manner.Int J Mol Sci. 2019 Sep 29;20(19):4843. doi: 10.3390/ijms20194843. Int J Mol Sci. 2019. PMID: 31569504 Free PMC article.
-
Frequency Patterns of T-Cell Exposed Amino Acid Motifs in Immunoglobulin Heavy Chain Peptides Presented by MHCs.Front Immunol. 2014 Oct 28;5:541. doi: 10.3389/fimmu.2014.00541. eCollection 2014. Front Immunol. 2014. PMID: 25389426 Free PMC article. Review.
-
In Silico Prediction Analysis of Idiotope-Driven T-B Cell Collaboration in Multiple Sclerosis.Front Immunol. 2017 Oct 2;8:1255. doi: 10.3389/fimmu.2017.01255. eCollection 2017. Front Immunol. 2017. PMID: 29038659 Free PMC article.
-
Classification epitopes in groups based on their protein family.BMC Bioinformatics. 2015;16 Suppl 19(Suppl 19):S7. doi: 10.1186/1471-2105-16-S19-S7. Epub 2015 Dec 16. BMC Bioinformatics. 2015. PMID: 26696329 Free PMC article.
-
A large peptidome dataset improves HLA class I epitope prediction across most of the human population.Nat Biotechnol. 2020 Feb;38(2):199-209. doi: 10.1038/s41587-019-0322-9. Epub 2019 Dec 16. Nat Biotechnol. 2020. PMID: 31844290 Free PMC article.
References
-
- Wold S, Sjorstrom M, Eriksson L. PLS-regression: a basic tool of chemometrics. Chemometrics and Intelligent Laboratory Systems. 2001;58:109–130. doi: 10.1016/S0169-7439(01)00155-1. - DOI
-
- Eriksson L, Johansson E, Kettaneh-Wold N, Trygg J, Wikstrom C, Wold S. Multi and Megavariate Data Analysis. Part II: Advanced Appplications and Method Extensions. 2. Umetrics Academy, Umea, Sweden; 2006.
-
- Eriksson L, Johansson E, Kettaneh-Wold N, Trygg J, Wikstrom C, Wold S. Multi and Megavariate Data Analysis. Part I: Basic Principles and Applications. 2. Umetrics Academy, Umea, Sweden; 2006.
-
- Flower DR, McSparron H, Blythe MJ, Zygouri C, Taylor D, Guan P, Wan S, Coveney PV, Walshe V, Borrow P, Doytchinova IA. Computational vaccinology: quantitative approaches. Novartis Found Symp. 2003;254:102–120. full_text. - PubMed
LinkOut - more resources
Full Text Sources
Other Literature Sources
Research Materials