Transfer learning improves pMHC kinetic stability and immunogenicity predictions

Romanos Fasoulis¹, Mauricio Menegatti Rigo¹, Dinler Amaral Antunes², Georgios Paliouras³, Lydia E Kavraki¹

Affiliations

¹ Department of Computer Science, Rice University, 6100 Main St, Houston, 77005, TX, United States.
² Department of Biology and Biochemistry, University of Houston, 4800 Calhoun Rd, Houston, 77004, TX, United States.
³ Institute of Informatics and Telecommunications, NCSR Demokritos, Patr. Gregoriou E and 27 Neapoleos St, Athens, 15341, Greece.

PMID: 38577265
PMCID: PMC10994007
DOI: 10.1016/j.immuno.2023.100030

Transfer learning improves pMHC kinetic stability and immunogenicity predictions

Romanos Fasoulis et al. Immunoinformatics (Amst). 2024 Mar.

. 2024 Mar:13:100030.

doi: 10.1016/j.immuno.2023.100030. Epub 2023 Dec 21.

Authors

Romanos Fasoulis¹, Mauricio Menegatti Rigo¹, Dinler Amaral Antunes², Georgios Paliouras³, Lydia E Kavraki¹

Affiliations

¹ Department of Computer Science, Rice University, 6100 Main St, Houston, 77005, TX, United States.
² Department of Biology and Biochemistry, University of Houston, 4800 Calhoun Rd, Houston, 77004, TX, United States.
³ Institute of Informatics and Telecommunications, NCSR Demokritos, Patr. Gregoriou E and 27 Neapoleos St, Athens, 15341, Greece.

PMID: 38577265
PMCID: PMC10994007
DOI: 10.1016/j.immuno.2023.100030

Abstract

The cellular immune response comprises several processes, with the most notable ones being the binding of the peptide to the Major Histocompability Complex (MHC), the peptide-MHC (pMHC) presentation to the surface of the cell, and the recognition of the pMHC by the T-Cell Receptor. Identifying the most potent peptide targets for MHC binding, presentation and T-cell recognition is vital for developing peptide-based vaccines and T-cell-based immunotherapies. Data-driven tools that predict each of these steps have been developed, and the availability of mass spectrometry (MS) datasets has facilitated the development of accurate Machine Learning (ML) methods for class-I pMHC binding prediction. However, the accuracy of ML-based tools for pMHC kinetic stability prediction and peptide immunogenicity prediction is uncertain, as stability and immunogenicity datasets are not abundant. Here, we use transfer learning techniques to improve stability and immunogenicity predictions, by taking advantage of a large number of binding affinity and MS datasets. The resulting models, TLStab and TLImm, exhibit comparable or better performance than state-of-the-art approaches on different stability and immunogenicity test sets respectively. Our approach demonstrates the promise of learning from the task of peptide binding to improve predictions on downstream tasks. The source code of TLStab and TLImm is publicly available at https://github.com/KavrakiLab/TL-MHC.

Keywords: Machine learning; Peptide immunogenicity; Peptide kinetic stability; Peptide-MHC; Transfer learning.

PubMed Disclaimer

Conflict of interest statement

Declaration of competing interest The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

Figures

**Fig. 1.**
*TLStab & TLImm*: A BA/EL predictor similar to NetMHCpan4.1 [10] is fine-tuned to stability/immunogenicity tasks. This is achieved by refining the MLP weights through task-specific training.

**Fig. 2.**
(A) Relationship between experimental ED50 values and stability values on the Ebola virus dataset and the Pox virus dataset. The y-axis depicts the mean stability values of peptides that have a better ED50 than the threshold (x-axis). (B) Relationship between BA predictions from two state-of-the-art tools (plus our pre-trained BA/EL predictor TLBind) and stability values. The y-axis depicts the mean stability values of peptides that have a better predicted BA than the threshold (x-axis). (C) NetMHCpan4.1 (p < 0.001), MHCFlurry2.0 (p < 0.01) and TLBind (p < 0.001) affinity predictions on immunogenic peptides are significantly different when compared to non-immunogenic ones.

**Fig. 3.**
(A) Pearson’s correlation and Kendall’s tau performance of TLStab against other knowledge transfer approaches on the unbiased 10-fold nested CV experiment. On the left part of the blue dashed line, the performance of BA/EL predictors is depicted (blue bars). On the right side, we show the performance of various knowledge transfer approaches and TLStab (dark yellow bars). (B) Pearson’s correlation and Kendall’s tau performance of TLStab against other approaches on the Ebola virus Dataset. On the left part of the blue dashed line, the performance of BA predictors is depicted (blue bars). On the right side, we show the performance of state-of-the-art pMHC stability tools (teal bars) compared to TLStab (dark yellow bar). (C) Pearson’s correlation and Kendall’s tau performance on the Pox virus Dataset. (For interpretation of the references to color in this figure legend, the reader is referred to the web version of this article.)

**Fig. 4.**
(A) HLA-A*02:01 motifs for the four depicted methods. While the binding affinity predictor has visible presence of negative charge in position 4, the stability prediction methods are less enriched, although the contribution of negative charge to peptide stability has been previously studied [63]. (B) HLA-A*01:01 motifs for the four depicted methods. The NetMHCstabpan motif has substantially lower presence of D3 and E3, although there is experimental evidence of formed bonds that contribute to peptide stability.

**Fig. 5.**
(A) *AUPRC* performances of different approaches on the **convalescent** donor labels. Baseline AUPRC Is depicted In dashed gray. (B) *AUPRC* performances of different approaches on the **unexposed** donor labels. (C) *Pearson’s correlation and Kendall’s tau* performances of different approaches on the response frequencies of **convalescent** donors. (D) *Pearson’s correlation and Kendall’s tau* performances of different approaches on the response frequencies of **unexposed** donors.

See this image and copyright information in PMC

Cited by

DiscovEpi: automated whole proteome MHC-I-epitope prediction and visualization.
Mahncke C, Schmiedeke F, Simm S, Kaderali L, Bröker BM, Seifert U, Cammann C. Mahncke C, et al. BMC Bioinformatics. 2024 Sep 27;25(1):310. doi: 10.1186/s12859-024-05931-2. BMC Bioinformatics. 2024. PMID: 39333860 Free PMC article.
MHC-I-presented non-canonical antigens expand the cancer immunotherapy targets in acute myeloid leukemia.
Cai Y, Li D, Lv D, Yu J, Ma Y, Jiang T, Ding N, Liu Z, Li Y, Xu J. Cai Y, et al. Sci Data. 2024 Aug 1;11(1):831. doi: 10.1038/s41597-024-03660-y. Sci Data. 2024. PMID: 39090129 Free PMC article.
Mutations in glioblastoma proteins do not disrupt epitope presentation and recognition, maintaining a specific CD8 T cell immune response potential.
Tarabini RF, Fioravanti Vieira G, Rigo MM, de Souza APD. Tarabini RF, et al. Sci Rep. 2024 Jul 19;14(1):16721. doi: 10.1038/s41598-024-67099-2. Sci Rep. 2024. PMID: 39030304 Free PMC article.

References

1. Alberts B, Bray D, Lewis J, Raff M, Roberts K, Watson J. Molecular biology of the cell. 4th ed.. New York: Garland: 2002.
1. Wieczorek M, Abualrous ET, Sticht J, Álvaro-Benito M, Stolzenberg S, Noé F, et al. Major histocompatibility complex (MHC) class I and MHC class II proteins: Conformational plasticity In antigen presentation. Front Immunol 2017:8. - PMC - PubMed
1. Nielsen M, Andreatta M, Peters B, Buus S. Immunoinformatics: Predicting peptlde–MHC binding. Annu Rev Blomed Data Sci 2020:3(1:191–215. - PMC - PubMed
1. Vita R, Mahajan S, Overton JA, Dhanda SK, Martini S, Cantrell JR, et al. The Immune epitope database (IEDB): 2018 update. Nucleic Acids Res 2018;47(D1):D339–43. - PMC - PubMed
1. Rammensee H-G, Bachmann J, Emmerich NPN, Bachor OA, Stevanović S. SYFPEITHI: database for MHC ligands and peptide motifs. Immunogenetics 1999:50:213–9. - PubMed

Grants and funding

U01 CA258512/CA/NCI NIH HHS/United States

LinkOut - more resources

Full Text Sources
Research Materials
- NCI CPTC Antibody Characterization Program

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Transfer learning improves pMHC kinetic stability and immunogenicity predictions

Affiliations

Transfer learning improves pMHC kinetic stability and immunogenicity predictions

Authors

Affiliations

Abstract

Conflict of interest statement

Figures

Similar articles

Cited by

References

Grants and funding

LinkOut - more resources

Full Text Sources

Research Materials