Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2024 Mar:13:100030.
doi: 10.1016/j.immuno.2023.100030. Epub 2023 Dec 21.

Transfer learning improves pMHC kinetic stability and immunogenicity predictions

Affiliations

Transfer learning improves pMHC kinetic stability and immunogenicity predictions

Romanos Fasoulis et al. Immunoinformatics (Amst). 2024 Mar.

Abstract

The cellular immune response comprises several processes, with the most notable ones being the binding of the peptide to the Major Histocompability Complex (MHC), the peptide-MHC (pMHC) presentation to the surface of the cell, and the recognition of the pMHC by the T-Cell Receptor. Identifying the most potent peptide targets for MHC binding, presentation and T-cell recognition is vital for developing peptide-based vaccines and T-cell-based immunotherapies. Data-driven tools that predict each of these steps have been developed, and the availability of mass spectrometry (MS) datasets has facilitated the development of accurate Machine Learning (ML) methods for class-I pMHC binding prediction. However, the accuracy of ML-based tools for pMHC kinetic stability prediction and peptide immunogenicity prediction is uncertain, as stability and immunogenicity datasets are not abundant. Here, we use transfer learning techniques to improve stability and immunogenicity predictions, by taking advantage of a large number of binding affinity and MS datasets. The resulting models, TLStab and TLImm, exhibit comparable or better performance than state-of-the-art approaches on different stability and immunogenicity test sets respectively. Our approach demonstrates the promise of learning from the task of peptide binding to improve predictions on downstream tasks. The source code of TLStab and TLImm is publicly available at https://github.com/KavrakiLab/TL-MHC.

Keywords: Machine learning; Peptide immunogenicity; Peptide kinetic stability; Peptide-MHC; Transfer learning.

PubMed Disclaimer

Conflict of interest statement

Declaration of competing interest The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

Figures

Fig. 1.
Fig. 1.
TLStab & TLImm: A BA/EL predictor similar to NetMHCpan4.1 [10] is fine-tuned to stability/immunogenicity tasks. This is achieved by refining the MLP weights through task-specific training.
Fig. 2.
Fig. 2.
(A) Relationship between experimental ED50 values and stability values on the Ebola virus dataset and the Pox virus dataset. The y-axis depicts the mean stability values of peptides that have a better ED50 than the threshold (x-axis). (B) Relationship between BA predictions from two state-of-the-art tools (plus our pre-trained BA/EL predictor TLBind) and stability values. The y-axis depicts the mean stability values of peptides that have a better predicted BA than the threshold (x-axis). (C) NetMHCpan4.1 (p < 0.001), MHCFlurry2.0 (p < 0.01) and TLBind (p < 0.001) affinity predictions on immunogenic peptides are significantly different when compared to non-immunogenic ones.
Fig. 3.
Fig. 3.
(A) Pearson’s correlation and Kendall’s tau performance of TLStab against other knowledge transfer approaches on the unbiased 10-fold nested CV experiment. On the left part of the blue dashed line, the performance of BA/EL predictors is depicted (blue bars). On the right side, we show the performance of various knowledge transfer approaches and TLStab (dark yellow bars). (B) Pearson’s correlation and Kendall’s tau performance of TLStab against other approaches on the Ebola virus Dataset. On the left part of the blue dashed line, the performance of BA predictors is depicted (blue bars). On the right side, we show the performance of state-of-the-art pMHC stability tools (teal bars) compared to TLStab (dark yellow bar). (C) Pearson’s correlation and Kendall’s tau performance on the Pox virus Dataset. (For interpretation of the references to color in this figure legend, the reader is referred to the web version of this article.)
Fig. 4.
Fig. 4.
(A) HLA-A*02:01 motifs for the four depicted methods. While the binding affinity predictor has visible presence of negative charge in position 4, the stability prediction methods are less enriched, although the contribution of negative charge to peptide stability has been previously studied [63]. (B) HLA-A*01:01 motifs for the four depicted methods. The NetMHCstabpan motif has substantially lower presence of D3 and E3, although there is experimental evidence of formed bonds that contribute to peptide stability.
Fig. 5.
Fig. 5.
(A) AUPRC performances of different approaches on the convalescent donor labels. Baseline AUPRC Is depicted In dashed gray. (B) AUPRC performances of different approaches on the unexposed donor labels. (C) Pearson’s correlation and Kendall’s tau performances of different approaches on the response frequencies of convalescent donors. (D) Pearson’s correlation and Kendall’s tau performances of different approaches on the response frequencies of unexposed donors.

Similar articles

Cited by

References

    1. Alberts B, Bray D, Lewis J, Raff M, Roberts K, Watson J. Molecular biology of the cell. 4th ed.. New York: Garland: 2002.
    1. Wieczorek M, Abualrous ET, Sticht J, Álvaro-Benito M, Stolzenberg S, Noé F, et al. Major histocompatibility complex (MHC) class I and MHC class II proteins: Conformational plasticity In antigen presentation. Front Immunol 2017:8. - PMC - PubMed
    1. Nielsen M, Andreatta M, Peters B, Buus S. Immunoinformatics: Predicting peptlde–MHC binding. Annu Rev Blomed Data Sci 2020:3(1:191–215. - PMC - PubMed
    1. Vita R, Mahajan S, Overton JA, Dhanda SK, Martini S, Cantrell JR, et al. The Immune epitope database (IEDB): 2018 update. Nucleic Acids Res 2018;47(D1):D339–43. - PMC - PubMed
    1. Rammensee H-G, Bachmann J, Emmerich NPN, Bachor OA, Stevanović S. SYFPEITHI: database for MHC ligands and peptide motifs. Immunogenetics 1999:50:213–9. - PubMed