. 2024 Nov 22;26(1):bbae625.

doi: 10.1093/bib/bbae625.

Meta learning for mutant HLA class I epitope immunogenicity prediction to accelerate cancer clinical immunotherapy

Long Xu¹, Qiang Yang^{1

2}, Weihe Dong³, Xiaokun Li^{1

4

5

6}, Kuanquan Wang¹, Suyu Dong³, Xianyu Zhang⁷, Tiansong Yang⁸, Gongning Luo^{1

9}, Xingyu Liao¹⁰, Xin Gao⁹, Guohua Wang^{1

3}

Affiliations

¹ School of Computer Science and Technology, Harbin Institute of Technology, West DaZhi Street, 150001 Harbin, China.
² School of Medicine and Health, Harbin Institute of Technology, Yikuang Street, 150000 Harbin, China.
³ College of Computer and Control Engineering, Northeast Forestry University, Hexing Road, 150040 Harbin, China.
⁴ School of Computer Science and Technology, Heilongjiang University, Xuefu Road, 150080 Harbin, China.
⁵ Postdoctoral Program of Heilongjiang Hengxun Technology Co., Ltd., Xuefu Road, 150090 Harbin, China.
⁶ Shandong Hengxun Technology Co., Ltd., Miaoling Road, 266100 Qingdao, China.
⁷ Department of Breast Surgery, Harbin Medical University Cancer Hospital, Haping Road, 150081 Harbin, China.
⁸ Department of Rehabilitation, The First Affiliated Hospital of Heilongjiang University of Traditional Chinese Medicine, Xuefu Road, 150040 Harbin, China.
⁹ Computer, Electrical and Mathematical Sciences & Engineering Division, King Abdullah University of Science and Technology, Thuwal 23955, 4700 KAUST Saudi, Arabia.
¹⁰ School of Computer Science, Northwestern Polytechnical University, 710072 Xian, China.

PMID: 39656887
PMCID: PMC11630330
DOI: 10.1093/bib/bbae625

Meta learning for mutant HLA class I epitope immunogenicity prediction to accelerate cancer clinical immunotherapy

Long Xu et al. Brief Bioinform. 2024.

. 2024 Nov 22;26(1):bbae625.

doi: 10.1093/bib/bbae625.

Authors

Affiliations

¹ School of Computer Science and Technology, Harbin Institute of Technology, West DaZhi Street, 150001 Harbin, China.
² School of Medicine and Health, Harbin Institute of Technology, Yikuang Street, 150000 Harbin, China.
³ College of Computer and Control Engineering, Northeast Forestry University, Hexing Road, 150040 Harbin, China.
⁴ School of Computer Science and Technology, Heilongjiang University, Xuefu Road, 150080 Harbin, China.
⁵ Postdoctoral Program of Heilongjiang Hengxun Technology Co., Ltd., Xuefu Road, 150090 Harbin, China.
⁶ Shandong Hengxun Technology Co., Ltd., Miaoling Road, 266100 Qingdao, China.
⁷ Department of Breast Surgery, Harbin Medical University Cancer Hospital, Haping Road, 150081 Harbin, China.
⁸ Department of Rehabilitation, The First Affiliated Hospital of Heilongjiang University of Traditional Chinese Medicine, Xuefu Road, 150040 Harbin, China.
⁹ Computer, Electrical and Mathematical Sciences & Engineering Division, King Abdullah University of Science and Technology, Thuwal 23955, 4700 KAUST Saudi, Arabia.
¹⁰ School of Computer Science, Northwestern Polytechnical University, 710072 Xian, China.

PMID: 39656887
PMCID: PMC11630330
DOI: 10.1093/bib/bbae625

Abstract

Accurate prediction of binding between human leukocyte antigen (HLA) class I molecules and antigenic peptide segments is a challenging task and a key bottleneck in personalized immunotherapy for cancer. Although existing prediction tools have demonstrated significant results using established datasets, most can only predict the binding affinity of antigenic peptides to HLA and do not enable the immunogenic interpretation of new antigenic epitopes. This limitation results from the training data for the computational models relying heavily on a large amount of peptide-HLA (pHLA) eluting ligand data, in which most of the candidate epitopes lack immunogenicity. Here, we propose an adaptive immunogenicity prediction model, named MHLAPre, which is trained on the large-scale MS-derived HLA I eluted ligandome (mostly presented by epitopes) that are immunogenic. Allele-specific and pan-allelic prediction models are also provided for endogenous peptide presentation. Using a meta-learning strategy, MHLAPre rapidly assessed HLA class I peptide affinities across the whole pHLA pairs and accurately identified tumor-associated endogenous antigens. During the process of adaptive immune response of T-cells, pHLA-specific binding in the antigen presentation is only a pre-task for CD8+ T-cell recognition. The key factor in activating the immune response is the interaction between pHLA complexes and T-cell receptors (TCRs). Therefore, we performed transfer learning on the pHLA model using the pHLA-TCR dataset. In pHLA binding task, MHLAPre demonstrated significant improvement in identifying neoepitope immunogenicity compared with five state-of-the-art models, proving its effectiveness and robustness. After transfer learning of the pHLA-TCR data, MHLAPre also exhibited relatively superior performance in revealing the mechanism of immunotherapy. MHLAPre is a powerful tool to identify neoepitopes that can interact with TCR and induce immune responses. We believe that the proposed method will greatly contribute to clinical immunotherapy, such as anti-tumor immunity, tumor-specific T-cell engineering, and personalized tumor vaccine.

Keywords: HLA genotyping; deep learning; epitope specificity; immunoinformatics; transfer learning.

PubMed Disclaimer

Figures

**Figure 1**
Statistical information on immunogenicity data. (a) Sequence determination by mass spectrometry methods after elution of antigenic peptides; allele type determination by gene expression. (b) Amino acid site frequency plots of three HLA alleles binding antigenic peptides classified according to peptide length (HLA-A*02:01, HLA-B*07:02, HLA-C*24:02). (c) Raw data cleaning process, division ratio between training and test sets; (d) Observed peptide Length frequencies across alleles. HLA-A alleles bind longer peptides more frequently compared with HLA-B and -C alleles, which tend to bind shorter peptides. Panel a created with BioReader.com.

**Figure 2**
Workflow diagram of the model. (a) Data input structure of the MHLAPre model and model workflow. (b) Training process of MHLAPre IM against pHLA complex epitope immunogenicity. (c) Transfer learning and prediction process of MHLAPre TT in the pHLA-TCR scenario. Panels a-c created with BioReader.com.

**Figure 3**
Performance evaluation of different pHLA antigen affinity prediction algorithms. (a) The figure presents a comparative analysis of the predictive accuracy of several antigen presentation prediction algorithms. The top panel displays the AUROC for each algorithm, whereas the bottom panel shows the AUPRC. Each bar represents the average performance score of the respective algorithms—MHCflurry, NetMHCpan, MHCnuggets, MixMHCpred 2.2, and HLAthena—across all HLA types (ALL) as well as individually stratified for HLA-A, HLA-B, and HLA-C alleles. The error bars correspond to the standard deviation of the performance scores, encapsulating the variability of the algorithm’s predictive power. This comprehensive assessment underscores the varying degrees of efficacy that these computational tools exhibit when tasked with predicting the presentation of antigens by different HLA molecules. (b) The left panel shows the AUROC for each algorithm and the right panel shows the AUPRC. Each line represents the average performance score of different algorithms for different lengths of antigenic peptides.

**Figure 4**
Performance comparison of MHLAPre TT with different pHLA-TCR interaction prediction models. (a) Trend of loss function loss for the MHLAPre TT transfer learning process, where it can be observed that the MHLAPre TT model learnt new environmental knowledge and fitted quickly. (b) PPV of the top 2% for the comparison of different models. each bar or violin represents the average performance score of the respective algorithms pMTnet, PanPep, DLpTCR, ERGO2 across all HLA types (ALL) as well as individually stratified for HLA-A, HLA-B, and HLA-C alleles. (c,d) Mean AUROC and AUPRC for different model comparisons. (e, f) Mean AUROC and AUPRC plots grouped by HLA-A, -B, -C alleles.

**Figure 5**
MHLAPre TT gets excellent performance on independent dataset. (a) MHLAPre with undetected pHLA affinity scores, observed to be well predicted by the presence of high-frequency HLA alleles and alleles with sequence similarity to high-frequency genes in the training data. (b) Comparison of MHLAPre TT and pMHC-specific model performance (NetTCR2.0 , NetTCR2.0 +, MixTCRPred). We obtained a dataset based on 10X Genomics single-cell sequencing data collation to restrict the score comparison of pHLA-TCR-positive samples with pHLA of A0201_GILGFVFTL.

formula image — **Figure 5**
MHLAPre TT gets excellent performance on independent dataset. (a) MHLAPre with undetected pHLA affinity scores, observed to be well predicted by the presence of high-frequency HLA alleles and alleles with sequence similarity to high-frequency genes in the training data. (b) Comparison of MHLAPre TT and pMHC-specific model performance (NetTCR2.0 , NetTCR2.0 +, MixTCRPred). We obtained a dataset based on 10X Genomics single-cell sequencing data collation to restrict the score comparison of pHLA-TCR-positive samples with pHLA of A0201_GILGFVFTL.

See this image and copyright information in PMC

Cited by

A systematic review of T cell epitopes defined from the proteome of human immunodeficiency virus.
Ding Y, Huang L, Wu Y, Yan J. Ding Y, et al. Virus Res. 2025 Aug;358:199602. doi: 10.1016/j.virusres.2025.199602. Epub 2025 Jun 23. Virus Res. 2025. PMID: 40562176 Free PMC article. Review.
Promising future of breast cancer vaccine asking for multidisciplinary collaboration: a literature review.
Zhang Z, Li M, Zhang L, Wang M, Liu D, Tang S, Li Y, Fang X, Ren S. Zhang Z, et al. Front Cell Dev Biol. 2025 Apr 24;13:1578883. doi: 10.3389/fcell.2025.1578883. eCollection 2025. Front Cell Dev Biol. 2025. PMID: 40342927 Free PMC article. Review.

References

1. Roemer MG, Advani RH, Redd RA. et al. .. Classical Hodgkin lymphoma with reduced 2M/MHC class I expression is associated with inferior outcome independent of 9p24.1 status. Cancer Immunol Res 2016;4:910–6. 10.1158/2326-6066.CIR-16-0201. - DOI - PMC - PubMed
1. Garrido F, Aptsiauri N. Cancer immune escape: MHC expression in primary tumours versus metastases. Immunology 2019;158:255–66. 10.1111/imm.13114. - DOI - PMC - PubMed
1. Hu Y, Wang Z, Hu H. et al. .. ACME: pan-specific peptide-MHC class I binding prediction through attention-based deep neural networks. Bioinformatics 2019;35:4946–54. 10.1093/bioinformatics/btz427. - DOI - PubMed
1. Jensen KK, Andreatta M, Marcatili P. et al. .. Improved methods for predicting peptide binding affinity to MHC class II molecules. Immunology 2018;154:394–406. 10.1111/imm.12889. - DOI - PMC - PubMed
1. Bassani-Sternberg M, Chong C, Guillaume P. et al. .. Deciphering HLA-I motifs across HLA peptidomes improves neo-antigen predictions and identifies allostery regulating HLA specificity. PLoS Comput Biol 2017;13:e1005725. 10.1371/journal.pcbi.1005725. - DOI - PMC - PubMed

MeSH terms

Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions

Substances

Actions
Actions
Actions
Actions

Grants and funding

5940/Center of Excellence on Generative AI

LinkOut - more resources

Full Text Sources
Medical
- MedlinePlus Health Information
Research Materials
- NCI CPTC Antibody Characterization Program

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Meta learning for mutant HLA class I epitope immunogenicity prediction to accelerate cancer clinical immunotherapy

Affiliations

Meta learning for mutant HLA class I epitope immunogenicity prediction to accelerate cancer clinical immunotherapy

Authors

Affiliations

Abstract

Figures

Similar articles

Cited by

References

MeSH terms

Substances

Grants and funding

LinkOut - more resources

Full Text Sources

Medical

Research Materials

Abstract

Figures

Similar articles

Cited by

References

MeSH terms

Substances

Related information

Grants and funding

LinkOut - more resources

Full Text Sources

Medical

Research Materials