Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Review
. 2024 Mar 12:15:1293706.
doi: 10.3389/fimmu.2024.1293706. eCollection 2024.

MHCII-peptide presentation: an assessment of the state-of-the-art prediction methods

Affiliations
Review

MHCII-peptide presentation: an assessment of the state-of-the-art prediction methods

Yaqing Yang et al. Front Immunol. .

Abstract

Major histocompatibility complex Class II (MHCII) proteins initiate and regulate immune responses by presentation of antigenic peptides to CD4+ T-cells and self-restriction. The interactions between MHCII and peptides determine the specificity of the immune response and are crucial in immunotherapy and cancer vaccine design. With the ever-increasing amount of MHCII-peptide binding data available, many computational approaches have been developed for MHCII-peptide interaction prediction over the last decade. There is thus an urgent need to provide an up-to-date overview and assessment of these newly developed computational methods. To benchmark the prediction performance of these methods, we constructed an independent dataset containing binding and non-binding peptides to 20 human MHCII protein allotypes from the Immune Epitope Database, covering DP, DR and DQ alleles. After collecting 11 known predictors up to January 2022, we evaluated those available through a webserver or standalone packages on this independent dataset. The benchmarking results show that MixMHC2pred and NetMHCIIpan-4.1 achieve the best performance among all predictors. In general, newly developed methods perform better than older ones due to the rapid expansion of data on which they are trained and the development of deep learning algorithms. Our manuscript not only draws a full picture of the state-of-art of MHCII-peptide binding prediction, but also guides researchers in the choice among the different predictors. More importantly, it will inspire biomedical researchers in both academia and industry for the future developments in this field.

Keywords: MHCII; bioinformatics; immunology; machine learning; peptide binding prediction; webserver.

PubMed Disclaimer

Conflict of interest statement

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Figures

Figure 1
Figure 1
Overall workflow of the construction of the benchmark dataset: we first collected all peptides from IEDB and selected data after 2020 (2020-2022) as our starting dataset Dsetini . We removed the overlap between Dsetini and the IEDB dataset entries before 2020 to make the positive Dsetpos dataset containing 67061 binding peptides of 11 to 19 amino acids; they span 20 allotypes. We used the non-bound regions of the host proteins to generate non-binding peptides of the same length as the binding ones. After that, we removed the non-binding peptides that appear in IEDB. The remaining peptides constitute the negative Dsetneg dataset of 67163 non-binding peptides. The union of Dsetpos and Dsetneg is the final benchmark dataset Dsetbench .
Figure 2
Figure 2
Overview of the computational approaches for MHCII-peptide interaction prediction. There are three types of predictors: scoring functions, machine-learning based tools, and consensus approaches. For each type of method, there are generally five steps to build a reliable predictor: data acquisition and preprocessing, feature generation and selection, model construction and optimization, performance evaluation, and the construction of a web server or independent software.
Figure 3
Figure 3
Analysis of the amino acid positional preferences for allotype-specific peptide ligands in specific HLAII allotypes: (A) HLA-DPA1*01:03 - DPB1*04:01,(B) HLA-DPA1*02:01-DPB1*14:01, (C) HLA-DPA1*01:03-DPB1*02:01, (D) HLA-DPA1*01:03-DPB1*03:01, (E) HLA-DPA1*01:03-DPB1*04:02, (F) HLA-DPA1*02:02-DPB1*05:01. The motifs are aligned with respect to the center of the 9-mer binding core. The overall height of letters indicates the sequence conservation at that position, while the height of each amino acid represents the relative frequency of that residue.
Figure 4
Figure 4
Performance of eleven MHC class II prediction methods assessed by their ROC curves and AUC values. The curves were generated by plotting the true positive rate (y-axis) against the false positive rate (x-axis). ROC curves for peptides binding to HLAII molecules specific for (A) HLA-DPA1*01:03-DPB1*02:01 (15mer), (B) HLA-DPA1*01:03-DPB1*04:01 (15mer) and (C) HLA-DPA1*02:01-DPB1*14:01 (15mer). AUC values for each prediction method are provided between parentheses in the subfigure legends. For prediction results on other HLA classes and peptide lengths see Supplementary Figure S1 and Data S4 .
Figure 5
Figure 5
Performance of MHC class II prediction methods for HLA-DR genes assessed by their ROC curves and AUC values. ROC curves for peptides binding to HLAII molecules specific for (A) HLA-DRB1*03:01 (15mer), (B) HLA-DRB1*04:01 (15mer), (C) HLA-DRB1*15:01(15mer), (D) HLA-DRB5*01:01 (15mer). AUROC values for each prediction method are provided between parentheses in the subfigure legends. For prediction results on other HLA classes and peptide lengths see Supplementary Figure S1 and Data S4 .

Similar articles

Cited by

References

    1. Guermonprez P, Valladeau J, Zitvogel L, Théry C, Amigorena S. Antigen presentation and T cell stimulation by dendritic cells. Annu Rev Immunol. (2002) 20:621–67. doi: 10.1146/annurev.immunol.20.100301.064828 - DOI - PubMed
    1. Blum JS, Wearsch PA, Cresswell P. Pathways of antigen processing. Annu Rev Immunol. (2013) 31:443–73. doi: 10.1146/annurev-immunol-032712-095910 - DOI - PMC - PubMed
    1. Speiser DE, Chijioke O, Schaeuble K, Münz C. CD4+ T cells in cancer. Nat Cancer. (2023) 4:317–29. doi: 10.1038/s43018-023-00521-2 - DOI - PubMed
    1. Sun Z, Chen F, Meng F, Wei J, Liu B. MHC class II restricted neoantigen: a promising target in tumor immunotherapy. Cancer Lett. (2017) 392:17–25. doi: 10.1016/j.canlet.2016.12.039 - DOI - PubMed
    1. Wang R-F, Wang HY. Immune targets and neoantigens for cancer immunotherapy and precision medicine. Cell Res. (2017) 27:11–37. doi: 10.1038/cr.2016.155 - DOI - PMC - PubMed

Publication types

Substances