Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2025 Apr 29;26(1):116.
doi: 10.1186/s12859-025-06123-2.

PPI-Graphomer: enhanced protein-protein affinity prediction using pretrained and graph transformer models

Affiliations

PPI-Graphomer: enhanced protein-protein affinity prediction using pretrained and graph transformer models

Jun Xie et al. BMC Bioinformatics. .

Abstract

Protein-protein interactions (PPIs) refer to the phenomenon of protein binding through various types of bonds to execute biological functions. These interactions are critical for understanding biological mechanisms and drug research. Among these, the protein binding interface is a critical region involved in protein-protein interactions, particularly the hotspot residues on it that play a key role in protein interactions. Current deep learning methods trained on large-scale data can characterize proteins to a certain extent, but they often struggle to adequately capture information about protein binding interfaces. To address this limitation, we propose the PPI-Graphomer module, which integrates pretrained features from large-scale language models and inverse folding models. This approach enhances the characterization of protein binding interfaces by defining edge relationships and interface masks on the basis of molecular interaction information. Our model outperforms existing methods across multiple benchmark datasets and demonstrates strong generalization capabilities.

Keywords: Bind affinity prediction; ESM; Graph transformer.

PubMed Disclaimer

Conflict of interest statement

Declarations. Ethics approval and consent to participate: Not applicable. Consent for publication: Not applicable. Competing interests: The authors declare no Conflict of interest.

Figures

Fig. 1
Fig. 1
The framework for protein-protein affinity prediction using pre-trained models and the PPI-Graphomer module. A The workflow of this study involves: utilizing ESM2 to extract sequence representations of protein complexes and ESM-IF1 to extract structural representations, which are subsequently concatenated. The concatenated representations are then processed through the PPI-Graphomer module to obtain interface representations. These interface representations are further concatenated with the original features. Ultimately, a MLP is employed to predict affinity values quantitatively. B The architecture of PPI-Graphomer incorporates bias terms based on three different encodings into the attention matrix, which are then multiplied by distance-based weight coefficients. C Three encodings based on amino acid structural information are utilized: encoding of amino acid pair types, encoding based on intermolecular interaction forces, and a masking matrix based on interface information
Fig. 2
Fig. 2
The density plots of the ΔG for the training set and two test sets. The x-axis represents the label values, while the y-axis denotes the probability distribution of the labels. The choice of a density plot over a histogram is attributed to the former’s ability to provide a smoother distribution estimation and facilitate the observation of distributions from datasets of varying sizes within the same figure
Fig. 3
Fig. 3
The Scatter plot of predicted vs experimental binding affinities. The model performance was validated on two separate test datasets. The first test set comprised 75 samples, yielding a PCC of 0.641 and a MAE of 1.64. The second test set included 87 samples, achieving a PCC of 0.625 and an MAE of 1.51

References

    1. Lu H, Zhou Q, He J, Jiang Z, Peng C, Tong R, Shi J. Recent advances in the development of protein-protein interactions modulators: mechanisms and clinical trials. Signal Transduct Target Ther. 2020;5(1):213. - PMC - PubMed
    1. Peng X, Wang J, Peng W, Wu F-X, Pan Y. Protein-protein interactions: detection, reliability assessment and applications. Brief Bioinform. 2017;18(5):798–819. - PubMed
    1. Aloy P, Russell RB. Structural systems biology: modelling protein interactions. Nat Rev Mol Cell Biol. 2006;7(3):188–97. - PubMed
    1. Kaczor AA, Bartuzi D, Stępniewski TM, Matosiuk D, Selent J. Protein-protein docking in drug design and discovery. Comput Drug Discov Design. 2018;2018:285–305. - PubMed
    1. Jubb H, Higueruelo AP, Winter A, Blundell TL. Structural biology and drug discovery for protein-protein interactions. Trends Pharmacol Sci. 2012;33(5):241–8. - PubMed

Grants and funding

LinkOut - more resources