Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2024 May 12;16(1):52.
doi: 10.1186/s13321-024-00844-x.

Distance plus attention for binding affinity prediction

Affiliations

Distance plus attention for binding affinity prediction

Julia Rahman et al. J Cheminform. .

Abstract

Protein-ligand binding affinity plays a pivotal role in drug development, particularly in identifying potential ligands for target disease-related proteins. Accurate affinity predictions can significantly reduce both the time and cost involved in drug development. However, highly precise affinity prediction remains a research challenge. A key to improve affinity prediction is to capture interactions between proteins and ligands effectively. Existing deep-learning-based computational approaches use 3D grids, 4D tensors, molecular graphs, or proximity-based adjacency matrices, which are either resource-intensive or do not directly represent potential interactions. In this paper, we propose atomic-level distance features and attention mechanisms to capture better specific protein-ligand interactions based on donor-acceptor relations, hydrophobicity, and π -stacking atoms. We argue that distances encompass both short-range direct and long-range indirect interaction effects while attention mechanisms capture levels of interaction effects. On the very well-known CASF-2016 dataset, our proposed method, named Distance plus Attention for Affinity Prediction (DAAP), significantly outperforms existing methods by achieving Correlation Coefficient (R) 0.909, Root Mean Squared Error (RMSE) 0.987, Mean Absolute Error (MAE) 0.745, Standard Deviation (SD) 0.988, and Concordance Index (CI) 0.876. The proposed method also shows substantial improvement, around 2% to 37%, on five other benchmark datasets. The program and data are publicly available on the website https://gitlab.com/mahnewton/daap. Scientific Contribution StatementThis study innovatively introduces distance-based features to predict protein-ligand binding affinity, capitalizing on unique molecular interactions. Furthermore, the incorporation of protein sequence features of specific residues enhances the model's proficiency in capturing intricate binding patterns. The predictive capabilities are further strengthened through the use of a deep learning architecture with attention mechanisms, and an ensemble approach, averaging the outputs of five models, is implemented to ensure robust and reliable predictions.

Keywords: π -Stacking; Attention; Binding affinity; Deep learning; Distance matrix; Donor-acceptor; Hydrophobicity.

PubMed Disclaimer

Conflict of interest statement

No Conflict of interest is declared.

Figures

Fig. 1
Fig. 1
Training and validation loss curve of DAAP
Fig. 2
Fig. 2
The distributions of real and predicted binding affinity values by our predictor (green) and the closest state-of-the-art predictor (red) across the six test sets
Fig. 3
Fig. 3
Visualization of attention maps for concatenated features in the 1o0h protein-ligand complex of the CASF-2016.290 dataset
Fig. 4
Fig. 4
Screening Performance of the Predictive Model: Roc curve (left) and EF (right)
Fig. 5
Fig. 5
Various distance measures that potentially capture protein-ligand interactions. In the figure, dij represents the distance between a donor (D), hydrophobic (H), or π-stacking (S) atom i in the protein and the corresponding acceptor (A), hydrophobic (H), or π-stacking (S) atom j in the ligand. Empty circles represent other atom types. Different colour lines represent different types of interactions
Fig. 6
Fig. 6
The proposed model architecture

References

    1. DiMasi JA, Grabowski HG, Hansen RW. Innovation in the pharmaceutical industry: new estimates of r &d costs. J Health Econ. 2016;47:20–33. doi: 10.1016/j.jhealeco.2016.01.012. - DOI - PubMed
    1. Gilson MK, Zhou H-X. Calculation of protein-ligand binding affinities. Ann Rev Biophys Biomol Str. 2007;36(1):21–42. doi: 10.1146/annurev.biophys.36.040306.132550. - DOI - PubMed
    1. Zheng L, Fan J, Mu Y. Onionnet: a multiple-layer intermolecular-contact-based convolutional neural network for protein-ligand binding affinity prediction. ACS Omega. 2019;4(14):15956–15965. doi: 10.1021/acsomega.9b01997. - DOI - PMC - PubMed
    1. Wang H, Liu H, Ning S, Zeng C, Zhao Y. Dlssaffinity: protein-ligand binding affinity prediction via a deep learning model. Phys Chem Chem Phys. 2022;24(17):10124–10133. doi: 10.1039/D1CP05558E. - DOI - PubMed
    1. Seo S, Choi J, Park S, Ahn J. Binding affinity prediction for protein-ligand complex using deep attention mechanism based on intermolecular interactions. BMC Bioinform. 2021;22(1):1–15. doi: 10.1186/s12859-021-04466-0. - DOI - PMC - PubMed