Review

. 2024 Jan 22;25(2):bbae081.

doi: 10.1093/bib/bbae081.

Prediction of protein-ligand binding affinity via deep learning models

Huiwen Wang¹

Affiliations

PMID: 38446737
PMCID: PMC10939342
DOI: 10.1093/bib/bbae081

Review

Prediction of protein-ligand binding affinity via deep learning models

Huiwen Wang. Brief Bioinform. 2024.

. 2024 Jan 22;25(2):bbae081.

doi: 10.1093/bib/bbae081.

Author

Huiwen Wang¹

Affiliation

¹ School of Physics and Engineering, Henan University of Science and Technology, Luoyang 471023, China.

PMID: 38446737
PMCID: PMC10939342
DOI: 10.1093/bib/bbae081

Erratum in

Correction to: Prediction of protein-ligand binding affinity via deep learning models.
[No authors listed] [No authors listed] Brief Bioinform. 2024 May 23;25(4):bbae310. doi: 10.1093/bib/bbae310. Brief Bioinform. 2024. PMID: 38888458 Free PMC article. No abstract available.

Abstract

Accurately predicting the binding affinity between proteins and ligands is crucial in drug screening and optimization, but it is still a challenge in computer-aided drug design. The recent success of AlphaFold2 in predicting protein structures has brought new hope for deep learning (DL) models to accurately predict protein-ligand binding affinity. However, the current DL models still face limitations due to the low-quality database, inaccurate input representation and inappropriate model architecture. In this work, we review the computational methods, specifically DL-based models, used to predict protein-ligand binding affinity. We start with a brief introduction to protein-ligand binding affinity and the traditional computational methods used to calculate them. We then introduce the basic principles of DL models for predicting protein-ligand binding affinity. Next, we review the commonly used databases, input representations and DL models in this field. Finally, we discuss the potential challenges and future work in accurately predicting protein-ligand binding affinity via DL models.

Keywords: accurate prediction; database; deep learning model; input representation; protein–ligand binding affinity.

PubMed Disclaimer

Figures

**Figure 1**
(A) The 2D chemical structure of vemurafenib. (B) The 2D chemical structure of SJF-0628 consisting of vemurafenib, a short linker and a Von Hippel Lindau (VHL)-recruiting ligand. (C) The structure of BRAF-vemurafenib complex. BRAF kinase, ligand vemurafenib and identified pocket are colored in green, red and yellow, respectively. (D) The interaction between vemurafenib and BRAF kinase, in which the hydrogen bonds and other contacts are shown as blue and purple lines, respectively.

**Figure 2**
(A) Three overlapping subsets, including the general, refined and core sets, in the PDBbind database. (B) The number of protein–ligand complexes in the PDBbind database from 2002 to 2020. (C) The pie chart shows the distribution of protein–ligand binding affinity values in the PDBbind database. (D) The distribution of 367 human kinases in the Davis database on the human kinome tree in which the red dots represent each kinase. (E) The pie chart shows the distribution of protein–ligand binding affinity values in the Davis database. The lowest values () constitute 70% of all binding affinity values in the Davis database. (F) The distribution of 216 human kinases in the KIBA database on the human kinome tree in which the red dots represent each kinase. (G) The pie chart shows the distribution of protein–ligand binding affinity values in the KIBA database.

formula image — **Figure 2**
(A) Three overlapping subsets, including the general, refined and core sets, in the PDBbind database. (B) The number of protein–ligand complexes in the PDBbind database from 2002 to 2020. (C) The pie chart shows the distribution of protein–ligand binding affinity values in the PDBbind database. (D) The distribution of 367 human kinases in the Davis database on the human kinome tree in which the red dots represent each kinase. (E) The pie chart shows the distribution of protein–ligand binding affinity values in the Davis database. The lowest values () constitute 70% of all binding affinity values in the Davis database. (F) The distribution of 216 human kinases in the KIBA database on the human kinome tree in which the red dots represent each kinase. (G) The pie chart shows the distribution of protein–ligand binding affinity values in the KIBA database.

**Figure 3**
(A) The binding affinity values of 22 ligands in the Davis database targeting wild-type BRAF and BRAF V600E mutant. (B–D) The structure diagrams of the 4th, 7th and 8th ligands in Figure 3A, respectively.

**Figure 4**
An overall flowchart for predicting protein–ligand interactions based on DL models.

**Figure 5**
(A) Conceptual workflow of interaction-based DL models. Inputs are the pocket–ligand complex structures and their characteristics. (B) Conceptual workflow of interaction-free DL models. The structure-free models can predict protein–ligand binding affinity without protein–ligand interaction information. The inputs of interaction-free models are ligand SMILES strings/protein sequences or ligand/protein monomers 3D structures and their characteristics.

**Figure 6**
(A) The binding affinity values of 16 ligands in the Davis database targeting CDK4-CyclinD1 and CDK4-CyclinD3 complexes. (B) The structures of CDK4-CyclinD1 and CDK4-CyclinD3 complexes. (C–H) Diagrams of 2D protein–ligand interaction for the 4th, 6th and 15th ligands targeting CDK4-CyclinD1 and CDK4-CyclinD3 complexes in Figure 6A, respectively. The hydrophobic residues of protein and hydrophobic interactions between residues and ligands are colored in red. The hydrogen bonds between residues and ligands are indicated by the green lines. The hydrogen bond residues are colored in yellow and names are colored in green.

**Figure 7**
(A) The accuracies of nine interaction-based models on the PDBbind-2016 core set. (B) The accuracies of 15 interaction-based models on the CASF-2016 set.

See this image and copyright information in PMC

Cited by

PLAIG: Protein-Ligand Binding Affinity Prediction Using a Novel Interaction-Based Graph Neural Network Framework.
Samudrala MV, Dandibhotla S, Kaneriya A, Dakshanamurthy S. Samudrala MV, et al. ACS Bio Med Chem Au. 2025 Apr 29;5(3):447-463. doi: 10.1021/acsbiomedchemau.5c00053. eCollection 2025 Jun 18. ACS Bio Med Chem Au. 2025. PMID: 40556781 Free PMC article.
Deep Drug-Target Binding Affinity Prediction Base on Multiple Feature Extraction and Fusion.
Li Z, Zeng Y, Jiang M, Wei B. Li Z, et al. ACS Omega. 2025 Jan 10;10(2):2020-2032. doi: 10.1021/acsomega.4c08048. eCollection 2025 Jan 21. ACS Omega. 2025. PMID: 39866608 Free PMC article.
Edge-enhanced interaction graph network for protein-ligand binding affinity prediction.
Yang D, Kuang L, Hu A. Yang D, et al. PLoS One. 2025 Apr 8;20(4):e0320465. doi: 10.1371/journal.pone.0320465. eCollection 2025. PLoS One. 2025. PMID: 40198678 Free PMC article.
DynamicDTA: Drug-Target Binding Affinity Prediction Using Dynamic Descriptors and Graph Representation.
Luo D, Zhou J, Xu L, Yuan S, Lin X. Luo D, et al. Interdiscip Sci. 2025 Jun 6. doi: 10.1007/s12539-025-00729-z. Online ahead of print. Interdiscip Sci. 2025. PMID: 40481301
Integrated modeling of protein and RNA.
Liu H, Zhao Y. Liu H, et al. Brief Bioinform. 2024 Mar 27;25(3):bbae139. doi: 10.1093/bib/bbae139. Brief Bioinform. 2024. PMID: 38561980 Free PMC article. No abstract available.

See all "Cited by" articles

References

1. Miller DW, DILL KA. Ligand binding to proteins: the binding landscape model. Protein Sci 1997;6:2166–79. - PMC - PubMed
1. Wei J, Chen S, Zong L, et al. Protein-RNA interaction prediction with deep learning: structure matters. Brief Bioinform 2022;23:1–19. - PMC - PubMed
1. Altemose N, Maslan A, Smith OK, et al. DiMeLo-seq: a long-read, single-molecule method for mapping protein-DNA interactions genome wide. Nat Methods 2022;19:711–23. - PMC - PubMed
1. Volkamer A, Eid S, Turk S, et al. Pocketome of human kinases: prioritizing the ATP binding sites of (yet) untapped protein kinases for drug discovery. J Chem Inf Model 2015;55(3):538–49. - PubMed
1. Zarrin AA, Bao K, Lupardus P, et al. Kinase inhibition in autoimmunity and inflammation. Nat Rev Drug Discov 2021;20:39–63. - PMC - PubMed

Publication types

Actions
Actions

MeSH terms

Actions
Actions
Actions
Actions
Actions

Substances

Actions

Grants and funding

12204154/National Natural Science Foundation of China

LinkOut - more resources

Full Text Sources

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Prediction of protein-ligand binding affinity via deep learning models

Affiliation

Prediction of protein-ligand binding affinity via deep learning models

Author

Affiliation

Erratum in

Abstract

Figures

Similar articles

Cited by

References

Publication types

MeSH terms

Substances

Grants and funding

LinkOut - more resources

Full Text Sources

Erratum in

Abstract

Figures

Similar articles

Cited by

References

Publication types

MeSH terms

Substances

Related information

Grants and funding

LinkOut - more resources

Full Text Sources