GraphSite: Ligand Binding Site Classification with Deep Graph Learning
- PMID: 36008947
- PMCID: PMC9405584
- DOI: 10.3390/biom12081053
GraphSite: Ligand Binding Site Classification with Deep Graph Learning
Abstract
The binding of small organic molecules to protein targets is fundamental to a wide array of cellular functions. It is also routinely exploited to develop new therapeutic strategies against a variety of diseases. On that account, the ability to effectively detect and classify ligand binding sites in proteins is of paramount importance to modern structure-based drug discovery. These complex and non-trivial tasks require sophisticated algorithms from the field of artificial intelligence to achieve a high prediction accuracy. In this communication, we describe GraphSite, a deep learning-based method utilizing a graph representation of local protein structures and a state-of-the-art graph neural network to classify ligand binding sites. Using neural weighted message passing layers to effectively capture the structural, physicochemical, and evolutionary characteristics of binding pockets mitigates model overfitting and improves the classification accuracy. Indeed, comprehensive cross-validation benchmarks against a large dataset of binding pockets belonging to 14 diverse functional classes demonstrate that GraphSite yields the class-weighted F1-score of 81.7%, outperforming other approaches such as molecular docking and binding site matching. Further, it also generalizes well to unseen data with the F1-score of 70.7%, which is the expected performance in real-world applications. We also discuss new directions to improve and extend GraphSite in the future.
Keywords: deep learning; graph neural network; ligand binding sites; structure-based drug discovery.
Conflict of interest statement
The authors declare no conflict of interests.
Figures







Similar articles
-
DeepDrug3D: Classification of ligand-binding pockets in proteins with a convolutional neural network.PLoS Comput Biol. 2019 Feb 4;15(2):e1006718. doi: 10.1371/journal.pcbi.1006718. eCollection 2019 Feb. PLoS Comput Biol. 2019. PMID: 30716081 Free PMC article.
-
AlphaFold2-aware protein-DNA binding site prediction using graph transformer.Brief Bioinform. 2022 Mar 10;23(2):bbab564. doi: 10.1093/bib/bbab564. Brief Bioinform. 2022. PMID: 35039821
-
Bionoi: A Voronoi Diagram-Based Representation of Ligand-Binding Sites in Proteins for Machine Learning Applications.Methods Mol Biol. 2021;2266:299-312. doi: 10.1007/978-1-0716-1209-5_17. Methods Mol Biol. 2021. PMID: 33759134
-
Recent advances in AI-driven protein-ligand interaction predictions.Curr Opin Struct Biol. 2025 Jun;92:103020. doi: 10.1016/j.sbi.2025.103020. Epub 2025 Feb 24. Curr Opin Struct Biol. 2025. PMID: 39999605 Review.
-
Predicting Protein-Ligand Docking Structure with Graph Neural Network.J Chem Inf Model. 2022 Jun 27;62(12):2923-2932. doi: 10.1021/acs.jcim.2c00127. Epub 2022 Jun 14. J Chem Inf Model. 2022. PMID: 35699430 Free PMC article. Review.
Cited by
-
The changing scenario of drug discovery using AI to deep learning: Recent advancement, success stories, collaborations, and challenges.Mol Ther Nucleic Acids. 2024 Aug 8;35(3):102295. doi: 10.1016/j.omtn.2024.102295. eCollection 2024 Sep 10. Mol Ther Nucleic Acids. 2024. PMID: 39257717 Free PMC article. Review.
-
In silico protein function prediction: the rise of machine learning-based approaches.Med Rev (2021). 2023 Nov 29;3(6):487-510. doi: 10.1515/mr-2023-0038. eCollection 2023 Dec. Med Rev (2021). 2023. PMID: 38282798 Free PMC article. Review.
-
Twenty years of advances in prediction of nucleic acid-binding residues in protein sequences.Brief Bioinform. 2024 Nov 22;26(1):bbaf016. doi: 10.1093/bib/bbaf016. Brief Bioinform. 2024. PMID: 39833102 Free PMC article. Review.
-
OLB-AC: toward optimizing ligand bioactivities through deep graph learning and activity cliffs.Bioinformatics. 2024 Jun 3;40(6):btae365. doi: 10.1093/bioinformatics/btae365. Bioinformatics. 2024. PMID: 38889277 Free PMC article.
-
Unraveling viral drug targets: a deep learning-based approach for the identification of potential binding sites.Brief Bioinform. 2023 Nov 22;25(1):bbad459. doi: 10.1093/bib/bbad459. Brief Bioinform. 2023. PMID: 38113077 Free PMC article.
References
-
- Armstrong J.D., Hubbard R.E., Farrell T., Maiguashca B., editors. Structure-Based Drug Discovery: An Overview. The Royal Society of Chemistry; Cambridge, UK: 2006.
-
- Vos T., Lim S.S., Abbafati C., Abbas K.M., Abbasi M., Abbasifard M., Abbasi-Kangevari M., Abbastabar H., Abd-Allah F., Abdelalim A., et al. Global burden of 369 diseases and injuries in 204 countries and territories, 1990–2019: A systematic analysis for the Global Burden of Disease Study 2019. Lancet. 2020;396:1204–1222. doi: 10.1016/S0140-6736(20)30925-9. - DOI - PMC - PubMed
Publication types
MeSH terms
Substances
Grants and funding
LinkOut - more resources
Full Text Sources