Clustering Protein Binding Pockets and Identifying Potential Drug Interactions: A Novel Ligand-Based Featurization Method
- PMID: 37847557
- PMCID: PMC10647021
- DOI: 10.1021/acs.jcim.3c00722
Clustering Protein Binding Pockets and Identifying Potential Drug Interactions: A Novel Ligand-Based Featurization Method
Abstract
Protein-ligand interactions are essential to drug discovery and drug development efforts. Desirable on-target or multitarget interactions are the first step in finding an effective therapeutic, while undesirable off-target interactions are the first step in assessing safety. In this work, we introduce a novel ligand-based featurization and mapping of human protein pockets to identify closely related protein targets and to project novel drugs into a hybrid protein-ligand feature space to identify their likely protein interactions. Using structure-based template matches from PDB, protein pockets are featured by the ligands that bind to their best co-complex template matches. The simplicity and interpretability of this approach provide a granular characterization of the human proteome at the protein-pocket level instead of the traditional protein-level characterization by family, function, or pathway. We demonstrate the power of this featurization method by clustering a subset of the human proteome and evaluating the predicted cluster associations of over 7000 compounds.
Conflict of interest statement
The authors declare no competing financial interest.
Figures





Similar articles
-
Graph-Based Clustering of Predicted Ligand-Binding Pockets on Protein Surfaces.J Chem Inf Model. 2015 Sep 28;55(9):1944-52. doi: 10.1021/acs.jcim.5b00045. Epub 2015 Sep 11. J Chem Inf Model. 2015. PMID: 26325445
-
SMAP-WS: a parallel web service for structural proteome-wide ligand-binding site comparison.Nucleic Acids Res. 2010 Jul;38(Web Server issue):W441-4. doi: 10.1093/nar/gkq400. Epub 2010 May 19. Nucleic Acids Res. 2010. PMID: 20484373 Free PMC article.
-
Insights into an original pocket-ligand pair classification: a promising tool for ligand profile prediction.PLoS One. 2013 Jun 20;8(6):e63730. doi: 10.1371/journal.pone.0063730. Print 2013. PLoS One. 2013. PMID: 23840299 Free PMC article.
-
Implications of the small number of distinct ligand binding pockets in proteins for drug discovery, evolution and biochemical function.Bioorg Med Chem Lett. 2015 Mar 15;25(6):1163-70. doi: 10.1016/j.bmcl.2015.01.059. Epub 2015 Feb 3. Bioorg Med Chem Lett. 2015. PMID: 25690787 Free PMC article. Review.
-
Ligand-receptor interaction platforms and their applications for drug discovery.Expert Opin Drug Discov. 2012 Oct;7(10):969-88. doi: 10.1517/17460441.2012.715631. Epub 2012 Aug 4. Expert Opin Drug Discov. 2012. PMID: 22860803 Review.
Cited by
-
The recent advances in the approach of artificial intelligence (AI) towards drug discovery.Front Chem. 2024 May 31;12:1408740. doi: 10.3389/fchem.2024.1408740. eCollection 2024. Front Chem. 2024. PMID: 38882215 Free PMC article. Review.
-
Multi-Component Synthesis of New Fluorinated-Pyrrolo[3,4-b]pyridin-5-ones Containing the 4-Amino-7-chloroquinoline Moiety and In Vitro-In Silico Studies Against Human SARS-CoV-2.Int J Mol Sci. 2025 Aug 7;26(15):7651. doi: 10.3390/ijms26157651. Int J Mol Sci. 2025. PMID: 40806777 Free PMC article.
References
-
- Stevenson G. A.; et al.High-Throughput Virtual Screening of Small Molecule Inhibitors for SARS-CoV-2 Protein Targets with Deep Fusion Models. Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis, New York: NY, USA, 2021.
-
- Lau E. Y.; Negrete O. A.; Bennett W. F. D.; Bennion B. J.; Borucki M.; Bourguet F.; Epstein A.; Franco M.; Harmon B.; He S.; et al. Discovery of Small-Molecule Inhibitors of SARS-CoV-2 Proteins Using a Computational and Experimental Pipeline. Front. Mol. Biosci. 2021, 8, 678701.10.3389/fmolb.2021.678701. - DOI - PMC - PubMed
-
- Tutone M.; Almerico A. M.. Targeting Enzymes for Pharmaceutical Development: Methods and Protocols; Labrou N. E., Ed.; Springer US: New York, NY, 2020; pp 29–39.