. 2024 Dec 31;22(1):301.

doi: 10.1186/s12915-024-02096-5.

PharmRL: pharmacophore elucidation with deep geometric reinforcement learning

Rishal Aggarwal^{1

2}, David R Koes³

Affiliations

¹ Joint PhD Program in Computational Biology, Carnegie Mellon University-University of Pittsburgh, Pittsburgh, PA, USA.
² Computational & Systems Biology, University of Pittsburgh, Pittsburgh, PA, USA.
³ Computational & Systems Biology, University of Pittsburgh, Pittsburgh, PA, USA. dkoes@pitt.edu.

PMID: 39736736
PMCID: PMC11687028
DOI: 10.1186/s12915-024-02096-5

PharmRL: pharmacophore elucidation with deep geometric reinforcement learning

Rishal Aggarwal et al. BMC Biol. 2024.

. 2024 Dec 31;22(1):301.

doi: 10.1186/s12915-024-02096-5.

Authors

Rishal Aggarwal^{1

2}, David R Koes³

Affiliations

¹ Joint PhD Program in Computational Biology, Carnegie Mellon University-University of Pittsburgh, Pittsburgh, PA, USA.
² Computational & Systems Biology, University of Pittsburgh, Pittsburgh, PA, USA.
³ Computational & Systems Biology, University of Pittsburgh, Pittsburgh, PA, USA. dkoes@pitt.edu.

PMID: 39736736
PMCID: PMC11687028
DOI: 10.1186/s12915-024-02096-5

Abstract

Background: Molecular interactions between proteins and their ligands are important for drug design. A pharmacophore consists of favorable molecular interactions in a protein binding site and can be utilized for virtual screening. Pharmacophores are easiest to identify from co-crystal structures of a bound protein-ligand complex. However, designing a pharmacophore in the absence of a ligand is a much harder task.

Results: In this work, we develop a deep learning method that can identify pharmacophores in the absence of a ligand. Specifically, we train a CNN model to identify potential favorable interactions in the binding site, and develop a deep geometric Q-learning algorithm that attempts to select an optimal subset of these interaction points to form a pharmacophore. With this algorithm, we show better prospective virtual screening performance, in terms of F1 scores, on the DUD-E dataset than random selection of ligand-identified features from co-crystal structures. We also conduct experiments on the LIT-PCBA dataset and show that it provides efficient solutions for identifying active molecules. Finally, we test our method by screening the COVID moonshot dataset and show that it would be effective in identifying prospective lead molecules even in the absence of fragment screening experiments.

Conclusions: PharmRL addresses the need for automated methods in pharmacophore design, particularly in cases where a cognate ligand is unavailable. Experimental results demonstrate that PharmRL generates functional pharmacophores. Additionally, we provide a Google Colab notebook to facilitate the use of this method.

Keywords: Machine learning; Pharmacophores; Protein-ligand interactions; Virtual screening.

PubMed Disclaimer

Conflict of interest statement

Declarations. Ethics approval and consent to participate: None to declare. Competing interests: The authors declare no competing interests.

Figures

**Fig. 1**
Pharmacophore model with several pharmacophore features that matches the caffeine molecule (caffeine molecule included for illustration). The colors of the feature points are as follows: aromatic—purple, hydrogen acceptor—blue, hydrophobic—green

**Fig. 2**
Pharmacophore prediction pipeline. CNN is used to predict pharmacophore features from gridded binding site (top). Protein—pharmacophore graph is built by sequentially adding feature and protein nodes to it using RL framework (bottom)

**Fig. 3**
CNN architecture for predicting pharmacophore feature points. The CNN takes the local grid around the query point as input and provides confidence scores on the presence of the 6 classes at that point

**Fig. 4**
Steps followed to obtain pharmacophore feature points from a CNN predictions on a binding site

**Fig. 5**
MDP process used for iterative construction of the protein-pharmacophore graph. At each time-point t, the action is to chose the next graph ( $G_{t + 1}$ ). The environment takes this and provides a F1 score for that pharmacophore, along with possible graphs to choose from ( ${G_{t + 2}}$ ) for the next iteration

**Fig. 6**
The SE(3)-equivariant neural network takes a protein-pharmacophore graph as input and predicts the Q-value

None — **Algorithm 1** Deep Q-learning algorithm to train Q function network

**Fig. 7**
F1 scores divided by the max F1 score attainable from ligand features for RL models trained and tested on ligand derived features (PharmRL_Ligand) and all CNN features (PharmRL_CNN)

**Fig. 8**
Performance of RL models on COVID screening experiments

**Fig. 9**
Performance of pharmacophore models on LIT-PCBA targets

See this image and copyright information in PMC

Update of

PharmRL: Pharmacophore elucidation with Deep Geometric Reinforcement Learning.
Aggarwal R, Koes DR. Aggarwal R, et al. Res Sq [Preprint]. 2024 Sep 24:rs.3.rs-5033986. doi: 10.21203/rs.3.rs-5033986/v1. Res Sq. 2024. Update in: BMC Biol. 2024 Dec 31;22(1):301. doi: 10.1186/s12915-024-02096-5. PMID: 39399689 Free PMC article. Updated. Preprint.

References

1. Sunseri J, Koes DR. Pharmit: interactive exploration of chemical space. Nucleic Acids Res. 2016;44(W1):W442–8. - DOI - PMC - PubMed
1. Koes DR, Camacho CJ. Pharmer: efficient and exact pharmacophore search. J Chem Inf Model. 2011;51(6):1307–14. - DOI - PMC - PubMed
1. Sato T, Honma T, Yokoyama S. Combining machine learning and pharmacophore-based interaction fingerprint for in silico screening. J Chem Inf Model. 2010;50(1):170–85. - DOI - PubMed
1. Kumar SP, Dixit NY, Patel CN, Rawal RM, Pandya HA. PharmRF: A machine-learning scoring function to identify the best protein-ligand complexes for structure-based pharmacophore screening with high enrichments. J Comput Chem. 2022;43(12):847–63. - DOI - PubMed
1. Zhu H, Zhou R, Cao D, Tang J, Li M. A pharmacophore-guided deep learning approach for bioactive molecular generation. Nat Commun. 2023;14(1):6234. - DOI - PMC - PubMed

MeSH terms

Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

PharmRL: pharmacophore elucidation with deep geometric reinforcement learning

Affiliations

PharmRL: pharmacophore elucidation with deep geometric reinforcement learning

Authors

Affiliations

Abstract

Conflict of interest statement

Figures

Update of

References

MeSH terms

Substances

Grants and funding

LinkOut - more resources

Full Text Sources