Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Review
. 2022 Jun 27;62(12):2923-2932.
doi: 10.1021/acs.jcim.2c00127. Epub 2022 Jun 14.

Predicting Protein-Ligand Docking Structure with Graph Neural Network

Affiliations
Review

Predicting Protein-Ligand Docking Structure with Graph Neural Network

Huaipan Jiang et al. J Chem Inf Model. .

Abstract

Modern day drug discovery is extremely expensive and time consuming. Although computational approaches help accelerate and decrease the cost of drug discovery, existing computational software packages for docking-based drug discovery suffer from both low accuracy and high latency. A few recent machine learning-based approaches have been proposed for virtual screening by improving the ability to evaluate protein-ligand binding affinity, but such methods rely heavily on conventional docking software to sample docking poses, which results in excessive execution latencies. Here, we propose and evaluate a novel graph neural network (GNN)-based framework, MedusaGraph, which includes both pose-prediction (sampling) and pose-selection (scoring) models. Unlike the previous machine learning-centric studies, MedusaGraph generates the docking poses directly and achieves from 10 to 100 times speedup compared to state-of-the-art approaches, while having a slightly better docking accuracy.

PubMed Disclaimer

Conflict of interest statement

The authors declare no competing financial interest.

Figures

Figure 1.
Figure 1.
(a) Flow of the pose-prediction GNN model. We first run MedusaDock on a protein–ligand complex to achieve a candidate docking pose. Then, the pose-prediction model calculates the movement of each flexible atom in the complex. Flexible atoms include all the ligand atoms and the receptor atoms that are near the receptor surface. Finally, we obtain the final docking pose based on the movement prediction. The pose-prediction GNN model contains several TransformerConv layers. (b) Multistep pose-prediction. An atom moves from the initial location to the final location step-by-step. (c) Flow of the pose-selection GNN model. The final poses generated by the pose-prediction model travel through three TransformerConv layers and three fully connected layers. The final output is the Yes/No neuron indicating the docking probability.
Figure 2.
Figure 2.
Example of generating the docking pose with MedusaGraph and other ML-based approach.
Figure 3.
Figure 3.
Average RMSD of the output poses for each approach. For the normalized latency, we set the execution time of one running of MedusaDock as 1 and normalized all other approaches. We randomly split the data set into a training set and a testing set 10 times to perform the cross-validation, and the error bar indicates the min/max RMSD value of each cross.
Figure 4.
Figure 4.
Histograms of RMSDs of all poses for initial docking poses generated by MedusaDock: the final docking poses generated by the three-step pose-prediction model and the poses selected by the pose-selection model.
Figure 5.
Figure 5.
We show the average RMSD of the initial poses and final docking poses for the complexes with different properties. We also calculate the good pose (with RMSD less than 2.5 Å) rates in initial docking poses and final docking poses. (a) Results of the complexes with a different number of atoms. (b) Results of the complexes with a different number of rotatable bonds.

References

    1. Ruddigkeit L; Van Deursen R; Blum LC; Reymond J-L Enumeration of 166 billion organic small molecules in the chemical universe database GDB-17. J. Chem. Inf. Model 2012, 52, 2864–2875. - PubMed
    1. Declerck PJ; Darendeliler F; Goth M; Kolouskova S; Micle I; Noordam C; Peterkova V; Volevodz NN; Zapletalová J; Ranke MB Biosimilars: controversies as illustrated by rhGH. Curr. Med. Res. Opin 2010, 26, 1219–1229. - PubMed
    1. Hunt JP; Yang SO; Wilding KM; Bundy BC The growing impact of lyophilized cell-free protein expression systems. Bioengineered 2017, 8, 325–330. - PMC - PubMed
    1. Di Stasi A; Tey S-K; Dotti G; Fujita Y; Kennedy-Nasser A; Martinez C; Straathof K; Liu E; Durett AG; Grilley B; Liu H; Cruz CR; Savoldo B; Gee AP; Schindler J; Krance RA; Heslop HE; Spencer DM; Rooney CM; Brenner MK Inducible apoptosis as a safety switch for adoptive cell therapy. N. Engl. J. Med 2011, 365, 1673–1683. - PMC - PubMed
    1. Convertino M; Das J; Dokholyan NV Pharmacological chaperones: design and development of new therapeutic strategies for the treatment of conformational diseases. ACS Chem. Biol 2016, 11, 1471–1489. - PubMed

Publication types