Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2022 Apr 12;14(1):22.
doi: 10.1186/s13321-022-00588-6.

Galaxy workflows for fragment-based virtual screening: a case study on the SARS-CoV-2 main protease

Affiliations

Galaxy workflows for fragment-based virtual screening: a case study on the SARS-CoV-2 main protease

Simon Bray et al. J Cheminform. .

Abstract

We present several workflows for protein-ligand docking and free energy calculation for use in the workflow management system Galaxy. The workflows are composed of several widely used open-source tools, including rDock and GROMACS, and can be executed on public infrastructure using either Galaxy's graphical interface or the command line. We demonstrate the utility of the workflows by running a high-throughput virtual screening of around 50000 compounds against the SARS-CoV-2 main protease, a system which has been the subject of intense study in the last year.

Keywords: Computational chemistry; Fragment screening; SARS-CoV-2; Workflows.

PubMed Disclaimer

Conflict of interest statement

The authors have no competing interests to declare.

Figures

Fig. 1
Fig. 1
Schematic of the docking and scoring workflow
Fig. 2
Fig. 2
Schematic of the MMGBSA workflow. A modular subworkflow for system parameterization is shared with the dcTMD workflow; see Fig. 4 for details
Fig. 3
Fig. 3
Pull groups for the TMD simulations (image depicts the x0397 structure). Group 1 (cyan) consists of the ligand non-hydrogen atoms. Group 2 (green) consists of a selection of alpha-carbons in the Mpro active site. During the course of the TMD simulation, the two groups are pulled apart by means of a constant constraint force
Fig. 4
Fig. 4
Schematic of the dcTMD workflow
Fig. 5
Fig. 5
a and b Distributions of SuCOS and TransFS scores per fragment; the mean values are marked in black. c Scatter plot of SuCOS and TransFS scores for all poses. 209 of these are filtered for further screening d All fragments superimposed on the protein structure and colored by the main subpocket to which the fragment binds (S1’ red, S1 blue, S2 pink, S3 orange)
Fig. 6
Fig. 6
Plot of MMGBSA enthalpies for poses derived from each of the 22 fragments (mean marked by the large circles)
Fig. 7
Fig. 7
Ligands (cyan) binding in pockets, overlaid on the parent fragments (green): S1’ a) (x0397; SuCOS 0.65)), S1 b (x0387; SuCOS 0.56), S2 c (x0678; SuCOS 0.53) and S3 d (x0161; SuCOS 0.60)
Fig. 8
Fig. 8
Free energy curves derived from dcTMD calculations for two of the screened compounds
Fig. 9
Fig. 9
a Friction profiles for four selected ligands; the profiles for the ligands binding in subpocket S1/S1’ (red/pink) show a rise starting at 0.2 nm, whereas for those binding in subpocket 2 (blue/cyan), this is absent, with an increase being observable instead at 0.3 nm. b Ligands exiting the subpocket S1/S1’ at 0.25 nm from the initial binding position (pink), with Asn142 highlighted, and subpocket S2 at 0.33 nm from the initial binding position (green), with Ser46 highlighted
Fig. 10
Fig. 10
The average number of interactions observed and the free energy as calculated by MMGBSA are correlated (R2 = − 0.46). The weakness of the relationship reflects the high variation in the strength and importance of interactions
Fig. 11
Fig. 11
Maximum dcTMD free energy scores for compounds which display hydrogen bonding with the peptide backbone at residues Gly143 (R2=0.69) and Cys145 (R2=0.85)

References

    1. Berthold MR, Cebron N, Dill F, Gabriel TR, Kötter T, Meinl T, Ohl P, Thiel K, Wiswedel B. KNIME—the Konstanz Information Miner: version 2.0 and beyond. AcM SIGKDD Explor Newsl. 2009;11(1):26–31. doi: 10.1145/1656274.1656280. - DOI
    1. Crusoe MR, Abeln S, Iosup A, Amstutz P, Chilton J, Tijanic N, Ménager H, Soiland-Reyes S, Goble CA (2021) Methods included: Standardizing computational reuse and portability with the Common Workflow Language. CoRR abs/2105.07028. arXiv:2105.07028
    1. Tommaso PD, Chatzou M, Floden EW, Barja PP, Palumbo E, Notredame C. Nextflow enables reproducible computational workflows. Nat Biotechnol. 2017;35(4):316–319. doi: 10.1038/nbt.3820. - DOI - PubMed
    1. Afgan E, Baker D, Batut B, van den Beek M, Bouvier D, Čech M, Chilton J, Clements D, Coraor N, Grüning BA, Guerler A, Hillman-Jackson J, Hiltemann S, Jalili V, Rasche H, Soranzo N, Goecks J, Taylor J, Nekrutenko A, Blankenberg D. The Galaxy platform for accessible, reproducible and collaborative biomedical analyses: 2018 update. Nucleic Acids Res. 2018;46(W1):537–544. doi: 10.1093/nar/gky379. - DOI - PMC - PubMed
    1. Bray SA, Lucas X, Kumar A, Grüning BA. The ChemicalToolbox: reproducible, user-friendly cheminformatics analysis on the Galaxy platform. J Cheminform. 2020 doi: 10.1186/s13321-020-00442-7. - DOI - PMC - PubMed

LinkOut - more resources