Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2022;5(1):136.
doi: 10.1038/s42004-022-00754-9. Epub 2022 Oct 27.

An open-source molecular builder and free energy preparation workflow

Affiliations

An open-source molecular builder and free energy preparation workflow

Mateusz K Bieniek et al. Commun Chem. 2022.

Abstract

Automated free energy calculations for the prediction of binding free energies of congeneric series of ligands to a protein target are growing in popularity, but building reliable initial binding poses for the ligands is challenging. Here, we introduce the open-source FEgrow workflow for building user-defined congeneric series of ligands in protein binding pockets for input to free energy calculations. For a given ligand core and receptor structure, FEgrow enumerates and optimises the bioactive conformations of the grown functional group(s), making use of hybrid machine learning/molecular mechanics potential energy functions where possible. Low energy structures are optionally scored using the gnina convolutional neural network scoring function, and output for more rigorous protein-ligand binding free energy predictions. We illustrate use of the workflow by building and scoring binding poses for ten congeneric series of ligands bound to targets from a standard, high quality dataset of protein-ligand complexes. Furthermore, we build a set of 13 inhibitors of the SARS-CoV-2 main protease from the literature, and use free energy calculations to retrospectively compute their relative binding free energies. FEgrow is freely available at https://github.com/cole-group/FEgrow, along with a tutorial.

Keywords: Computational chemistry; Structure-based drug design.

PubMed Disclaimer

Conflict of interest statement

Competing interestsThe authors declare no competing interests.

Figures

Fig. 1
Fig. 1. Overview of the FEgrow workflow.
(left) The user specifies the receptor, ligand core, and a list of functional groups, along with their attachment points. (centre) RDKit is used to attach the selected R-group(s) and enumerate the available conformers with a rigid core. (right) Possible bioactive conformers undergo structural optimisation using a hybrid ML/MM potential energy function. The binding affinity is predicted using a convolutional neural network scoring function and molecular properties are assessed. Final structures are output for further free energy based binding affinity assessment.
Fig. 2
Fig. 2. Overlay of experimental and predicted protein-ligand benchmark dataset structures.
Crystal structures are shown in yellow and grown compounds in grey. a TYK2 (PDB: 4GIH), b Thrombin (PDB: 2ZFF), c P38 (PDB: 3FLY), d PTP1B with force field optimisation (PDB: 2QBS), e PTP1B using ML/MM optimisation, and f BACE(Hunt) (PDB: 4JPC). Root-mean-square distances (RMSD) between predicted and experimental coordinates of atoms in the built R-groups were calculated using RDKit.
Fig. 3
Fig. 3. Structures of the series of cyanophenyl- and uracil-based compounds SARS-CoV-2 main protease (Mpro) inhibitors investigated here.
a Cyanophenyl-based Mpro inhibitors. b X-ray crystal structure of 4 in complex with the protease, with discussed binding pockets labelled. c, d Uracil-based Mpro inhibitors.
Fig. 4
Fig. 4. Comparison between experimental and predicted structures of SARS-CoV-2 main protease (Mpro) inhibitors.
Overlay of a 5 and PDBID: 7L11, b 26 and 7L14, c 14 and 7L12, d 21 and 7L13. Crystal structures are coloured in yellow, and modelled binding poses in grey. Root-mean-square distances (RMSD) between predicted and experimental coordinates of atoms in the built R-groups were calculated using RDKit.
Fig. 5
Fig. 5. Comparison between free energy calculations and experiment. Binding free energies of 13 analogues of the uracil-based Mpro inhibitors, relative to compound 10.
The error bars indicate one standard error based on least square fitting.

Similar articles

Cited by

  • Exploring Scoring Function Space: Developing Computational Models for Drug Discovery.
    Bitencourt-Ferreira G, Villarreal MA, Quiroga R, Biziukova N, Poroikov V, Tarasova O, de Azevedo Junior WF. Bitencourt-Ferreira G, et al. Curr Med Chem. 2024;31(17):2361-2377. doi: 10.2174/0929867330666230321103731. Curr Med Chem. 2024. PMID: 36944627 Review.
  • Active learning driven prioritisation of compounds from on-demand libraries targeting the SARS-CoV-2 main protease.
    Cree B, Bieniek MK, Amin S, Kawamura A, Cole DJ. Cree B, et al. Digit Discov. 2025 Jan 8;4(2):438-450. doi: 10.1039/d4dd00343h. eCollection 2025 Feb 12. Digit Discov. 2025. PMID: 39816163 Free PMC article.
  • CACHE Challenge #2: Targeting the RNA Site of the SARS-CoV-2 Helicase Nsp13.
    Herasymenko O, Silva M, Abu-Saleh AAA, Ahmad A, Alvarado-Huayhuaz J, Arce OEA, Armstrong RJ, Arrowsmith C, Bachta KE, Beck H, Berta D, Bieniek MK, Blay V, Bolotokova A, Bourne PE, Breznik M, Brown PJ, Campbell ADG, Carosati E, Chau I, Cole DJ, Cree B, Dehaen W, Denzinger K, Dos Santos Machado K, Dunn I, Durai P, Edfeldt K, Edwards A, Fayne D, Felfoldi D, Friston K, Ghiabi P, Gibson E, Günther J, Gunnarsson A, Hillisch A, Houston DR, Jensen JH, Harding RJ, Harris KS, Hoffer L, Hogner A, Horton JT, Houliston S, Hultquist JF, Hutchinson A, Irwin JJ, Jukič M, Kandwal S, Karlova A, Katis VL, Kich RP, Kireev D, Koes D, Inniss NL, Lessel U, Liu S, Loppnau P, Lu W, Martino S, McGibbon M, Meiler J, Mettu A, Money-Kyrle S, Moretti R, Moroz YS, Muvva C, Newman JA, Obendorf L, Paige B, Pandit A, Park K, Perveen S, Pirie R, Poda G, Protopopov M, Pütter V, Ricci F, Roper NJ, Rosta E, Rzhetskaya M, Sabnis Y, Satchell KJF, Schmitt Kremer F, Scott T, Seitova A, Steinmann C, Talagayev V, Tarkhanova OO, Tatum NJ, Treleaven D, Velasque Werhli A, Walters WP, Wang X, Wells J, Wells G, Westermaier Y, Wolber G, Wortmann L, Zhang J, Zhao Z, Zheng S, Schapira M. Herasymenko O, et al. J Chem Inf Model. 2025 Jul 14;65(13):6884-6898. doi: 10.1021/acs.jcim.5c00535. Epub 2025 Jun 20. J Chem Inf Model. 2025. PMID: 40539604 Free PMC article.

References

    1. Bender BJ, et al. A practical guide to large-scale docking. Nat. Protoc. 2021;16:4799–4832. doi: 10.1038/s41596-021-00597-z. - DOI - PMC - PubMed
    1. Schneider G, Fechner U. Computer-based de novo design of drug-like molecules. Nat. Rev. Drug Discov. 2005;4:649–663. doi: 10.1038/nrd1799. - DOI - PubMed
    1. Chéron N, Jasty N, Shakhnovich EI. OpenGrowth: An automated and rational algorithm for finding new protein ligands. J. Med. Chem. 2016;59:4171–4188. doi: 10.1021/acs.jmedchem.5b00886. - DOI - PubMed
    1. Durrant JD, Amaro RE, McCammon JA. AutoGrow: A novel algorithm for protein inhibitor design. Chem. Biol. Drug Des. 2009;73:168–178. doi: 10.1111/j.1747-0285.2008.00761.x. - DOI - PMC - PubMed
    1. Yuan Y, Pei J, Lai L. Ligbuilder 2: A practical de novo drug design approach. J. Chem. Inf. Model. 2011;51:1083–1091. doi: 10.1021/ci100350u. - DOI - PubMed