Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2023 Nov 10;382(6671):eabo7201.
doi: 10.1126/science.abo7201. Epub 2023 Nov 10.

Open science discovery of potent noncovalent SARS-CoV-2 main protease inhibitors

Melissa L Boby #  1   2   3 Daren Fearon #  4   5 Matteo Ferla #  6 Mihajlo Filep #  7 Lizbé Koekemoer #  8   9 Matthew C Robinson #  10 COVID Moonshot Consortium‡John D Chodera  3 Alpha A Lee  10 Nir London  7 Annette von Delft  6   8   9 Frank von Delft  4   5   8   9   11 Hagit Achdout  12 Anthony Aimon  13   14 Dominic S Alonzi  15 Robert Arbon  16 Jasmin C Aschenbrenner  13   14 Blake H Balcomb  13   14 Elad Bar-David  12 Haim Barr  17 Amir Ben-Shmuel  12 James Bennett  18   19 Vitaliy A Bilenko  20   21 Bruce Borden  22 Pascale Boulet  23 Gregory R Bowman  24 Lennart Brewitz  25 Juliane Brun  15 Sarma Bvnbs  26 Mark Calmiano  27 Anna Carbery  13   28 Daniel W Carney  29 Emma Cattermole  15 Edcon Chang  29 Eugene Chernyshenko  20 Austin Clyde  30 Joseph E Coffland  31 Galit Cohen  17 Jason C Cole  32 Alessandro Contini  33 Lisa Cox  34 Tristan Ian Croll  35   36 Milan Cvitkovic  37 Steven De Jonghe  38 Alex Dias  13   14 Kim Donckers  38 David L Dotson  39 Alice Douangamath  13   14 Shirly Duberstein  17 Tim Dudgeon  40 Louise E Dunnett  13   14 Peter Eastman  41 Noam Erez  12 Charles J Eyermann  42 Michael Fairhead  18 Gwen Fate  43 Oleg Fedorov  18   19 Rafaela S Fernandes  44 Lori Ferrins  42 Richard Foster  45 Holly Foster  45   46 Laurent Fraisse  23 Ronen Gabizon  47 Adolfo García-Sastre  48   49   50   51   52 Victor O Gawriljuk  44   53 Paul Gehrtz  47   54 Carina Gileadi  18 Charline Giroud  18   19 William G Glass  16   46 Robert C Glen  55 Itai Glinert  12 Andre S Godoy  44 Marian Gorichko  21 Tyler Gorrie-Stone  13   14 Ed J Griffen  56 Amna Haneef  57 Storm Hassell Hart  58 Jag Heer  59 Michael Henry  16 Michelle Hill  15   60 Sam Horrell  13   14 Qiu Yu Judy Huang  61 Victor D Huliak  20 Matthew F D Hurley  62 Tomer Israely  12 Andrew Jajack  37 Jitske Jansen  63 Eric Jnoff  64 Dirk Jochmans  38 Tobias John  25   65 Benjamin Kaminow  16   66 Lulu Kang  67 Anastassia L Kantsadi  15   68 Peter W Kenny  69 J L Kiappes  15   70 Serhii O Kinakh  20 Boris Kovar  71 Tobias Krojer  18   72 Van Ngoc Thuy La  57 Sophie Laghnimi-Hahn  23 Bruce A Lefker  43 Haim Levy  12 Ryan M Lithgo  13   14 Ivan G Logvinenko  20 Petra Lukacik  13   14 Hannah Bruce Macdonald  16   73 Elizabeth M MacLean  18 Laetitia L Makower  15 Tika R Malla  18 Peter G Marples  13   14 Tatiana Matviiuk  20 Willam McCorkindale  73   74 Briana L McGovern  48   49 Sharon Melamed  12 Kostiantyn P Melnykov  20   21 Oleg Michurin  20 Pascal Miesen  75 Halina Mikolajek  13   14 Bruce F Milne  76   77 David Minh  78 Aaron Morris  37 Garrett M Morris  28 Melody Jane Morwitzer  79 Demetri Moustakas  80 Charles E Mowbray  23 Aline M Nakamura  44   81 Jose Brandao Neto  13   14 Johan Neyts  38 Luong Nguyen  37 Gabriela D Noske  44 Vladas Oleinikovas  27   82 Glaucius Oliva  44 Gijs J Overheul  75 C David Owen  13   14 Ruby Pai  37 Jin Pan  37 Nir Paran  12 Alexander Matthew Payne  16   66 Benjamin Perry  23   83 Maneesh Pingle  26 Jakir Pinjari  26   84 Boaz Politi  12 Ailsa Powell  13   14 Vladimír Pšenák  71 Iván Pulido  16 Reut Puni  12 Victor L Rangel  85   86 Rambabu N Reddi  47 Paul Rees  87 St Patrick Reid  79 Lauren Reid  56 Efrat Resnick  47 Emily Grace Ripka  37 Ralph P Robinson  43 Jaime Rodriguez-Guerra  88 Romel Rosales  48   49 Dominic A Rufa  16   66 Kadi Saar  74 Kumar Singh Saikatendu  29 Eidarus Salah  25 David Schaller  88 Jenke Scheen  16 Celia A Schiffer  61 Christopher J Schofield  25 Mikhail Shafeev  20 Aarif Shaikh  26 Ala M Shaqra  61 Jiye Shi  64   89 Khriesto Shurrush  17 Sukrit Singh  16 Assa Sittner  12 Peter Sjö  23 Rachael Skyner  13   14 Adam Smalley  27 Bart Smeets  90 Mihaela D Smilova  18 Leonardo J Solmesky  17 John Spencer  58 Claire Strain-Damerell  13   14 Vishwanath Swamy  26   91 Hadas Tamir  12 Jenny C Taylor  92 Rachael E Tennant  93 Warren Thompson  13   14 Andrew Thompson  18   94 Susana Tomásio  95 Charles W E Tomlinson  13   14 Igor S Tsurupa  20 Anthony Tumber  25 Ioannis Vakonakis  15   96 Ronald P van Rij  75 Laura Vangeel  38 Finny S Varghese  75   97 Mariana Vaschetto  95 Einat B Vitner  12 Vincent Voelz  62 Andrea Volkamer  88   98 Martin A Walsh  13   14 Walter Ward  99 Charlie Weatherall  100 Shay Weiss  12 Kris M White  48   49 Conor Francis Wild  13   14 Karolina D Witt  101 Matthew Wittmann  16 Nathan Wright  18 Yfat Yahalom-Ronen  12 Nese Kurt Yilmaz  61 Daniel Zaidmann  47 Ivy Zhang  16 Hadeer Zidane  17 Nicole Zitzmann  15 Sarah N Zvornicanin  61
Affiliations

Open science discovery of potent noncovalent SARS-CoV-2 main protease inhibitors

Melissa L Boby et al. Science. .

Abstract

We report the results of the COVID Moonshot, a fully open-science, crowdsourced, and structure-enabled drug discovery campaign targeting the severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) main protease. We discovered a noncovalent, nonpeptidic inhibitor scaffold with lead-like properties that is differentiated from current main protease inhibitors. Our approach leveraged crowdsourcing, machine learning, exascale molecular simulations, and high-throughput structural biology and chemistry. We generated a detailed map of the structural plasticity of the SARS-CoV-2 main protease, extensive structure-activity relationships for multiple chemotypes, and a wealth of biochemical activity data. All compound designs (>18,000 designs), crystallographic data (>490 ligand-bound x-ray structures), assay data (>10,000 measurements), and synthesized molecules (>2400 compounds) for this campaign were shared rapidly and openly, creating a rich, open, and intellectual property-free knowledge base for future anticoronavirus drug discovery.

PubMed Disclaimer

Conflict of interest statement

Competing interests: Disclosures for each author are listed individually in the consortium spreadsheet (data S7). The authors declare no additional competing interests.

Figures

Fig. 1.
Fig. 1.. Crowdsourcing rapidly identified chemotype scaffolds by merging fragment hits.
(A) A Diamond/XChem fragment screen that initiated this SARS-CoV-2 Mpro inhibitor discovery campaign generated 71 hits that completely cover the Mpro active site, with a variety of chemotypes engaging each pocket; 1638 x-ray datasets were collected, and 96 solved structures for hits were publicly posted (20). The peptidomimetic N3 ligand is shown on the left for comparison to indicate natural substrate engagement in the binding site, defining the peptide sidechain numbering scheme used throughout this work. The nucleophilic Cys145 reacts with the scissile peptide bond between P1 and P1’; His41-Cys145 form a catalytic dyad whose coupled charge states shuttle between zwitterionic and neutral states (90). (B) On 18 March 2020, the COVID Moonshot set up a crowdsourcing website to empower scientists across the globe to contribute molecule designs. The number of designs actioned for synthesis each quarter (except for the 2020 Q2, which is shown per-month in brackets) is shown, subdivided by the region of the submitter of the design idea. The total number of unique submitters that contributed actioned designs for that quarter is shown on top of the bars. (C) Many submissions, such as TRY-UNI-714a760b-6, exploited spatially overlapping fragment hits to design potent leads that are synthetically facile. (D) Experimental biochemical potency of designs broken down by submission group. Multiple submissions in 2020 from the community were more potent than the best designs from the core team, as seen for the top three chloroacetamide structures (left) and noncovalent structures (right). (E) Distribution of synthetic accessibility scores (SAScores) for designs contributed by the core team and the community. The concern that community submissions may be of poor quality is not supported by the fact that these were as synthetically accessible as those designed by the core team (median: community, 0.17; core, 0.13). Half of the outliers (SAScore = 1) were primarily natural products, which are hard to achieve through organic chemistry.
Fig. 2.
Fig. 2.. Strategies to support rapid optimization cycles.
(A) Machine learning forecasts experimental synthesis time (left) and returns efficient routes that leverage more than 10 million in-stock advanced intermediates (right). Our algorithm predicts the probability of each step being successful and predicts synthetic accessibility by taking the product of the probabilities along the whole route. We analyzed all compounds made in COVID Moonshot from 1 May 2020 to 1 July 2021 (n = 898). The right panel exemplifies the experimental execution of the predicted routes, demonstrating the ability of the algorithm to build on functionalized intermediates to shorten synthesis. (B) Applying alchemical free-energy calculations at scale enables us to estimate the potency of compounds. Retrospective assessment of our automated free-energy calculation workflow on early compounds in the 3-aminopyridine series in the first month of the COVID Moonshot campaign suggested that free-energy calculations could provide good predictive utility, which inspired confidence for large-scale deployment during this campaign. Here, the absolute free energy of binding (ΔG) is shown in the rightmost panel by adding a constant offset to the computed relative free-energy differences. (C) Alchemical free-energy predictions for all submissions elaborating on the depicted scaffold for three representative batches of prospective free-energy calculations plotted as calculated (converted using Cheng-Prusoff equation) versus experimental pIC50. Simulations were run using Mpro in dimer form, with neutral catalytic dyad and no restraints. Each batch (numbered 1 to 3 from left to right) is annotated with its scaffold, and top-scoring candidates are shown on the right-hand side (numbered 1 to 3 from top to bottom)–for these, the structure names are shown together with their predicted and experimental pIC50 (“Pred” and “Exp,” respectively). Statistical performance with 95% confidence intervals for each batch is shown as a table in each scatterplot. (D) Two examples of nanomole-scale HTC campaigns used to optimize the potency of intermediate binders, centering on the Chan-Lam reaction (fig. S7) and amide couplings (fig. S8). Direct biochemical screening of crude reactions identified candidates that were resynthesized and in both cases were able to improve the potency of the parent compound. Soaking of crude reaction mixtures of the most potent biochemical hits into Mpro crystals provided complex structures with the identified hits (Chan-Lam PDBs: 7GJJ/7GJZ, resolution: 1.75Å/1.65Å; Amide coupling PDBs: 7GNL/7GNQ, resolution: 1.68Å/1.53Å). In both cases, new interactions were discovered, explaining the improved activity. Although for the Chan-Lam reaction campaign, the extended compounds occupied the intended P4, for the amide-coupling vector, all compounds extended into the P3/5 pockets.
Fig. 3.
Fig. 3.. Analysis of 367 complex crystal structures reveals hotspots for ligand engagement and a variety of ways to engage substrate recognition subpockets.
(A) The five substrate recognition subpockets exhibit distinct preferences for intermolecular interactions. The figure highlights the locations of different types of interactions, with the shading indicating the frequency. The bottom row tallies the number of times that each interaction was seen in our structures for different residues. The interaction map was generated using PLIPify (Materials and methods) and summarizes the interactions witnessed across 367 complexes from the perspective of the protein, distinguishing between backbone (bb) and sidechain (sc) interactions (which might be more vulnerable to point mutations). (B) Representative examples of protein-ligand interactions engaging the P1’, P1, P2, and P3-5 subpockets. Hydrogen bonds and π-stacking interactions are depicted as yellow and cyan dashed lines, respectively. The rows above each set of complexes tally the number of times that each interaction was seen with the specific residues within the subpockets. See data S4 for Protein Data Bank (PDB) IDs and crystallography statistics. Single-letter abbreviations for the amino acid residues are as follows: A, Ala; C, Cys; D, Asp; E, Glu; F, Phe; G, Gly; H, His; I, Ile; K, Lys; L, Leu; M, Met; N, Asn; P, Pro; Q, Gln; R, Arg; S, Ser; T, Thr; V, Val; W, Trp; and Y, Tyr.
Fig. 4.
Fig. 4.. Structural plasticity of the binding subpockets.
The subpockets have different degrees of plasticity, which is also dependent on the chemical series (fig. S2). The corners of the figure show the distribution of sidechain root mean square deviation (RMSD) deviations from the structure of MAT-POS-e194df51-1 (middle panel; PDB: 7GAW). The boxes exemplify ligands that significantly deform the pockets.
Fig. 5.
Fig. 5.. Iterative medicinal chemistry furnished an orally bioavailable inhibitor.
(A) Summary of key medicinal chemistry milestones in developing the initial crowdsourced lead compound into a potent antiviral. X-ray structures for each milestone compound are available via Fragalysis, and each compound can be obtained from Enamine using the corresponding catalog numbers. Retrospective alchemical free-energy calculation predictions for each transformation (ΔΔGFEP) are shown for each step between milestones, along with the corresponding experimental free-energy difference (ΔΔGexp) derived from biochemical activities. As positive control, under our assay condition, nirmatrelvir has an IC50 of 2.6 nM. (B) Antiviral activity of MAT-POS-e194df51-1 cellular antiviral assays, with an EC50 of 64 nM in A549-ACE2-TMPRSS2 cells assessing CPE (black; plotted as 100 – percent viability) and 126 nM in HeLa-ACE2 assays (blue; plotted as percentage of infected cells). Both assays were performed with P-gp inhibitors. (C) Histogram comparing antiviral efficacy of all COVID Moonshot compounds measured to date in an A549-ACE2-TMPRSS2 CPE cellular antiviral assay. (D) Detailed cellular antiviral assessment of key compounds composing the synthetic strategy (A) across different cell lines and assay techniques, with and without p-gp inhibitors, demonstrating efficacy of MAT-POS-e194df51-1 in various setups and laboratories. (E) MAT-POS-e194df51-1 shows good cross-reactivity against known circulating variants of SARS-CoV-2 in antiviral cellular assays in a CPE assay in HeLa-ACE2 cells. (F) PK profile of MAT-POS-e194df51-1 in rats with a 2 mg/kg intravenous and 10 mg/kg oral dosing with good oral availability. (G) ADME characteristics of MAT-POS-e194df51-1 demonstrate a favorable safety profile, indicating translational potential of the lead series.

Comment in

References

    1. Phillips N, The coronavirus is here to stay – Here’s what that means. Nature 590, 382–384 (2021). doi: 10.1038/d41586-021-00396-2 - DOI - PubMed
    1. Winchester N, “Covid-19 vaccinations: Is the Global South falling behind?” (UK House of Lords Library, 2021); https://lordslibrary.parliament.uk/covid-19-vaccinations-is-the-global-s....
    1. Zhao T et al., Warnings regarding the potential coronavirus disease 2019 (COVID-19) transmission risk: Vaccination is not enough. Infect. Control Hosp. Epidemiol 43, 679–680 (2022). doi: 10.1017/ice.2021.63 - DOI - PMC - PubMed
    1. Hammond J et al., Oral Nirmatrelvir for High-Risk, Nonhospitalized Adults with Covid-19. N. Engl. J. Med 386, 1397–1408 (2022). doi: 10.1056/NEJMoa2118542 - DOI - PMC - PubMed
    1. McCarthy MW, Ensitrelvir as a potential treatment for COVID-19. Expert Opin. Pharmacother. 23, 1995–1998 (2022). doi: 10.1080/14656566.2022.2146493 - DOI - PubMed

MeSH terms

Substances