Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2025 Jul 14;65(13):6884-6898.
doi: 10.1021/acs.jcim.5c00535. Epub 2025 Jun 20.

CACHE Challenge #2: Targeting the RNA Site of the SARS-CoV-2 Helicase Nsp13

Oleksandra Herasymenko  1 Madhushika Silva  1 Abd Al-Aziz A Abu-Saleh  1 Ayaz Ahmad  2 Jesus Alvarado-Huayhuaz  3 Oscar E A Arce  3 Roly J Armstrong  2 Cheryl Arrowsmith  1   4   5 Kelly E Bachta  6 Hartmut Beck  7 Denes Berta  8 Mateusz K Bieniek  2 Vincent Blay  9 Albina Bolotokova  1 Philip E Bourne  10 Marko Breznik  11 Peter J Brown  12 Aaron D G Campbell  2 Emanuele Carosati  13 Irene Chau  1 Daniel J Cole  2 Ben Cree  2 Wim Dehaen  14   15 Katrin Denzinger  11 Karina Dos Santos Machado  3 Ian Dunn  16 Prasannavenkatesh Durai  17 Kristina Edfeldt  18 Aled Edwards  1 Darren Fayne  19   20 Daniel Felfoldi  8 Kallie Friston  2 Pegah Ghiabi  1 Elisa Gibson  1 Judith Günther  21 Anders Gunnarsson  22 Alexander Hillisch  23 Douglas R Houston  24 Jan Halborg Jensen  25 Rachel J Harding  1   26   27 Kate S Harris  2 Laurent Hoffer  28 Anders Hogner  29 Joshua T Horton  2 Scott Houliston  5 Judd F Hultquist  6   30 Ashley Hutchinson  1 John J Irwin  31 Marko Jukič  32   33 Shubhangi Kandwal  34   19   20 Andrea Karlova  35 Vittorio L Katis  36 Ryan P Kich  6 Dmitri Kireev  37 David Koes  16 Nicole L Inniss  38 Uta Lessel  39 Sijie Liu  11 Peter Loppnau  1 Wei Lu  40 Sam Martino  8 Miles McGibbon  25 Jens Meiler  41   42 Akhila Mettu  37 Sam Money-Kyrle  9 Rocco Moretti  41   42 Yurii S Moroz  43 Charuvaka Muvva  17 Joseph A Newman  36 Leon Obendorf  11 Brooks Paige  35 Amit Pandit  11 Keunwan Park  17 Sumera Perveen  1 Rachael Pirie  2 Gennady Poda  27   28 Mykola Protopopov  43   44 Vera Pütter  45 Federico Ricci  46 Natalie J Roper  2 Edina Rosta  8 Margarita Rzhetskaya  6   30 Yogesh Sabnis  47 Karla J F Satchell  38 Frederico Schmitt Kremer  48 Thomas Scott  41   42 Almagul Seitova  1 Casper Steinmann  49 Valerij Talagayev  11 Olga O Tarkhanova  43 Natalie J Tatum  50 Dakota Treleaven  51 Adriano Velasque Werhli  3 W Patrick Walters  52 Xiaowen Wang  37 Jude Wells  35 Geoffrey Wells  53 Yvonne Westermaier  54 Gerhard Wolber  11 Lars Wortmann  39 Jixian Zhang  40 Zheng Zhao  10 Shuangjia Zheng  55 Matthieu Schapira  1   5   26
Affiliations

CACHE Challenge #2: Targeting the RNA Site of the SARS-CoV-2 Helicase Nsp13

Oleksandra Herasymenko et al. J Chem Inf Model. .

Abstract

A critical assessment of computational hit-finding experiments (CACHE) challenge was conducted to predict ligands for the SARS-CoV-2 Nsp13 helicase RNA binding site, a highly conserved COVID-19 target. Twenty-three participating teams comprised of computational chemists and data scientists used protein structure and data from fragment-screening paired with advanced computational and machine learning methods to each predict up to 100 inhibitory ligands. Across all teams, 1957 compounds were predicted and were subsequently procured from commercial catalogs for biophysical assays. Of these compounds, 0.7% were confirmed to bind to Nsp13 in a surface plasmon resonance assay. The six best-performing computational workflows used fragment growing, active learning, or conventional virtual screening with and without complementary deep-learning scoring functions. Follow-up functional assays resulted in identification of two compound scaffolds that bound Nsp13 with a Kd below 10 μM and inhibited in vitro helicase activity. Overall, CACHE #2 participants were successful in identifying hit compound scaffolds targeting Nsp13, a central component of the coronavirus replication-transcription complex. Computational design strategies recurrently successful across the first two CACHE challenges include linking or growing docked or crystallized fragments and docking small and diverse libraries to train ultrafast machine-learning models. The CACHE #2 competition reveals how crowd-sourcing ligand prediction efforts using a distinct array of approaches followed with critical biophysical assays can result in novel lead compounds to advance drug discovery efforts.

PubMed Disclaimer

Conflict of interest statement

Disclosures

The authors declare the following competing financial interest(s): U.L, Y.W and L.W are full-time employees of Boehringer Ingelheim, Y.S and A.H are full time employees of UCB and may also be stockholders.

Figures

Figure 1:
Figure 1:. Fragments occupy the SARS-CoV-2 Nsp13 RNA-binding channel.
Composite image formed by superimposing experimental structures of Nsp13 in complex with four fragments and in complex with RNA and ADP (blue and orange respectively; PDB code 7RDY). CACHE #2 participants were asked to find ligands targeting the RNA-binding site occupied by fragments. Electrostatic potential coloring of the binding site, revealing the overall polar area, and bound fragments are depicted in the inset.
Figure 2:
Figure 2:. Computational workflows used in CACHE #2
Figure 3:
Figure 3:. Drug-likeness and chemical diversity of 1957 Round 1 compounds.
(a) Number of compounds tested in Round 1 and advanced to Round 2 for each participant. (b) Chemical descriptors distribution of Round 1 compounds. (c) Pairwise Tanimoto distance matrix, using ECFP4 Morgan fingerprints from RDKit (compounds are ordered based on the selected teams). (d) closest analogs selected by different participants. MW: molecular weight; PSA: polar surface area; HBD: hydrogen-bond donors; HBA: hydrogen-bond acceptors; ROTB: rotatable bonds; TD: Tanimoto Distance.
Figure 4:
Figure 4:. Experimental evaluation of CACHE #1 Round 1 compounds.
Binding to Nsp13 measured by SPR and ATPase activity inhibition was used to advance compounds to Round 2.
Figure 5:
Figure 5:. Top six chemical series identified in Round 2.
Activity of the parent molecules and experimental data from Round 2 analogs are shown, including SPR sensorgrams and 19F-NMR spectra. Computational workflow IDs are encoded into compound names.
Figure 5:
Figure 5:. Top six chemical series identified in Round 2.
Activity of the parent molecules and experimental data from Round 2 analogs are shown, including SPR sensorgrams and 19F-NMR spectra. Computational workflow IDs are encoded into compound names.
Figure 5:
Figure 5:. Top six chemical series identified in Round 2.
Activity of the parent molecules and experimental data from Round 2 analogs are shown, including SPR sensorgrams and 19F-NMR spectra. Computational workflow IDs are encoded into compound names.
Figure 6:
Figure 6:. Two compounds inhibited RNA duplex unwinding.
Out of the 13 most potent compounds in the SPR assay (Table S7), CACHE2-HO-1431_6 and CACHE2-HO_1454_15 had measurable IC50 values in a FRET-based RNA unwinding assay (a),- and had a detectable inhibitory effect in a gel-based RNA unwinding assay when added at 1 mM (b).
Figure 7:
Figure 7:. Scores of CACHE2 participants.
For each team, the aggregated score of all Round 1 hits (a) or the score of the best Round 1 hit (b) selected from the Enamine Real library is plotted. (c) Normalized score when predicting active molecules from the 1957 Round 1 compounds (calculated as the aggregated score of all compounds predicted active divided by the number of compounds predicted active). The score of each molecule was assigned by the CACHE Hit Evaluation Committee (Table S1).
Figure 8:
Figure 8:. Best performing workflows.
(a) Group, workflow ID, and associated ranks in three evaluation schemes. (b) Schematics of the computational workflows.
Figure 9:
Figure 9:. Docked poses of top compounds.
The docked poses of some of the best scoring CACHE #2 hits (right) compared with the crystal structure of fragments found in the PDB (left). RNA from a superimposed cryo-EM structure shown in blue (PDB code: 7RDY). CACHE_1454_98 was docked to an alternate site.
Figure 10:
Figure 10:. Classification of most successful workflows.
Computational workflows are classified based on hit-prediction strategies. Computational steps using machine-learning are highlighted in blue. Software names are shown in italic.

Similar articles

References

    1. Kryshtafovych A; Schwede T; Topf M; Fidelis K; Moult J Critical Assessment of Methods of Protein Structure Prediction (CASP)-Round XV. Proteins 2023, 91 (12), 1539–1549. 10.1002/prot.26617. - DOI - PMC - PubMed
    1. Ackloo S; Al-Awar R; Amaro RE; Arrowsmith CH; Azevedo H; Batey RA; Bengio Y; Betz UAK; Bologa CG; Chodera JD; Cornell WD; Dunham I; Ecker GF; Edfeldt K; Edwards AM; Gilson MK; Gordijo CR; Hessler G; Hillisch A; Hogner A; Irwin JJ; Jansen JM; Kuhn D; Leach AR; Lee AA; Lessel U; Morgan MR; Moult J; Muegge I; Oprea TI; Perry BG; Riley P; Rousseaux SAL; Saikatendu KS; Santhakumar V; Schapira M; Scholten C; Todd MH; Vedadi M; Volkamer A; Willson TM CACHE (Critical Assessment of Computational Hit-Finding Experiments): A Public-Private Partnership Benchmarking Initiative to Enable the Development of Computational Methods for Hit-Finding. Nat Rev Chem 2022, 6 (4), 287–295. 10.1038/s41570-022-00363-z. - DOI - PMC - PubMed
    1. Li F; Ackloo S; Arrowsmith CH; Ban F; Barden CJ; Beck H; Beránek J; Berenger F; Bolotokova A; Bret G; Breznik M; Carosati E; Chau I; Chen Y; Cherkasov A; Corte DD; Denzinger K; Dong A; Draga S; Dunn I; Edfeldt K; Edwards A; Eguida M; Eisenhuth P; Friedrich L; Fuerll A; Gardiner SS; Gentile F; Ghiabi P; Gibson E; Glavatskikh M; Gorgulla C; Guenther J; Gunnarsson A; Gusev F; Gutkin E; Halabelian L; Harding RJ; Hillisch A; Hoffer L; Hogner A; Houliston S; Irwin JJ; Isayev O; Ivanova A; Jacquemard C; Jarrett AJ; Jensen JH; Kireev D; Kleber J; Koby SB; Koes D; Kumar A; Kurnikova MG; Kutlushina A; Lessel U; Liessmann F; Liu S; Lu W; Meiler J; Mettu A; Minibaeva G; Moretti R; Morris CJ; Narangoda C; Noonan T; Obendorf L; Pach S; Pandit A; Perveen S; Poda G; Polishchuk P; Puls K; Pütter V; Rognan D; Roskams-Edris D; Schindler C; Sindt F; Spiwok V; Steinmann C; Stevens RL; Talagayev V; Tingey D; Vu O; Walters WP; Wang X; Wang Z; Wolber G; Wolf CA; Wortmann L; Zeng H; Zepeda CA; Zhang KYJ; Zhang J; Zheng S; Schapira M CACHE Challenge #1: Targeting the WDR Domain of LRRK2, A Parkinson’s Disease Associated Protein. J Chem Inf Model 2024, 64 (22), 8521–8536. 10.1021/acs.jcim.4c01267. - DOI - PubMed
    1. Mullard A When Can AI Deliver the Drug Discovery Hits? Nat Rev Drug Discov 2024, 23 (3), 159–161. 10.1038/d41573-024-00036-0. - DOI - PubMed
    1. Newman JA; Douangamath A; Yadzani S; Yosaatmadja Y; Aimon A; Brandão-Neto J; Dunnett L; Gorrie-Stone T; Skyner R; Fearon D; Schapira M; von Delft F; Gileadi O Structure, Mechanism and Crystallographic Fragment Screening of the SARS-CoV-2 NSP13 Helicase. Nat Commun 2021, 12 (1), 4848. 10.1038/s41467-021-25166-6. - DOI - PMC - PubMed

LinkOut - more resources