Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2023 Mar 30;15(4):891.
doi: 10.3390/v15040891.

Discovery of Potential Inhibitors of SARS-CoV-2 Main Protease by a Transfer Learning Method

Affiliations

Discovery of Potential Inhibitors of SARS-CoV-2 Main Protease by a Transfer Learning Method

Huijun Zhang et al. Viruses. .

Abstract

The COVID-19 pandemic caused by SARS-CoV-2 remains a global public health threat and has prompted the development of antiviral therapies. Artificial intelligence may be one of the strategies to facilitate drug development for emerging and re-emerging diseases. The main protease (Mpro) of SARS-CoV-2 is an attractive drug target due to its essential role in the virus life cycle and high conservation among SARS-CoVs. In this study, we used a data augmentation method to boost transfer learning model performance in screening for potential inhibitors of SARS-CoV-2 Mpro. This method appeared to outperform graph convolution neural network, random forest and Chemprop on an external test set. The fine-tuned model was used to screen for a natural compound library and a de novo generated compound library. By combination with other in silico analysis methods, a total of 27 compounds were selected for experimental validation of anti-Mpro activities. Among all the selected hits, two compounds (gyssypol acetic acid and hyperoside) displayed inhibitory effects against Mpro with IC50 values of 67.6 μM and 235.8 μM, respectively. The results obtained in this study may suggest an effective strategy of discovering potential therapeutic leads for SARS-CoV-2 and other coronaviruses.

Keywords: SARS-CoV-2 Mpro; deep learning; drug development; natural compound; transfer learning.

PubMed Disclaimer

Conflict of interest statement

The authors declare no conflict of interest.

Figures

Figure 1
Figure 1
t-Distributed stochastic neighbor embedding (t-SNE) analysis of (A) active molecules (magenta) and inactive molecules (cyan) of the original dataset; (B) molecules in five subsets; (C) active molecules in five subsets; (D) molecules in original dataset (cyan) and the independent test dataset (red).
Figure 2
Figure 2
Top ranked 20 natural compounds screened by fine-tuned model. Rankings were from averages of five independent model predictions.
Figure 3
Figure 3
Top ranked 20 de novo generated molecules screened by fine-tuned model. Rankings were from averages of five independent model predictions.
Figure 4
Figure 4
Inhibition of SARS-CoV-2 Mpro. (A) Inhibition percentage of selected compounds at concentrations of 200 μM. (B) Inhibition percentage of selected compounds at concentrations of 40 μM. (C) Representative curves of Boceprevir, compound T2730 and T2844. All data are from at least three independent experiments and shown as mean ± SD.

References

    1. Wu F., Zhao S., Yu B., Chen Y.M., Wang W., Song Z.G., Hu Y., Tao Z.W., Tian J.H., Pei Y.Y., et al. A new coronavirus associated with human respiratory disease in China. Nature. 2020;579:265–269. doi: 10.1038/s41586-020-2008-3. - DOI - PMC - PubMed
    1. Lu R., Zhao X., Li J., Niu P., Yang B., Wu H., Wang W., Song H., Huang B., Zhu N., et al. Genomic characterisation and epidemiology of 2019 novel coronavirus: Implications for virus origins and receptor binding. Lancet. 2020;395:565–574. doi: 10.1016/S0140-6736(20)30251-8. - DOI - PMC - PubMed
    1. Anand K., Ziebuhr J., Wadhwani P., Mesters J.R., Hilgenfeld R. Coronavirus main proteinase (3CLpro) structure: Basis for design of anti-SARS drugs. Science. 2003;300:1763–1767. doi: 10.1126/science.1085658. - DOI - PubMed
    1. Kim D., Lee J.Y., Yang J.S., Kim J.W., Kim V.N., Chang H. The Architecture of SARS-CoV-2 Transcriptome. Cell. 2020;181:914–921.e10. doi: 10.1016/j.cell.2020.04.011. - DOI - PMC - PubMed
    1. Chen Y., Liu Q., Guo D. Emerging coronaviruses: Genome structure, replication, and pathogenesis. J. Med. Virol. 2020;92:418–423. doi: 10.1002/jmv.25681. - DOI - PMC - PubMed

Publication types