Enhancing HCV NS3 Inhibitor Classification with Optimized Molecular Fingerprints Using Random Forest
- PMID: 40141322
- PMCID: PMC11943357
- DOI: 10.3390/ijms26062680
Enhancing HCV NS3 Inhibitor Classification with Optimized Molecular Fingerprints Using Random Forest
Abstract
The classification of Hepatitis C virus (HCV) NS3 inhibitors is essential for identifying potential antiviral agents through computational methods. This study aims to develop an optimized machine learning (ML) model using random forest (RF) and molecular fingerprints to accurately classify HCV NS3 inhibitors. A dataset of 965 molecules was retrieved from the ChEMBL database, and 290 bioactive compounds were selected for model training. Twelve molecular fingerprint descriptors were tested, and the CDK graph-only fingerprint yielded the best performance. In addition to RF, performance comparisons of other classifiers such as instance-based k-nearest neighbor (IBk), logistic regression (LR), AdaBoost, and OneR were conducted using WEKA with various molecular fingerprint descriptors. The optimized RF model achieved an accuracy of 89.6552%, a mean absolute error (MAE) of 0.2114, a root mean square error (RMSE) of 0.3304, and a Matthews correlation coefficient (MCC) of 0.7950 on the test set. These results highlight the effectiveness of optimized molecular fingerprints in enhancing virtual screening (VS) for HCV inhibitors. This approach offers a data-driven method for drug discovery.
Keywords: HCV NS3 inhibitors; QSAR; computational drug design; machine learning; molecular descriptor optimization.
Conflict of interest statement
The author declares no conflicts of interest.
Figures







Similar articles
-
iDNS3IP: Identification and Characterization of HCV NS3 Protease Inhibitory Peptides.Int J Mol Sci. 2025 Jun 3;26(11):5356. doi: 10.3390/ijms26115356. Int J Mol Sci. 2025. PMID: 40508165 Free PMC article.
-
On the steroids extracted from soft corals against the NS3/4A protease of hepatitis C virus.J Mol Graph Model. 2025 May;136:108936. doi: 10.1016/j.jmgm.2024.108936. Epub 2024 Dec 27. J Mol Graph Model. 2025. PMID: 39793475
-
Pharmacophore-Assisted Covalent Docking Identifies a Potential Covalent Inhibitor for Drug-Resistant Genotype 3 Variants of Hepatitis C Viral NS3/4A Serine Protease.Viruses. 2024 Aug 3;16(8):1250. doi: 10.3390/v16081250. Viruses. 2024. PMID: 39205224 Free PMC article.
-
A review of HCV protease inhibitors.Curr Opin Investig Drugs. 2009 Aug;10(8):821-37. Curr Opin Investig Drugs. 2009. PMID: 19649927 Review.
-
Macrocyclic Hepatitis C Virus NS3/4A Protease Inhibitors: An Overview of Medicinal Chemistry.Curr Med Chem. 2016;23(29):3404-3447. doi: 10.2174/0929867323666160510122525. Curr Med Chem. 2016. PMID: 27160539 Review.
Cited by
-
Drug repurposing targeting COVID-19 3CL protease using molecular docking and machine learning regression approaches.Sci Rep. 2025 May 28;15(1):18722. doi: 10.1038/s41598-025-02773-7. Sci Rep. 2025. PMID: 40436944 Free PMC article.
References
-
- World Health Organization Hepatitis C. [(accessed on 20 December 2024)]. Available online: https://www.who.int/news-room/fact-sheets/detail/hepatitis-c.
-
- Bunally S.B., Luscombe C.N., Young R.J. Using Physicochemical Measurements to Influence Better Compound Design. SLAS Discov. Adv. Life Sci. R&D. 2019;24:791–801. - PubMed
-
- Atasever S. In Silico Drug Discovery: A Machine Learning-Driven Systematic Review. Med. Chem. Res. 2024;33:1465–1490. doi: 10.1007/s00044-024-03260-w. - DOI
MeSH terms
Substances
LinkOut - more resources
Full Text Sources