Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2022 Nov 15:298:115620.
doi: 10.1016/j.jep.2022.115620. Epub 2022 Aug 10.

Identification of intrinsic hepatotoxic compounds in Polygonum multiflorum Thunb. using machine-learning methods

Affiliations

Identification of intrinsic hepatotoxic compounds in Polygonum multiflorum Thunb. using machine-learning methods

Xiaowen Hu et al. J Ethnopharmacol. .

Abstract

Ethnopharmacological relevance: Polygonum multiflorum Thunb. (PM) is a herb, extracts of which have been used as Chinese medicine for years. Although it is believed to be beneficial to the liver, heart, and kidneys, it causes idiosyncratic drug-induced liver injury (DILI).

Aim of the study: We propose that the intrinsic DILI caused by natural products in PM (NPPM) is an important complementary mechanism to PM-related herb-induced liver injury, and aim to identify the ingredients with high DILI potential by machine learning methods.

Materials and methods: One hundred and ninety-seven NPPM were collected from the literature to identify the intrinsic hepatotoxic compounds. Additionally, a DILI-labeled dataset consisting of 2384 compounds was collected and randomly split into training and test sets. A diparametric optimization method was developed to tune the parameters of extended-connectivity fingerprints (ECFPs), Rdkit, and atom-pair fingerprints as well as those of machine-learning (ML) algorithms. Subsequently, K means were employed to cluster the NPPM that were predicted to have a high DILI risk. An in vitro cell-viability assay was performed using HepaRG cells to validate the prediction results.

Results: ECFPs with the top 35% of features ranked by the F-value with support vector machine (SVM) yielded the best performance. The optimized SVM model achieved an accuracy of 0.761 and recall value of 0.834 on the test dataset. The silico screening for NPPM resulted in 47 ingredients with high DILI potential, which were clustered into six groups based on the elbow method. A representative subgroup that contained 21 ingredients, of which two dianthrones exhibited the lowest IC50 value (0.7-0.9 μM) and anthraquinones showed moderate toxicity (15-25 μM), was constructed.

Conclusion: Using ML methods and in vitro screening, two classes of compounds, dianthrones and anthraquinones, were predicted and validated to have a high risk of DILI. The diparametric optimization method used in this study could provide a useful and powerful tool to screen toxicants for large datasets and is available at https://github.com/dreadlesss/Hepatotoxicity_predictor.

Keywords: Dianthrones; Drug-induced liver injury; K means; Machine learning; Polygonum multiflorum Thunb..

PubMed Disclaimer

Conflict of interest statement

Declaration of competing interest The authors declare that there are no conflicts of interest.

Similar articles

Cited by

MeSH terms

LinkOut - more resources