Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2025 Mar 25;10(13):13502-13514.
doi: 10.1021/acsomega.5c00075. eCollection 2025 Apr 8.

pDILI_v1: A Web-Based Machine Learning Tool for Predicting Drug-Induced Liver Injury (DILI) Integrating Chemical Space Analysis and Molecular Fingerprints

Affiliations

pDILI_v1: A Web-Based Machine Learning Tool for Predicting Drug-Induced Liver Injury (DILI) Integrating Chemical Space Analysis and Molecular Fingerprints

Sk Abdul Amin et al. ACS Omega. .

Abstract

Drug-induced liver injury (DILI) represents a critical safety concern for drug development, regulatory oversight, and clinical practice, with substantial economic and public health implications. While predicting DILI risk in humans has garnered significant attention, the associated chemical space has remained insufficiently explored. This study addresses this gap through a comprehensive computational approach, leveraging machine learning (ML) to investigate structural determinants of DILI risk systematically. The study focuses on three key objectives: (i) exploring the chemical space and scaffold diversity associated with DILI; (ii) employing fragment-based approaches to identify structural alerts (SAs) that influence DILI risk; and (iii) developing supervised ML models to not only predict DILI risk but also elucidate the structural significance of molecular fingerprints. To broaden accessibility, we introduce pDILI_v1, a Python-based web application available at https://pdiliv1web.streamlit.app/. This user-friendly platform facilitates the prediction and visualization of DILI risk, enabling both experts and nonexperts to screen compounds effectively. Additional formats, including a Google Colab notebook and a graphical user interface (GUI) for Windows, ensure flexibility for diverse user needs. The proposed models demonstrate the potential for early identification of hepatotoxic risks in drug candidates, providing critical insights into drug discovery and development. By integrating ML-driven predictions with chemical space analysis, this research advances the field of drug safety evaluation, contributing to the development of safer pharmaceuticals and mitigating the risks of DILI.

PubMed Disclaimer

Conflict of interest statement

The authors declare no competing financial interest.

Figures

Figure 1
Figure 1
Workflow of the current study involves different approaches such as (i) chemical space analysis, (ii) the fragment-based approach, and (iii) ML-based QSAR.
Figure 2
Figure 2
Bin plots of each feature (Class, LogP, MW, nAR, HBA, HBD, nRings, nRB, and TPSA), colored by DILI Toxic (1) and DILI Nontoxic (0).
Figure 3
Figure 3
Representative drug compounds with DILI toxic (T) and DILI nontoxic (N) fingerprints/substructural features. DILI toxic fragments promote DILI risk, while DILI nontoxic fragments hinder DILI risk. These substructural features were produced by the ECFP-6 fingerprint descriptor.
Figure 4
Figure 4
68 descriptors with their importance score.
Figure 5
Figure 5
Heatmap of the correlation matrix of the selected descriptors.
Figure 6
Figure 6
(a) Optimization history plot of the hyperparameter optimization process, (b) Slice plot of specific hyperparameters (max_depth, min_samples_leaf, min_samples_split, and n_estimators) with respect to the objective value.
Figure 7
Figure 7
Partial dependence plot (PDP) of descriptors (a) SLogP, (b) GATS 2d, and (c) AATS0v. Two-variable PDP of chemical descriptors (d) SLogP vs GATS 2d and (e) SLogP vs AATS0v. The contour plot uses color gradients to represent regions corresponding to specific numerical values, as indicated by the contour levels. These values likely reflect a performance metric with the highest values appearing in the yellow-green regions and lower values in the darker blue and purple areas. The gradient transitions from darker to lighter shades, where lighter regions correspond to higher values.
Figure 8
Figure 8
Mechanistic interpretations of the descriptors (SLogP and GATS 2d) and mathematical contributions to the ML model.
Figure 9
Figure 9
Mechanistic interpretations of the descriptors (SLogP and AATS0v) and mathematical contributions to the ML model.
Figure 10
Figure 10
Applicability domain (AD) was based on the leverage approach. The outliers (those outside AD) identified by leverage are highlighted in blue circles.

Similar articles

References

    1. Vaja R.; Rana M. Drugs and the liver. Anaesthesia and Intensive Care Medicine 2020, 21 (10), 517–523. 10.1016/j.mpaic.2020.07.001. - DOI
    1. Andrade R. J.; Chalasani N.; Björnsson E. S.; Suzuki A.; Kullak-Ublick G. A.; Watkins P. B.; Devarbhavi H.; Merz M.; Lucena M. I.; Kaplowitz N.; Aithal G. P. Drug-induced liver injury. Nat. Rev. Dis. Prim. 2019, 5 (1), 58.10.1038/s41572-019-0105-0. - DOI - PubMed
    1. Licata A. Adverse drug reactions and organ damage: The liver. European Journal of Internal Medicine 2016, 28, 9–16. 10.1016/j.ejim.2015.12.017. - DOI - PubMed
    1. Onakpoya I. J.; Heneghan C. J.; Aronson J. K. Post-marketing withdrawal of 462 medicinal products because of adverse drug reactions: A systematic review of the world literature. BMC Med. 2016, 14 (1), 10.10.1186/s12916-016-0553-2. - DOI - PMC - PubMed
    1. Raschi E.; De Ponti F. Strategies for early prediction and timely recognition of drug-induced liver injury: The case of cyclin-dependent kinase 4/6 inhibitors. Front. Pharmacol. 2019, 10, 1235.10.3389/fphar.2019.01235. - DOI - PMC - PubMed

LinkOut - more resources