. 2024 Oct 14;14(1):24045.

doi: 10.1038/s41598-024-71169-w.

Explainable artificial intelligence (XAI) to find optimal in-silico biomarkers for cardiac drug toxicity evaluation

Muhammad Adnan Pramudito¹, Yunendah Nur Fuadah^{1

2}, Ali Ikhsanul Qauli^{1

3}, Aroli Marcellinus¹, Ki Moo Lim^{4

5

6}

Affiliations

¹ Computational Medicine Lab, Department of IT Convergence Engineering, Kumoh National Institute of Technology, Gumi, 39177, Republic of Korea.
² School of Electrical Engineering, Telkom University, Bandung, 40257, Indonesia.
³ Department of Engineering, Faculty of Advanced Technology and Multidiscipline, Universitas Airlangga, Surabaya, 60115, Jawa Timur, Indonesia.
⁴ Computational Medicine Lab, Department of IT Convergence Engineering, Kumoh National Institute of Technology, Gumi, 39177, Republic of Korea. kmlim@kumoh.ac.kr.
⁵ Computational Medicine Lab, Department of Medical IT Convergence Engineering, Kumoh National Institute of Technology, Gumi, 39177, Republic of Korea. kmlim@kumoh.ac.kr.
⁶ Meta Heart Co., Ltd., Gumi, 39253, Republic of Korea. kmlim@kumoh.ac.kr.

PMID: 39402077
PMCID: PMC11473646
DOI: 10.1038/s41598-024-71169-w

Explainable artificial intelligence (XAI) to find optimal in-silico biomarkers for cardiac drug toxicity evaluation

Muhammad Adnan Pramudito et al. Sci Rep. 2024.

. 2024 Oct 14;14(1):24045.

doi: 10.1038/s41598-024-71169-w.

Authors

Muhammad Adnan Pramudito¹, Yunendah Nur Fuadah^{1

2}, Ali Ikhsanul Qauli^{1

3}, Aroli Marcellinus¹, Ki Moo Lim^{4

5

6}

Affiliations

¹ Computational Medicine Lab, Department of IT Convergence Engineering, Kumoh National Institute of Technology, Gumi, 39177, Republic of Korea.
² School of Electrical Engineering, Telkom University, Bandung, 40257, Indonesia.
³ Department of Engineering, Faculty of Advanced Technology and Multidiscipline, Universitas Airlangga, Surabaya, 60115, Jawa Timur, Indonesia.
⁴ Computational Medicine Lab, Department of IT Convergence Engineering, Kumoh National Institute of Technology, Gumi, 39177, Republic of Korea. kmlim@kumoh.ac.kr.
⁵ Computational Medicine Lab, Department of Medical IT Convergence Engineering, Kumoh National Institute of Technology, Gumi, 39177, Republic of Korea. kmlim@kumoh.ac.kr.
⁶ Meta Heart Co., Ltd., Gumi, 39253, Republic of Korea. kmlim@kumoh.ac.kr.

PMID: 39402077
PMCID: PMC11473646
DOI: 10.1038/s41598-024-71169-w

Abstract

The Comprehensive In-vitro Proarrhythmia Assay (CiPA) initiative aims to refine the assessment of drug-induced torsades de pointes (TdP) risk, utilizing computational models to predict cardiac drug toxicity. Despite advancements in machine learning applications for this purpose, the specific contribution of in-silico biomarkers to toxicity risk levels has yet to be thoroughly elucidated. This study addresses this gap by implementing explainable artificial intelligence (XAI) to illuminate the impact of individual biomarkers in drug toxicity prediction. We employed the Markov chain Monte Carlo method to generate a detailed dataset for 28 drugs, from which twelve in-silico biomarkers of 12 drugs were computed to train various machine learning models, including Artificial Neural Networks (ANN), Support Vector Machines (SVM), Random Forests (RF), XGBoost, K-Nearest Neighbors (KNN), and Radial Basis Function (RBF) networks. Our study's innovation is leveraging XAI, mainly through the SHAP (SHapley Additive exPlanations) method, to dissect and quantify the contributions of biomarkers across these models. Furthermore, the model performance was evaluated using the test set from 16 drugs. We found that the ANN model coupled with the eleven most influential in-silico biomarkers namely ${\frac{dVm}{dt}}_{repol}, {\frac{dVm}{dt}}_{\max}, {APD}_{90}, {APD}_{50}, {APD}_{tri}, {CaD}_{90}, {CaD}_{50}, {Ca}_{tri}, {Ca}_{Diastole}, q I n w a r d, a n d q N e t$ showed the highest classification performance among all classifiers with Area Under the Curve (AUC) scores of 0.92 for predicting high-risk, 0.83 for intermediate-risk, and 0.98 for low-risk drugs. We also found that the optimal in silico biomarkers selected based on SHAP analysis may be different for various classification models. However, we also found that the biomarker selection only sometimes improved the performance; therefore, evaluating various classifiers is still essential to obtain the desired classification performance. Our proposed method could provide a systematic way to assess the best classifier with the optimal in-silico biomarkers for predicting the TdP risk of drugs, thereby advancing the field of cardiac safety evaluations.

PubMed Disclaimer

Conflict of interest statement

The authors declare no competing interests.

Figures

**Fig. 1**
Illustration of our proposed algorithm for evaluating proarrhythmic drug risk, identifying key biomarkers in high, intermediate, and low-risk groups. The figure outlines the process, including preprocessing in-vitro data, sample generation, feature variability simulation, and identifying significant biomarkers using XAI algorithms (ANN). System performance is evaluated through model testing based on the XAI approach.

**Fig. 2**
Illustration of in-silico biomarkers in AP profile and Ca profile, which consisted of repolarization ${\frac{dVm}{dt}}_{repol}, {\frac{dVm}{dt}}_{\max}, {Vm}_{resting}, {APD}_{90}, {APD}_{50}, {APD}_{tri}, {CaD}_{90}, {CaD}_{50}, {Ca}_{tri}, {Ca}_{Diastole}, q I n w a r d,$ and $qNet$ .

**Fig. 3**
(a) Schematic representation of the ANN classification model, which employs twelve in-silico biomarkers as inputs. The model architecture comprises three hidden layers, each consisting of six neurons. Outputs from the ANN model are categorized into three risk classes: high-risk, intermediate-risk, and low-risk for TdP. (b) Illustration of the XGBoost classifier model. This model utilizes twelve in-silico features to train an ensemble of 200 decision trees. The trees are built sequentially with each tree learning from the errors (residuals) of the previous ones, thereby refining the classification. The node splitting is guided by an objective function, and the final output is the sum of predictions from all trees. (c) Depiction of the Random Forest training process. Starting with twelve in-silico features, the method employs bootstrap sampling to create multiple training sets. Each set is used to train a decision tree, resulting in 200 trees. The classification outcome for a sample is then determined by majority voting or averaging the results from all decision trees. (d) Workflow of the KNN classifier. The process begins with twelve in-silico features and initializes using K mean. It calculates the distance between training and testing points, sorts by distance, and applies majority voting for TdP risk classification, resulting in the final output. (e) Process diagram for the SVM classifier. The model initiates with twelve in-silico features and uses kernel mean initialization. Parameters Y & C are then established, followed by the training phase, culminating in the TdP risk classification. (f) Diagram of the RBF. The network starts with twelve in-silico features leading into a hidden layer that applies the RBF for transformation. The output is divided into three risk categories: high, intermediate, and low risk for TdP.

**Fig. 4**
An evaluation algorithm was employed to assess the performance of the classification model proposed by the CiPA research group, utilizing the principles of the central limit theorem; AUC, the area under the receiver operating curve; LR, likelihood ratio.

**Fig. 5**
(a) Feature importance visualization for the ANN model. The bar chart displays the mean SHAP values of each in-silico biomarker across three classes: high-risk, intermediate-risk, and low-risk. The feature qInward appears to be the most influential for high-risk classification, while Vm Resting has the least impact. (b) Feature importance chart for the XGBoost model. The graph illustrates the average impact of each in-silico biomarker on the model’s output, with higher mean SHAP values indicating greater importance. For high-risk classification, qInward and dVmdt Repol show significant influence. (c) Summary of SHAP values by class for the RF model. This bar chart represents the mean SHAP values by class, highlighting the features that most strongly affect the model’s predictions, with qInward showing a high impact on the high-risk class. (d) SHAP value summary for the SVM model. The bar chart details the feature importance, where CaD_50 is notably influential across all risk categories, suggesting a critical role in the model’s risk stratification process. (e) Feature importance for the KNN model, depicted through mean SHAP values. CaD_50 and APD_90 stand out as key features with high importance for the high-risk and intermediate-risk classifications, respectively. (f) Visualization of feature importance for the RBF model. The chart highlights the mean SHAP values with dVmdt Max showing a prominent role in distinguishing across all of risk category.

See this image and copyright information in PMC

Cited by

Machine Learning on Toxicogenomic Data Reveals a Strong Association Between the Induction of Drug-Metabolizing Enzymes and Centrilobular Hepatocyte Hypertrophy in Rats.
Ikoma K, Hosaka T, Ooka A, Shizu R, Yoshinari K. Ikoma K, et al. Int J Mol Sci. 2025 May 20;26(10):4886. doi: 10.3390/ijms26104886. Int J Mol Sci. 2025. PMID: 40430025 Free PMC article.
Validation of new AI-based classification method for in silico cardiac safety assessment of drugs following the CiPA framework.
Hanum UL, Qauli AI, Fuadah YN, Izza RN, Lim KM. Hanum UL, et al. Arch Toxicol. 2025 Sep;99(9):3735-3749. doi: 10.1007/s00204-025-04079-z. Epub 2025 May 22. Arch Toxicol. 2025. PMID: 40405016

References

1. Li, M. & Ramos, L. G. Drug-Induced QT Prolongation And Torsades de Pointes PHARMACOVIGILANCE FORUM. P&T® vol. 42 www.crediblemeds.org (2017). - PMC - PubMed
1. Gintant, G. A. Preclinical Torsades-de-Pointes Screens: Advantages and limitations of surrogate and direct approaches in evaluating proarrhythmic risk. Pharmacol. Therap.119, 199–209. 10.1016/j.pharmthera.2008.04.010 (2008). - PubMed
1. Crumb, W. J., Vicente, J., Johannesen, L. & Strauss, D. G. An evaluation of 30 clinical drugs against the comprehensive in vitro proarrhythmia assay (CiPA) proposed ion channel panel. J. Pharmacol. Toxicol. Methods81, 251–262 (2016). - PubMed
1. Sager, P. T., Gintant, G., Turner, J. R., Pettit, S. & Stockbridge, N. Rechanneling the cardiac proarrhythmia safety paradigm: A meeting report from the Cardiac Safety Research Consortium. Am. Heart J.167, 292–300. 10.1016/j.ahj.2013.11.004 (2014). - PubMed
1. Strauss, D. G. et al. Comprehensive in vitro proarrhythmia assay (CiPA) update from a Cardiac Safety Research Consortium/Health and Environmental Sciences Institute/FDA meeting. Ther. Innov. Regul. Sci.53, 519–525 (2019). - PubMed

MeSH terms

Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions

Substances

Actions

LinkOut - more resources

Full Text Sources
- Nature Publishing Group
- PubMed Central
Miscellaneous
- NCI CPTAC Assay Portal

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Explainable artificial intelligence (XAI) to find optimal in-silico biomarkers for cardiac drug toxicity evaluation

Affiliations

Explainable artificial intelligence (XAI) to find optimal in-silico biomarkers for cardiac drug toxicity evaluation

Authors

Affiliations

Abstract

Conflict of interest statement

Figures

Similar articles

Cited by

References

MeSH terms

Substances

LinkOut - more resources

Full Text Sources

Miscellaneous

Abstract

Conflict of interest statement

Figures

Similar articles

Cited by

References

MeSH terms

Substances

Related information

LinkOut - more resources

Full Text Sources

Miscellaneous