Interpretable machine learning models for hospital readmission prediction: a two-step extracted regression tree approach
- PMID: 37277767
- PMCID: PMC10243084
- DOI: 10.1186/s12911-023-02193-5
Interpretable machine learning models for hospital readmission prediction: a two-step extracted regression tree approach
Abstract
Background: Advanced machine learning models have received wide attention in assisting medical decision making due to the greater accuracy they can achieve. However, their limited interpretability imposes barriers for practitioners to adopt them. Recent advancements in interpretable machine learning tools allow us to look inside the black box of advanced prediction methods to extract interpretable models while maintaining similar prediction accuracy, but few studies have investigated the specific hospital readmission prediction problem with this spirit.
Methods: Our goal is to develop a machine-learning (ML) algorithm that can predict 30- and 90- day hospital readmissions as accurately as black box algorithms while providing medically interpretable insights into readmission risk factors. Leveraging a state-of-art interpretable ML model, we use a two-step Extracted Regression Tree approach to achieve this goal. In the first step, we train a black box prediction algorithm. In the second step, we extract a regression tree from the output of the black box algorithm that allows direct interpretation of medically relevant risk factors. We use data from a large teaching hospital in Asia to learn the ML model and verify our two-step approach.
Results: The two-step method can obtain similar prediction performance as the best black box model, such as Neural Networks, measured by three metrics: accuracy, the Area Under the Curve (AUC) and the Area Under the Precision-Recall Curve (AUPRC), while maintaining interpretability. Further, to examine whether the prediction results match the known medical insights (i.e., the model is truly interpretable and produces reasonable results), we show that key readmission risk factors extracted by the two-step approach are consistent with those found in the medical literature.
Conclusions: The proposed two-step approach yields meaningful prediction results that are both accurate and interpretable. This study suggests a viable means to improve the trust of machine learning based models in clinical practice for predicting readmissions through the two-step approach.
Keywords: Administrative data; Hospital readmission; Interpretable machine learning; Risk factors; Risk prediction.
© 2023. The Author(s).
Conflict of interest statement
The authors declare no competing interests.
Figures



Similar articles
-
Predictive modeling for 14-day unplanned hospital readmission risk by using machine learning algorithms.BMC Med Inform Decis Mak. 2021 Oct 20;21(1):288. doi: 10.1186/s12911-021-01639-y. BMC Med Inform Decis Mak. 2021. PMID: 34670553 Free PMC article.
-
Building interpretable predictive models for pediatric hospital readmission using Tree-Lasso logistic regression.Artif Intell Med. 2016 Sep;72:12-21. doi: 10.1016/j.artmed.2016.07.003. Epub 2016 Jul 29. Artif Intell Med. 2016. PMID: 27664505
-
Evaluating machine learning algorithms to Predict 30-day Unplanned REadmission (PURE) in Urology patients.BMC Med Inform Decis Mak. 2023 Jun 13;23(1):108. doi: 10.1186/s12911-023-02200-9. BMC Med Inform Decis Mak. 2023. PMID: 37312177 Free PMC article.
-
Predictive models for hospital readmission risk: A systematic review of methods.Comput Methods Programs Biomed. 2018 Oct;164:49-64. doi: 10.1016/j.cmpb.2018.06.006. Epub 2018 Jun 28. Comput Methods Programs Biomed. 2018. PMID: 30195431
-
Application of machine learning in predicting hospital readmissions: a scoping review of the literature.BMC Med Res Methodol. 2021 May 6;21(1):96. doi: 10.1186/s12874-021-01284-z. BMC Med Res Methodol. 2021. PMID: 33952192 Free PMC article.
Cited by
-
Predicting ICU Delirium in Critically Ill COVID-19 Patients Using Demographic, Clinical, and Laboratory Admission Data: A Machine Learning Approach.Life (Basel). 2025 Jun 30;15(7):1045. doi: 10.3390/life15071045. Life (Basel). 2025. PMID: 40724546 Free PMC article.
-
Dissatisfaction-considered waiting time prediction for outpatients with interpretable machine learning.Health Care Manag Sci. 2024 Sep;27(3):370-390. doi: 10.1007/s10729-024-09676-5. Epub 2024 Jun 1. Health Care Manag Sci. 2024. PMID: 38822906 Free PMC article.
References
-
- Centers for Medicare and Medicaid Services, Readmissions reduction program., 2012. https://www.cms.gov/Medicare/Medicare-Fee-for-Service-Payment/AcuteInpat....
-
- Jiang S, Chin KS, Qu G, Tsui KL. An integrated machine learning framework for hospital readmission prediction. Knowl Based Syst. 2018;146:73–90. doi: 10.1016/j.knosys.2018.01.027. - DOI
-
- Bastani H, Bastani O, Kim C. “Interpreting predictive models for human-in-the-loop analytics.“ arXiv preprint arXiv:1705.08504 (2018): 1–45.
-
- Ustun B, Rudin C. Supersparse linear integer models for optimized medical scoring systems. Mach Learn. 2016;102(3):349–91. doi: 10.1007/s10994-015-5528-6. - DOI
Publication types
MeSH terms
LinkOut - more resources
Full Text Sources