A hospital wide predictive model for unplanned readmission using hierarchical ICD data
- PMID: 30777619
- DOI: 10.1016/j.cmpb.2019.02.007
A hospital wide predictive model for unplanned readmission using hierarchical ICD data
Abstract
Background and objective: Hospitals already acquire a large amount of data, mainly for administrative, billing and registration purposes. Tapping on these already available data for additional purposes, aiming at improving care, without significant incremental effort and cost. This potential of secondary patient data is explored through modeling administrative and billing data, as well as the hierarchical structure of pathology codes of the International Classification of Diseases (ICD) in the prediction of unplanned readmissions, as a clinically relevant outcome parameter that can be impacted on in a quality improvement program.
Methods: In this single-center, hospital-wide observational cohort study, we included all adult patients discharged in 2016 after applying an exclusion protocol (n = 29,702). In addition to administrative variables, such as age and length of stay, structured pathology data were taken into account in predictive models. As a first research question, we compared logistic regression against penalized logistic regression, gradient boosting and Random Forests to predict unplanned readmission. As a second research goal, we investigated the level of hierarchy within the pathology data needed to achieve the best accuracy. Finally, we investigated which prediction variables play a prominent role in predicting hospital readmission. The performance of all models was evaluated using the Area Under the ROC Curve (AUC) measure.
Results: All models have the best predictive results using Random Forests. An added value of 7% is observed compared to a baseline method such as logistic regression. The best model, based on Random Forests, achieved an AUC of 0.77, using the diagnosis category and procedure code as lowest level of the hierarchical pathology data.
Conclusions: The most accurate model to predict hospital wide unplanned readmission is based on Random Forests and includes the ICD hierarchy, especially diagnosis category. Such an approach lowers the number of predictor variables and yields a higher interpretability than a model based on a detailed diagnosis. The performance of the model proved high enough to be used as a decision support tool.
Keywords: Boosting; Decision support; ICD-10 diagnosis; Machine learning; Random Forests; Readmission.
Copyright © 2019. Published by Elsevier B.V.
Similar articles
-
Predicting 30-day Hospital Readmission with Publicly Available Administrative Database. A Conditional Logistic Regression Modeling Approach.Methods Inf Med. 2015;54(6):560-7. doi: 10.3414/ME14-02-0017. Epub 2015 Nov 9. Methods Inf Med. 2015. PMID: 26548400
-
Building interpretable predictive models for pediatric hospital readmission using Tree-Lasso logistic regression.Artif Intell Med. 2016 Sep;72:12-21. doi: 10.1016/j.artmed.2016.07.003. Epub 2016 Jul 29. Artif Intell Med. 2016. PMID: 27664505
-
Using machine learning to predict paediatric 30-day unplanned hospital readmissions: a case-control retrospective analysis of medical records, including written discharge documentation.Aust Health Rev. 2021 Jun;45(3):328-337. doi: 10.1071/AH20062. Aust Health Rev. 2021. PMID: 33840419
-
Predictive models for hospital readmission risk: A systematic review of methods.Comput Methods Programs Biomed. 2018 Oct;164:49-64. doi: 10.1016/j.cmpb.2018.06.006. Epub 2018 Jun 28. Comput Methods Programs Biomed. 2018. PMID: 30195431
-
Evaluation of machine learning methods for prediction of heart failure mortality and readmission: meta-analysis.BMC Cardiovasc Disord. 2025 Apr 7;25(1):264. doi: 10.1186/s12872-025-04700-0. BMC Cardiovasc Disord. 2025. PMID: 40189534 Free PMC article.
Cited by
-
Machine-learning algorithms for predicting hospital re-admissions in sickle cell disease.Br J Haematol. 2021 Jan;192(1):158-170. doi: 10.1111/bjh.17107. Epub 2020 Nov 10. Br J Haematol. 2021. PMID: 33169861 Free PMC article.
-
Forecasting Patient Early Readmission from Irish Hospital Discharge Records Using Conventional Machine Learning Models.Diagnostics (Basel). 2024 Oct 29;14(21):2405. doi: 10.3390/diagnostics14212405. Diagnostics (Basel). 2024. PMID: 39518372 Free PMC article.
-
"Using network analysis modularity to group health code systems and decrease dimensionality in machine learning models".Explor Res Clin Soc Pharm. 2024 Jun 11;14:100463. doi: 10.1016/j.rcsop.2024.100463. eCollection 2024 Jun. Explor Res Clin Soc Pharm. 2024. PMID: 38974056 Free PMC article.
-
Current Trends in Readmission Prediction: An Overview of Approaches.Arab J Sci Eng. 2021 Aug 16:1-18. doi: 10.1007/s13369-021-06040-5. Online ahead of print. Arab J Sci Eng. 2021. PMID: 34422543 Free PMC article.
-
Using structured pathology data to predict hospital-wide mortality at admission.PLoS One. 2020 Jun 25;15(6):e0235117. doi: 10.1371/journal.pone.0235117. eCollection 2020. PLoS One. 2020. PMID: 32584872 Free PMC article.
Publication types
MeSH terms
LinkOut - more resources
Full Text Sources
Medical
Research Materials