Comprehensible Predictive Modeling Using Regularized Logistic Regression and Comorbidity Based Features
- PMID: 26645087
- PMCID: PMC4672891
- DOI: 10.1371/journal.pone.0144439
Comprehensible Predictive Modeling Using Regularized Logistic Regression and Comorbidity Based Features
Abstract
Different studies have demonstrated the importance of comorbidities to better understand the origin and evolution of medical complications. This study focuses on improvement of the predictive model interpretability based on simple logical features representing comorbidities. We use group lasso based feature interaction discovery followed by a post-processing step, where simple logic terms are added. In the final step, we reduce the feature set by applying lasso logistic regression to obtain a compact set of non-zero coefficients that represent a more comprehensible predictive model. The effectiveness of the proposed approach was demonstrated on a pediatric hospital discharge dataset that was used to build a readmission risk estimation model. The evaluation of the proposed method demonstrates a reduction of the initial set of features in a regression model by 72%, with a slight improvement in the Area Under the ROC Curve metric from 0.763 (95% CI: 0.755-0.771) to 0.769 (95% CI: 0.761-0.777). Additionally, our results show improvement in comprehensibility of the final predictive model using simple comorbidity based terms for logistic regression.
Conflict of interest statement
Figures



Similar articles
-
Building interpretable predictive models for pediatric hospital readmission using Tree-Lasso logistic regression.Artif Intell Med. 2016 Sep;72:12-21. doi: 10.1016/j.artmed.2016.07.003. Epub 2016 Jul 29. Artif Intell Med. 2016. PMID: 27664505
-
Pediatric readmission classification using stacked regularized logistic regression models.AMIA Annu Symp Proc. 2014 Nov 14;2014:1072-81. eCollection 2014. AMIA Annu Symp Proc. 2014. PMID: 25954417 Free PMC article.
-
Can Predictive Modeling Tools Identify Patients at High Risk of Prolonged Opioid Use After ACL Reconstruction?Clin Orthop Relat Res. 2020 Jul;478(7):0-1618. doi: 10.1097/CORR.0000000000001251. Clin Orthop Relat Res. 2020. PMID: 32282466 Free PMC article.
-
The Elixhauser comorbidity method outperforms the Charlson index in predicting inpatient death after orthopaedic surgery.Clin Orthop Relat Res. 2014 Sep;472(9):2878-86. doi: 10.1007/s11999-014-3686-7. Epub 2014 May 28. Clin Orthop Relat Res. 2014. PMID: 24867450 Free PMC article.
-
Predictive models for hospital readmission risk: A systematic review of methods.Comput Methods Programs Biomed. 2018 Oct;164:49-64. doi: 10.1016/j.cmpb.2018.06.006. Epub 2018 Jun 28. Comput Methods Programs Biomed. 2018. PMID: 30195431
Cited by
-
A Framework for Considering Comprehensibility in Modeling.Big Data. 2016 Jun;4(2):75-88. doi: 10.1089/big.2016.0007. Epub 2016 Jun 7. Big Data. 2016. PMID: 27441712 Free PMC article.
-
Applicability of predictive models for 30-day unplanned hospital readmission risk in paediatrics: a systematic review.BMJ Open. 2022 Mar 30;12(3):e055956. doi: 10.1136/bmjopen-2021-055956. BMJ Open. 2022. PMID: 35354615 Free PMC article.
-
Contribution of temporal data to predictive performance in 30-day readmission of morbidly obese patients.PeerJ. 2017 Apr 25;5:e3230. doi: 10.7717/peerj.3230. eCollection 2017. PeerJ. 2017. PMID: 28462037 Free PMC article.
-
Large-scale Analysis of Opioid Poisoning Related Hospital Visits in New York State.AMIA Annu Symp Proc. 2018 Apr 16;2017:545-554. eCollection 2017. AMIA Annu Symp Proc. 2018. PMID: 29854119 Free PMC article.
-
Analysing repeated hospital readmissions using data mining techniques.Health Syst (Basingstoke). 2018 Nov 9;7(3):166-180. doi: 10.1080/20476965.2018.1510040. eCollection 2018. Health Syst (Basingstoke). 2018. PMID: 31215903 Free PMC article.
References
-
- Friedman JH. Fast sparse regression and classification. Int J Forecast. 2012; 28(3), 722–738.
-
- Radchenko P, James G. Variable selection using adaptive nonlinear interaction structures in high dimensions. J Am Stat Assoc. 2010; 105: 1541–1553
-
- Choi N, Li W, Zhu J. Variable selection with the strong heredity constraint and its oracle property. J Am Stat Assoc. 2010; 105: 354–364.
Publication types
MeSH terms
LinkOut - more resources
Full Text Sources
Other Literature Sources