. 2014 Nov 14:2014:1072-81.

eCollection 2014.

Pediatric readmission classification using stacked regularized logistic regression models

Gregor Stiglic¹, Fei Wang², Adam Davey³, Zoran Obradovic³

Affiliations

¹ University of Maribor, Maribor, Slovenia.
² IBM T.J. Watson Research Center, Yorktown Heights, NY.
³ Temple University, Philadelphia, PA.

PMID: 25954417
PMCID: PMC4419883

Pediatric readmission classification using stacked regularized logistic regression models

Gregor Stiglic et al. AMIA Annu Symp Proc. 2014.

. 2014 Nov 14:2014:1072-81.

eCollection 2014.

Authors

Gregor Stiglic¹, Fei Wang², Adam Davey³, Zoran Obradovic³

Affiliations

¹ University of Maribor, Maribor, Slovenia.
² IBM T.J. Watson Research Center, Yorktown Heights, NY.
³ Temple University, Philadelphia, PA.

PMID: 25954417
PMCID: PMC4419883

Abstract

Background: Regulations and privacy concerns often hinder exchange of healthcare data between hospitals or other healthcare providers. Sharing predictive models built on original data and averaging their results offers an alternative to more efficient prediction of outcomes on new cases. Although one can choose from many techniques to combine outputs from different predictive models, it is difficult to find studies that try to interpret the results obtained from ensemble-learning methods.

Methods: We propose a novel approach to classification based on models from different hospitals that allows a high level of performance along with comprehensibility of obtained results. Our approach is based on regularized sparse regression models in two hierarchical levels and exploits the interpretability of obtained regression coefficients to rank the contribution of hospitals in terms of outcome prediction.

Results: The proposed approach was used to predict the 30-days all-cause readmissions for pediatric patients in 54 Californian hospitals. Using repeated holdout evaluation, including more than 60,000 hospital discharge records, we compared the proposed approach to alternative approaches. The performance of two-level classification model was measured using the Area Under the ROC Curve (AUC) with an additional evaluation that uncovered the importance and contribution of each single data source (i.e. hospital) to the final result. The results for the best distributed model (AUC=0.787, 95% CI: 0.780-0.794) demonstrate no significant difference in terms of AUC performance when compared to a single elastic net model built on all available data (AUC=0.789, 95% CI: 0.781-0.796).

Conclusions: This paper presents a novel approach to improved classification with shared predictive models for environments where centralized collection of data is not possible. The significant improvements in classification performance and interpretability of results demonstrate the effectiveness of our approach.

PubMed Disclaimer

Figures

**Figure 1.**
Two-level classification framework for distributed hospital based predictive modeling.

**Figure 2.**
Distribution of AUC results on 1000 hold-out runs for averaged local models (AVG), best local model (BLM), deep learning approach (DLA), deep learning approach with two classifiers (DLA2) and single sparse logistic regression on all samples (SLRA) with mean AUC (red dotted line) and 95% CI (blue dotted line).

**Figure 3.**
Trends of Relative Hospital Influence (RHI) in relation to average total charge per hospital (TOTCHG), percentage of records with diagnosed pneumonia (Pneumonia), average number of procedure codes on the record (NPR), rate of 30-day readmissions (readmit), percentage of scheduled admissions (ASCHED) and percentage of records with gastrostomy (Gastrostomy).

See this image and copyright information in PMC

Cited by

Comprehensible Predictive Modeling Using Regularized Logistic Regression and Comorbidity Based Features.
Stiglic G, Povalej Brzan P, Fijacko N, Wang F, Delibasic B, Kalousis A, Obradovic Z. Stiglic G, et al. PLoS One. 2015 Dec 8;10(12):e0144439. doi: 10.1371/journal.pone.0144439. eCollection 2015. PLoS One. 2015. PMID: 26645087 Free PMC article.
Applicability of predictive models for 30-day unplanned hospital readmission risk in paediatrics: a systematic review.
Niehaus IM, Kansy N, Stock S, Dötsch J, Müller D. Niehaus IM, et al. BMJ Open. 2022 Mar 30;12(3):e055956. doi: 10.1136/bmjopen-2021-055956. BMJ Open. 2022. PMID: 35354615 Free PMC article.
Are Randomized Controlled Trials the (G)old Standard? From Clinical Intelligence to Prescriptive Analytics.
Van Poucke S, Thomeer M, Heath J, Vukicevic M. Van Poucke S, et al. J Med Internet Res. 2016 Jul 6;18(7):e185. doi: 10.2196/jmir.5549. J Med Internet Res. 2016. PMID: 27383622 Free PMC article.
Identifying the Prevalence and Causes of 30-Day Hospital Readmission in Children: A Case Study from a Tertiary Pediatric Hospital.
AlKhalaf H, AlHamdan W, Kinani S, AlZighaibi R, Fallata S, Al Mutrafy A, Alqanatish J. AlKhalaf H, et al. Glob J Qual Saf Healthc. 2023 Nov 24;6(4):101-110. doi: 10.36401/JQSH-23-17. eCollection 2023 Nov. Glob J Qual Saf Healthc. 2023. PMID: 38404457 Free PMC article.
A non-negative spike-and-slab lasso generalized linear stacking prediction modeling method for high-dimensional omics data.
Shen J, Wang S, Dong Y, Sun H, Wang X, Tang Z. Shen J, et al. BMC Bioinformatics. 2024 Mar 20;25(1):119. doi: 10.1186/s12859-024-05741-6. BMC Bioinformatics. 2024. PMID: 38509499 Free PMC article.

References

1. Cole TS, Frankovich J, Iyer S, LePendu P, Bauer-Mehren A, Shah NH. Profiling risk factors for chronic uveitis in juvenile idiopathic arthritis: a new model for EHR-based research. Pediatric Rheumatology. 2013;11(1):45. - PMC - PubMed
1. Sun J, Hu J, Luo D, et al. Combining knowledge and data driven insights for identifying risk factors using electronic health records. AMIA Annu Symp Proc. 2012;2012:901–910. - PMC - PubMed
1. Menachemi N, Collum TH. Benefits and drawbacks of electronic health record systems. Risk Manag Healthc Policy. 2011;4:47–55. - PMC - PubMed
1. Coloma PM, Schuemie MJ, Trifirò G, et al. Combining electronic healthcare databases in Europe to allow for large-scale drug safety monitoring: the EU-ADR Project. Pharmacoepidemiology and drug safety. 2011;20(1):1–11. - PubMed
1. Davis DA, Chawla NV, Christakis NA, Barabási AL. Time to CARE: a collaborative engine for practical disease prediction. Data Mining and Knowledge Discovery. 2010;20(3):388–415.

Publication types

Actions
Actions

MeSH terms

Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions

LinkOut - more resources

Full Text Sources
- Europe PubMed Central
- PubMed Central

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Pediatric readmission classification using stacked regularized logistic regression models

Affiliations

Pediatric readmission classification using stacked regularized logistic regression models

Authors

Affiliations

Abstract

Figures

Similar articles

Cited by

References

Publication types

MeSH terms

LinkOut - more resources

Full Text Sources