Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2018 Nov 9;7(3):166-180.
doi: 10.1080/20476965.2018.1510040. eCollection 2018.

Analysing repeated hospital readmissions using data mining techniques

Affiliations

Analysing repeated hospital readmissions using data mining techniques

Ofir Ben-Assuli et al. Health Syst (Basingstoke). .

Abstract

Few studies have examined how to identify future readmission of patients with a large number of repeat emergency department (ED) visits. We explore 30-day readmission risk prediction using Microsoft's AZURE machine learning software and compare five classification methods: Logistic Regression, Boosted Decision Trees (BDTs), Support Vector Machine (SVM), Bayes Point Machine (BPM), and Two-Class Neural Network (TCNN). We predict the last readmission visit of frequent ED patients extracted from the electronic health records of their 8455 penultimate visits. The methods show differential improvement, with the BDT indicating marginally better AUC (area under the ROC curve) than logistic regression and BPM, followed by the TCNN and SVM. A comparison of BDT and Logistic Regression results for correct and incorrect classification highlights the similarities and differences in the significant predictors identified by each method. Future research may incorporate time-varying covariates to identify other longitudinal factors that can lead to readmission risk reduction.

Keywords: Hospital readmissions; boosted decision tree; logistic regression; predicting readmissions; support vector machine; two-class neural network.

PubMed Disclaimer

Figures

Figure 1.
Figure 1.
Two-class classification comparisons with AZURE ML: BDT, logistic regression, SVM, BPM and two-class neural network (TCNN).
Figure 2.
Figure 2.
Example of one BDT produced by AZURE ML (out of 479 trees).
Figure 3.
Figure 3.
Comparisons of ROC curves.

Corrected and republished from

Similar articles

Cited by

References

    1. Alexander M., Grumbach K., Remy L., Rowell R., & Massie B. M. (1999). Congestive heart failure hospitalizations and survival in California: Patterns according to race/ethnicity. American Heart Journal, 137(5), 919–927.10.1016/S0002-8703(99)70417-5 - DOI - PubMed
    1. Almagro P., Barreiro B., Ochoa de Echagüen A., Quintana S., Rodríguez Carballeira M., Heredia J. L., & Garau J. (2006). Risk factors for hospital readmission in patients with chronic obstructive pulmonary disease. Respiration, 73(3), 311–317.10.1159/000088092 - DOI - PubMed
    1. Amarasingham R., Moore B. J., Tabak Y. P., Drazner M. H., Clark C. A., Zhang S., … Halm E. A. (2010). An automated model to identify heart failure patients at risk for 30-day readmission or death using electronic medical record data. Medical Care, 48(11), 981–988.10.1097/MLR.0b013e3181ef60d9 - DOI - PubMed
    1. Artetxe A., Beristain A., Graña M., & Besga A. (2016). Predicting 30-day emergency readmission risk. In European Transnational Education (pp. 3–12). Springer.
    1. Ather S., Chung K. D., Gregory P., & Demissie K. (2004). The association between hospital readmission and insurance provider among adults with asthma. Journal of Asthma, 41(7), 709–713.10.1081/JAS-200027829 - DOI - PubMed

Publication types

LinkOut - more resources