Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2020 Nov 18;10(1):20127.
doi: 10.1038/s41598-020-77243-3.

Machine learning to predict mortality after rehabilitation among patients with severe stroke

Affiliations

Machine learning to predict mortality after rehabilitation among patients with severe stroke

Domenico Scrutinio et al. Sci Rep. .

Abstract

Stroke is among the leading causes of death and disability worldwide. Approximately 20-25% of stroke survivors present severe disability, which is associated with increased mortality risk. Prognostication is inherent in the process of clinical decision-making. Machine learning (ML) methods have gained increasing popularity in the setting of biomedical research. The aim of this study was twofold: assessing the performance of ML tree-based algorithms for predicting three-year mortality model in 1207 stroke patients with severe disability who completed rehabilitation and comparing the performance of ML algorithms to that of a standard logistic regression. The logistic regression model achieved an area under the Receiver Operating Characteristics curve (AUC) of 0.745 and was well calibrated. At the optimal risk threshold, the model had an accuracy of 75.7%, a positive predictive value (PPV) of 33.9%, and a negative predictive value (NPV) of 91.0%. The ML algorithm outperformed the logistic regression model through the implementation of synthetic minority oversampling technique and the Random Forests, achieving an AUC of 0.928 and an accuracy of 86.3%. The PPV was 84.6% and the NPV 87.5%. This study introduced a step forward in the creation of standardisable tools for predicting health outcomes in individuals affected by stroke.

PubMed Disclaimer

Conflict of interest statement

The authors declare no competing interests.

Figures

Figure 1
Figure 1
The workflow of the study is represented: the data of 1207 patients from three facilities of Maugeri Institute in the South and in the North of Italy were collected and used to create models through a multivariate logistic regression and tree-based ML algorithms to predict three-year mortality in stroke patients after rehabilitation.
Figure 2
Figure 2
Receiver operating characteristics curves for the SMOTE RF algorithm and the logistic model.
Figure 3
Figure 3
Top 10 features according to the SMOTE RF model.

References

    1. GBD 2016 Stroke Collaborators. Global, regional, and national burden of stroke, 1990–2016: A systematic analysis for the Global Burden of Disease Study 2016. Lancet Neurol.18(5), 439–58 (2019). - PMC - PubMed
    1. Lozano R, et al. Global and regional mortality from 235 causes of death for 20 age groups in 1990 and 2010: A systematic analysis for the global burden of disease study 2010. Lancet. 2012;380(9859):2095–2128. doi: 10.1016/S0140-6736(12)61728-0. - DOI - PMC - PubMed
    1. Katan M, Luft A. Global burden of stroke. Semin. Neurol. 2018;38(2):208–211. doi: 10.1055/s-0038-1649503. - DOI - PubMed
    1. Chen Y, et al. Mortality and recurrent vascular events after first incident stroke: A 9-year community-based study of 0·5 million Chinese adults. Lancet Glob. Health. 2020;8(4):e580–e590. doi: 10.1016/S2214-109X(20)30069-3. - DOI - PMC - PubMed
    1. Xian Y, et al. Unexplained variation for hospitals’ use of inpatient rehabilitation and skilled nursing facilities after an acute ischemic stroke. Stroke. 2017;48(10):2836–2842. doi: 10.1161/STROKEAHA.117.016904. - DOI - PubMed