Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2021 May 12;11(1):10071.
doi: 10.1038/s41598-021-89434-7.

Random forest-based prediction of stroke outcome

Affiliations

Random forest-based prediction of stroke outcome

Carlos Fernandez-Lozano et al. Sci Rep. .

Abstract

We research into the clinical, biochemical and neuroimaging factors associated with the outcome of stroke patients to generate a predictive model using machine learning techniques for prediction of mortality and morbidity 3-months after admission. The dataset consisted of patients with ischemic stroke (IS) and non-traumatic intracerebral hemorrhage (ICH) admitted to Stroke Unit of a European Tertiary Hospital prospectively registered. We identified the main variables for machine learning Random Forest (RF), generating a predictive model that can estimate patient mortality/morbidity according to the following groups: (1) IS + ICH, (2) IS, and (3) ICH. A total of 6022 patients were included: 4922 (mean age 71.9 ± 13.8 years) with IS and 1100 (mean age 73.3 ± 13.1 years) with ICH. NIHSS at 24, 48 h and axillary temperature at admission were the most important variables to consider for evolution of patients at 3-months. IS + ICH group was the most stable for mortality prediction [0.904 ± 0.025 of area under the receiver operating characteristics curve (AUC)]. IS group presented similar results, although variability between experiments was slightly higher (0.909 ± 0.032 of AUC). ICH group was the one in which RF had more problems to make adequate predictions (0.9837 vs. 0.7104 of AUC). There were no major differences between IS and IS + ICH groups according to morbidity prediction (0.738 and 0.755 of AUC) but, after checking normality with a Shapiro Wilk test with the null hypothesis that the data follow a normal distribution, it was rejected with W = 0.93546 (p-value < 2.2e-16). Conditions required for a parametric test do not hold, and we performed a paired Wilcoxon Test assuming the null hypothesis that all the groups have the same performance. The null hypothesis was rejected with a value < 2.2e-16, so there are statistical differences between IS and ICH groups. In conclusion, machine learning algorithms RF can be effectively used in stroke patients for long-term outcome prediction of mortality and morbidity.

PubMed Disclaimer

Conflict of interest statement

The authors declare no competing interests.

Figures

Figure 1
Figure 1
Flowchart of patient groups and functional outcome.
Figure 2
Figure 2
Mortality prediction for IS + ICH, IS and ICH groups. (A) Main variables for the machine learning model: NIHSS score at admission [NIHSS (0)]; NIHSS score at 24 h [NIHSS (24)]; NIHSS score at 48 h [NIHSS (48)]; Axillary temperature at admission [T(0)]; Early neurological deterioration [ED]; Leukocytes at admission [LEU (0)]; and Blood glucose at admission [GLU (0)]. (B) AUROC values obtained. (C) ROC curves for the Random Forest classifier.
Figure 3
Figure 3
2D-heatmap of mortality (EXT) predictions against NIHSS(48) and NIHSS(24). Model results are shown for the IS + ICH group, as it was the most stable for mortality prediction (0.904 ± 0.025 of AUC). Red areas correspond to patients who do not die (0), blue areas correspond to patients who die (1), and misclassified items are highlighted.
Figure 4
Figure 4
Morbidity prediction for IS + ICH, IS and ICH groups. (A) Main variables for the machine learning model: NIHSS score at admission [NIHSS (0)]; NIHSS score at 24 h [NIHSS (24)]; NIHSS score at 48 h [NIHSS (48)]; Axillary temperature at admission [T(0)]; Early neurological deterioration [ED]; Leukocytes at admission [LEU (0)]; and Blood glucose at admission [GLU (0)]. (B) AUROC values obtained. (C) ROC curves for the Random Forest classifier.
Figure 5
Figure 5
Comparison of ROC curves of 7 variables selected for machine learning experiments for mortality and morbidity prediction at 3 months of the different patient groups evaluated. (A,B) Morbidity and mortality of IS + ICH group. (C,D) Morbidity and mortality of IS group. (E–F) Morbidity and mortality of ICH group.

Similar articles

Cited by

References

    1. Neuhaus AA, Couch Y, Hadley G, et al. Neuroprotection in stroke: The importance of collaboration and reproducibility. Brain. 2017;140:2079–2092. doi: 10.1093/brain/awx126. - DOI - PubMed
    1. Bramlett HM, Dietrich WD. Pathophysiology of cerebral ischemia and brain trauma: Similarities and differences. J. Cereb. Blood Flow Metab. 2004;24:133–150. doi: 10.1097/01.WCB.0000111614.19196.04. - DOI - PubMed
    1. Burns JD, Fisher JL, Cervantes-Arslanian AM. Recent advances in the acute management of intracerebral hemorrhage. Neurosurg. Clin. N. Am. 2018;29:263–272. doi: 10.1016/j.nec.2017.11.005. - DOI - PubMed
    1. Béjot Y, Bailly H, Durier J, et al. Epidemiology of stroke in Europe and trends for the 21st century. Presse. Med. 2016;45:e391–e439. doi: 10.1016/j.lpm.2016.10.003. - DOI - PubMed
    1. Rodríguez-Castro E, López-Dequidt I, Santamaría-Cadavid M, et al. Trends in stroke outcome in the last ten years in a European tertiary hospital. BMC Neurol. 2018;18:164. doi: 10.1186/s12883-018-1164-7. - DOI - PMC - PubMed

Publication types

MeSH terms