. 2021 May 12;11(1):10071.

doi: 10.1038/s41598-021-89434-7.

Random forest-based prediction of stroke outcome

Affiliations

¹ Department of Computer Science and Information Technologies, Faculty of Computer Science, CITIC-Research Center of Information and Communication Technologies, Universidade da Coruña, A Coruña, Spain.
² Grupo de Redes de Neuronas Artificiales y Sistemas Adaptativos. Imagen Médica y Diagnóstico Radiológico (RNASA-IMEDIR). Instituto de Investigación Biomédica de A Coruña (INIBIC). Complexo Hospitalario Universitario de A Coruña (CHUAC), SERGAS, Universidade da Coruña, A Coruña, Spain.
³ Clinical Neurosciences Research Laboratory (LINC), Health Research Institute of Santiago de Compostela (IDIS), Santiago de Compostela, Spain.
⁴ Software Engineering Laboratory, Department of Computer Science and Information Technologies, Faculty of Computer Science, University of A Coruña, Campus de Elviña, 15071, A Coruña, Spain.
⁵ Stroke Unit, Department of Neurology, Health Research Institute of Santiago de Compostela (IDIS), Hospital Clínico Universitario, Rúa Travesa da Choupana, s/n, 15706Santiago de Compostela, Spain.
⁶ Unit of Methodology of the Research, Health Research Institute of Santiago de Compostela (IDIS), Santiago de Compostela, Spain.
⁷ Software Engineering Laboratory, Department of Computer Science and Information Technologies, Faculty of Computer Science, University of A Coruña, Campus de Elviña, 15071, A Coruña, Spain. santiago.rodriguez@udc.es.
⁸ Clinical Neurosciences Research Laboratory (LINC), Health Research Institute of Santiago de Compostela (IDIS), Santiago de Compostela, Spain. ramon.iglesias.rey@sergas.es.

PMID: 33980906
PMCID: PMC8115135
DOI: 10.1038/s41598-021-89434-7

Random forest-based prediction of stroke outcome

Carlos Fernandez-Lozano et al. Sci Rep. 2021.

. 2021 May 12;11(1):10071.

doi: 10.1038/s41598-021-89434-7.

Affiliations

¹ Department of Computer Science and Information Technologies, Faculty of Computer Science, CITIC-Research Center of Information and Communication Technologies, Universidade da Coruña, A Coruña, Spain.
² Grupo de Redes de Neuronas Artificiales y Sistemas Adaptativos. Imagen Médica y Diagnóstico Radiológico (RNASA-IMEDIR). Instituto de Investigación Biomédica de A Coruña (INIBIC). Complexo Hospitalario Universitario de A Coruña (CHUAC), SERGAS, Universidade da Coruña, A Coruña, Spain.
³ Clinical Neurosciences Research Laboratory (LINC), Health Research Institute of Santiago de Compostela (IDIS), Santiago de Compostela, Spain.
⁴ Software Engineering Laboratory, Department of Computer Science and Information Technologies, Faculty of Computer Science, University of A Coruña, Campus de Elviña, 15071, A Coruña, Spain.
⁵ Stroke Unit, Department of Neurology, Health Research Institute of Santiago de Compostela (IDIS), Hospital Clínico Universitario, Rúa Travesa da Choupana, s/n, 15706Santiago de Compostela, Spain.
⁶ Unit of Methodology of the Research, Health Research Institute of Santiago de Compostela (IDIS), Santiago de Compostela, Spain.
⁷ Software Engineering Laboratory, Department of Computer Science and Information Technologies, Faculty of Computer Science, University of A Coruña, Campus de Elviña, 15071, A Coruña, Spain. santiago.rodriguez@udc.es.
⁸ Clinical Neurosciences Research Laboratory (LINC), Health Research Institute of Santiago de Compostela (IDIS), Santiago de Compostela, Spain. ramon.iglesias.rey@sergas.es.

PMID: 33980906
PMCID: PMC8115135
DOI: 10.1038/s41598-021-89434-7

Abstract

We research into the clinical, biochemical and neuroimaging factors associated with the outcome of stroke patients to generate a predictive model using machine learning techniques for prediction of mortality and morbidity 3-months after admission. The dataset consisted of patients with ischemic stroke (IS) and non-traumatic intracerebral hemorrhage (ICH) admitted to Stroke Unit of a European Tertiary Hospital prospectively registered. We identified the main variables for machine learning Random Forest (RF), generating a predictive model that can estimate patient mortality/morbidity according to the following groups: (1) IS + ICH, (2) IS, and (3) ICH. A total of 6022 patients were included: 4922 (mean age 71.9 ± 13.8 years) with IS and 1100 (mean age 73.3 ± 13.1 years) with ICH. NIHSS at 24, 48 h and axillary temperature at admission were the most important variables to consider for evolution of patients at 3-months. IS + ICH group was the most stable for mortality prediction [0.904 ± 0.025 of area under the receiver operating characteristics curve (AUC)]. IS group presented similar results, although variability between experiments was slightly higher (0.909 ± 0.032 of AUC). ICH group was the one in which RF had more problems to make adequate predictions (0.9837 vs. 0.7104 of AUC). There were no major differences between IS and IS + ICH groups according to morbidity prediction (0.738 and 0.755 of AUC) but, after checking normality with a Shapiro Wilk test with the null hypothesis that the data follow a normal distribution, it was rejected with W = 0.93546 (p-value < 2.2e-16). Conditions required for a parametric test do not hold, and we performed a paired Wilcoxon Test assuming the null hypothesis that all the groups have the same performance. The null hypothesis was rejected with a value < 2.2e-16, so there are statistical differences between IS and ICH groups. In conclusion, machine learning algorithms RF can be effectively used in stroke patients for long-term outcome prediction of mortality and morbidity.

PubMed Disclaimer

Conflict of interest statement

The authors declare no competing interests.

Figures

**Figure 1**
Flowchart of patient groups and functional outcome.

**Figure 2**
Mortality prediction for IS + ICH, IS and ICH groups. (A) Main variables for the machine learning model: NIHSS score at admission [NIHSS (0)]; NIHSS score at 24 h [NIHSS (24)]; NIHSS score at 48 h [NIHSS (48)]; Axillary temperature at admission [T(0)]; Early neurological deterioration [ED]; Leukocytes at admission [LEU (0)]; and Blood glucose at admission [GLU (0)]. (B) AUROC values obtained. **(C)** ROC curves for the Random Forest classifier.

**Figure 3**
2D-heatmap of mortality (EXT) predictions against NIHSS(48) and NIHSS(24). Model results are shown for the IS + ICH group, as it was the most stable for mortality prediction (0.904 ± 0.025 of AUC). Red areas correspond to patients who do not die (0), blue areas correspond to patients who die (1), and misclassified items are highlighted.

**Figure 4**
Morbidity prediction for IS + ICH, IS and ICH groups. (A) Main variables for the machine learning model: NIHSS score at admission [NIHSS (0)]; NIHSS score at 24 h [NIHSS (24)]; NIHSS score at 48 h [NIHSS (48)]; Axillary temperature at admission [T(0)]; Early neurological deterioration [ED]; Leukocytes at admission [LEU (0)]; and Blood glucose at admission [GLU (0)]. (B) AUROC values obtained. (C) ROC curves for the Random Forest classifier.

**Figure 5**
Comparison of ROC curves of 7 variables selected for machine learning experiments for mortality and morbidity prediction at 3 months of the different patient groups evaluated. (**A,B**) Morbidity and mortality of IS + ICH group. (**C,D**) Morbidity and mortality of IS group. (**E–F**) Morbidity and mortality of ICH group.

See this image and copyright information in PMC

Cited by

Machine Learning Approaches for Stroke Risk Prediction: Findings from the Suita Study.
Vu T, Kokubo Y, Inoue M, Yamamoto M, Mohsen A, Martin-Morales A, Inoué T, Dawadi R, Araki M. Vu T, et al. J Cardiovasc Dev Dis. 2024 Jul 1;11(7):207. doi: 10.3390/jcdd11070207. J Cardiovasc Dev Dis. 2024. PMID: 39057627 Free PMC article.
Machine Learning-Based Prediction of Subsequent Vascular Events After 6 Months in Chinese Patients with Minor Ischemic Stroke.
Zhang R, Wang J. Zhang R, et al. Int J Gen Med. 2022 Apr 7;15:3797-3808. doi: 10.2147/IJGM.S356373. eCollection 2022. Int J Gen Med. 2022. PMID: 35418774 Free PMC article.
[An interpretable machine learning-based prediction model for risk of death for patients with ischemic stroke in intensive care unit].
Luo X, Cheng Y, Wu C, He J. Luo X, et al. Nan Fang Yi Ke Da Xue Xue Bao. 2023 Jul 20;43(7):1241-1247. doi: 10.12122/j.issn.1673-4254.2023.07.21. Nan Fang Yi Ke Da Xue Xue Bao. 2023. PMID: 37488807 Free PMC article. Chinese.
Predicting ischemic stroke patients' prognosis changes using machine learning in a nationwide stroke registry.
Lin CH, Chen YA, Jeng JS, Sun Y, Wei CY, Yeh PY, Chang WL, Fann YC, Hsu KC, Lee JT; Taiwan Stroke Registry Investigators. Lin CH, et al. Med Biol Eng Comput. 2024 Aug;62(8):2343-2354. doi: 10.1007/s11517-024-03073-4. Epub 2024 Apr 5. Med Biol Eng Comput. 2024. PMID: 38575823 Free PMC article.
Interpretable prediction of stroke prognosis: SHAP for SVM and nomogram for logistic regression.
Guo K, Zhu B, Zha L, Shao Y, Liu Z, Gu N, Chen K. Guo K, et al. Front Neurol. 2025 Mar 4;16:1522868. doi: 10.3389/fneur.2025.1522868. eCollection 2025. Front Neurol. 2025. PMID: 40103937 Free PMC article.

See all "Cited by" articles

References

1. Neuhaus AA, Couch Y, Hadley G, et al. Neuroprotection in stroke: The importance of collaboration and reproducibility. Brain. 2017;140:2079–2092. doi: 10.1093/brain/awx126. - DOI - PubMed
1. Bramlett HM, Dietrich WD. Pathophysiology of cerebral ischemia and brain trauma: Similarities and differences. J. Cereb. Blood Flow Metab. 2004;24:133–150. doi: 10.1097/01.WCB.0000111614.19196.04. - DOI - PubMed
1. Burns JD, Fisher JL, Cervantes-Arslanian AM. Recent advances in the acute management of intracerebral hemorrhage. Neurosurg. Clin. N. Am. 2018;29:263–272. doi: 10.1016/j.nec.2017.11.005. - DOI - PubMed
1. Béjot Y, Bailly H, Durier J, et al. Epidemiology of stroke in Europe and trends for the 21st century. Presse. Med. 2016;45:e391–e439. doi: 10.1016/j.lpm.2016.10.003. - DOI - PubMed
1. Rodríguez-Castro E, López-Dequidt I, Santamaría-Cadavid M, et al. Trends in stroke outcome in the last ten years in a European tertiary hospital. BMC Neurol. 2018;18:164. doi: 10.1186/s12883-018-1164-7. - DOI - PMC - PubMed

Publication types

Actions

MeSH terms

Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions

LinkOut - more resources

Full Text Sources
Other Literature Sources
- The Lens - Patent Citations Database
- scite Smart Citations
Medical
- MedlinePlus Health Information

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Random forest-based prediction of stroke outcome

Affiliations

Random forest-based prediction of stroke outcome

Authors

Affiliations

Abstract

Conflict of interest statement

Figures

Similar articles

Cited by

References

Publication types

MeSH terms

LinkOut - more resources

Full Text Sources

Other Literature Sources

Medical

Abstract

Conflict of interest statement

Figures

Similar articles

Cited by

References

Publication types

MeSH terms

Related information

LinkOut - more resources

Full Text Sources

Other Literature Sources

Medical