Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2025 Mar 18;15(1):9256.
doi: 10.1038/s41598-025-88826-3.

Identifying novel risk factors for aneurysmal subarachnoid haemorrhage using machine learning

Affiliations

Identifying novel risk factors for aneurysmal subarachnoid haemorrhage using machine learning

Jos P Kanning et al. Sci Rep. .

Abstract

Aneurysmal subarachnoid haemorrhage (aSAH) is a type of stroke with high mortality and morbidity. This study aimed to identify novel aSAH risk factors by combining machine learning (ML) and traditional statistical methods. Using the UK Biobank, we identified aSAH cases via hospital-based ICD codes and analysed 618 baseline variables covering demographics, lifestyle, medical history, and physical measurements. The CatBoost ML algorithm and Shapley Additive Explanations (SHAP) identified the top 25 variables most influential in predicting aSAH. Logistic regression further described these variables while adjusting for established aSAH risk factors. Among 501,847 participants, 893 aSAH cases were identified. ML identified 214 variables with non-zero SHAP values. Logistic regression of the top 25 variables revealed four potential novel aSAH risk factors. Increased aSAH risk was associated with mean sphered cell volume (OR 1.02, 95% CI 1.00-1.03) and tea intake (OR 1.03, 95% CI 1.01-1.05). Decreased aSAH risk was associated with peak expiratory flow (OR 0.80, 95% CI 0.66-0.96), and haematocrit percentage (OR 0.97, 95% CI 0.95-1.00). Future research should validate these findings and explore the potential non-linear relationships and interactions indicated by the ML models.

PubMed Disclaimer

Conflict of interest statement

Declarations. Competing interests: The authors declare no competing interests.

Figures

Fig. 1
Fig. 1
The 25 most important potential risk factors identified by the CatBoost algorithm. SHAP = SHapley Additive exPlanations, SHBG = Sex hormone binding globulin, IGF-1 = Insulin-like growth factor 1.
Fig. 2
Fig. 2
Flowchart of study design. Numbers in parentheses indicate the number of rows (i.e. participants) and columns (i.e. variables) respectively. aSAH = aneurysmal subarachnoid haemorrhage, SHAP = SHapley Additive exPlanations.

Similar articles

References

    1. Macdonald, R. L. & Schweizer, T. A. Spontaneous subarachnoid haemorrhage. Lancet389, 655–666 (2017). - PubMed
    1. GBD 2019 Stroke Collaborators. Global, regional, and national burden of stroke and its risk factors, 1990–2019: a systematic analysis for the global burden of Disease Study 2019. Lancet Neurol.20, 795–820 (2021). - PMC - PubMed
    1. Johnston, S. C., Selvin, S. & Gress, D. R. The burden, trends, and demographics of mortality from subarachnoid hemorrhage. Neurology50, 1413–1418 (1998). - PubMed
    1. Etminan, N. & Rinkel, G. J. Unruptured intracranial aneurysms: development, rupture and preventive management. Nat. Rev. Neurol.12, 699–713 (2016). - PubMed
    1. Feigin, V. L. et al. Risk factors for subarachnoid hemorrhage: an updated systematic review of epidemiological studies. Stroke36, 2773–2780 (2005). - PubMed

LinkOut - more resources