Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2022 Dec 5;12(1):21035.
doi: 10.1038/s41598-022-25527-1.

Machine-learning-based risk stratification for probability of dying in patients with basal ganglia hemorrhage

Affiliations

Machine-learning-based risk stratification for probability of dying in patients with basal ganglia hemorrhage

Lili Guo et al. Sci Rep. .

Abstract

To confirm whether machine learning algorithms (MLA) can achieve an effective risk stratification of dying within 7 days after basal ganglia hemorrhage (BGH). We collected patients with BGH admitted to Sichuan Provincial People's Hospital between August 2005 and August 2021. We developed standard ML-supervised models and fusion models to assess the prognostic risk of patients with BGH and compared them with the classical logistic regression model. We also use the SHAP algorithm to provide clinical interpretability. 1383 patients with BGH were included and divided into the conservative treatment group (CTG) and surgical treatment group (STG). In CTG, the Stack model has the highest sensitivity (78.5%). In STG, Weight-Stack model achieves 58.6% sensitivity and 85.1% specificity, and XGBoost achieves 61.4% sensitivity and 82.4% specificity. The SHAP algorithm shows that the predicted preferred characteristics of the CTG are consciousness, hemorrhage volume, prehospital time, break into ventricles, brain herniation, intraoperative blood loss, and hsCRP were also added to the STG. XGBoost, Stack, and Weight-Stack models combined with easily available clinical data enable risk stratification of BGH patients with high performance. These ML classifiers could assist clinicians and families to identify risk states timely when emergency admission and offer medical care and nursing information.

PubMed Disclaimer

Conflict of interest statement

The authors declare no competing interests.

Figures

Figure 1
Figure 1
Distribution of time to death in patients with basal ganglia hemorrhage. The upper right corner shows deaths within 30 days of study subjects.
Figure 2
Figure 2
Flow diagram of study recruitment and exclusion.
Figure 3
Figure 3
Overall workflow summarizing model algorithm.
Figure 4
Figure 4
Receiver operating characteristic curves versus precision-recall curves for the conservative treatment group and the surgical treatment group. (a) Receiver operating characteristic curves for all models in the conservative treatment group. (b) Receiver operating characteristic curves for all models in the surgical treatment group. (c) Precision-recall curves for all models in the conservative treatment group. (d) Precision-recall curves for all models in the surgical treatment group.
Figure 5
Figure 5
Overall SHAP interpretation of XGBoost and Stack model risk stratification. XGBoost uses the tree model interpretation method and the Stack model uses the Kernel interpretation method. (a, c): Ranking of all features in order of importance and showing the top 20 features. (b, d): Distribution showing the correlation of features with risk stratification results. Each point is a sample, red dots represent a high value of features, blue dots represent a low value of features, a negative SHAP value indicates a decrease in the probability of high risk and a positive one indicates an increase in the probability of high risk.
Figure 6
Figure 6
SHAP force plots based on individual patient prediction scores. (a) Low- and high-risk instances in the CTG. (b) Low- and high-risk instances in the STG. the base value is the mean Shap values predicted by the model, and the red and blue directed bars indicate the risk and safety features, respectively; the length of the bars represents the importance of the feature, which together drive the prediction from the base to the final value. The Shap values shown in the graph are the log odds of the true predicted probability.

References

    1. Muengtaweepongsa S, Seamhan B. Predicting mortality rate with ich score in thai intracerebral hemorrhage patients. Neurol. Asia. 2013;18(2):131–135.
    1. Van Asch CJ, Luitse MJ, Rinkel GJ, van der Tweel I, Algra A, Klijn CJ. Incidence, case fatality, and functional outcome of intracerebral haemorrhage over time, according to age, sex, and ethnic origin: A systematic review and meta-analysis. Lancet Neurol. 2010;9(2):167–176. doi: 10.1016/S1474-4422(09)70340-0. - DOI - PubMed
    1. An SJ, Kim TJ, Yoon B-W. Epidemiology, risk factors, and clinical features of intracerebral hemorrhage: An update. J. Stroke. 2017;19(1):3. doi: 10.5853/jos.2016.00864. - DOI - PMC - PubMed
    1. Obermeyer Z, Emanuel EJ. Predicting the future big data, machine learning, and clinical medicine. N. Engl. J. Med. 2016;375(13):1216. doi: 10.1056/NEJMp1606181. - DOI - PMC - PubMed
    1. Bhagyashree SIR, Nagaraj K, Prince M, Fall CH, Krishna M. Diagnosis of dementia by machine learning methods in epidemiological studies: a pilot exploratory study from South India. Soc. Psychiatry Psychiatr. Epidemiol. 2018;53(1):77–86. doi: 10.1007/s00127-017-1410-0. - DOI - PMC - PubMed

Publication types