Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2024 Aug 31;24(1):306.
doi: 10.1186/s12883-024-03818-6.

Development of machine learning-based models for predicting risk factors in acute cerebral infarction patients: a clinical retrospective study

Affiliations

Development of machine learning-based models for predicting risk factors in acute cerebral infarction patients: a clinical retrospective study

Changqing Yang et al. BMC Neurol. .

Abstract

Objectives: The aim of this study was to develop machine learning-based models for predicting acute cerebral infarction (ACI) in patients.

Methods: We extracted the data of ACI patients and non-ACI patients (as control) from two hospitals. The Lasso algorithm was employed to select the most crucial features associated with ACI. Five machine learning algorithms-based models were trained, which was performed with 10-fold cross-validation. Then, the area under the receiver operating characteristic curve (AUC), accuracy, and F1-score were calculated in the training models. Accordingly, the training models with excellent performance was selected as the final predictive model. The relative importance of variables was analyzed and ranked.

Results: A total of 150 patients were diagnosed with ACI (50.00%), with a higher proportion of males (70.67% vs. 44.00%) compared to the non-ACI patients. The logistic regression model exhibited a good performance in predicting ACI in the training set, as evidenced by its highest AUC, accuracy, sensitivity, and F1-score. Furthermore, feature importance analysis showed that blood glucose, gender, smoking history, serum homocysteine, folic acid, and C-reactive protein were the top six crucial variables of the logistic regression.

Conclusions: In our work, the ACI risk prediction model developed by the logistic regression exhibited excellent performance. This could contribute to the identification of risk variables for ACI patients and enables clinicians timely and effective interventions.

Keywords: Acute cerebral infarction; Machine learning; Prediction model; Risk factors.

PubMed Disclaimer

Conflict of interest statement

The authors declare no competing interests.

Figures

Fig. 1
Fig. 1
The flowchart of this study. ACI, acute cerebral infarction; SVM, support vector machine; MLP, multi-layer perceptron neural network; CNB, complement Naive Bayes; LR, logistic regression; EMR, electronic medical record. The ACI group (N = 150), the non-ACI group (N = 150)
Fig. 2
Fig. 2
Clinical features selection using the Lasso algorithm. 28 out of 36 clinical variables were finally selected to train machine learning models
Fig. 3
Fig. 3
The receiver operating characteristic curves (ROCs) of machine learning training models. AUC, area under the ROC; SVM, support vector machine; MLP, multi-layer perceptron neural network; CNB, complement Naive Bayes
Fig. 4
Fig. 4
The relative importance of the variables in the logistic regression model is in decreasing order. The relative importance of high-ranking variables in the logistic regression model is arranged in descending order (the top six): blood glucose, gender, smoking history, Hcy, folic acid, and CRP

References

    1. Suzuki H, Kanamaru H, Kawakita F, Asada R, Fujimoto M, Shiba M. Cerebrovascular pathophysiology of delayed cerebral ischemia after aneurysmal subarachnoid hemorrhage. Histol Histopathol. 2021;36(2):143–58. - PubMed
    1. Zhang H, Yang G, Dong A. Prediction Model between Serum Vitamin D and Neurological Deficit in Cerebral Infarction Patients Based on Machine Learning. Computational and mathematical methods in medicine. 2022;2022:2914484. - PMC - PubMed
    1. Li X, Wang Y, Xu J. Development of a machine learning-based risk prediction model for cerebral infarction and comparison with nomogram model. J Affect Disord. 2022;314:341–8. 10.1016/j.jad.2022.07.045 - DOI - PubMed
    1. Nishi H, Oishi N, Ogawa H, Natsue K, Doi K, Kawakami O, et al. Predicting cerebral infarction in patients with atrial fibrillation using machine learning: the Fushimi AF registry. J Cereb Blood flow Metabolism: Official J Int Soc Cereb Blood Flow Metabolism. 2022;42(5):746–56. 10.1177/0271678X211063802 - DOI - PMC - PubMed
    1. Zhang X, Hu Y, Hong M, Guo T, Wei W, Song S. Plasma thrombomodulin, fibrinogen, and activity of tissue factor as risk factors for acute cerebral infarction. Am J Clin Pathol. 2007;128(2):287–92. 10.1309/HB6AB1YR4DQUT5AU - DOI - PubMed

LinkOut - more resources