Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2024 Aug 13;24(1):420.
doi: 10.1186/s12872-024-04082-9.

Machine learning-based model to predict composite thromboembolic events among Chinese elderly patients with atrial fibrillation

Affiliations

Machine learning-based model to predict composite thromboembolic events among Chinese elderly patients with atrial fibrillation

Jiefeng Ren et al. BMC Cardiovasc Disord. .

Abstract

Objective: Accurate prediction of survival prognosis is helpful to guide clinical decision-making. The aim of this study was to develop a model using machine learning techniques to predict the occurrence of composite thromboembolic events (CTEs) in elderly patients with atrial fibrillation(AF). These events encompass newly diagnosed cerebral ischemia events, cardiovascular events, pulmonary embolism, and lower extremity arterial embolism.

Methods: This retrospective study included 6,079 elderly hospitalized patients (≥ 75 years old) with AF admitted to the People's Liberation Army General Hospital in China from January 2010 to June 2022. Random forest imputation was used for handling missing data. In the descriptive statistics section, patients were divided into two groups based on the occurrence of CTEs, and differences between the two groups were analyzed using chi-square tests for categorical variables and rank-sum tests for continuous variables. In the machine learning section, the patients were randomly divided into a training dataset (n = 4,225) and a validation dataset (n = 1,824) in a 7:3 ratio. Four machine learning models (logistic regression, decision tree, random forest, XGBoost) were trained on the training dataset and validated on the validation dataset.

Results: The incidence of composite thromboembolic events was 19.53%. The Least Absolute Shrinkage and Selection Operator (LASSO) method, using 5-fold cross-validation, was applied to the training dataset and identified a total of 18 features that exhibited a significant association with the occurrence of CTEs. The random forest model outperformed other models in terms of area under the curve (ACC: 0.9144, SEN: 0.7725, SPE: 0.9489, AUC: 0.927, 95% CI: 0.9105-0.9443). The random forest model also showed good clinical validity based on the clinical decision curve. The Shapley Additive exPlanations (SHAP) showed that the top five features associated with the model were history of ischemic stroke, high triglyceride (TG), high total cholesterol (TC), high plasma D-dimer, age.

Conclusions: This study proposes an accurate model to stratify patients with a high risk of CTEs. The random forest model has good performance. History of ischemic stroke, age, high TG, high TC and high plasma D-Dimer may be correlated with CTEs.

Keywords: Atrial fibrillation; Composite thromboembolic events; Early prediction model; Elderly patients; Machine learning techniques.

PubMed Disclaimer

Conflict of interest statement

The authors declare no competing interests.

Figures

Fig 1
Fig 1
Flow chart outlining patient’s enrollment and study design
Fig. 2
Fig. 2
A: Overall performance of each model B: Clinical decision curve analysis of C: The heatmap of SHAP value each model D: Analysis of feature importance based on Analysis of feature importance based on SHAP summary plot

References

    1. Noubiap JJ, Tang JJ, Dewland TA, Marcus GM. Contemporary trends in incident ischemic stroke, intracranial hemorrhage, and mortality in individuals with atrial fibrillation. Eur Heart J Qual Care Clin Outcomes Published Online April. 2024;9. 10.1093/ehjqcco/qcae022. - PubMed
    1. Bucci T, Romiti GF, Shantsila A, et al. Risk of Death and Cardiovascular events in Asian patients with Atrial Fibrillation and Chronic Obstructive Pulmonary Disease: a Report from the prospective APHRS Registry. J Am Heart Assoc. 2024;13(7):e032785. 10.1161/JAHA.123.032785. 10.1161/JAHA.123.032785 - DOI - PMC - PubMed
    1. Paludan-Müller C, Vad OB, Stampe NK, et al. Atrial fibrillation: age at diagnosis, incident cardiovascular events, and mortality. Eur Heart J Published Online April. 2024;9. 10.1093/eurheartj/ehae216. - PMC - PubMed
    1. Liu D, Shi S, Liu X, et al. Retrospective cohort study of new-onset atrial fibrillation in acute pulmonary embolism on prognosis. BMJ Open. 2021;11(9):e047658. 10.1136/bmjopen-2020-047658. Published 2021 Sep 22. 10.1136/bmjopen-2020-047658 - DOI - PMC - PubMed
    1. Relander A, Jaakkola S, Virri H, et al. Fibrillatory wave amplitude and thromboembolic risk in non-anticoagulated patients with atrial fibrillation. Ann Med. 2024;56(1):2317362. 10.1080/07853890.2024.2317362. 10.1080/07853890.2024.2317362 - DOI - PMC - PubMed

Publication types

MeSH terms

LinkOut - more resources