Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2024 Sep:107:105280.
doi: 10.1016/j.ebiom.2024.105280. Epub 2024 Aug 16.

AI-based derivation of atrial fibrillation phenotypes in the general and critical care populations

Affiliations

AI-based derivation of atrial fibrillation phenotypes in the general and critical care populations

Ryan A A Bellfield et al. EBioMedicine. 2024 Sep.

Abstract

Background: Atrial fibrillation (AF) is the most common heart arrhythmia worldwide and is linked to a higher risk of mortality and morbidity. To predict AF and AF-related complications, clinical risk scores are commonly employed, but their predictive accuracy is generally limited, given the inherent complexity and heterogeneity of patients with AF. By classifying different presentations of AF into coherent and manageable clinical phenotypes, the development of tailored prevention and treatment strategies can be facilitated. In this study, we propose an artificial intelligence (AI)-based methodology to derive meaningful clinical phenotypes of AF in the general and critical care populations.

Methods: Our approach employs generative topographic mapping, a probabilistic machine learning method, to identify micro-clusters of patients with similar characteristics. It then identifies macro-cluster regions (clinical phenotypes) in the latent space using Ward's minimum variance method. We applied it to two large cohort databases (UK-Biobank and MIMIC-IV) representing general and critical care populations.

Findings: The proposed methodology showed its ability to derive meaningful clinical phenotypes of AF. Because of its probabilistic foundations, it can enhance the robustness of patient stratification. It also produced interpretable visualisation of complex high-dimensional data, enhancing understanding of the derived phenotypes and their key characteristics. Using our methodology, we identified and characterised clinical phenotypes of AF across diverse patient populations.

Interpretation: Our methodology is robust to noise, can uncover hidden patterns and subgroups, and can elucidate more specific patient profiles, contributing to more robust patient stratification, which could facilitate the tailoring of prevention and treatment programs specific to each phenotype. It can also be applied to other datasets to derive clinically meaningful phenotypes of other conditions.

Funding: This study was funded by the DECIPHER project (LJMU QR-PSF) and the EU project TARGET (10113624).

Keywords: Atrial fibrillation; Clinical phenotypes; Clustering; Generative topographic mapping; MIMIC-IV; Machine learning; Probabilistic modelling; Stratification; UK-Biobank.

PubMed Disclaimer

Conflict of interest statement

Declaration of interests All authors declare no competing interests.

Figures

Fig. 1
Fig. 1
Proposed AI-based methodology to generate reliable phenotypes. Data is modelled by the GTM algorithm, which projects the data into a 2-dimensional latent space, visualised in the membership map. The GTM also produces reference maps, which are used to indicate the influence of a variable over a micro-cluster. Hierarchical clustering is then applied to the reference vectors to group similar micro-clusters together into larger macro-clusters, which in turn are used to derive the phenotypes.
Fig. 2
Fig. 2
Reference vector visualisations demonstrating how each biological sample variable affects the cluster distribution in the latent space for both, the UK-Biobank and the MIMIC-IV AF cohorts.
Fig. 3
Fig. 3
Membership maps showing how a selection of investigative variables data are distributed within the latent space for the UK-Biobank and the MIMIC-IV cohorts. AF: Atrial Fibrillation. AKI: Acute Kidney Injury. ARDS: Acute Respiratory Distress Syndrome. GCS: Glasgow Coma Scale.
Fig. 4
Fig. 4
Derived phenotypes of AF in the general population using UK-Biobank data. a) Dendrogram produced using Ward’s minimum variance method. The graph shows the 5 clusters that are used to define the 5 AF phenotypes for the general population. b) Membership map with a uniform size for the micro-clusters to show the distribution of the macro-cluster regions. c) The size of the micro-clusters in the membership map dictated by the number of participants assigned to it. d) Main characterising features for each of the phenotypes.
Fig. 5
Fig. 5
Derived phenotypes of AF in the general population using MIMIC-IV data. a) Dendrogram produced using Ward’s minimum variance method. The graph shows the 4 clusters that are used to define the 4 AF phenotypes for ICU patients. b) Membership map with a uniform size for the micro-clusters to show the distribution of the macro-cluster regions. c) The size of the micro-clusters in the membership map dictated by the number of participants assigned to it. d) Main characterising features for each of the phenotypes.
Fig. 6
Fig. 6
Membership map with the probability distributions for different data points superimposed. Maps a) and b) show the probability distribution for two randomly selected participants from the general population taken from the UK Biobank database. Maps c) and d) show the probability distribution for two randomly selected patients from the critical care population taken from the MIMIC-IV database.

References

    1. Saito Y., Omae Y., Nagashima K., et al. Phenotyping of atrial fibrillation with cluster analysis and external validation. Heart. 2023;109:1751–1758. - PMC - PubMed
    1. Zhang J., Johnsen S.P., Guo Y., Lip G.Y.H. Epidemiology of atrial fibrillation. Card Electrophysiol Clin. 2021;13(1):1–23. - PubMed
    1. Lip G.Y.H., Genaidy A., Tran G., Marroquin P., Estes C., Sloop S. Improving stroke risk prediction in the general population: a comparative assessment of common clinical rules, a new multimorbid index, and machine-learning-based algorithms. Thromb Haemost. 2021;122(1):142–150. - PubMed
    1. Romiti G.F., Proietti M., Bonini N., et al. Clinical complexity domains, anticoagulation, and outcomes in patients with atrial fibrillation: a report from the GLORIA-AF registry phase II and III. Thromb Haemost. 2022;122(12):2030–2041. - PubMed
    1. Olier I., Ortega-Martorell S., Pieroni M., Lip G.Y.H. How machine learning is impacting research in atrial fibrillation: implications for risk prediction and future management. Cardiovasc Res. 2021;117(7):1700–1717. - PMC - PubMed