Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2022 Jan 18:8:812182.
doi: 10.3389/fcvm.2021.812182. eCollection 2021.

A Risk-Stratification Machine Learning Framework for the Prediction of Coronary Artery Disease Severity: Insights From the GESS Trial

Affiliations

A Risk-Stratification Machine Learning Framework for the Prediction of Coronary Artery Disease Severity: Insights From the GESS Trial

Nikolaos Mittas et al. Front Cardiovasc Med. .

Abstract

Our study aims to develop a data-driven framework utilizing heterogenous electronic medical and clinical records and advanced Machine Learning (ML) approaches for: (i) the identification of critical risk factors affecting the complexity of Coronary Artery Disease (CAD), as assessed via the SYNTAX score; and (ii) the development of ML prediction models for accurate estimation of the expected SYNTAX score. We propose a two-part modeling technique separating the process into two distinct phases: (a) a binary classification task for predicting, whether a patient is more likely to present with a non-zero SYNTAX score; and (b) a regression task to predict the expected SYNTAX score accountable to individual patients with a non-zero SYNTAX score. The framework is based on data collected from the GESS trial (NCT03150680) comprising electronic medical and clinical records for 303 adult patients with suspected CAD, having undergone invasive coronary angiography in AHEPA University Hospital of Thessaloniki, Greece. The deployment of the proposed approach demonstrated that atherogenic index of plasma levels, diabetes mellitus and hypertension can be considered as important risk factors for discriminating patients into zero- and non-zero SYNTAX score groups, whereas diastolic and systolic arterial blood pressure, peripheral vascular disease and body mass index can be considered as significant risk factors for providing an accurate estimation of the expected SYNTAX score, given that a patient belongs to the non-zero SYNTAX score group. The experimental findings utilizing the identified set of important risk factors indicate a sufficient prediction performance for the Support Vector Machine model (classification task) with an F-measure score of ~0.71 and the Support Vector Regression model (regression task) with a median absolute error value of ~6.5. The proposed data-driven framework described herein present evidence of the prediction capacity and the potential clinical usefulness of the developed risk-stratification models. However, further experimentation in a larger clinical setting is needed to ensure the practical utility of the presented models in a way to contribute to a more personalized management and counseling of CAD patients.

Keywords: SYNTAX score; coronary artery disease; machine learning; personalized (precision) medicine; risk-stratification model.

PubMed Disclaimer

Conflict of interest statement

FC is employed by Labnet Laboratories. The remaining authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Figures

Figure 1
Figure 1
Proposed data-driven framework.
Figure 2
Figure 2
The boxplots and violin plots represent the distributions of the SYNTAX score of patients (dots) for each level of categorical risk factor.
Figure 3
Figure 3
Importance of features extracted by the Boruta algorithm (zero-part) (the abbreviations of the risk-factors can be found in Table 4) [Ratio 1: MonocytetoHDLcholesterol ratio; Ratio 2: Lymphocytetomonocyte ratio; Ratio 3: Atherogenic Index of Plasma levels (log(TGHDL)].
Figure 4
Figure 4
Importance of features extracted by the Boruta algorithm (count-part) (the abbreviations of the risk-factors can be found in Table 4).

Similar articles

Cited by

References

    1. Vizirianakis IS. Nanomedicine and personalized medicine toward the application of pharmacotyping in clinical practice to improve drug-delivery outcomes. Nanomed Nanotechnol Biol Med. (2011) 7:11–7. 10.1016/j.nano.2010.11.002 - DOI - PubMed
    1. Vizirianakis IS, Fatouros DG. Personalized nanomedicine: paving the way to the practical clinical utility of genomics and nanotechnology advancements. Adv Drug Deliv Rev. (2012) 64:1359–62. 10.1016/j.addr.2012.09.034 - DOI - PubMed
    1. Vizirianakis IS, Miliotou AN, Mystridis GA, Andriotis EG, Andreadis II, Papadopoulou LC et al. Tackling pharmacological response heterogeneity by PBPK modeling to advance precision medicine productivity of nanotechnology and genomics therapeutics. Expert Rev Precis Med Drug Dev. (2019) 4:139–51. 10.1080/23808993.2019.1605828 - DOI
    1. Hulsen T, Jamuar SS, Moody AR, Karnes JH, Varga O, Hedensted S, et al. . From big data to precision medicine. Front Med. (2019) 6:34. 10.3389/fmed.2019.00034 - DOI - PMC - PubMed
    1. Ngiam KY, Khor W. Big data and machine learning algorithms for health-care delivery. Lancet Oncol. (2019) 20:262–73. 10.1016/S1470-2045(19)30149-4 - DOI - PubMed