Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2020 Jun:189:105343.
doi: 10.1016/j.cmpb.2020.105343. Epub 2020 Jan 16.

Application of data mining in a cohort of Italian subjects undergoing myocardial perfusion imaging at an academic medical center

Affiliations

Application of data mining in a cohort of Italian subjects undergoing myocardial perfusion imaging at an academic medical center

Carlo Ricciardi et al. Comput Methods Programs Biomed. 2020 Jun.

Abstract

Introduction: Coronary artery disease (CAD) is still one of the primary causes of death in the developed countries. Stress single-photon emission computed tomography is used to evaluate myocardial perfusion and ventricular function in patients with suspected or known CAD. This study sought to test data mining and machine learning tools and to compare some supervised learning algorithms in a large cohort of Italian subjects with suspected or known CAD who underwent stress myocardial perfusion imaging.

Methods: The dataset consisted of 10,265 patients with suspected or known CAD. The analysis was conducted using Knime analytics platform in order to implement Random Forests, C4.5, Gradient boosted tree, Naïve Bayes, and K nearest neighbor (KNN) after a procedure of features filtering. K-fold cross-validation was employed.

Results: Accuracy, error, precision, recall, and specificity were computed through the above-mentioned algorithms. Random Forests and gradients boosted trees obtained the highest accuracy (>95%), while it was comprised between 83% and 88%. The highest value for sensitivity and specificity was obtained by C4.5 (99.3%) and by Gradient boosted tree (96.9%). Naïve Bayes had the lowest precision (70.9%) and specificity (72.0%), KNN the lowest recall and sensitivity (79.2%).

Conclusions: The high scores obtained by the implementation of the algorithms suggests health facilities consider the idea of including services of advanced data analysis to help clinicians in decision-making. Similar applications of this kind of study in other contexts could support this idea.

Keywords: Analytics platform; Cardiology; Data mining; Decision-making; Myocardial perfusion imaging.

PubMed Disclaimer

Conflict of interest statement

Declaration of Competing Interest The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

Publication types

LinkOut - more resources