Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2021 Jan 5;45(1):6.
doi: 10.1007/s10916-020-01682-8.

Data Mining for Cardiovascular Disease Prediction

Affiliations

Data Mining for Cardiovascular Disease Prediction

Bárbara Martins et al. J Med Syst. .

Abstract

Cardiovascular diseases (CVDs) aredisorders of the heart and blood vessels and are a major cause of disability and premature death worldwide. Individuals at higher risk of developing CVD must be noticed at an early stage to prevent premature deaths. Advances in the field of computational intelligence, together with the vast amount of data produced daily in clinical settings, have made it possible to create recognition systems capable of identifying hidden patterns and useful information. This paper focuses on the application of Data Mining Techniques (DMTs) to clinical data collected during the medical examination in an attempt to predict whether or not an individual has a CVD. To this end, the CRossIndustry Standard Process for Data Mining (CRISP-DM) methodology was followed, in which five classifiers were applied, namely DT, Optimized DT, RI, RF, and DL. The models were mainly developed using the RapidMiner software with the assist of the WEKA tool and were analyzed based on accuracy, precision, sensitivity, and specificity. The results obtained were considered promising on the basis of the research for effective means of diagnosing CVD, with the best model being Optimized DT, which achieved the highest values for all the evaluation metrics, 73.54%, 75.82%, 68.89%, 78.16% and 0.788 for accuracy, precision, sensitivity, specificity, and AUC, respectively.

Keywords: CRISP-DM; Cardiovascular disease; Classification; Data mining; Decision support systems; Health information systems.

PubMed Disclaimer

References

    1. Cardiovascular diseases (cvds). https://www.who.int/news-room/fact-sheets/detail/cardiovascular-diseases...
    1. Anderson K.M., Odell P.M., Wilson P.W., Kannel W.B.: Cardiovascular disease risk profiles. American Heart Journal 121(1):293–298, 1991 - DOI
    1. Brito C., Esteves M., Peixoto H., Abelha A., Machado J. (2019) A data mining approach to classify serum creatinine values in patients undergoing continuous ambulatory peritoneal dialysis. Wirel. Netw:1–9
    1. Ferreira D., Silva S., Abelha A., Machado J.: Recommendation system using autoencoders. Appl. Sci. 10(16):5510, 2020 - DOI
    1. Jothi N., Husain W., et al.: Data mining in healthcare–a review. Procedia Computer Science 72:306–313, 2015 - DOI

LinkOut - more resources