Evaluating Binary Classifiers for Cardiovascular Disease Prediction: Enhancing Early Diagnostic Capabilities
- PMID: 39728286
- PMCID: PMC11678659
- DOI: 10.3390/jcdd11120396
Evaluating Binary Classifiers for Cardiovascular Disease Prediction: Enhancing Early Diagnostic Capabilities
Abstract
Cardiovascular disease (CVD) is a significant global health concern and the leading cause of death in many countries. Early detection and diagnosis of CVD can significantly reduce the risk of complications and mortality. Machine learning methods, particularly classification algorithms, have demonstrated their potential to accurately predict the risk of cardiovascular disease (CVD) by analyzing patient data. This study evaluates seven binary classification algorithms, including Random Forests, Logistic Regression, Naive Bayes, K-Nearest Neighbors (kNN), Support Vector Machines, Gradient Boosting, and Artificial Neural Networks, to understand their effectiveness in predicting CVD. Advanced preprocessing techniques, such as SMOTE-ENN for addressing class imbalance and hyperparameter optimization through Grid Search Cross-Validation, were applied to enhance the reliability and performance of these models. Standard evaluation metrics, including accuracy, precision, recall, F1-score, and Area Under the Receiver Operating Characteristic Curve (ROC-AUC), were used to assess predictive capabilities. The results show that kNN achieved the highest accuracy (99%) and AUC (0.99), surpassing traditional models like Logistic Regression and Gradient Boosting. The study examines the challenges encountered when working with datasets related to cardiovascular diseases, such as class imbalance and feature selection. It demonstrates how addressing these issues enhances the reliability and applicability of predictive models. These findings emphasize the potential of kNN as a reliable tool for early CVD prediction, offering significant improvements over previous studies. This research highlights the value of advanced machine learning techniques in healthcare, addressing key challenges and laying a foundation for future studies aimed at improving predictive models for CVD prevention.
Keywords: artificial intelligence; artificial intelligence in medical diagnosis; cardiovascular diseases; machine learning.
Conflict of interest statement
The authors declare no conflicts of interest.
Figures





Similar articles
-
Improving Surgical Site Infection Prediction Using Machine Learning: Addressing Challenges of Highly Imbalanced Data.Diagnostics (Basel). 2025 Feb 19;15(4):501. doi: 10.3390/diagnostics15040501. Diagnostics (Basel). 2025. PMID: 40002652 Free PMC article.
-
Enhancing stroke disease classification through machine learning models via a novel voting system by feature selection techniques.PLoS One. 2025 Jan 9;20(1):e0312914. doi: 10.1371/journal.pone.0312914. eCollection 2025. PLoS One. 2025. Retraction in: PLoS One. 2025 May 20;20(5):e0324683. doi: 10.1371/journal.pone.0324683. PMID: 39787105 Free PMC article. Retracted.
-
Machine learning algorithms for predicting COVID-19 mortality in Ethiopia.BMC Public Health. 2024 Jun 28;24(1):1728. doi: 10.1186/s12889-024-19196-0. BMC Public Health. 2024. PMID: 38943093 Free PMC article.
-
Artificial intelligence in clinical care amidst COVID-19 pandemic: A systematic review.Comput Struct Biotechnol J. 2021;19:2833-2850. doi: 10.1016/j.csbj.2021.05.010. Epub 2021 May 7. Comput Struct Biotechnol J. 2021. PMID: 34025952 Free PMC article. Review.
-
Machine learning based prediction models for cardiovascular disease risk using electronic health records data: systematic review and meta-analysis.Eur Heart J Digit Health. 2024 Oct 27;6(1):7-22. doi: 10.1093/ehjdh/ztae080. eCollection 2025 Jan. Eur Heart J Digit Health. 2024. PMID: 39846062 Free PMC article. Review.
Cited by
-
Predicting periprosthetic joint Infection: Evaluating supervised machine learning models for clinical application.J Orthop Translat. 2025 Jul 17;54:51-64. doi: 10.1016/j.jot.2025.06.016. eCollection 2025 Sep. J Orthop Translat. 2025. PMID: 40703570 Free PMC article.
References
-
- Cardiovascular Diseases. World Health Organization. [(accessed on 8 October 2024)]. Available online: https://www.who.int/health-topics/cardiovascular-diseases.
-
- Frąk W., Wojtasińska A., Lisińska W., Młynarska E., Franczyk B., Rysz J. Pathophysiology of Cardiovascular Diseases: New Insights into Molecular Mechanisms of Atherosclerosis, Arterial Hypertension, and Coronary Artery Disease. Biomedicines. 2022;10:1938. doi: 10.3390/biomedicines10081938. - DOI - PMC - PubMed
-
- Patidar S., Kumar D., Rukwal D. Comparative Analysis of Machine Learning Algorithms for Heart Disease Prediction. In: Singari R.M., Kankar P.K., editors. Advances in Transdisciplinary Engineering. IOS Press; Amsterdam, The Netherlands: 2022.
-
- Ananey-Obiri D., Sarku E. Predicting the Presence of Heart Diseases using Comparative Data Mining and Machine Learning Algorithms. Int. J. Comput. Appl. 2020;176:17–21. doi: 10.5120/ijca2020920034. - DOI
Publication types
LinkOut - more resources
Full Text Sources