A biomarker discovery of acute myocardial infarction using feature selection and machine learning
- PMID: 37199891
- PMCID: PMC10191821
- DOI: 10.1007/s11517-023-02841-y
A biomarker discovery of acute myocardial infarction using feature selection and machine learning
Abstract
Acute myocardial infarction (AMI) or heart attack is a significant global health threat and one of the leading causes of death. The evolution of machine learning has greatly revamped the risk stratification and death prediction of AMI. In this study, an integrated feature selection and machine learning approach was used to identify potential biomarkers for early detection and treatment of AMI. First, feature selection was conducted and evaluated before all classification tasks with machine learning. Full classification models (using all 62 features) and reduced classification models (using various feature selection methods ranging from 5 to 30 features) were built and evaluated using six machine learning classification algorithms. The results showed that the reduced models performed generally better (mean AUPRC via random forest (RF) algorithm for recursive feature elimination (RFE) method ranges from 0.8048 to 0.8260, while for random forest importance (RFI) method, it ranges from 0.8301 to 0.8505) than the full models (mean AUPRC via RF: 0.8044). The most notable finding of this study was the identification of a five-feature model that included cardiac troponin I, HDL cholesterol, HbA1c, anion gap, and albumin, which had achieved comparable results (mean AUPRC via RF: 0.8462) as to the models that containing more features. These five features were proven by the previous studies as significant risk factors for AMI or cardiovascular disease and could be used as potential biomarkers to predict the prognosis of AMI patients. From the medical point of view, fewer features for diagnosis or prognosis could reduce the cost and time of a patient as lesser clinical and pathological tests are needed.
Keywords: Acute myocardial infarction; Biomarker; Classification; Feature selection; Heart attack; Machine learning.
© 2023. International Federation for Medical and Biological Engineering.
Conflict of interest statement
The authors declare no competing interests.
Figures


Similar articles
-
A systematic comparison of short-term and long-term mortality prediction in acute myocardial infarction using machine learning models.BMC Med Inform Decis Mak. 2025 Jun 5;25(1):208. doi: 10.1186/s12911-025-03052-1. BMC Med Inform Decis Mak. 2025. PMID: 40474184 Free PMC article.
-
Application of machine learning to predict the occurrence of arrhythmia after acute myocardial infarction.BMC Med Inform Decis Mak. 2021 Nov 2;21(1):301. doi: 10.1186/s12911-021-01667-8. BMC Med Inform Decis Mak. 2021. PMID: 34724938 Free PMC article.
-
Accurate Classification and Prediction of Acute Myocardial Infarction through an ARMD Procedure.J Proteome Res. 2023 Mar 3;22(3):758-767. doi: 10.1021/acs.jproteome.2c00488. Epub 2023 Jan 30. J Proteome Res. 2023. PMID: 36710647
-
Robust biomarker screening from gene expression data by stable machine learning-recursive feature elimination methods.Comput Biol Chem. 2022 Oct;100:107747. doi: 10.1016/j.compbiolchem.2022.107747. Epub 2022 Jul 29. Comput Biol Chem. 2022. PMID: 35932551
-
Machine Learning Applications in Acute Coronary Syndrome: Diagnosis, Outcomes and Management.Adv Ther. 2025 Feb;42(2):636-665. doi: 10.1007/s12325-024-03060-z. Epub 2024 Dec 6. Adv Ther. 2025. PMID: 39641854 Review.
Cited by
-
The association between anion gap and length of stay in patients undergoing hip fracture surgery: data from the MIMIC-IV database.BMC Musculoskelet Disord. 2024 Oct 16;25(1):819. doi: 10.1186/s12891-024-07932-x. BMC Musculoskelet Disord. 2024. PMID: 39415122 Free PMC article.
-
Advancements and challenges in high-sensitivity cardiac troponin assays: diagnostic, pathophysiological, and clinical perspectives.Clin Chem Lab Med. 2025 Feb 7;63(7):1260-1278. doi: 10.1515/cclm-2024-1090. Print 2025 Jun 26. Clin Chem Lab Med. 2025. PMID: 39915924 Review.
-
An exploratory study of high-throughput transcriptomic analysis reveals novel mRNA biomarkers for acute myocardial infarction using integrated methods.Sci Rep. 2025 Mar 11;15(1):8436. doi: 10.1038/s41598-025-92757-4. Sci Rep. 2025. PMID: 40069305 Free PMC article.
-
Machine Learning-Driven Transcriptome Analysis of Keratoconus for Predictive Biomarker Identification.Biomedicines. 2025 Apr 24;13(5):1032. doi: 10.3390/biomedicines13051032. Biomedicines. 2025. PMID: 40426861 Free PMC article.
-
Intrinsic factors behind long COVID: exploring the role of nucleocapsid protein in thrombosis.PeerJ. 2025 May 20;13:e19429. doi: 10.7717/peerj.19429. eCollection 2025. PeerJ. 2025. PMID: 40416618 Free PMC article. Review.
References
-
- Venkatason P, et al. In-hospital mortality of cardiogenic shock complicating ST-elevation myocardial infarction in Malaysia: a retrospective analysis of the Malaysian National Cardiovascular Database (NCVD) registry. BMJ Open. 2019;9(5):e025734. doi: 10.1136/bmjopen-2018-025734. - DOI - PMC - PubMed
-
- World Health Organization (WHO) (2021) Cardiovascular Disease. Available from: https://www.who.int/cardiovascular_diseases/en. Accessed 13 Oct 2021
-
- Amir M, Mappiare M, Indra P. The impact of cytochrome P450 2C19 polymorphism on cardiovascular events in indonesian patients with coronary artery disease. Clin Cardiol Cardiovasc Med. 2017;1:15–21.
-
- Ang CS, Chan KM. A review of coronary artery disease research in Malaysia. Med J Malaysia. 2016;74:67–78. - PubMed
-
- Institute for Health Metrics and Evaluation (IHME) (2020) GBD 2019 cause and risk summary: cardiovascular disease. Available from: https://www.healthdata.org/results/gbd_summaries/2019. Accessed 9 Apr 2022
MeSH terms
Grants and funding
LinkOut - more resources
Full Text Sources
Medical
Research Materials
Miscellaneous