Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2024 Dec 10;19(1):127.
doi: 10.1186/s13062-024-00543-5.

An exploration into the diagnostic capabilities of microRNAs for myocardial infarction using machine learning

Affiliations

An exploration into the diagnostic capabilities of microRNAs for myocardial infarction using machine learning

Mehrdad Samadishadlou et al. Biol Direct. .

Abstract

Background: MicroRNAs (miRNAs) have shown potential as diagnostic biomarkers for myocardial infarction (MI) due to their early dysregulation and stability in circulation after MI. Moreover, they play a crucial role in regulating adaptive and maladaptive responses in cardiovascular diseases, making them attractive targets for potential biomarkers. However, their potential as novel biomarkers for diagnosing cardiovascular diseases requires systematic evaluation.

Methods: This study aimed to identify a miRNA biomarker panel for early-stage MI detection using bioinformatics and machine learning (ML) methods. miRNA expression data were obtained for early-stage MI patients and healthy controls from the Gene Expression Omnibus. Separate datasets were allocated for training and independent testing. Differential expression analysis was performed to identify dysregulated miRNAs in the training set. The least absolute shrinkage and selection operator (LASSO) was applied for feature selection to prioritize relevant miRNAs associated with MI. The selected miRNAs were used to develop ML models including support vector machine, Gradient Boosted, XGBoost, and a hard voting ensemble (HVE).

Results: Differential expression analysis discovered 99 dysregulated miRNAs in the training set. LASSO feature selection prioritized 21 miRNAs. Ten miRNAs were identified in both the LASSO subset and independent test set. The HVE model trained with the selected miRNAs achieved an accuracy of 0.86 and AUC of 0.83 on the independent test set.

Conclusions: An integrated framework for robust miRNA selection from omics data shows promise for developing accurate diagnostic models for early-stage MI detection. The HVE model demonstrated good performance despite differences between training and test datasets.

Keywords: Bioinformatics; Diagnosis; Machine learning; MicroRNA; Myocardial infarction.

PubMed Disclaimer

Conflict of interest statement

Declarations. Ethics approval and consent to participate: The study was approved by the research ethics committee of Tabriz University of Medical Sciences (approval ID: IR.TBZMED.VCR.REC.1399.388, date of approval: 2021/1/11). Consent for publication: All authors gave consent for the publication of the article Competing interests: The authors declare that they have no Conflict of interest.

Figures

Fig. 1
Fig. 1
Data processing and identification of DEMs. (A) Principal component analysis plots for the training data. (B) Volcano plot of DEMs
Fig. 2
Fig. 2
Gene Ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG) pathways enrichment result in DEMs. A Biological process terms (BP), Cellular component terms (CC), and Molecular function terms (MF). B KEGG analysis
Fig. 3
Fig. 3
Selection of the most important miRNAs. A LASSO coefficient profiles for DEMs and B Expression heatmap of 21 LASSO-selected miRNAs
Fig. 4
Fig. 4
A The intersection of LASSO-selected miRNAs and miRNAs was selected on test dataset. B Expression profile of selected miRNAs for training machine learning models
Fig. 5
Fig. 5
ROC curve for A individual miRNAs on the validation set. B Gradient Boosted (GB), XGBoost (XGB), and support vector machine (SVM) with preset hyper-parameters on the validation set. C Hyper-tuned GB, XGB, and SVM and a hard voting ensemble (HVE) made of the three hyper-tuned models on the validation set. D GB, XGB, SVM, and HVE on test set

References

    1. Nowbar AN, Gitto M, Howard JP, Francis DP, Al-Lamee R. Mortality from ischemic heart disease: Analysis of data from the World Health Organization and coronary artery disease risk factors From NCD Risk Factor Collaboration. Circ Cardiovasc Qual Outcomes. 2019;12(6): e005375. 10.1161/CIRCOUTCOMES.118.005375. - PMC - PubMed
    1. Thygesen K, Alpert JS, Jaffe AS, Chaitman BR, Bax JJ, Morrow DA, et al. Fourth Universal Definition of Myocardial Infarction (2018). Circulation. 2018;138(20):e618–51. 10.1161/CIR.0000000000000617. - PubMed
    1. Schulte C, Barwari T, Joshi A, Zeller T, Mayr M. Noncoding RNAs versus Protein Biomarkers in Cardiovascular Disease. Trends Mol Med. 2020;26(6):583–96. 10.1016/j.molmed.2020.02.001. - PubMed
    1. Cheng C, Wang Q, You W, Chen M, Xia J. MiRNAs as biomarkers of myocardial infarction: a meta-analysis. PloS ONE. 2014;9(2):88566. - PMC - PubMed
    1. Wang B, Li Y, Hao X, Yang J, Han X, Li H, et al. Comparison of the Clinical Value of miRNAs and Conventional Biomarkers in AMI: A Systematic Review. Front Genet. 2021. 10.3389/fgene.2021.668324. - PMC - PubMed

LinkOut - more resources