Integrative transcriptomic, proteomic, and machine learning approach to identifying feature genes of atrial fibrillation using atrial samples from patients with valvular heart disease
- PMID: 33509101
- PMCID: PMC7842070
- DOI: 10.1186/s12872-020-01819-0
Integrative transcriptomic, proteomic, and machine learning approach to identifying feature genes of atrial fibrillation using atrial samples from patients with valvular heart disease
Abstract
Background: Atrial fibrillation (AF) is the most common arrhythmia with poorly understood mechanisms. We aimed to investigate the biological mechanism of AF and to discover feature genes by analyzing multi-omics data and by applying a machine learning approach.
Methods: At the transcriptomic level, four microarray datasets (GSE41177, GSE79768, GSE115574, GSE14975) were downloaded from the Gene Expression Omnibus database, which included 130 available atrial samples from AF and sinus rhythm (SR) patients with valvular heart disease. Microarray meta-analysis was adopted to identified differentially expressed genes (DEGs). At the proteomic level, a qualitative and quantitative analysis of proteomics in the left atrial appendage of 18 patients (9 with AF and 9 with SR) who underwent cardiac valvular surgery was conducted. The machine learning correlation-based feature selection (CFS) method was introduced to selected feature genes of AF using the training set of 130 samples involved in the microarray meta-analysis. The Naive Bayes (NB) based classifier constructed using training set was evaluated on an independent validation test set GSE2240.
Results: 863 DEGs with FDR < 0.05 and 482 differentially expressed proteins (DEPs) with FDR < 0.1 and fold change > 1.2 were obtained from the transcriptomic and proteomic study, respectively. The DEGs and DEPs were then analyzed together which identified 30 biomarkers with consistent trends. Further, 10 features, including 8 upregulated genes (CD44, CHGB, FHL2, GGT5, IGFBP2, NRAP, SEPTIN6, YWHAQ) and 2 downregulated genes (TNNI1, TRDN) were selected from the 30 biomarkers through machine learning CFS method using training set. The NB based classifier constructed using the training set accurately and reliably classify AF from SR samples in the validation test set with a precision of 87.5% and AUC of 0.995.
Conclusion: Taken together, our present work might provide novel insights into the molecular mechanism and provide some promising diagnostic and therapeutic targets of AF.
Keywords: Atrial fibrillation; Feature gene; Machine learning; Proteomic; Transcriptomic.
Conflict of interest statement
The authors declare that they have no competing interests.
Figures



Similar articles
-
Identification of potential crucial genes in atrial fibrillation: a bioinformatic analysis.BMC Med Genomics. 2020 Jul 18;13(1):104. doi: 10.1186/s12920-020-00754-5. BMC Med Genomics. 2020. PMID: 32682418 Free PMC article.
-
Analysis of potential genetic biomarkers using machine learning methods and immune infiltration regulatory mechanisms underlying atrial fibrillation.BMC Med Genomics. 2022 Mar 19;15(1):64. doi: 10.1186/s12920-022-01212-0. BMC Med Genomics. 2022. PMID: 35305619 Free PMC article.
-
Identification of key genes in atrial fibrillation using bioinformatics analysis.BMC Cardiovasc Disord. 2020 Aug 10;20(1):363. doi: 10.1186/s12872-020-01653-4. BMC Cardiovasc Disord. 2020. PMID: 32778054 Free PMC article.
-
The transcriptional landscape of atrial fibrillation: A systematic review and meta-analysis.PLoS One. 2025 May 30;20(5):e0323534. doi: 10.1371/journal.pone.0323534. eCollection 2025. PLoS One. 2025. PMID: 40446189 Free PMC article.
-
Uncovering hepatic transcriptomic and circulating proteomic signatures in MASH: A meta-analysis and machine learning-based biomarker discovery.Comput Biol Med. 2025 Jun;191:110170. doi: 10.1016/j.compbiomed.2025.110170. Epub 2025 Apr 12. Comput Biol Med. 2025. PMID: 40220593
Cited by
-
Distinct functional and molecular profiles between physiological and pathological atrial enlargement offer potential new therapeutic opportunities for atrial fibrillation.Clin Sci (Lond). 2024 Aug 7;138(15):941-962. doi: 10.1042/CS20240178. Clin Sci (Lond). 2024. PMID: 39018488 Free PMC article.
-
Construction of Prediction Model for Atrial Fibrillation with Valvular Heart Disease Based on Machine Learning.Rev Cardiovasc Med. 2022 Jun 28;23(7):247. doi: 10.31083/j.rcm2307247. eCollection 2022 Jul. Rev Cardiovasc Med. 2022. PMID: 39076905 Free PMC article.
-
Machine Learning in Prediction of Bladder Cancer on Clinical Laboratory Data.Diagnostics (Basel). 2022 Jan 14;12(1):203. doi: 10.3390/diagnostics12010203. Diagnostics (Basel). 2022. PMID: 35054370 Free PMC article.
-
Multi-Omics Analysis of Gene and Protein Candidates Possibly Related to Tetrodotoxin Accumulation in the Skin of Takifugu flavidus.Mar Drugs. 2021 Nov 15;19(11):639. doi: 10.3390/md19110639. Mar Drugs. 2021. PMID: 34822510 Free PMC article.
-
Mapping of Neuro-Cardiac Electrophysiology: Interlinking Epilepsy and Arrhythmia.J Cardiovasc Dev Dis. 2023 Oct 18;10(10):433. doi: 10.3390/jcdd10100433. J Cardiovasc Dev Dis. 2023. PMID: 37887880 Free PMC article. Review.
References
-
- Chugh SS, Havmoeller R, Narayanan K, Singh D, Rienstra M, Benjamin EJ, Gillum RF, Kim YH, McAnulty JH, Jr, Zheng ZJ, Forouzanfar MH, Naghavi M, Mensah GA, Ezzati M, Murray CJ. Worldwide epidemiology of atrial fibrillation: a Global Burden of Disease 2010 Study. Circulation. 2014;129(8):837–847. - PMC - PubMed
-
- Loris N, Sheryl B, Alessandra L. Combining multiple approaches for gene microarray classification. Bioinformatics. 2012;8:1151–1157. - PubMed
-
- Ghazalpour A, Bennett B, Petyuk VA, Orozco L, Hagopian R, Mungrue IN, Farber CR, Sinsheimer J, Kang HM, Furlotte N, Park CC, Wen PZ, Brewer H, Weitz K, Camp DG, 2nd, Pan C, Yordanova R, Neuhaus I, Tilford C, Siemers N, Gargalovic P, Eskin E, Kirchgessner T, Smith DJ, Smith RD, Lusis AJ. Comparative analysis of proteome and transcriptome variation in mouse. PLoS Genet. 2011;7(6):e1001393. doi: 10.1371/journal.pgen.1001393. - DOI - PMC - PubMed
Publication types
MeSH terms
Grants and funding
LinkOut - more resources
Full Text Sources
Other Literature Sources
Medical
Research Materials
Miscellaneous