Identification of Key Genes Associated with Overall Survival in Glioblastoma Multiforme Using TCGA RNA-Seq Expression Data
- PMID: 40725410
- PMCID: PMC12294400
- DOI: 10.3390/genes16070755
Identification of Key Genes Associated with Overall Survival in Glioblastoma Multiforme Using TCGA RNA-Seq Expression Data
Abstract
Background/Objectives: Glioblastoma multiforme (GBM) is an aggressive and heterogeneous brain tumor with poor prognosis, emphasizing the need for reliable molecular biomarkers to improve patient stratification and treatment planning. This study aimed to identify key genes associated with overall survival in GBM by employing and comparing machine learning (ML) and deep learning (DL) approaches using RNA-Seq gene expression data. Methods: RNA-Seq expression and clinical data for primary GBM tumors were obtained from The Cancer Genome Atlas (TCGA). A univariate Cox proportional hazards regression was used to identify survival-associated genes. For survival prediction, ML-based feature selection techniques-RF, GB, SVM-RFE, RF-RFE, and PCA-were used to construct multivariate Cox models. Separately, DeepSurv, a DL-based survival model, was trained using the significant genes from the univariate analysis. Gradient-based importance scoring was applied to determine key genes from the DeepSurv model. Results: Univariate analysis yielded 694 survival-associated genes. The best ML-based Cox model (RF-RFE with 90% training data) achieved a c-index of 0.725. In comparison, DeepSurv demonstrated superior performance with a c-index of 0.822. The top 10 genes were identified from the DeepSurv analysis, including CMTR1, GMPR, and PPY. Kaplan-Meier survival curves confirmed their prognostic significance, and network analysis highlighted their roles in processes such as purine metabolism, RNA processing, and neuroendocrine signaling. Conclusions: This study demonstrates the effectiveness of combining ML and DL models to identify prognostic gene expression biomarkers in GBM, with DeepSurv providing higher predictive accuracy. The findings offer valuable insights into GBM biology and highlight candidate biomarkers for further validation and therapeutic development.
Keywords: Cox regression; RNA-Seq; biomarkers; deep learning; gene network analysis; glioblastoma multiforme; machine learning; survival analysis.
Conflict of interest statement
The authors declare no conflicts of interest.
Figures




Similar articles
-
Immune intrinsic escape signature stratifies prognosis, characterizes the tumor immune microenvironment, and identifies tumorigenic PPP1R8 in glioblastoma multiforme patients.Front Immunol. 2025 Aug 6;16:1577920. doi: 10.3389/fimmu.2025.1577920. eCollection 2025. Front Immunol. 2025. PMID: 40842996 Free PMC article.
-
New insights for precision treatment of glioblastoma from analysis of single-cell lncRNA expression.J Cancer Res Clin Oncol. 2021 Jul;147(7):1881-1895. doi: 10.1007/s00432-021-03584-9. Epub 2021 Mar 11. J Cancer Res Clin Oncol. 2021. PMID: 33693962 Free PMC article.
-
Joint analysis of single-cell RNA sequencing and bulk transcriptome reveals the heterogeneity of the urea cycle of astrocytes in glioblastoma.Neurobiol Dis. 2025 May;208:106835. doi: 10.1016/j.nbd.2025.106835. Epub 2025 Feb 10. Neurobiol Dis. 2025. PMID: 39938577
-
A novel literature-based approach to identify genetic and molecular predictors of survival in glioblastoma multiforme: Analysis of 14,678 patients using systematic review and meta-analytical tools.J Clin Neurosci. 2015 May;22(5):785-99. doi: 10.1016/j.jocn.2014.10.029. Epub 2015 Feb 16. J Clin Neurosci. 2015. PMID: 25698544
-
Hypoxia-inducible factor 1alpha and vascular endothelial growth factor in Glioblastoma Multiforme: a systematic review going beyond pathologic implications.Oncol Res. 2024 Jul 17;32(8):1239-1256. doi: 10.32604/or.2024.052130. eCollection 2024. Oncol Res. 2024. PMID: 39055895 Free PMC article.
References
-
- Redekar S.S., Varma S.L., Bhattacharjee A. Identification of key genes associated with survival of glioblastoma multiforme using integrated analysis of TCGA datasets. Comput. Methods Programs Biomed. Update. 2022;2:100051. doi: 10.1016/j.cmpbup.2022.100051. - DOI
-
- Stoyanov G.S., Lyutfi E., Georgieva R., Georgiev R., Dzhenkov D.L., Petkova L., Ivanov B.D., Kaprelyan A., Ghenev P. Reclassification of glioblastoma multiforme according to the 2021 World Health Organization classification of central nervous system tumors: A single institution report and practical significance. Cureus. 2022;14:e21822. doi: 10.7759/cureus.21822. - DOI - PMC - PubMed
MeSH terms
Substances
LinkOut - more resources
Full Text Sources
Medical
Miscellaneous