Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2025 Apr 23;15(1):14034.
doi: 10.1038/s41598-025-98542-7.

Gene regulatory networks analysis for the discovery of prognostic genes in gliomas

Affiliations

Gene regulatory networks analysis for the discovery of prognostic genes in gliomas

Pedro Marçal Barcelos et al. Sci Rep. .

Abstract

Gliomas are the most common and aggressive primary tumors of the central nervous system. Dysregulated transcription factors (TFs) and genes have been implicated in glioma progression, yet these tumors' overall structure of gene regulatory networks (GRNs) remains undefined. We analyzed transcriptional data from 989 primary gliomas in The Cancer Genome Atlas (TCGA) and the Chinese Glioma Genome Atlas (CGGA) to address this. GRNs were reconstructed using the RTN package which identifies regulons-sets of genes regulated by a common TF based on co-expression and mutual information. Regulon activity was evaluated through Gene Set Enrichment Analysis. Elastic net regularization and Cox regression identified 31 and 32 prognostic genes in the TCGA and CGGA datasets, respectively, with 11 genes overlapping, many of which are associated with neural development and synaptic processes. GAS2L3, HOXD13, and OTP demonstrated the strongest correlations with survival outcomes among these. Single-cell RNA-seq analysis of 201,986 cells revealed distinct expression patterns for these genes in glioma subpopulations, particularly oligoprogenitor cells. This study uncovers key GRNs and prognostic genes in gliomas, offering new insights into tumor biology and potential therapeutic targets.

Keywords: Glioma; Regulons; Survival.

PubMed Disclaimer

Conflict of interest statement

Declarations. Competing interests: The authors declare no competing interests.

Figures

Fig. 1
Fig. 1
Least absolute shrinkage and selection operator (LASSO) analysis of regulons from TCGA and CGGA: (a) Schematic workflow. (b) Cross-validation results demonstrating the performance of LASSO for variable coefficient selection. The plot illustrates the partial likelihood deviance across different values of the regularization parameter (lambda), aiding in the selection of the optimal lambda. (c, d) LASSO coefficients for the identified regulons in the TCGA and CGGA dataset respectively. Each dot represents the magnitude and direction of the effect of each variable on the model. (e) Hierarchical tree-and-leaf representation of the identified regulons; the size of each circle corresponds to the number of genes within each regulon. Green circles indicate regulons derived from TCGA, while orange circles represent those from CGGA. (e) Heatmaps displaying the activity levels of the identified regulons across all samples from CGGA and TCGA.
Fig. 2
Fig. 2
Characterization of genes within the regulons: (a) Functional enrichment analysis of all identified regulons from CGGA and TCGA, highlighting the biological processes significantly associated with the gene sets. (b) Venn diagram illustrating the overlap of genes within prognostically relevant regulons from CGGA and TCGA. The diagram indicates the number of unique and shared genes. (c, d) Cross-validation results for Elastic Net variable coefficient screening, demonstrating the stability and performance of selected variables across different parameter settings. The plots shows partial likelihood deviance as a function of the regularization parameters, aiding in the identification of the optimal model, significant variables coefficient values and elastic net coefficients for the identified regulons in the CGGA and TCGA dataset respectively. Each dot represents the magnitude and direction of the effect of each variable on the model.
Fig. 3
Fig. 3
Common prognostic genes. (a) Upset plot illustrating the overlap of genes identified by elastic net in the TCGA and CGGA datasets. (b) Hazard ratios for the eleven common prognostic genes, adjusted for tumor grade and patient age. The plot presents the hazard ratios with corresponding confidence intervals. (c, d) Survival curves of CGGA and TCGA respectively, stratified by the maximization of the log-rank test statistic, demonstrating the differential survival probabilities of patients with high versus low expression levels. The curves are accompanied by log-rank test p-values to assess the statistical significance of the observed differences. Benjamini–Hochberg (BH) was used to adjust for multiple hypothesis.
Fig. 4
Fig. 4
Gene expression at the single-cell level. (a) Schematic representation of the analysis workflow used to assess gene expression at the single-cell level. (b) UMAP plot displaying the distribution of all 201,989 single cells, with cells color-coded according to their respective labels. (c) Feature plots illustrating the expression levels of the eleven prognostic genes across all cells. Each plot showcases the spatial distribution of gene expression within the cell population. (d) Dotplot showcasing average expression (color gradient) and percentage of cells expressing (dot size) of genes across cell populations.
Fig. 5
Fig. 5
Glioma subcluster exploration. (a) UMAP plot illustrating glioma subclusters identified through clustering analysis. Seven distinct GCs are specified. (b) UMAP plot depicting the cell cycle phases of the cells included in the analysis. Cells are color-coded according to their respective cell cycle phases (G1, S, G2/M). (c) Heatmap displaying the top 5 marker genes for each GC. (d) UMAP plot showcasing the module scores of the four glioma cell states (AC, MES, NPC, OPC). (e) Quadrant plot illustrating the distribution of each GCs across four quadrants, each representing one of the glioma cell states.
Fig. 6
Fig. 6
Gene expression across glioma subclusters (GCs). (a) Feature plot displaying the expression levels of eleven selected genes within the glioma subset. Each cell is color-coded according to the expression intensity of the respective gene. (b) Dotplot of the mean and percent expression from the eleven genes across GCs.
Fig. 7
Fig. 7
Proposed mechanisms linking prognostic genes to glioma biology: Schematic processes that each prognostic gene is involved in the glioma biology.

References

    1. Ostrom, Q. T. et al. CBTRUS statistical report: Primary brain and other central nervous system tumors diagnosed in the United States in 2009–2013. Neuro Oncology18, v1–v75 (2016). - PMC - PubMed
    1. Mathur, R. et al. Glioblastoma evolution and heterogeneity from a 3D whole-tumor perspective. Cell187, 446–463 (2024). - PMC - PubMed
    1. Torp, S. H., Solheim, O. & Skjulsvik, A. J. The WHO 2021 Classification of Central Nervous System tumours: A practical update on what neurosurgeons need to know—A minireview. Acta Neurochir.164, 66 (2022). - PMC - PubMed
    1. Louis, D. N. et al. The 2021 WHO classification of tumors of the central nervous system: A summary. Neuro Oncology23, 1231–1251 (2021). - PMC - PubMed
    1. Antonelli, M. & Poliani, P. L. Adult type diffuse gliomas in the new 2021 WHO classification. Pathologica10.32074/1591-951X-823 (2022). - PMC - PubMed

LinkOut - more resources