Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2025 Aug 7;15(1):28926.
doi: 10.1038/s41598-025-13547-6.

Machine learning derived development and validation of extracellular matrix related signature for predicting prognosis in adolescents and young adults glioma

Affiliations

Machine learning derived development and validation of extracellular matrix related signature for predicting prognosis in adolescents and young adults glioma

Pancheng Wu et al. Sci Rep. .

Abstract

The mortality rates have been increasing for glioma in adolescents and young adults (AYAs, aged 15-39 years). However, current biomarkers for clinical assessment in AYAs glioma are limited, prompting the urgent need for identifying ideal prognostic signature. Extracellular matrix is involved in the development of tumors, while their prognostic significance in AYAs glioma remains unclear. By an integrated machine learning workflow and circuit training and validation procedure, we developed a machine learning-derived prognostic signature (MLDPS) based on 1,026 extracellular matrix-related genes and 3 AYAs glioma cohorts. MLDPS exhibited robust and consistent predictive performance in overall survival and could serve as an independent prognostic factor for AYAs glioma. Simultaneously, MLDPS outperformed previous 89 published prognostic signatures and traditional clinical characteristics, confirming the robust predictive capability. Besides, MLDPS had the potential to stratify prognosis in patients with other cancer types. In addition, the tumor microenvironment between high and low MLDPS groups displayed different patterns while more tumor-infiltrating immune cells were observed in high MLDPS group. Additionally, patients in low MLDPS group had significantly prolonged survival when received immunotherapy in cancers including glioblastoma, urothelial carcinoma and melanoma. Overall, our study proposes a promising signature, which can be utilized for clinicians to evaluate prognosis and might provide individualized clinical management for AYAs glioma.

Keywords: Adolescents and young adults; Glioma; Immunotherapy; Machine learning; Prognosis.

PubMed Disclaimer

Conflict of interest statement

Declarations. Competing interests: The authors declare no competing interests.

Figures

Fig. 1
Fig. 1
The flowchart of this study.
Fig. 2
Fig. 2
Construction of the machine learning-derived prognostic signature (MLDPS). (A) The C-index of 65 machine learning algorithms combinations in CGGA-693 training procedure. (B) The C-index of 65 machine learning algorithms combinations in CGGA-325 training procedure. (C) The C-index of 65 machine learning algorithms combinations in TCGA training procedure. (D-F) Top five average C-index in CGGA-693, CGGA-325 and TCGA training procedure, respectively. (G-I) The performance of MLDPS was compared with common clinical and molecular characteristics in CGGA-693 (G), CGGA-325 (H) and TCGA (I). *p < 0.05, **p < 0.01, ***p < 0.001, ****p < 0.0001.
Fig. 3
Fig. 3
Survival analysis and predictive performance evaluation of machine learning-derived prognostic signature (MLDPS). (A-C) Kaplan-Meier survival analysis for overall survival between high and low MLDPS groups in CGGA-693 (A), CGGA-325 (B) and TCGA cohorts (C). (D-F) Univariate and multivariate Cox regression analyses regarding of MLDPS in CGGA-693 (D), CGGA-325 (E) and TCGA cohorts (F). (G-I) Time-dependent receiver-operator characteristic (ROC) analysis for predicting OS at 1-, 3- and 5-year in CGGA-693 (G), CGGA-325 (H) and TCGA cohorts (I).
Fig. 4
Fig. 4
The correlation between machine learning-derived prognostic signature (MLDPS) and clinical characteristics. (A-C) The correlation between age, gender, grade, IDH status, 1p/19q status and MLDPS in CGGA-693 (A), CGGA-325 (B) and TCGA cohort (C), respectively. (D) Kaplan-Meier survival analysis for overall survival between high and low MLDPS groups in different age, gender, grade, IDH status and 1p/19q status subgroups in CGGA-693 cohort.
Fig. 5
Fig. 5
Comparisons between machine learning-derived prognostic signature (MLDPS) and 89 published prognostic signatures. (A) The C-index of 89 published signatures and MLDPS in CGGA-693, CGGA-325 and TCGA cohort. (B-D) Comparisons between C-index of MLDPS and 89 published signatures in CGGA-693 (B), CGGA-325 (C) and TCGA cohorts (D). (E-G) Comparisons between the area under the curve (AUC) values of MLDPS and 89 published signatures in predicting overall survival at 1-year (E) 3-year (F) and 5-year (G) in CGGA-693 cohort, respectively. *p < 0.05, **p < 0.01, ***p < 0.001, ****p < 0.0001.
Fig. 6
Fig. 6
Pan-cancer survival analysis and functional characteristics of the high and low machine learning-derived prognostic signature (MLDPS) groups. (A-H) Kaplan-Meier survival analysis for overall survival (OS) in TCGA-ACC (A), TCGA-BRCA (B), TCGA-COAD (C), TCGA-LAML (D), TCGA-GBMLGG (E, excluded the AYAs glioma), TCGA-MESO (F), TCGA-SARC (G) and TCGA-THCA cohort (H). (I-J) The biological processes (BP) (I) and pathways (J) enriched in high MLDPS group. (K-L) The biological processes (BP) (K) and pathways (L) enriched in low MLDPS group.
Fig. 7
Fig. 7
Immune microenvironment analyses. (A-C) The differences in ESTIMATE score, immune score, stromal score and tumor purity between high and low MLDPS groups in CGGA-693 (A), CGGA-325 (B) and TCGA cohorts (C). (D-F) The differences in immune cells between high and low MLDPS groups according to WHO II (D), WHO III (E) and WHO IV (F) in CGGA-693 cohort estimated by ssGSEA method. (G-J) Kaplan-Meier survival analysis for evaluating prognosis in patients received immunotherapy in PRJNA482620 (G), IMvigor 210 (H), GSE91061 (I) and GSE78220 cohort (J). (K-N) The stacked histogram shows the differences in immunotherapy responsiveness between high and low MLDPS groups in PRJNA482620 (K), Imvigor 210 (L), GSE91061 (M) and GSE78220 (N). *p < 0.05, **p < 0.01, ***p < 0.001, ****p < 0.0001.

Similar articles

References

    1. Gusyatiner, O. & Hegi, M. E. Glioma epigenetics: from subclassification to novel treatment options. Semin Cancer Biol.51, 50–58. 10.1016/j.semcancer.2017.11.010 (2018). - PubMed
    1. Miller, K. D. et al. Cancer statistics for adolescents and young adults, 2020. CA Cancer J. Clin.70, 443–459. 10.3322/caac.21637 (2020). - PubMed
    1. Louis, D. N. et al. The 2021 WHO classification of tumors of the central nervous system: a summary. Neuro Oncol.23, 1231–1251. 10.1093/neuonc/noab106 (2021). - PMC - PubMed
    1. Weiser, A. et al. Bridging the age gap: a review of molecularly informed treatments for glioma in adolescents and young adults. Front. Oncol.13, 1254645. 10.3389/fonc.2023.1254645 (2023). - PMC - PubMed
    1. Close, A. G., Dreyzin, A., Miller, K. D., Seynnaeve, B. K. N. & Rapkin, L. B. Adolescent and young adult oncology-past, present, and future. CA Cancer J. Clin.69, 485–496. 10.3322/caac.21585 (2019). - PubMed

MeSH terms

Substances

LinkOut - more resources