Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2024 Sep 9;14(1):20938.
doi: 10.1038/s41598-024-71317-2.

Development of a prognostic model for NSCLC based on differential genes in tumour stem cells

Affiliations

Development of a prognostic model for NSCLC based on differential genes in tumour stem cells

Yuqi Ma et al. Sci Rep. .

Abstract

Non-small cell lung cancer (NSCLC) constitutes a significant portion of lung cancers and cytotoxic drugs (e.g. cisplatin) are currently the first-line treatment. However, NSCLC has developed resistance to this drug, which limits the therapeutic effect and thus affects prognosis. NSCLC sc-RNA-seq data were downloaded from the GEO database and Ku Leuven Laboratory for Functional Epigenetics, and bulk RNA-seq data were obtained from the TCGA database. The "Seurat" package was employed for scRNA-seq data processing, and the uniform manifold approximation and projection (UMAP) were applied for downscaling and cluster identification. Use the FindAllMarkers function to find differential genes (DEGs) for tumor stem cells. Then, we performed univariate regression analyses on the DEGs to identify potential prognostic genes. We created a machine learning framework based on potential prognostic genes, which combines 10 machine learning methods and their 101 combinations to get the optimal prognostic risk model. The model was evaluated in the training set and validation set. A nomogram was developed to provide physicians with a quantitative tool for prognosis prediction. Finally, we evaluated the expression and functionality of SLC2A1. We discovered 22 cell clusters containing 218379 cells by examining single-cell RNA sequencing datasets (GSE148071, KU_lom, GSE131907, GSE136246, GSE127465). Tumour cells were isolated for subpopulation analysis and 162 differential genes from SOX2_cancer were obtained. After univariate Cox analysis, we found 23 genes with prognostic potential prognostic value and utilized them to develop 101‑combination machine learning computational framework. We eventually picked the best performing 'StepCox[both] + RSF', which includes 8 genes. The model has a relatively high prediction accuracy in both TCGA and GEO datasets. In in vitro investigations, targeted suppression of the SLC2A1 gene resulted in significant reductions in proliferation, invasion and migration in A549 cells. In addition, a significant reduction in cisplatin resistance was seen in A549/DDP cells. The outcomes demonstrated the precision and credibility of the prognostic model for NSCLC, highlighting its potential significance in the treatment and prognosis of individuals affected by this disease. SLC2A1 may become a promising prognostic marker and a potential therapeutic target, offering valuable insights to inform clinical treatment decisions.

Keywords: Machine learning; NSCLC; Prognosis; Single cell.

PubMed Disclaimer

Conflict of interest statement

The authors declare no competing interests.

Figures

Fig. 1
Fig. 1
Flowchart.
Fig. 2
Fig. 2
(a-c) Clustering annotation and cell type identification by UMAP. (d-e) Secondary clustering and annotation of tumour cells. (f) stemness score of tumour cells.
Fig. 3
Fig. 3
C-index of each model on all validation datasets. (a) A total of 101 predictive models were built by the tenfold cross-validation framework and the C-index of each model was further calculated on all validation datasets. (b) Kaplan-Meier and ROC curves for TCGA-LUAD. (c-d) Kaplan-Meier and ROC curves for GSE29016 and GSE13213. (e) Nomograms constructed on the basis of clinical characteristics including age, graders and stages.
Fig. 4
Fig. 4
(a-b) Waterfall plot of gene mutations. (c) Tumour Mutation Burden (TMB) risk.
Fig. 5
Fig. 5
(a-b) Heat map of immune cell infiltration. (c) Distribution and correlation of tumour infiltrating immune cells in LUAD and LUSC.
Fig. 6
Fig. 6
GSEA.
Fig. 7
Fig. 7
Seven key cellular interactions within tumour cells. (a) Efferent and afferent signalling patterns of tumour cells. (b) Signals from various types of tumour cells. (c) The number and strength of 7 types of cell interactions.
Fig. 8
Fig. 8
Drug sensitivity analysis.
Fig. 9
Fig. 9
(a-b) qPCR and WB showing SLC2A1 expression in A549/DDP cell line compared to A549 cell line.
Fig. 10
Fig. 10
(a) Proliferation of CCK-8. (b) Transwell assay. (c) Expression of SLC2A1 in Transwell assay.
Fig. 11
Fig. 11
(a) Apoptosis assay. (b) Cytotoxicity of CCK-8.

Similar articles

Cited by

References

    1. Han, J. et al. MEK inhibitors for the treatment of non-small cell lung cancer. J. Hematol. Oncol.14(1), 1 (2021). 10.1186/s13045-020-01025-7 - DOI - PMC - PubMed
    1. Molina, J. R., Yang, P., Cassivi, S. D., Schild, S. E. & Adjei, A. A. Non-small cell lung cancer: Epidemiology, risk factors, treatment, and survivorship. Mayo Clin. Proc.83(5), 584–594 (2008). 10.1016/S0025-6196(11)60735-0 - DOI - PMC - PubMed
    1. Fennell, D. A. et al. Cisplatin in the modern era: The backbone of first-line chemotherapy for non-small cell lung cancer. Cancer Treat Rev.44, 42–50 (2016). 10.1016/j.ctrv.2016.01.003 - DOI - PubMed
    1. Wu, Y. L. et al. Osimertinib in resected EGFR-mutated non-small-cell lung cancer. N. Engl. J. Med.383(18), 1711–1723 (2020). 10.1056/NEJMoa2027071 - DOI - PubMed
    1. Sung, H. et al. Global cancer statistics 2020: GLOBOCAN estimates of incidence and mortality worldwide for 36 cancers in 185 countries. CA Cancer J. Clin.71(3), 209–49 (2021). 10.3322/caac.21660 - DOI - PubMed

MeSH terms

Substances