Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2018 Oct 26;8(1):15834.
doi: 10.1038/s41598-018-34160-w.

Transcriptomic and functional network features of lung squamous cell carcinoma through integrative analysis of GEO and TCGA data

Affiliations

Transcriptomic and functional network features of lung squamous cell carcinoma through integrative analysis of GEO and TCGA data

Yin Li et al. Sci Rep. .

Abstract

Lung squamous cell carcinoma (LUSC) is associated with poor clinical prognosis and lacks available targeted therapy. Novel molecules are urgently required for the diagnosis and prognosis of LUSC. Here, we conducted our data mining analysis for LUSC by integrating the differentially expressed genes acquired from Gene Expression Omnibus (GEO) database by comparing tumor tissues versus normal tissues (GSE8569, GSE21933, GSE33479, GSE33532, GSE40275, GSE62113, GSE74706) into The Cancer Genome Atlas (TCGA) database which includes 502 tumors and 49 adjacent non-tumor lung tissues. We identified intersections of 129 genes (91 up-regulated and 38 down-regulated) between GEO data and TCGA data. Based on these genes, we conducted our downstream analysis including functional enrichment analysis, protein-protein interaction, competing endogenous RNA (ceRNA) network and survival analysis. This study may provide more insight into the transcriptomic and functional features of LUSC through integrative analysis of GEO and TCGA data and suggests therapeutic targets and biomarkers for LUSC.

PubMed Disclaimer

Conflict of interest statement

The authors declare no competing interests.

Figures

Figure 1
Figure 1
Flowchart for bioinformatics analysis of publicly available data from both GEO and TCGA databases.
Figure 2
Figure 2
Results from the principal component analysis for microarray studies downloaded from the GEO database. (A) Bar plots showing the proportion of variance evaluated for each of the five microarray datasets. (B) Two-dimensional plots of normal and tumor groups with the top two principal components. Horizontal and vertical axes represent the distribution of each sample within PCA1 and PCA2 respectively. PCA1: principle component 1; PCA2: principal component 2.
Figure 3
Figure 3
Convergence of gene expression signatures across different studies of LUSC. (A) Volcano plots showed the number of differentially expressed genes identified from each of the seven GEO datasets and after batch correction. (B) Volcano plot showed the number of differentially expressed genes in TCGA. (C) Venn diagram demonstrates the intersections of genes between GEO data and TCGA data. (D) Chromosome mapping of consensus genes.
Figure 4
Figure 4
The expression changes of these genes in GEO and TCGA data. (A) Heatmap of differentially expressed genes in GEO dataset coloring the samples-groups. (B) Heatmap of differentially expressed genes in TCGA dataset coloring the groups.
Figure 5
Figure 5
GO annotations, KEGG pathways, functional enrichment analysis and protein-protein interaction of up-regulated gene and down-regulated genes in LUSC. (A) The bubble plots showing GO and KEGG pathway enrichment data for genes that were up-regulated. (B) The bubble plots showing GO and KEGG pathway enrichment data for genes that were down-regulated. (C) Functional enrichment analysis plot. A negative z-score indicates that the activity is decreased. A positive z-score indicates that the activity is increased. (D) Protein-protein interaction network.
Figure 6
Figure 6
CeRNA network. LncRNA–miRNA–mRNA interactions in LUSC. Red indicates up-regulated lncRNAs, purple indicates up-regulated mRNAs, yellow stands for over-expressed miRNAs, and green means down-regulated miRNAs.
Figure 7
Figure 7
Survival analysis for differentially expressed genes in LUSC. Survival curves showing 20 examples of genes which were related to overall patient survival rate. P-value set for this analysis is less than 0.05.

Similar articles

Cited by

References

    1. Torre LA, et al. Global cancer statistics, 2012. CA Cancer J Clin. 2015;65:87–108. doi: 10.3322/caac.21262. - DOI - PubMed
    1. Ferlay J, et al. Cancer incidence and mortality worldwide: sources, methods and major patterns in GLOBOCAN 2012. Int J Cancer. 2015;136:E359–386. doi: 10.1002/ijc.29210. - DOI - PubMed
    1. Lambert AA, Dransfield MT. COPD Overlap Syndromes: Asthma and Beyond. Chronic obstructive pulmonary diseases. 2016;3:459–465. doi: 10.15326/jcopdf.3.1.2015.0176. - DOI - PMC - PubMed
    1. Hirsch FR, et al. Lung cancer: current therapies and new targeted treatments. Lancet. 2017;389:299–311. doi: 10.1016/S0140-6736(16)30958-8. - DOI - PubMed
    1. Kulasingam V, Diamandis EP. Strategies for discovering novel cancer biomarkers through utilization of emerging technologies. Nat Clin Pract Oncol. 2008;5:588–599. doi: 10.1038/ncponc1187. - DOI - PubMed

Publication types

MeSH terms