Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2019 Aug 9:10:730.
doi: 10.3389/fgene.2019.00730. eCollection 2019.

Differentially Expressed Genes Associated With Prognosis in Locally Advanced Lymph Node-Negative Prostate Cancer

Affiliations

Differentially Expressed Genes Associated With Prognosis in Locally Advanced Lymph Node-Negative Prostate Cancer

Elena A Pudova et al. Front Genet. .

Abstract

Older age is one of the main risk factors for cancer development. The incidence of prostate cancer, as a multifactorial disease, also depends upon demographic factors, race, and genetic predisposition. Prostate cancer most frequently occurs in men over 60 years of age, indicating a clear association between older age and disease onset. Carcinogenesis is followed by the deregulation of many genes, and some of these changes could serve as biomarkers for diagnosis, prognosis, prediction of drug therapy efficacy, as well as possible therapeutic targets. We have performed a bioinformatic analysis of a The Cancer Genome Atlas (TCGA) data and RNA-Seq profiling of a Russian patient cohort to reveal prognostic markers of locally advanced lymph node-negative prostate cancer (lymph node-negative LAPC). We also aimed to identify markers of the most common molecular subtype of prostate cancer carrying a fusion transcript TMPRSS2-ERG. We have found several genes that were differently expressed between the favorable and unfavorable prognosis groups and involved in the enriched KEGG pathways based on the TCGA (B4GALNT4, PTK6, and CHAT) and Russian patient cohort data (AKR1C1 and AKR1C3). Additionally, we revealed such genes for the TMPRSS2-ERG prostate cancer molecular subtype (B4GALNT4, ASRGL1, MYBPC1, RGS11, SLC6A14, GALNT13, and ST6GALNAC1). Obtained results contribute to a better understanding of the molecular mechanisms behind prostate cancer progression and could be used for further development of the LAPC prognosis marker panel.

Keywords: RNA-Seq; TCGA; TMPRSS2-ERG; locally advanced prostate cancer; prognostic markers.

PubMed Disclaimer

Figures

Figure 1
Figure 1
Heatmap demonstrating log relative expression level of top differentially expressed genes between the groups of adverse and favorable prognosis (TCGA data). Cell colors (blue-white-orange gradient) correspond the binary logarithm of the ratio of the expression level in a current sample to the average level across all the samples (per each gene). Blue - expression level is below the average, orange - above the average. Spearman_r—Spearman’s rank correlation coefficient. -Log10(p) is the negative logarithm of DESeq’s p-value. LogCPM—average binary logarithm of read counts per million (CPM). LogFC—binary logarithm of expression level fold change (samples with adverse prognosis compared to favorable prognosis).
Figure 2
Figure 2
Dotplot showing the results of KEGG pathways enrichment analyses performed for top-40, 80, 250, 500, and 1,000 differentially expressed genes (either overexpressed or downregulated) between the groups of adverse and favorable prognosis (TCGA data). All the genes have passed the following thresholds: DESeq p-value < 0.01, Mann–Whitney p-value < 0.05, average expression-level fold change > 25%. Dot size indicates k/n ratio (“gene ratio”), where k is the number of genes participating in the current KEGG pathway (within top-40, top-80, or other top lists), and n is the number of genes (also within top-40, top-80, or other top lists) annotated as participants of any KEGG pathway (these numbers are provided in the brackets in the bottom of the figure). Dot color indicates the enrichment test FDR (Fisher’s exact test).
Figure 3
Figure 3
Potential markers of unfavorable prognosis in lymph node-negative locally advanced prostate cancer (LAPC) involved in the enriched KEGG pathways (TCGA data). LogFC—trimmed binary logarithm of average expression level fold change (between the groups of favorable and unfavorable prognosis); LogCPM—binary logarithm of read counts per million (CPM).
Figure 4
Figure 4
Heatmap demonstrating log relative expression level of top differentially expressed genes between the groups of adverse and favorable prognosis (TCGA data; TMPRSS2-ERG-positive samples). Cell colors (blue-white-orange gradient) correspond the binary logarithm of the ratio of the expression level in a current sample to the average level across all the samples (per each gene). Blue - expression level is below the average, orange - above the average. Spearman_r—Spearman’s rank correlation coefficient. -Log10(p) is the negative logarithm of DESeq’s p-value. LogCPM—average binary logarithm of read counts per million (CPM). LogFC—binary logarithm of expression-level fold change (sample with adverse prognosis compared to favorable prognosis).
Figure 5
Figure 5
Dotplot showing the results of KEGG pathways enrichment analyses performed for top-40, 80, 250, 500, and 1,000 differentially expressed genes (either overexpressed or downregulated) between the groups of adverse and favorable prognosis (TCGA data; TMPRSS2-ERG-positive samples). All the genes have passed the following thresholds: DESeq p-value < 0.01, Mann–Whitney p-value < 0.05, average expression-level fold change > 25%. Dot size indicates k/n ratio (“gene ratio”), where k is the number of genes participating in the current KEGG pathway (within top-40, top-80, or other top lists), and n is the number of genes (also within top-40, top-80, or other top lists) annotated as participants of any KEGG pathway (these numbers are provided in the brackets in the bottom of the figure). Dot color indicates the enrichment test FDR (Fisher’s exact test).
Figure 6
Figure 6
Potential markers of unfavorable prognosis in lymph node-negative LAPC involved in the enriched KEGG pathways (TCGA data, TMPRSS2-ERG-positive samples). LogFC—trimmed binary logarithm of average expression-level fold change (between the groups of favorable and unfavorable prognosis); LogCPM—binary logarithm of CPM.
Figure 7
Figure 7
Heatmap demonstrating log relative expression level of top differentially expressed genes between the groups of adverse and favorable prognosis (Russian patient cohort). Cell colors (blue-white-orange gradient) correspond the binary logarithm of the ratio of the expression level in a current sample to the average level across all the samples (per each gene). Blue - expression level is below the average, orange - above the average. Spearman_r—Spearman’s rank correlation coefficient. -Log10(p) is the negative logarithm of DESeq’s p-value. LogCPM—average binary logarithm of read counts per million (CPM). LogFC—binary logarithm of expression-level fold change (sample with adverse prognosis compared to favorable prognosis).
Figure 8
Figure 8
Dotplot showing the results of KEGG pathways enrichment analyses performed for top-40, 80, 250, 500, and 1,000 differentially expressed genes (either overexpressed or downregulated) between the groups of adverse and favorable prognosis (Russian patient cohort). All the genes have passed the following thresholds: DESeq p-value < 0.01, Mann–Whitney p-value < 0.05, average expression-level fold change > 25%. Dot size indicates k/n ratio (“gene ratio”), where k is the number of genes participating in the current KEGG pathway (within top-40, top-80 or other top lists), and n is the number of genes (also within top-40, top-80, or other top lists) annotated as participants of any KEGG pathway (these numbers are provided in the brackets in the bottom of the figure). Dot color indicates the enrichment test FDR (Fisher’s exact test).
Figure 9
Figure 9
Potential markers of unfavorable prognosis in lymph node-negative LAPC involved in the enriched KEGG pathways (Russian patient cohort). LogFC—trimmed binary logarithm of average expression-level fold change (between the groups of favorable and unfavorable prognosis); LogCPM—binary logarithm of read counts per million (CPM).

References

    1. Adeniji A. O., Chen M., Penning T. M. (2013). AKR1C3 as a target in castrate resistant prostate cancer. J. Steroid. Biochem. Mol. Biol. 137, 136–149. 10.1016/j.jsbmb.2013.05.012 - DOI - PMC - PubMed
    1. Belova A. A., Sosnovtseva A. O., Lipatova A. V., Njushko K. M., Volchenko N. N., Belyakov M. M., et al. (2017). Biomarkers of Prostate Cancer Sensitivity to the Sendai Virus. Mol. Biol. 51 (1), 80–88. 10.1134/S0026893317010046 - DOI - PubMed
    1. Bray N. L., Pimentel H., Melsted P., Pachter L. (2016). Near-optimal probabilistic RNA-seq quantification. Nat. Biotechnol. 34 (5), 525–527. 10.1038/nbt.3519 - DOI - PubMed
    1. Byrd J. C., Bresalier R. S. (2004). Mucins and mucin binding proteins in colorectal cancer. Cancer. Metastasis. Rev. 23 (1-2), 77–99. 10.1023/A:1025815113599 - DOI - PubMed
    1. Cantor J. R., Stone E. M., Chantranupong L., Georgiou G. (2009). The human asparaginase-like protein 1 hASRGL1 is an Ntn hydrolase with beta-aspartyl peptidase activity. Biochemistry 48 (46), 11026–11031. 10.1021/bi901397h - DOI - PMC - PubMed