Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2015;16 Suppl 8(Suppl 8):S5.
doi: 10.1186/1471-2164-16-S8-S5. Epub 2015 Jun 18.

Bioinformatics analysis of thousands of TCGA tumors to determine the involvement of epigenetic regulators in human cancer

Bioinformatics analysis of thousands of TCGA tumors to determine the involvement of epigenetic regulators in human cancer

Florian Gnad et al. BMC Genomics. 2015.

Abstract

Background: Many cancer cells show distorted epigenetic landscapes. The Cancer Genome Atlas (TCGA) project profiles thousands of tumors, allowing the discovery of somatic alterations in the epigenetic machinery and the identification of potential cancer drivers among members of epigenetic protein families.

Methods: We integrated mutation, expression, and copy number data from 5943 tumors from 13 cancer types to train a classification model that predicts the likelihood of being an oncogene (OG), tumor suppressor (TSG) or neutral gene (NG). We applied this predictor to epigenetic regulator genes (ERGs), and used differential expression and correlation network analysis to identify dysregulated ERGs along with co-expressed cancer genes. Furthermore, we quantified global proteomic changes by mass spectrometry after EZH2 inhibition.

Results: Mutation-based classifiers uncovered the OG-like profile of DNMT3A and TSG-like profiles for several ERGs. Differential gene expression and correlation network analyses revealed that EZH2 is the most significantly over-expressed ERG in cancer and is co-regulated with a cell cycle network. Proteomic analysis showed that EZH2 inhibition induced down-regulation of cell cycle regulators in lymphoma cells.

Conclusions: Using classical driver genes to train an OG/TSG predictor, we determined the most predictive features at the gene level. Our predictor uncovered one OG and several TSGs among ERGs. Expression analyses elucidated multiple dysregulated ERGs including EZH2 as member of a co-expressed cell cycle network.

PubMed Disclaimer

Figures

Figure 1
Figure 1
Visualizing genomic alterations of bromo domain containing genes. The core of the plot reflects the phylogenetic relationships between bromo domain containing proteins estimated by the sequence similarity of their associated domains. The inner circle displays the expression fold-changes between tumors and healthy tissues. High expression in tumors is indicated in red, while low expression in tumors is shown in blue. The outer circle illustrates the proportion of tumors with 'deep loss' (blue) or 'high-level gain' (red) changes. Mutation rates are reflected by the outer stacked bar plots.
Figure 2
Figure 2
Selecting features for OG and TSG prediction. A) Box plots illustrate feature differences between OGs (red), TSGs (blue) and NGs (gray). Associated p-values on the top of each box plot are based on one-tailed Mann-Whitney U tests and reflect the differences between OGs and NGs, and TSGs and NGs. B) Dots reflect the frequencies of protein altering mutations in the combined set of tumors from seven cancer types. OGs (red), TSGs (blue) and NGs (gray) are sorted alphabetically on the x-axis. C) Proportions of loss of function (LOF) to benign mutations are plotted against the entropy based mutation selection scores for all human genes. Blue indicates high fractions of LOF mutations, while red indicates high mutation selection. D) Stacked bar plots present the relative frequencies of mutation classes in the combined tumor panel for OGs, TSGs and NGs.
Figure 3
Figure 3
Predicted TSGs and most recurrent mutations in the ERG family. A) Predicted TSGs are listed along with the proportions of mutated samples in each indication and overall frequencies of LOF to benign mutations. B) Most recurrent mutations within the ERG family (del: deletion, *: nonsense mutation, fs: frameshift). C) Mutation profiles of ARID1A and PBRM1. Non-synonymous mutations are represented as solid circles, with color distinguishing different cancer types. Circle sizes are proportional to the mutation frequencies.
Figure 4
Figure 4
Epigenetic regulators with significant gene expression profiles in cancer. Significantly (A) over- and (B) under-expressed ERGs are ranked according to the combined p-values (based on Fisher's probability test) over all cancer types. Numbers reflect log2-fold changes, while colors reflect associated p-values. ERGs with consistently over-expression in tumors included EZH2 (pF = 3.2 × 10−112), ATAD2 (pF = 1.9 × 10−76), PRDM13 (pF = 2.7 × 10−27), DPF1 (pF = 1.0 × 10−19), DNMT1 (pF = 8.3 × 10−19), SUV420H2 (pF = 1.7 × 10−15), WHSC1 (pF = 3.3 × 10−15), TRIM28 (pF = 1.2 × 10−8), BAZ1A (pF = 2.2 × 10−7), PRMT1 (pF = 9.6 × 10−6), and HDAC10 (pF = 8.1 × 10−5). ERGs with consistently lower expression in tumors included KAT2B (pm = 1.0 × 10−74), EZH1 (pm = 2.3 × 10−42), SMARCA2 (pm = 2.0 × 10−25), NCOA1 (pm = 1.2 × 10−10), ZMYND11 (pm = 3.6 × 10−9), PRDM2 (pm = 9.5 × 10−7), BAZ2B (pm = 3.5 × 10−6) and SIRT1 (pm = 8.1 × 10−6).
Figure 5
Figure 5
Expression plots of significantly expressed ERGs. Gene expressions of (A) EZH2, ATAD2, EZH1, and (B) BRDT, PRDM13 and PRDM9 are shown in RPKM units (black: healthy tissues, red: tumors). Gene expression levels are reflected by RPKM values.
Figure 6
Figure 6
Co-expression network analyses. A) Using Cytoscape co-expressed genes are visualized as networks with nodes representing genes and edges reflecting pairwise co-expression relationships in healthy tissues. B) Numbers of co-regulated cancer genes in healthy tissues (right panel of the plot) are shown along with the mutation frequencies (left panel of the plot). Mutation frequencies are presented as stacked bars with cancer type dependent coloration. ERGs are sorted on the y-axis by the overall mutation frequencies. C) A subnetwork within the main co-expression network contains 24 co-expressed ERGs. Colors indicate the corresponding ERG families. D) EZH2 (green) and 99 co-expressed genes form one co-regulated network that is significantly enriched for cell cycle regulators. Genes that are directly connected with EZH2, because they show a very high degree of co-expression (R > 0.85), are highlighted in orange. Genes that are present in the network, but not directly connected with EZH2 are shown in blue. F) Examples of positive correlations between EZH2 and co-expressed cell cycle regulators. Each dot reflects the gene expression level (represented by variance stabilized RNAseq count data) of EZH2 (x axis) and the co-expressed gene (y axis). Dots are colored according to tissue type.
Figure 7
Figure 7
Quantitative mass spectrometry based proteomic analysis after EZH2 inhibition. A) Protein expression differences of selected cell cycle regulators in EPZ-6438 (Epizyme®, Cambridge, MA) versus DMSO treated lymphoma cells (WSU-DLCL2) are represented as log2 ratios. B) Global protein expression changes after 8 days of EPZ-6438 treatment. Down-regulated cell cycle regulators are highlighted in dark blue.

References

    1. Bird A. DNA methylation patterns and epigenetic memory. Genes & Development. 2002;16(1):6–21. doi: 10.1101/gad.947102. - DOI - PubMed
    1. Kouzarides T. Chromatin modifications and their function. Cell. 2007;128(4):693–705. doi: 10.1016/j.cell.2007.02.005. - DOI - PubMed
    1. Strahl BD, Allis CD. The language of covalent histone modifications. Nature. 2000;403(6765):41–45. doi: 10.1038/47412. - DOI - PubMed
    1. Wilson BG, Roberts CW. SWI/SNF nucleosome remodellers and cancer. Nature Reviews Cancer. 2011;11(7):481–492. doi: 10.1038/nrc3068. - DOI - PubMed
    1. Jones PA, Baylin SB. The epigenomics of cancer. Cell. 2007;128(4):683–692. doi: 10.1016/j.cell.2007.01.029. - DOI - PMC - PubMed

LinkOut - more resources