Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2013 Aug 5;8(8):e70498.
doi: 10.1371/journal.pone.0070498. Print 2013.

Voting-based cancer module identification by combining topological and data-driven properties

Affiliations

Voting-based cancer module identification by combining topological and data-driven properties

A K M Azad et al. PLoS One. .

Erratum in

  • PLoS One. 2014;9(4):e96883

Abstract

Recently, computational approaches integrating copy number aberrations (CNAs) and gene expression (GE) have been extensively studied to identify cancer-related genes and pathways. In this work, we integrate these two data sets with protein-protein interaction (PPI) information to find cancer-related functional modules. To integrate CNA and GE data, we first built a gene-gene relationship network from a set of seed genes by enumerating all types of pairwise correlations, e.g. GE-GE, CNA-GE, and CNA-CNA, over multiple patients. Next, we propose a voting-based cancer module identification algorithm by combining topological and data-driven properties (VToD algorithm) by using the gene-gene relationship network as a source of data-driven information, and the PPI data as topological information. We applied the VToD algorithm to 266 glioblastoma multiforme (GBM) and 96 ovarian carcinoma (OVC) samples that have both expression and copy number measurements, and identified 22 GBM modules and 23 OVC modules. Among 22 GBM modules, 15, 12, and 20 modules were significantly enriched with cancer-related KEGG, BioCarta pathways, and GO terms, respectively. Among 23 OVC modules, 19, 18, and 23 modules were significantly enriched with cancer-related KEGG, BioCarta pathways, and GO terms, respectively. Similarly, we also observed that 9 and 2 GBM modules and 15 and 18 OVC modules were enriched with cancer gene census (CGC) and specific cancer driver genes, respectively. Our proposed module-detection algorithm significantly outperformed other existing methods in terms of both functional and cancer gene set enrichments. Most of the cancer-related pathways from both cancer data sets found in our algorithm contained more than two types of gene-gene relationships, showing strong positive correlations between the number of different types of relationship and CGC enrichment [Formula: see text]-values (0.64 for GBM and 0.49 for OVC). This study suggests that identified modules containing both expression changes and CNAs can explain cancer-related activities with greater insights.

PubMed Disclaimer

Conflict of interest statement

Competing Interests: The authors have declared that no competing interests exist.

Figures

Figure 1
Figure 1. A schematic of our approach.
(A) Gene expressions and their paired CNA data are collected. (B) A gene-gene relationship network, GGR, is constructed using direct and indirect relationships of GE-GE, CNA-GE, and CNA-CNA. (C) A novel algorithm, VToD, finds overlapping modules combining the GGR network and PPI information. (D) Functional and cancer gene set enrichments are tested for identified modules.
Figure 2
Figure 2. Comparative functional enrichments of pre-modules generated using different vote thresholds.
(A) is for GBM and (B) is for OVC. Bars represent fractions of modules enriched with KEGG, BioCarta, GO biological process, cancer-related KEGG, cancer-related BioCarta, cancer-related GO biological process, and cancer gene census (CGC) for three different vote thresholds. Additionally, in each case, vote-values were computed using only topological properties, using only data-driven properties, and by combining them to compare their individual effects on performance. The numbers of genes (nGS) in each pre-module set are shown correspondingly.
Figure 3
Figure 3. Analysis of GBM Module 2.
(A) A network view of GBM Module 2 using only direct relationships, drawn by Cytoscape . Genes were grouped together based on the overlap with BioCarta pathways, and the percentages of samples with CNAs and GE changes are shown. CGC genes are colored in olive and GBM genes are in purple. Cytoband and Amp/Del (or Alteration-Expression Changes) information for CNA-CNA (or CNA-GE) pairs are shown in the inset table. (B) Pathway enrichment tests with KEGG and BioCarta pathways for this module are shown. Blue bars indicate the enrichment formula image-values of pathways and red bars indicate the overlap formula image-values between the pathway and GBM driver genes. Black vertical bars show formula image-value threshold, 0.05, and the width of the horizontal bars depends on formula image(formula image-value). (C) Red bars show the overlapping formula image-value with CGC and GBM driver genes.
Figure 4
Figure 4. Analysis of OVC Module 8, with a description similar to that of Figure 3.
(A) A network view of OVC Module 8 using only direct relationships. CGC genes are colored in olive and OVC-related genes are in purple. (B) Pathway enrichment tests tests were similar to those in Figure 3(B), but here, red bars indicate the overlapping formula image-values between the pathway and OVC-related genes. (C) Red bars show the formula image-values that overlap with those of the CGC- and OVC-related genes.

References

    1. Hahn WC, Weinberg RA (2002) Modelling the molecular circuitry of cancer. Nat Rev Cancer 2: 331–341. - PubMed
    1. Vogelstein B, Kinzler KW (2004) Cancer genes and the pathways they control. Nat Med 10: 789–799. - PubMed
    1. Davies H, Bignell GR, Cox C, Stephens P, Edkins S, et al. (2002) Mutations of the BRAF gene in human cancer. Nature 417: 949–954. - PubMed
    1. Wan PTC, Garnett MJ, Roe SM, Lee S, Niculescu-Duvaz D, et al. (2004) Mechanism of Activation of the RAF-ERK Signaling Pathway by Oncogenic Mutations of B-RAF. Cell 116: 855–867. - PubMed
    1. Santarosa M, Ashworth A (2004) Haploinsufficiency for tumour suppressor genes: when you don't need to go all the way. Biochimica et Biophysica Acta (BBA) - Reviews on Cancer 1654: 105–122. - PubMed

Publication types

LinkOut - more resources