Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2007:3:140.
doi: 10.1038/msb4100180. Epub 2007 Oct 16.

Network-based classification of breast cancer metastasis

Affiliations

Network-based classification of breast cancer metastasis

Han-Yu Chuang et al. Mol Syst Biol. 2007.

Abstract

Mapping the pathways that give rise to metastasis is one of the key challenges of breast cancer research. Recently, several large-scale studies have shed light on this problem through analysis of gene expression profiles to identify markers correlated with metastasis. Here, we apply a protein-network-based approach that identifies markers not as individual genes but as subnetworks extracted from protein interaction databases. The resulting subnetworks provide novel hypotheses for pathways involved in tumor progression. Although genes with known breast cancer mutations are typically not detected through analysis of differential expression, they play a central role in the protein network by interconnecting many differentially expressed genes. We find that the subnetwork markers are more reproducible than individual marker genes selected without network information, and that they achieve higher accuracy in the classification of metastatic versus non-metastatic tumors.

PubMed Disclaimer

Figures

Figure 1
Figure 1
Subnetworks enriched for the hallmarks of cancer. Example discriminative subnetworks from van de Vijver et al (2002) are shown in (AE), whereas those from Wang et al (2005) are shown in (FK). Nodes and links represent human proteins and protein interactions, respectively. The color of each node scales with the change in expression of the corresponding gene for metastatic versus non-metastatic cancer. The shape of each node indicates whether its gene is significantly differentially expressed (diamond; P<0.05 from a two-tailed t-test) or not (circle). The predominant cellular functions from Supplementary Figure 1 are indicated next to each module. Known breast cancer susceptibility genes are marked by a blue asterisk.
Figure 2
Figure 2
Marker reproducibility and metastasis prediction performance. (A) Agreement in markers selected from the van de Vijver et al (2002) data set versus those selected from Wang et al (2005). Blue bars chart the magnitude of overlap on the left axis; the red line charts the hypergeometric P-values of overlap on the right axis. The first ‘single-gene' analysis was performed by using the same number of top discriminative genes as the number of genes covered by subnetwork markers. The second ‘single-gene' analysis was performed by using the same number of top discriminative genes as those in the gene signatures published in van de Vijver et al (2002) and Wang et al (2005). (B) AUC classification performance of subnetworks, individual genes, or modules from GO or MSigDB. The blue line charts the performance of markers selected based on the Wang et al (2005) data set and tested on the van de Vijver et al (2002) data set; the pink line represents the reciprocal test. The performance of the 1000 random subnetworks is denoted by its mean±s.d. (C and D) Erk1 (MAPK3) subnetworks in van de Vijver et al (2002) and Wang et al (2005). (E and F) Example network motifs shared between subnetworks selected from the two cohorts. The left-hand side motif is from van de Vijver et al (2002) and the right-hand side is from Wang et al (2005). (G and H) Examples of highly predictive subnetwork markers from Wang et al (2005). (I and J) Examples of highly predictive subnetwork markers from van de Vijver et al (2002).
Figure 3
Figure 3
Detection of 60 known disease genes in breast cancer. The enrichment of disease genes is shown for subnetworks or individual genes selected from van de Vijver et al (2002) (A) or Wang et al (2005) (B). Blue bars chart the percentage of disease genes among all genes covered in the markers on the left axis; the red line charts the hypergeometric P-values of enrichment on the right axis. Numbers above the bars are the recovery rates of the known susceptibility genes in each marker set. (CE) Example discriminative subnetworks containing genes with breast cancer mutations listed in Sjoblom et al. Mutation genes are marked by a plus sign.

Comment in

Similar articles

Cited by

References

    1. Agresti A (1990) Categorical Data Analysis. New York: Wiley
    1. Alfarano C, Andrade CE, Anthony K, Bahroos N, Bajec M, Bantoft K, Betel D, Bobechko B, Boutilier K, Burgess E, Buzadzija K, Cavero R, D'Abreo C, Donaldson I, Dorairajoo D, Dumontier MJ, Dumontier MR, Earles V, Farrall R, Feldman H et al.. (2005) The biomolecular interaction network database and related tools 2005 update. Nucleic Acids Res 33: D418–D424 - PMC - PubMed
    1. Alizadeh AA, Eisen MB, Davis RE, Ma C, Lossos IS, Rosenwald A, Boldrick JC, Sabet H, Tran T, Yu X, Powell JI, Yang L, Marti GE, Moore T, Hudson J Jr, Lu L, Lewis DB, Tibshirani R, Sherlock G, Chan WC et al.. (2000) Distinct types of diffuse large B-cell lymphoma identified by gene expression profiling. Nature 403: 503–511 - PubMed
    1. Bachman KE, Argani P, Samuels Y, Silliman N, Ptak J, Szabo S, Konishi H, Karakas B, Blair BG, Lin C, Peters BA, Velculescu VE, Park BH (2004) The PIK3CA gene is mutated with high frequency in human breast cancers. Cancer Biol Ther 3: 772–775 - PubMed
    1. Ben-Dor A, Bruhn L, Friedman N, Nachman I, Schummer M, Yakhini Z (2000) Tissue classification with gene expression profiles. J Comput Biol 7: 559–583 - PubMed

Publication types

MeSH terms