Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2013 Jun 15;19(12):3114-20.
doi: 10.1158/1078-0432.CCR-12-2093. Epub 2013 Feb 21.

Molecular pathways: extracting medical knowledge from high-throughput genomic data

Affiliations

Molecular pathways: extracting medical knowledge from high-throughput genomic data

Theodore C Goldstein et al. Clin Cancer Res. .

Abstract

High-throughput genomic data that measures RNA expression, DNA copy number, mutation status, and protein levels provide us with insights into the molecular pathway structure of cancer. Genomic lesions (amplifications, deletions, mutations) and epigenetic modifications disrupt biochemical cellular pathways. Although the number of possible lesions is vast, different genomic alterations may result in concordant expression and pathway activities, producing common tumor subtypes that share similar phenotypic outcomes. How can these data be translated into medical knowledge that provides prognostic and predictive information? First-generation mRNA expression signatures such as Genomic Health's Oncotype DX already provide prognostic information, but do not provide therapeutic guidance beyond the current standard of care, which is often inadequate in high-risk patients. Rather than building molecular signatures based on gene expression levels, evidence is growing that signatures based on higher-level quantities such as from genetic pathways may provide important prognostic and diagnostic cues. We provide examples of how activities for molecular entities can be predicted from pathway analysis and how the composite of all such activities, referred to here as the "activitome," helps connect genomic events to clinical factors to predict the drivers of poor outcome.

PubMed Disclaimer

Figures

Figure 1
Figure 1. PARADIGM model for integrative data analysis
A. Factor graph model oriented around a single gene Hidden states in a tumor sample (open ellipses) for genomic copies (G), epigenetic promoter state (E), mRNA transcripts (T), peptide (P), and active protein (A). Regulation gene expression (open ellipses) include transcriptional (RT), translational (RP), and post-translational (RA) control. Sample data (filled circles, constrain gene states through factors (boxes). B. Toy example of a MYC/MAX-associated pathway. Two transcription factors (MYC and MAX) form a complex (MYC/MAX) that is inhibited by PAK2, a protein kinase. MYC/MAX activates two target genes (CCNB1, ENO1) and inactivates a third (WNT5A). C. Single patient data converted to inferred activities for toy pathway. Measurements and inferred levels are either higher (red), lower (blue), or comparable (purple) to levels in matched normal. Belief propagation infers the kinase is inactive based on inferred higher activity of MYC/MAX.
Figure 2
Figure 2. HotLink Result for TCGA Breast Cancer
Rings of data depict different measurements about genes or proteins as higher activity (red) or lower activity (blue) compared to normal controls. Pathway illustrates part of the HotLink solution inferred from the TCGA basal and luminal breast tumors with various data available through TCGA including (inner to outer): pathway levels inferred by PARADIGM, Copy number alterations, RNA-Seq RSEM levels, and RPPA data. The outermost ring depicts the patient subtypes. Segments display aggregated levels for samples within each grouping defined by the breast cancer subtype (indicated in the outermost ring).
Figure 3
Figure 3
DiPSC(Dipstick) depicts correlations comparing mutations, and biomarkers along the Luminal-A/Luminal B dichotomy.

References

    1. Heiser LM, Sadanandam A, Kuo WL, Benz SC, Goldstein TC, Ng S, et al. Subtype and pathway specific responses to anticancer compounds in breast cancer. Proceedings of the National Academy of Sciences of the United States of America; 2011. - PMC - PubMed
    1. Vaske CJ, Benz SC, Sanborn JZ, Earl D, Szeto C, Zhu J, et al. Inference of patient-specific pathway activities from multi-dimensional cancer genomics data using PARADIGM. Bioinformatics. 2010;26:i237–245. - PMC - PubMed
    1. Kschischang, Frey, Loeliger Factor graphs and the sum-product algorithm. IEEE Transactions on Information Theory. 2001:47.
    1. Vandin F, Upfal E, Raphael BJ. Algorithms for detecting significantly mutated pathways in cancer. Journal of computational biology: a journal of computational molecular cell biology. 2011;18:507–522. - PubMed
    1. Ciriello G, Cerami E, Sander C, Schultz N. Mutual exclusivity analysis identifies oncogenic network modules. Genome research. 2012;22:398–406. - PMC - PubMed

Publication types

LinkOut - more resources