. 2018 Nov 28;7(5):526-536.e6.

doi: 10.1016/j.cels.2018.10.001. Epub 2018 Nov 7.

Integration of Tumor Genomic Data with Cell Lines Using Multi-dimensional Network Modules Improves Cancer Pharmacogenomics

James T Webber¹, Swati Kaushik¹, Sourav Bandyopadhyay²

Affiliations

¹ Department of Bioengineering and Therapeutic Sciences, Institute for Computational Health Sciences, Helen Diller Family Comprehensive Cancer Center, University of California, San Francisco, San Francisco, CA, USA.
² Department of Bioengineering and Therapeutic Sciences, Institute for Computational Health Sciences, Helen Diller Family Comprehensive Cancer Center, University of California, San Francisco, San Francisco, CA, USA. Electronic address: sourav.bandyopadhyay@ucsf.edu.

PMID: 30414925
PMCID: PMC6265063
DOI: 10.1016/j.cels.2018.10.001

Integration of Tumor Genomic Data with Cell Lines Using Multi-dimensional Network Modules Improves Cancer Pharmacogenomics

James T Webber et al. Cell Syst. 2018.

. 2018 Nov 28;7(5):526-536.e6.

doi: 10.1016/j.cels.2018.10.001. Epub 2018 Nov 7.

Authors

James T Webber¹, Swati Kaushik¹, Sourav Bandyopadhyay²

Affiliations

¹ Department of Bioengineering and Therapeutic Sciences, Institute for Computational Health Sciences, Helen Diller Family Comprehensive Cancer Center, University of California, San Francisco, San Francisco, CA, USA.
² Department of Bioengineering and Therapeutic Sciences, Institute for Computational Health Sciences, Helen Diller Family Comprehensive Cancer Center, University of California, San Francisco, San Francisco, CA, USA. Electronic address: sourav.bandyopadhyay@ucsf.edu.

PMID: 30414925
PMCID: PMC6265063
DOI: 10.1016/j.cels.2018.10.001

Abstract

Leveraging insights from genomic studies of patient tumors is limited by the discordance between these tumors and the cell line models used for functional studies. We integrate omics datasets using functional networks to identify gene modules reflecting variation between tumors and show that the structure of these modules can be evaluated in cell lines to discover clinically relevant biomarkers of therapeutic responses. Applied to breast cancer, we identify 219 gene modules that capture recurrent alterations and subtype patients and quantitate various cell types within the tumor microenvironment. Comparison of modules between tumors and cell lines reveals that many modules composed primarily of gene expression and methylation are poorly preserved. In contrast, preserved modules are highly predictive of drug responses in a manner that is robust and clinically relevant. This work addresses a fundamental challenge in pharmacogenomics that can only be overcome by the joint analysis of patient and cell line data.

Keywords: biomarkers; breast cancer; data integration; networks; pharmacogenomics; therapeutics.

PubMed Disclaimer

Figures

**Figure 1:. Data integration and module discovery using MAGNETIC.**
**(a)** Interaction network of ubiquitin specific peptidases USP6 and USP32 from the STRING database and Pearson correlation between molecular features of USP6 and USP32 across TCGA breast cancers. P-value of association after Bonferroni correction for multiple testing are in parentheses. **(b)** Scatter of normalized USP32 copy-number and USP6 expression across TCGA. **(c)** The interaction network of the kinase LCK and its substrate LAT and relationships between their molecular profiles across platforms. **(d)** Scatter of LCK expression and LAT methylation. **(e)** MAGNETIC uses as input the normalized DNA copy-number, methylation, somatic mutations, mRNA expression and protein abundance data from a collection of tumor samples. We compute a multi-layer pairwise gene similarity network by computing the correlation between all pairs of gene features both within and between profiling platforms. Each linkage in this correlation network is normalized through comparison against a benchmark of pathways reflected in protein-protein interaction databases. Scored edges are then merged into a multigraph in which nodes represent genes and the edges between nodes represent co-incidence of different types of linkages. Clustering of this network using a random walk algorithm reveals gene modules whose components are closely related in multiple data types. **(f)** Circos plot representation of the module network containing HER2. Colors represent different data sources selected in the final integrated network for each gene and edge thickness is proportional to edge score. Top central genes are labeled. **(g)** TCGA samples sorted by HER2 module score. PAM50 subtype and molecular receptor status as determined by IHC are shown. **(h)** The module network containing the estrogen receptor, ESR1. Direct transcriptional targets of ER as assessed through ChIP analysis are marked with a star. **(i)** TCGA samples sorted by ESR1 module score. See also Figure S1-S3.

**Figure 2:. Many patient derived modules are not preserved in cell lines and are associated with specific data types.**
**(a)** Overview of approach to score module preservation in cell lines. MAGNETIC takes molecular correlations present across tumor samples and determines if they remain significantly correlated across a cell line panel. Solid edges, above random background, dotted edges, below random background (see STAR Methods). Different colors represent edges derived from comparison between different molecular profiling platforms. **(b)** Histogram and kernel density estimation of the distribution of module preservation scores. The vertical dotted line shows the cutoff of 5 chosen for further evaluation. **(c)** Correlation of module scores with pathologic assessments of necrosis and normal cell infiltration for lowly (L) and highly (H) preserved modules. **(d)** Comparison of module types with computational assessment of tumor purity. **(e)** Sorted preservation scores for 219 breast cancer modules evaluated in cell lines. Lower preserved modules have a score less than 5 (dotted line). **(f)** For each module in (e), the percent of the LLR>1 network that corresponds to each edge type are shown. **(g)** Percent of each edge type for lowly and highly preserved modules in the LLR>1 network. P-values based on Mann-Whitney U-test in parenthesis. See also Figure S4.

**Figure 3:. Modules reflect specific aspects of the tumor microenvironment.**
**(a)** Heatmap of molecular features associated with the overall activity of the immune module (r²>0.1). For clarity, the CNV of one gene is not shown. **(b)** Enrichment for high expression of module genes from normalized RNA-seq data in 227 purified immune cell type datasets. Cell types are categorized into 15 groups and enrichment based on a t-test. **(c)** Comparison of module scores with annotated lymphocytic infiltration values in TCGA and METABRIC datasets. **(d)** Heatmap of molecular features associated with module 12, associated with stromal cells. **(e)** Comparison of module scores with pathologic assessment of stromal cells in TCGA samples. P-values based on t-test. **(f)** Examples of samples from TCGA with low and high scores for module 12, showing the difference in stromal content. **(g)** Heatmap of molecular features associated with module 16, associated with endothelial cells. **(h)** Comparison of module scores with annotations of necrosis. **(i)** Examples of samples with low and high scores for module 16.

**Figure 4:. A module-drug network identifies high performance biomarkers that are preserved between patients and cell lines.**
**(a)** Network of 97 module-drug associations based on breast cancer cell line modeling. Modules significantly associated with drug response are shown (FDR≤5%). Drugs are limited to those that are not associated with PAM50 subtype based on an FDR threshold of 5%. The size of each module is proportional to the number of genes within it, and the thickness of the border depicts the strength of a module or drug’s association with PAM50 subtype. Edges are colored red when a module correlated with sensitivity to a drug, and blue when it correlated with resistance. Thicker edges have a lower FDR. As an example, gain of chr1q is associated with resistance of Etoposide at an FDR of <0.1%. **(b)** Scatter plot of cell line association of lapatinib response with module #92 (HER2) and **(c)** oxaliplatin with module #139 (chr11q14#1). Cell lines colored by PAM50 subtype. **(d)** Comparison of median absolute error of cross-validated predictions of drug sensitivity using single gene features or modules as input to elastic net, random forest or SVM based predictors. P-values based on Mann-Whitney U-test. **(e)** Cross-correlation for all pairs of molecular features that are the most predictive of response to imatinib in cell lines at an FDR of 1% and cross-correlation of the same features in TCGA. **(f)** The average cross-correlation (r²) of features selected by various statistical methods (FDR, elastic net) using single genes or modules in cell lines and evaluation of cross-correlation of the same features in TCGA. Each point represents a model for a single drug. P-values based on Mann-Whitney U-test. See also Figure S4.

See this image and copyright information in PMC

Cited by

Network-based machine learning in colorectal and bladder organoid models predicts anti-cancer drug efficacy in patients.
Kong J, Lee H, Kim D, Han SK, Ha D, Shin K, Kim S. Kong J, et al. Nat Commun. 2020 Oct 30;11(1):5485. doi: 10.1038/s41467-020-19313-8. Nat Commun. 2020. PMID: 33127883 Free PMC article.
A Bayesian framework for pathway-guided identification of cancer subgroups by integrating multiple types of genomic data.
Sun Z, Chung D, Neelon B, Millar-Wilson A, Ethier SP, Xiao F, Zheng Y, Wallace K, Hardiman G. Sun Z, et al. Stat Med. 2023 Dec 10;42(28):5266-5284. doi: 10.1002/sim.9911. Epub 2023 Sep 15. Stat Med. 2023. PMID: 37715500 Free PMC article.
Information about immune cell proportions and tumor stage improves the prediction of recurrence in patients with colorectal cancer.
Kong J, Kim J, Kim D, Lee K, Lee J, Han SK, Kim I, Lim S, Park M, Shin S, Lee WY, Yun SH, Kim HC, Hong HK, Cho YB, Park D, Kim S. Kong J, et al. Patterns (N Y). 2023 Apr 20;4(6):100736. doi: 10.1016/j.patter.2023.100736. eCollection 2023 Jun 9. Patterns (N Y). 2023. PMID: 37409049 Free PMC article.
Predicting drug sensitivity of cancer cells based on DNA methylation levels.
Miranda SP, Baião FA, Fleck JL, Piccolo SR. Miranda SP, et al. PLoS One. 2021 Sep 10;16(9):e0238757. doi: 10.1371/journal.pone.0238757. eCollection 2021. PLoS One. 2021. PMID: 34506489 Free PMC article.
Synthetic lethal interactions of DEAD/H-box helicases as targets for cancer therapy.
Arna AB, Patel H, Singh RS, Vizeacoumar FS, Kusalik A, Freywald A, Vizeacoumar FJ, Wu Y. Arna AB, et al. Front Oncol. 2023 Jan 26;12:1087989. doi: 10.3389/fonc.2022.1087989. eCollection 2022. Front Oncol. 2023. PMID: 36761420 Free PMC article. Review.

See all "Cited by" articles

References

1. Aran D, Sirota M, and Butte AJ (2015). Systematic pan-cancer analysis of tumour purity. Nature Communications 6, 8971. - PMC - PubMed
1. Barretina J, Caponigro G, Stransky N, Venkatesan K, Margolin AA, Kim S, Wilson CJ, Lehar J, Kryukov GV, Sonkin D, et al. (2012). The Cancer Cell Line Encyclopedia enables predictive modelling of anticancer drug sensitivity. Nature 483, 603–607. - PMC - PubMed
1. Basu A, Bodycombe Nicole E., Cheah Jaime H., Price Edmund V., Liu K, Schaefer Giannina I., Ebright Richard Y., Stewart Michelle L., Ito D, Wang S, et al. (2013). An Interactive Resource to Identify Cancer Genetic and Lineage Dependencies Targeted by Small Molecules. Cell 154, 1151–1161. - PMC - PubMed
1. Bhat-Nakshatri P, Wang G, Appaiah H, Luktuke N, Carroll JS, Geistlinger TR, Brown M, Badve S, Liu Y, and Nakshatri H (2008). AKT alters genome-wide estrogen receptor alpha binding and impacts estrogen signaling in breast cancer. Mol Cell Biol 28, 7487–7503. - PMC - PubMed
1. Borst P, and Wessels L (2010). Do predictive signatures really predict response to cancer chemotherapy? Cell Cycle 9, 4836–4840. - PubMed

Publication types

Actions

MeSH terms

Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Integration of Tumor Genomic Data with Cell Lines Using Multi-dimensional Network Modules Improves Cancer Pharmacogenomics

Affiliations

Integration of Tumor Genomic Data with Cell Lines Using Multi-dimensional Network Modules Improves Cancer Pharmacogenomics

Authors

Affiliations

Abstract

Figures

Similar articles

Cited by

References

Publication types

MeSH terms

Substances

Grants and funding

LinkOut - more resources

Full Text Sources

Medical

Abstract

Figures

Similar articles

Cited by

References

Publication types

MeSH terms

Substances

Related information

Grants and funding

LinkOut - more resources

Full Text Sources

Medical