Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2018 Aug;28(8):1207-1216.
doi: 10.1101/gr.227066.117. Epub 2018 Jun 13.

Principled multi-omic analysis reveals gene regulatory mechanisms of phenotype variation

Affiliations

Principled multi-omic analysis reveals gene regulatory mechanisms of phenotype variation

Casey Hanson et al. Genome Res. 2018 Aug.

Abstract

Recent studies have analyzed large-scale data sets of gene expression to identify genes associated with interindividual variation in phenotypes ranging from cancer subtypes to drug sensitivity, promising new avenues of research in personalized medicine. However, gene expression data alone is limited in its ability to reveal cis-regulatory mechanisms underlying phenotypic differences. In this study, we develop a new probabilistic model, called pGENMi, that integrates multi-omic data to investigate the transcriptional regulatory mechanisms underlying interindividual variation of a specific phenotype-that of cell line response to cytotoxic treatment. In particular, pGENMi simultaneously analyzes genotype, DNA methylation, gene expression, and transcription factor (TF)-DNA binding data, along with phenotypic measurements, to identify TFs regulating the phenotype. It does so by combining statistical information about expression quantitative trait loci (eQTLs) and expression-correlated methylation marks (eQTMs) located within TF binding sites, as well as observed correlations between gene expression and phenotype variation. Application of pGENMi to data from a panel of lymphoblastoid cell lines treated with 24 drugs, in conjunction with ENCODE TF ChIP data, yielded a number of known as well as novel (TF, Drug) associations. Experimental validations by TF knockdown confirmed 41% of the predicted and tested associations, compared to a 12% confirmation rate of tested nonassociations (controls). An extensive literature survey also corroborated 62% of the predicted associations above a stringent threshold. Moreover, associations predicted only when combining eQTL and eQTM data showed higher precision compared to an eQTL-only or eQTM-only analysis using pGENMi, further demonstrating the value of multi-omic integrative analysis.

PubMed Disclaimer

Figures

Figure 1.
Figure 1.
Plate diagram of pGENMi model. A latent variable zg represents whether a gene g mediates the influence of a TF on phenotype, and its enclosing rectangle denotes G such genes. pg denotes the TWAS P-value between the gene's expression and phenotype variation for each of the G genes. If zg = 1, we expect an enrichment for significant TWAS P-values, and pg is modeled by a Beta distribution parameterized by α; otherwise, pg is modeled as being distributed uniformly in [0,1]. We also observe one or more lines of evidence supporting the TF's influence on the expression of each gene g, such as the existence of a cis-eQTL within a ChIP peak of the TF near that gene. These “regulatory evidences” are denoted by the binary variables rgm, and there exist M such types of evidence (m = 1 … M), with relative weights wm. These evidences combine in a logistic function to determine Pr(zg = 1). The weights wm are learned over all genes, and as such, are shown outside of the rectangle enclosing G. The H variable indicates whether wm is free or restricted to zero (null model) for hypothesis testing (see “Probabilistic graphical model” section in Methods).
Figure 2.
Figure 2.
The 90 (TF, Drug) associations predicted by eQTL + M with LLR ≥ 3, colored by LLR. Labels indicate whether eQTL-only or eQTM-only analysis corroborated the prediction at LLR ≥ 1.74 (labels “S” and “M,” respectively). An “X” indicates both eQTL and eQTM analysis supported the prediction, while “–” indicates the prediction is unique to eQTL + M.
Figure 3.
Figure 3.
All 25 experimentally tested (TF, Drug) dosage-response curves. Red outlines indicate significant shifts in cytotoxicity between siRNA negative and siRNA TF conditions. Curves with a gray background are eQTL + M predictions, while those with a black background are negative controls. We confirmed seven out of 17 predictions and only one out of eight negative controls.

Similar articles

Cited by

References

    1. Barretina J, Caponigro G, Stransky N, Venkatesan K, Margolin AA, Kim S, Wilson CJ, Lehar J, Kryukov GV, Sonkin D, et al. 2012. The Cancer Cell Line Encyclopedia enables predictive modelling of anticancer drug sensitivity. Nature 483: 603–607. - PMC - PubMed
    1. Basu A, Bodycombe NE, Cheah JH, Price EV, Liu K, Schaefer GI, Ebright RY, Stewart ML, Ito D, Wang S, et al. 2013. An interactive resource to identify cancer genetic and lineage dependencies targeted by small molecules. Cell 154: 1151–1161. - PMC - PubMed
    1. Battle A, Mostafavi S, Zhu X, Potash JB, Weissman MM, McCormick C, Haudenschild CD, Beckman KB, Shi J, Mei R, et al. 2014. Characterizing the genetic basis of transcriptome diversity through RNA-sequencing of 922 individuals. Genome Res 24: 14–24. - PMC - PubMed
    1. Bavelloni A, Faenza I, Aluigi M, Ferri A, Toker A, Maraldi NM, Marmiroli S. 2000. Inhibition of phosphoinositide 3-kinase impairs pre-commitment cell cycle traverse and prevents differentiation in erythroleukaemia cells. Cell Death Differ 7: 112–117. - PubMed
    1. Bernstein BE, Stamatoyannopoulos JA, Costello JF, Ren B, Milosavljevic A, Meissner A, Kellis M, Marra MA, Beaudet AL, Ecker JR, et al. 2010. The NIH Roadmap Epigenomics Mapping Consortium. Nat Biotechnol 28: 1045–1048. - PMC - PubMed

Publication types

LinkOut - more resources