Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2017 Nov;27(11):1859-1871.
doi: 10.1101/gr.216754.116. Epub 2017 Oct 11.

Identifying cis-mediators for trans-eQTLs across many human tissues using genomic mediation analysis

Collaborators, Affiliations

Identifying cis-mediators for trans-eQTLs across many human tissues using genomic mediation analysis

Fan Yang et al. Genome Res. 2017 Nov.

Abstract

The impact of inherited genetic variation on gene expression in humans is well-established. The majority of known expression quantitative trait loci (eQTLs) impact expression of local genes (cis-eQTLs). More research is needed to identify effects of genetic variation on distant genes (trans-eQTLs) and understand their biological mechanisms. One common trans-eQTLs mechanism is "mediation" by a local (cis) transcript. Thus, mediation analysis can be applied to genome-wide SNP and expression data in order to identify transcripts that are "cis-mediators" of trans-eQTLs, including those "cis-hubs" involved in regulation of many trans-genes. Identifying such mediators helps us understand regulatory networks and suggests biological mechanisms underlying trans-eQTLs, both of which are relevant for understanding susceptibility to complex diseases. The multitissue expression data from the Genotype-Tissue Expression (GTEx) program provides a unique opportunity to study cis-mediation across human tissue types. However, the presence of complex hidden confounding effects in biological systems can make mediation analyses challenging and prone to confounding bias, particularly when conducted among diverse samples. To address this problem, we propose a new method: Genomic Mediation analysis with Adaptive Confounding adjustment (GMAC). It enables the search of a very large pool of variables, and adaptively selects potential confounding variables for each mediation test. Analyses of simulated data and GTEx data demonstrate that the adaptive selection of confounders by GMAC improves the power and precision of mediation analysis. Application of GMAC to GTEx data provides new insights into the observed patterns of cis-hubs and trans-eQTL regulation across tissue types.

PubMed Disclaimer

Figures

Figure 1.
Figure 1.
Causal diagram demonstrating mediation and “mediator-outcome confounding.” Here, the variable set “U” represents a set of unmeasured or unknown variables that may show confounding effects in the mediation analysis. Mediation analysis can detect mediation of the effect of the eSNP on the trans-gene by the cis-gene, assuming mediator-outcome confounding is absent or adjusted for in the analysis. Mediation will not be detected if the effect of the eSNP on the trans-gene is through some alternative pathway that does not involve the cis-gene.
Figure 2.
Figure 2.
Graphical illustrations of GMAC and its main ideas. (A) A summary of the GMAC algorithm; (B) a mediation relationship among an eQTL, Li, its cis-gene transcript, Ci, and a trans-gene transcript, Tj, with confounders, Xij, allowing Li to affect Tj via a pathway independent of Ci; (C) a mediation trio where Ci and Tj have common child variable(s), Zij; (D) a mediation trio where Ci affects Tj through intermediate variable(s), Wij. (E) The adaptive confounder selection procedure: Based on the P-value matrix for the association of each potential confounder variable to at least one of the cis- or the trans-gene transcripts, we apply a stratified FDR approach by considering the P-values for each potential confounder (each column) as a stratum, with the significant ones indicated by a check mark (√). When conducting the mediation test for each trio, we only adjust for the significant confounding variables (the ones with √ in each row). (F) A mediation trio LiCiTj (left) and a trio under the null with both cis-linkage and trans-linkage but no mediation (right). Within-genotype permutation of the cis-gene expression levels maintains the cis- and trans-linkage (different mean levels) while breaking the potential correlation between the cis- and trans-expression levels within each genotype group. Note that Xij, Zij, Wij may vary by trios and are all subsets of H. We assume that either Xij or a combination of variables in Xij would capture the variation of the unmeasured confounder U in Figure 1.
Figure 3.
Figure 3.
Plots of negative log base 10 of mediation P-values versus the percentages of reduction in trans-effects after accounting for cis-mediation. The P-values are calculated based on (A) mediation tests without adjusting for hidden confounders, and (B) mediation tests by GMAC considering all PCs as potential confounders. P-values are truncated at 10−16. The plots are based on the results from the adipose subcutaneous tissue. The percentage of reduction in trans-effects is calculated by (β2m − β2)/β2m × 100%, where β2m is the marginal trans-effect of the eQTL on the trans-gene expression levels, and β2 is the trans-effect after adjusting for a potential cis-mediator and other covariates. For trios with true cis-mediations, the marginal trans-effects are nonzero, and after adjusting for the true cis-mediators, we expect the adjusted trans-effects β2 to be substantially reduced; that is, we expect the trios with very significant mediation P-values to have positive percent reduction in trans-effects. For results based on no adjustment of hidden confounders (A), we observed many trios with significant mediation P-values but the percentages of reduction in trans-effects are negative. At the 0.05 P-value threshold, 712 (21.4%) and 188 (5.6%) out of 3332 trios have P-values below the threshold and percent reduction in trans-effects being negative in A and B, respectively.
Figure 4.
Figure 4.
A biologically interpretable example of a cis-eGene (IFI44L) that appears to mediate the effects of trans-eSNPs in multiple tissues. The gene IFI44L (A) resides <5 kb away from IFI44 (B), and expression of these genes is associated with a common cis-eQTL that also impacts the expression of multiple genes in trans in multiple tissues. Both IFI44 and IFI44L show statistical evidence of mediation for a similar set of interferon-related genes. Thus, based on this evidence, we infer that at least one of these genes is a cis-mediator, although we cannot know which is (or if both are) the true mediator.

Comment in

References

    1. The 1000 Genomes Project Consortium. 2015. A global reference for human genetic variation. Nature 526: 68–74. - PMC - PubMed
    1. Ardlie KG, DeLuca DS, Segre AV, Sullivan TJ, Young TR, Gelfand ET, Trowbridge CA, Maller JB, Tukiainen T, Lek M, et al. 2015. The Genotype-Tissue Expression (GTEx) pilot analysis: multitissue gene regulation in humans. Science 348: 648–660. - PMC - PubMed
    1. Battle A, Mostafavi S, Zhu XW, Potash JB, Weissman MM, McCormick C, Haudenschild CD, Beckman KB, Shi JX, Mei R, et al. 2014. Characterizing the genetic basis of transcriptome diversity through RNA-sequencing of 922 individuals. Genome Res 24: 14–24. - PMC - PubMed
    1. Chen LS, Emmert-Streib F, Storey JD. 2007. Harnessing naturally randomized transcription to infer regulatory relationships among genes. Genome Biol 8: R219. - PMC - PubMed
    1. Chen DT, Jiang X, Akula N, Shugart YY, Wendland JR, Steele CJ, Kassem L, Park JH, Chatterjee N, Jamain S, et al. 2013. Genome-wide association study meta-analysis of European and Asian-ancestry samples identifies three novel loci associated with bipolar disorder. Mol Psychiatry 18: 195–205. - PubMed

Publication types