Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
[Preprint]. 2024 Jul 8:2023.11.14.567054.
doi: 10.1101/2023.11.14.567054.

Trans-eQTL hotspots shape complex traits by modulating cellular states

Affiliations

Trans-eQTL hotspots shape complex traits by modulating cellular states

Kaushik Renganaath et al. bioRxiv. .

Update in

Abstract

Regulatory genetic variation shapes gene expression, providing an important mechanism connecting DNA variation and complex traits. The causal relationships between gene expression and complex traits remain poorly understood. Here, we integrated transcriptomes and 46 genetically complex growth traits in a large cross between two strains of the yeast Saccharomyces cerevisiae. We discovered thousands of genetic correlations between gene expression and growth, suggesting potential functional connections. Local regulatory variation was a minor source of these genetic correlations. Instead, genetic correlations tended to arise from multiple independent trans-acting regulatory loci. Trans-acting hotspots that affect the expression of numerous genes accounted for particularly large fractions of genetic growth variation and of genetic correlations between gene expression and growth. Genes with genetic correlations were enriched for similar biological processes across traits, but with heterogeneous direction of effect. Our results reveal how trans-acting regulatory hotspots shape complex traits by altering cellular states.

Keywords: Genetic variation; IRA2; QTL; eQTL; mediation; pleiotropy; quantitative genetics.

PubMed Disclaimer

Figures

Figure 1:
Figure 1:
Genetic correlations. (A) Cartoon summarizing the datasets used to compute genetic correlations. (B) Number of genes with significant genetic correlation in each of the 46 growth conditions at 5% FDR (pale colors) and Bonferroni significance (saturated colors). Positive correlations are in blue and above the zero line; negative correlations are in red and below. For each condition, the number of segregants with both growth and expression measurements in Bloom et al. (2013) and Albert et al. (2018) used to compute the genetic correlations are indicated in parentheses.
Figure 2:
Figure 2:
Local eQTL analyses. (A) Illustration of causal variants at local eQTLs that overlap a gQTL. QTL locations are shown as hollow boxes; gene position and transcriptional direction is indicated as solid purple box and arrow. A significant p-value on the colocalization test indicates that a single QTL caused by shared causal variants is rejected in favor of two separate QTLs. (B) Histogram of the number of gQTLs with a given number of colocalized local eQTLs. (C) The number of gQTLs with zero, one, and more than one colocalized local eQTL. (D) Profile LOD curves for growth in tunicamycin (blue) and the expression of indicated genes (purple) at a gQTL on chromosome X. “Profile LOD” scores are computed under the two-QTLs model as a LOD trace for one trait while keeping the QTL for the second trait fixed at its maximum-likelihood position, followed by subtracting the maximum LOD score of the single-QTL model in this region. A maximum profile LOD value of zero (as for CHS6) indicates that the two-QTLs model fits no better than the single-QTL model. Positive maximum profile LOD values (as for TRL1 and IME2) indicate that the two-QTLs model fits better than the single-QTL model, with the two profile LOD curves peaking at the best positions for the two QTLs. QTL and gene positions shown as in A.
Figure 3:
Figure 3:
QTL effect correlations. (A) As an example, the plot shows the locations of eQTLs affecting the gene HSP12 (x-axis) and their effects (y-axis; coefficient of correlation between trait and genotype) on HSP12 expression (red) and growth in the presence of copper (blue). 95% effect size confidence intervals are shown. HSP12 and copper were chosen as the example due to the large number of eQTLs and strong effect size correlation. (B) Scatterplot of the effects from panel A. The line represents the weighted regression. Weights are calculated as the inverse product of the widths of the 95% effect size confidence intervals and is reflected in the size of the point. (C) Scatterplot between QTL effect correlation coefficients and genetic correlation coefficients across all genes and all growth conditions.
Figure 4:
Figure 4:
Trans-eQTL hotspot and gQTL locations. (A) The top panel shows the number of genes affected by the 102 trans-eQTL hotspots (vertical lines). The bottom panel shows the locations of the 591 gQTLs as points with size scaled by effect size (coefficient of correlation between trait and genotype at the gQTL). Positive (RM allele increases growth compared to BY allele) and negative (RM allele decreases growth) gQTL effects are red and blue, respectively. (B) Histogram showing the distribution of the number of gQTLs that overlap a trans-eQTL hotspot in 1,000 sets of randomly placed gQTLs, compared to the actual gQTLs (red line). (C) Histogram showing the number of trans-eQTL hotspots that overlap at least one gQTL in 1,000 sets of randomly placed gQTLs, compared to the actual data (red line). (D) Scatter plot comparing the average growth variance explained by each hotspot and the number of genes whose expression they influence. The top two hotspots influencing the expression of most number of genes (chrXIV:466588_T/G and chrXIV:372376_G/A) are excluded from the analysis as it is not possible to estimate the average proportion of variance explained by just two markers under the linear mixed model scheme we use to estimate the proportion of variance explained by hotspots.
Figure 5:
Figure 5:
Phenotypic variance and genetic correlations linked to trans-eQTL hotspots. (A) Proportion of phenotypic variance explained by all 11,530 genetic markers (narrow-sense heritability; red dots), the 102 trans-eQTL hotspots (blue dots), and all gQTLs (yellow dots), in 46 growth conditions. Box plots show the distribution of variance explained by 1,000 sets of 102 random genetic markers. These distributions were used to calculate p-values (shown at top) for the proportion of variance explained by the hotspots. *: p < 0.05. (B) Scatterplot between hotspot effect correlation coefficients and genetic correlation coefficients across all genes and growth conditions. (C) Boxplots showing distributions of magnitudes of genetic correlation coefficients before (“uncorrected”) and after removing the effects of local eQTLs (one per gene), trans-eQTLs (range 1 to 21 per gene, median of 6), and the 102 trans-eQTL hotspots (the number of markers regressed out for each category are indicated along the x-axis). This analysis only included genes with at least one local eQTL and one trans-eQTL, and only uncorrected genetic correlations that were significant at 5% FDR. For each category, a random expectation was computed by calculating the decrease in the magnitude of genetic correlations after regressing out the effects of 1,000 sets of random markers of size equal to the actual set. Wilcoxon p-values for the difference in medians are indicated above the boxplots.
Figure 6:
Figure 6:
Biological processes enriched in genetic correlations. The top heatmap shows log2-fold enrichments for 100 GO-BP Slim terms in the set of genes with significant (5% FDR) genetic correlations. Enrichments for genes with positive and negative genetic correlations are shown in red and blue, respectively. Enrichments are displayed for terms significant at uncorrected p < 0.001. The stacked bar plot above the heatmap shows the number of genes with significant genetic correlation (5% FDR) for each condition, with positive genetic correlation in red and negative correlations in blue. The dendrogram clusters traits based on GO-enrichment, with the three trait groups discussed in the text indicated. The heatmaps at the bottom show average genetic correlation coefficients for genes in the indicated gene groups from Brauer et al. and Gasch et al.
Figure 7:
Figure 7:
Mediation of growth variation at a trans-eQTL hotspot. (A) Conceptual model showing how the effect of the IRA2 hotspot on growth in hydrogen peroxide may be mediated by the expression of hotspot target genes (Genei). The coefficients used in the mediation analysis (Methods) are indicated. (B) Mediated proportion of the IRA2 hotspot effect on growth in hydrogen peroxide for 380 genes with significant mediation (5% FDR). Colored circles indicate whether a gene is annotated as an Msn2 target in the Yeastract database. Genes described in the text are highlighted. Positive mediation indicates genes for which increased expression increases growth, while negative mediation indicates genes for which increased expression decreases growth. (C) Conceptual model of how trans-eQTL hotspots affect growth by altering cellular states, as reflected in the hotspot’s effect on multiple genes. The two example states roughly correspond to the ‘green’ and ‘blue’ trait clusters in Figure 6, with high translation and low respiration versus low translation and high respiration. Growth in different conditions is affected differently by these states. Most local eQTLs (shown below the dashed grey line) affect gene expression but do not go on to affect key cellular states underlying growth variation.

References

    1. Maurano M.T., Humbert R., Rynes E., Thurman R.E., Haugen E., Wang H., Reynolds A.P., Sandstrom R., Qu H., Brody J., et al. (2012). Systematic Localization of Common Disease-Associated Variation in Regulatory DNA. Science 337, 1190–1195. 10.1126/science.1222794. - DOI - PMC - PubMed
    1. Albert F.W., and Kruglyak L. (2015). The role of regulatory variation in complex traits and disease. Nat. Rev. Genet. 16, 197–212. 10.1038/nrg3891. - DOI - PubMed
    1. Aguirre-Gamboa R., de Klein N., di Tommaso J., Claringbould A., van der Wijst M.G., de Vries D., Brugge H., Oelen R., Võsa U., Zorro M.M., et al. (2020). Deconvolution of bulk blood eQTL effects into immune cell subpopulations. BMC Bioinformatics 21, 243. 10.1186/s12859-020-03576-5. - DOI - PMC - PubMed
    1. Brown A.A., Viñuela A., Delaneau O., Spector T.D., Small K.S., and Dermitzakis E.T. (2017). Predicting causal variants affecting expression by using whole-genome sequencing and RNA-seq from multiple human tissues. Nat. Genet. 49, 1747–1751. 10.1038/ng.3979. - DOI - PubMed
    1. Cusanovich D.A., Billstrand C., Zhou X., Chavarria C., De Leon S., Michelini K., Pai A.A., Ober C., and Gilad Y. (2012). The combination of a genome-wide association study of lymphocyte count and analysis of gene expression data reveals novel asthma candidate genes. Hum. Mol. Genet. 21, 2111–2123. 10.1093/hmg/dds021. - DOI - PMC - PubMed

Publication types