Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
[Preprint]. 2023 Apr 7:2023.04.06.535816.
doi: 10.1101/2023.04.06.535816.

Systematic visualisation of molecular QTLs reveals variant mechanisms at GWAS loci

Affiliations

Systematic visualisation of molecular QTLs reveals variant mechanisms at GWAS loci

Nurlan Kerimov et al. bioRxiv. .

Update in

Abstract

Splicing quantitative trait loci (QTLs) have been implicated as a common mechanism underlying complex trait associations. However, utilising splicing QTLs in target discovery and prioritisation has been challenging due to extensive data normalisation which often renders the direction of the genetic effect as well as its magnitude difficult to interpret. This is further complicated by the fact that strong expression QTLs often manifest as weak splicing QTLs and vice versa, making it difficult to uniquely identify the underlying molecular mechanism at each locus. We find that these ambiguities can be mitigated by visualising the association between the genotype and average RNA sequencing read coverage in the region. Here, we generate these QTL coverage plots for 1.7 million molecular QTL associations in the eQTL Catalogue identified with five quantification methods. We illustrate the utility of these QTL coverage plots by performing colocalisation between vitamin D levels in the UK Biobank and all molecular QTLs in the eQTL Catalogue. We find that while visually confirmed splicing QTLs explain just 6/53 of the colocalising signals, they are significantly less pleiotropic than eQTLs and identify a prioritised causal gene in 4/6 cases. All our association summary statistics and QTL coverage plots are freely available at https://www.ebi.ac.uk/eqtl/.

PubMed Disclaimer

Conflict of interest statement

Competing interests J.C.U. is an employee of Illumina. N.K. is an employee of Nightingale Health. M.W. is an employee of Genomics Plc. S.G. is an employee of Verge Genomics.

Figures

Figure 1.
Figure 1.
Uniform re-processing of all datasets. (A), The number of studies, datasets, donors, and samples in the previous publication (R3) and current version of the eQTL Catalogue (R6). (B), Number of genes with at least one significant eQTL (‘eGenes’) on the X chromosome as a function of dataset sample size. Red points indicate datasets for which the X chromosome genotypes were unavailable. (C), The number of eGenes identified in each dataset for the five molecular traits (gene expression, exon expression, transcript usage, txrevise event usage, and Leafcutter splice-junction usage). Datasets newly added since release 3 have been highlighted.
Figure 2.
Figure 2.
Visualisation of a splicing QTL detected in the CYP2R1 gene. (A) RNA-seq read coverage across the CYP2R1 gene in GTEx transverse colon tissue stratified by the genotype of the lead sQTL variant (chr11_14855172_G_A). All introns have been shortened to 50 nt with wiggleplotr (Alasoo, 2017) to make variation in exonic read coverage easier to see. (B) Effect sizes and 95% confidence intervals of the lead sQTL variant on the expression level of individual exons (or exonic parts) of CYP2R1. Associations significant at FDR <= 1% are shown in dark blue. (C) The top two rows show the MANE Select (Morales et al., 2022) reference transcript and all annotated exons of CYP2R1, respectively. The remaining rows show the txrevise (Alasoo et al., 2019) event annotations used for sQTL mapping. The short version of exon 4 (between dashed lines) is only present in annotated nonsense-mediated decay (NMD) transcripts.
Figure 3.
Figure 3.
Sharing of significantly colocalised signals with vitamin D (A) Number of colocalised signals detected by the different molecular QTL quantification methods and sharing between them. (B) Number of colocalised signals assigned to empirical functional consequence (eQTL, sQTL, puQTL, apaQTL or ambiguous) and sharing structure between them. (C) Number of independent colocalised signals associated with either a single target gene or multiple target genes in each functional consequences group. eQTL - expression QTL, sQTL - splicing QTL, puQTL - promoter usage QTL, apaQTL - alternative polyadenylation QTL.

References

    1. Abugessaisa I, Noguchi S, Hasegawa A, Harshbarger J, Kondo A, Lizio M, Severin J, Carninci P, Kawaji H, Kasukawa T. 2017. FANTOM5 CAGE profiles of human and mouse reprocessed for GRCh38 and GRCm38 genome assemblies. Sci Data 4:170107. - PMC - PubMed
    1. Alasoo K. 2017. wiggleplotr: Make read coverage plots from BigWig files. Bioconductor. doi:10.18129/B9.bioc.wiggleplotr - DOI
    1. Alasoo K, Rodrigues J, Danesh J, Freitag DF, Paul DS, Gaffney DJ. 2019. Genetic effects on promoter usage are highly context-specific and contribute to complex traits. Elife 8. doi:10.7554/eLife.41673 - DOI - PMC - PubMed
    1. Alasoo K, Rodrigues J, Mukhopadhyay S, Knights AJ, Mann AL, Kundu K, HIPSCI Consortium, Hale C, Dougan G, Gaffney DJ. 2018. Shared genetic effects on chromatin and gene expression indicate a role for enhancer priming in immune response. Nat Genet 50:424–431. - PMC - PubMed
    1. Anders S, Reyes A, Huber W. 2012. Detecting differential usage of exons from RNA-seq data. Genome Res 22:2008–2017. - PMC - PubMed

Publication types