Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2021 Jun 1;17(6):e1009596.
doi: 10.1371/journal.pgen.1009596. eCollection 2021 Jun.

Rare variants regulate expression of nearby individual genes in multiple tissues

Affiliations

Rare variants regulate expression of nearby individual genes in multiple tissues

Jiajin Li et al. PLoS Genet. .

Abstract

The rapid decrease in sequencing cost has enabled genetic studies to discover rare variants associated with complex diseases and traits. Once this association is identified, the next step is to understand the genetic mechanism of rare variants on how the variants influence diseases. Similar to the hypothesis of common variants, rare variants may affect diseases by regulating gene expression, and recently, several studies have identified the effects of rare variants on gene expression using heritability and expression outlier analyses. However, identifying individual genes whose expression is regulated by rare variants has been challenging due to the relatively small sample size of expression quantitative trait loci studies and statistical approaches not optimized to detect the effects of rare variants. In this study, we analyze whole-genome sequencing and RNA-seq data of 681 European individuals collected for the Genotype-Tissue Expression (GTEx) project (v8) to identify individual genes in 49 human tissues whose expression is regulated by rare variants. To improve statistical power, we develop an approach based on a likelihood ratio test that combines effects of multiple rare variants in a nonlinear manner and has higher power than previous approaches. Using GTEx data, we identify many genes regulated by rare variants, and some of them are only regulated by rare variants and not by common variants. We also find that genes regulated by rare variants are enriched for expression outliers and disease-causing genes. These results suggest the regulatory effects of rare variants, which would be important in interpreting associations of rare variants with complex traits.

PubMed Disclaimer

Conflict of interest statement

The authors have declared that no competing interests exist.

Figures

Fig 1
Fig 1. Power comparison between LRT-q and seven existing methods on simulated data.
A. for different effect sizes and fixed causal ratio (10%), and for fixed effect sizes (B. ≤ 0.99, C. ≤ 2.97, D. ≤ 4.95) and various causal ratios. Significance level α = 0.05.
Fig 2
Fig 2. RV eGenes detected from 49 tissues in the GTEx v8 dataset.
A. The relationship between the number of total RV eGenes detected by LRT-q in each tissue and the sample size of each tissue. The colors of the data points are randomly assigned. Each tissue has its own color. B. The number of total RV eGenes detected by each method. In panel B, only tissues with more than one RV eGene detected by any methods are included.
Fig 3
Fig 3. Tissue-sharing patterns of RV eGenes in the GTEx v8 dataset.
A. Pairwise tissue-sharing matrix of RV eGenes. It shows the fraction of shared RV eGenes in each pair of tissues. Here we use FDR < 10% to increase the number of RV eGenes. Tissues are sorted by clustering. Only tissues with more than one RV eGenes are included. B. The proportion of RV eGenes and CV eGenes shared among different numbers of tissues. Only tissues with more than 200 RV eGenes are selected. It shows the proportion of eGenes that are only detected in one tissue, in 2–4 tissues, and in more than 4 tissues.
Fig 4
Fig 4. Outlier analysis of RV eGenes detected by LRT-q in GTEx v8.
A. Enrichment of proximal rare variants in outliers compared to non-outliers for RV eGenes in each tissue. Tissues without RV eGenes are excluded. B. Enrichment of RV eGenes in disease-associated genes and genes related to common traits (BMI and Height) from public databases. The numbers represent p-values. In both panels, we show the mean values as dots and 95% confidence intervals as error bars.

Similar articles

Cited by

References

    1. Buniello A, MacArthur JAL, Cerezo M, Harris LW, Hayhurst J, Malangone C, et al.. The NHGRI-EBI GWAS Catalog of published genome-wide association studies, targeted arrays and summary statistics 2019. Nucleic Acids Res. 2019;47: D1005–D1012. doi: 10.1093/nar/gky1120 - DOI - PMC - PubMed
    1. Hindorff LA, Sethupathy P, Junkins HA, Ramos EM, Mehta JP, Collins FS, et al.. Potential etiologic and functional implications of genome-wide association loci for human diseases and traits. Proc Natl Acad Sci U S A. 2009;106: 9362–9367. doi: 10.1073/pnas.0903103106 - DOI - PMC - PubMed
    1. Maher B. Personal genomes: The case of the missing heritability. Nature. 2008;456: 18–21. doi: 10.1038/456018a - DOI - PubMed
    1. Manolio TA, Collins FS, Cox NJ, Goldstein DB, Hindorff LA, Hunter DJ, et al.. Finding the missing heritability of complex diseases. Nature. 2009;461: 747–753. doi: 10.1038/nature08494 - DOI - PMC - PubMed
    1. Zuk O, Schaffner SF, Samocha K, Do R, Hechter E, Kathiresan S, et al.. Searching for missing heritability: designing rare variant association studies. Proc Natl Acad Sci U S A. 2014;111: E455–64. doi: 10.1073/pnas.1322563111 - DOI - PMC - PubMed

Publication types