Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2021 Dec;31(12):2249-2257.
doi: 10.1101/gr.275488.121. Epub 2021 Sep 20.

Structural variants are a major source of gene expression differences in humans and often affect multiple nearby genes

Affiliations

Structural variants are a major source of gene expression differences in humans and often affect multiple nearby genes

Alexandra J Scott et al. Genome Res. 2021 Dec.

Abstract

Structural variants (SVs) are an important source of human genome diversity, but their functional effects are poorly understood. We mapped 61,668 SVs in 613 individuals from the GTEx project and measured their effects on gene expression. We estimate that common SVs are causal at 2.66% of eQTLs, a 10.5-fold enrichment relative to their abundance in the genome. Duplications and deletions were the most impactful variant types, whereas the contribution of mobile element insertions was small (0.12% of eQTLs, 1.9-fold enriched). Multitissue analysis of eQTLs revealed that gene-altering SVs show more constitutive effects than other variant types, with 62.09% of coding SV-eQTLs active in all tissues with eQTL activity compared with 23.08% of coding SNV- and indel-eQTLs. Noncoding SVs, SNVs and indels show broadly similar patterns. We also identified 539 rare SVs associated with nearby gene expression outliers. Of these, 62.34% are noncoding SVs that affect gene expression but have modest enrichment at regulatory elements, showing that rare noncoding SVs are a major source of gene expression differences but remain difficult to predict from current annotations. Both common and rare SVs often affect the expression of multiple genes: SV-eQTLs affect an average of 1.82 nearby genes, whereas SNV- and indel-eQTLs affect an average of 1.09 genes, and 21.34% of rare expression-altering SVs show effects on two to nine different genes. We also observe significant effects on rare gene expression changes extending 1 Mb from the SV. This provides a mechanism by which individual SVs may have strong or pleiotropic effects on phenotypic variation.

PubMed Disclaimer

Figures

Figure 1.
Figure 1.
Features of SV-eQTLs. (A) Size distribution of eSVs compared with all common SVs. (B) Distribution of the number of eGenes per eVariant for SVs compared with SNVs and indels. “Coding” eGenes refer to eGenes whose exons are intersected by the associated eVariant, and “noncoding” eGenes are not intersected by the associated eVariant. Counts are shown for every eVariant; thus, eVariants with zero coding or zero noncoding eGenes are included in the distributions. (C,D) The number of eVariants, as shown by dot size and color, with the indicated combination of coding and noncoding eGenes, as defined above. Shown for SVs (C) and SNV/indels (D), with histograms showing the total number of eVariants, with the indicated number of associated coding or noncoding eGenes above the y- and x-axes, respectively. (E) Distribution of tissue specificity of eQTLs across tissues as evaluated by METASOFT, separated into the lowest quartile, middle two quartiles, and top quartile, for eQTLs in which the activity status is known in at least 43 of 48 evaluated tissues. The points indicate the fraction of SV-eQTLs or SNV- and indel-eQTLs that are active (m > 0.9) in the proportion of tissues indicated on the x-axis.
Figure 2.
Figure 2.
Features of outlier-associated SVs. (A) Location of outlier-associated SVs relative to their associated outlier gene and the number of SV/outlier gene associations identified in each category. Percentages indicate the fraction of outlier/SV pairs found at each relative location compared with the total number of SV/outlier gene associations. Note that this definition allows one SV to be associated with multiple outlier genes, and thus, the SV is counted in multiple categories. Gene diagrams provide examples of possible SV location, shown in red, relative to the outlier gene. (B,C) Odds ratio (OR) of being outlier-associated by SV type (B) and SV size (C) for the SV category of interest compared with all other SVs. Note that BNDs were excluded from the size OR calculations owing to their ambiguous nature and thus size. (D) Distribution of SV sizes for singleton SVs <1 Mb identified in European individuals that were used in outlier analyses. Panels depict size distributions for all European-cohort singletons, control-associated singletons, multitissue outlier-associated singletons, and tissue-restricted outlier-associated singletons.
Figure 3.
Figure 3.
Mechanistic insights into outlier-associated SVs. (A) Enrichment of outlier-associated SVs in functional genomic annotations compared with control-associated SVs. Asterisks indicate statistical significance based on a Fisher's exact test with Bonferroni correction for multiple testing. (B,C) The distribution of the number of noncoding primary (B) and secondary (C) outliers found within 1 Mb of the region surrounding tissue-restricted outlier-associated SVs, control-associated SVs, and a shuffled null.

References

    1. Abel HJ, Larson DE, Rieger AA, Chiang C, Das I, Kanchi KL, Layer RM, Neale BM, Salerno WJ, Reeves C, et al. 2020. Mapping and characterization of structural variation in 17,795 human genomes. Nature 583: 83–89. 10.1038/s41586-020-2371-0 - DOI - PMC - PubMed
    1. Ashoor H, Kleftogiannis D, Radovanovic A, Bajic VB. 2015. DENdb: database of integrated human enhancers. Database 2015: bav085. 10.1093/database/bav085 - DOI - PMC - PubMed
    1. Brandler WM, Antaki D, Gujral M, Kleiber ML, Whitney J, Maile MS, Hong O, Chapman TR, Tan S, Tandon P, et al. 2018. Paternally inherited cis-regulatory structural variants are associated with autism. Science 360: 327–331. 10.1126/science.aan2261 - DOI - PMC - PubMed
    1. Chiang C, Layer RM, Faust GG, Lindberg MR, Rose DB, Garrison EP, Marth GT, Quinlan AR, Hall IM. 2015. SpeedSeq: ultra-fast personal genome analysis and interpretation. Nat Methods 12: 966–968. 10.1038/nmeth.3505 - DOI - PMC - PubMed
    1. Chiang C, Scott AJ, Davis JR, Tsang EK, Li X, Kim Y, Hadzic T, Damani FN, Ganel L, GTEx Consortium, et al. 2017. The impact of structural variation on human gene expression. Nat Genet 49: 692–699. 10.1038/ng.3834 - DOI - PMC - PubMed

LinkOut - more resources