Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2009 Mar;26(3):649-58.
doi: 10.1093/molbev/msn289. Epub 2008 Dec 17.

Gene expression levels are a target of recent natural selection in the human genome

Affiliations

Gene expression levels are a target of recent natural selection in the human genome

Sridhar Kudaravalli et al. Mol Biol Evol. 2009 Mar.

Abstract

Changes in gene expression may represent an important mode of human adaptation. However, to date, there are relatively few known examples in which selection has been shown to act directly on levels or patterns of gene expression. In order to test whether single nucleotide polymorphisms (SNPs) that affect gene expression in cis are frequently targets of positive natural selection in humans, we analyzed genome-wide SNP and expression data from cell lines associated with the International HapMap Project. Using a haplotype-based test for selection that was designed to detect incomplete selective sweeps, we found that SNPs showing signals of selection are more likely than random SNPs to be associated with gene expression levels in cis. This signal is significant in the Yoruba (which is the population that shows the strongest signals of selection overall) and shows a trend in the same direction in the other HapMap populations. Our results argue that selection on gene expression levels is an important type of human adaptation. Finally, our work provides an analytical framework for tackling a more general problem that will become increasingly important: namely, testing whether selection signals overlap significantly with SNPs that are associated with phenotypes of interest.

PubMed Disclaimer

Figures

F<sc>IG</sc>. 1.—
FIG. 1.—
Two examples in which an eQTL is centered on a strong signal of selection. The upper half of each plot (green and red points) shows the strength of association between SNPs and gene expression levels (plotted as −log10(P values) of the indicated gene). The lower half of each plot (blue and red points) indicates −|iHS| scores for the same set of SNPs. Red points indicate SNPs that are both strongly associated with expression (P < 10−4) and have |iHS| > 2. The positions of the genes of interest are indicated by the red bars at the center of each plot. (A) Data from SCL25A16 (YRI). (B) Data from SPATA20 (ASN). According to the sliding-window analysis, the clusters of high |iHS| signals are in the 2.5% and 1% tails of the empirical Yoruba (A) and Asian (B) distributions, respectively. The favored haplotypes are at 60% and 89% frequency, respectively.
F<sc>IG</sc>. 2.—
FIG. 2.—
The abundance of eQTL signals in SNPs with and without evidence for selection (YRI data). In each plot, the red data correspond to SNPs with strong evidence for selection (|iHS| > 2 and surrounded by an unusual cluster of other high |iHS| SNPs); blue data are for SNPs with |iHS| > 2; and black data are for all SNPs. (A) Quantile–quantile plots of the distributions of −log10(P values) obtained from testing the expression levels at each gene for association with nearby SNPs. The dashed line indicates the expected distribution of P values if there were no true associations between SNPs and gene expression levels. Notice that SNPs with high |iHS| (red and blue data) show a higher rate of significant P values compared with SNPs without a signal of selection. (B) SNPs with high |iHS| show an enrichment for eQTLs at various distances from the transcription start site. (C) SNPs with high iHS tend to be enriched for eQTLs after controlling for allele frequency. The enrichment may be highest in the frequency ranges where iHS has the greatest power (roughly 50–80%; Voight et al. 2006). (D) SNPs with high iHS show generally higher rates of eQTLs after controlling for LD levels, as measured by the number of SNPs in high LD with the SNP in question (r2 > 0.8). For analogous plots of the other two populations, see the Supplementary Material online.

References

    1. Altshuler D, Hirschhorn J, Klannemark M, et al. (16 co-authors) The common PPARgamma Pro12Ala polymorphism is associated with decreased risk of type 2 diabetes. Nat Genet. 2000;26:76–80. - PubMed
    1. Bersaglieri T, Sabeti P, Patterson N, Vanderploeg T, Schaffner S, Drake J, Rhodes M, Reich D, Hirschhorn J. Genetic signatures of strong recent positive selection at the lactase gene. Am J Hum Genet. 2004;74:1111–1120. - PMC - PubMed
    1. Dixon A, Liang L, Moffatt M, et al. (13 co-authors) A genome-wide association study of global gene expression. Nat Genet. 2007;39:1202–1207. - PubMed
    1. Enattah N, Forsblom C, Rasinper H, Tuomi T, Groop P, Jrvel I. The genetic variant of lactase persistence C (-13910) T as a risk factor for type I and II diabetes in the Finnish population. Eur J Clin Nutr. 2004;58:1319–1322. - PubMed
    1. Fellay J, Shianna K, Ge D, et al. (27 co-authors) A whole-genome association study of major determinants for host control of HIV-1. Science. 2007;317:944–947. - PMC - PubMed

Publication types