Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2010 Aug;185(4):1411-23.
doi: 10.1534/genetics.110.114819. Epub 2010 Jun 1.

Using environmental correlations to identify loci underlying local adaptation

Affiliations

Using environmental correlations to identify loci underlying local adaptation

Graham Coop et al. Genetics. 2010 Aug.

Abstract

Loci involved in local adaptation can potentially be identified by an unusual correlation between allele frequencies and important ecological variables or by extreme allele frequency differences between geographic regions. However, such comparisons are complicated by differences in sample sizes and the neutral correlation of allele frequencies across populations due to shared history and gene flow. To overcome these difficulties, we have developed a Bayesian method that estimates the empirical pattern of covariance in allele frequencies between populations from a set of markers and then uses this as a null model for a test at individual SNPs. In our model the sample frequencies of an allele across populations are drawn from a set of underlying population frequencies; a transform of these population frequencies is assumed to follow a multivariate normal distribution. We first estimate the covariance matrix of this multivariate normal across loci using a Monte Carlo Markov chain. At each SNP, we then provide a measure of the support, a Bayes factor, for a model where an environmental variable has a linear effect on the transformed allele frequencies compared to a model given by the covariance matrix alone. This test is shown through power simulations to outperform existing correlation tests. We also demonstrate that our method can be used to identify SNPs with unusually large allele frequency differentiation and offers a powerful alternative to tests based on pairwise or global F(ST). Software is available at http://www.eve.ucdavis.edu/gmcoop/.

PubMed Disclaimer

Figures

F<sc>igure</sc> 1.—
Figure 1.—
The distance from the equator for each of 52 human populations, plotted against sample allele frequencies for the SNP AGT M235T in each population. The points are colored according to the geographic region each population belongs to, following region definitions of Rosenberg et al. (2002). The data were generated using HGDP samples by Thompson et al. (2004) and are replotted on the basis of a figure in that article.
F<sc>igure</sc> 2.—
Figure 2.—
(A) A single draw from the posterior of the covariance matrix estimated for the HGDP SNPs of Conrad et al. (2006). (B) The correlation matrix calculated from the covariance matrix shown in A. The matrices are displayed as heat maps with lighter colors corresponding to higher values. The rows and columns of these matrices have been arranged by broad geographic label.
F<sc>igure</sc> 3.—
Figure 3.—
The power of various methods to detect a correlation between latitude and allele frequency.
F<sc>igure</sc> 4.—
Figure 4.—
The power of various methods to detect a correlation between summer precipitation and allele frequency.
F<sc>igure</sc> 5.—
Figure 5.—
The power of various methods to detect a “European effect” on allele frequency.
F<sc>igure</sc> 6.—
Figure 6.—
A plot of the log10 Bayes factor for each SNP along the human genome for (A) a European effect and (B) a western Eurasian effect. Bayes factors <1 are not plotted. The numbers on the x-axis indicate chromosome number, with SNPs on different chromosomes colored alternately in red and black. We list the name of the gene that is nearest to each of the six highest-ranking SNPs in each plot (considering only the peak SNP in each cluster of high Bayes factors).

Similar articles

Cited by

References

    1. Allen, J., 1877. The influence of physical conditions in the genesis of species. Radic. Rev. 1 108–140.
    1. Auton, A., K. Bryc, A. R. Boyko, K. E. Lohmueller, J. Novembre et al., 2009. Global distribution of genomic diversity underscores rich complex history of continental human populations. Genome Res. 19 795–803. - PMC - PubMed
    1. Balding, D., 2003. Likelihood-based inference for genetic correlation coefficients. Theor. Popul. Biol. 63 221–230. - PubMed
    1. Balding, D. J., and R. A. Nichols, 1995. A method for quantifying differentiation between populations at multi-allelic loci and its implications for investigating identity and paternity. Genetica 96 3–12. - PubMed
    1. Balding, D. J., A. D. Carothers, J. L. Marchini, L. R. Cardon, A. Vetta et al., 2002. Discussion on the meeting on ‘Statistical modelling and analysis of genetic data’. J. R. Stat. Soc. B 64 737–775.

Publication types