Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2022 Jan 7;39(1):msab294.
doi: 10.1093/molbev/msab294.

Detecting Selection in Multiple Populations by Modeling Ancestral Admixture Components

Affiliations

Detecting Selection in Multiple Populations by Modeling Ancestral Admixture Components

Jade Yu Cheng et al. Mol Biol Evol. .

Abstract

One of the most powerful and commonly used approaches for detecting local adaptation in the genome is the identification of extreme allele frequency differences between populations. In this article, we present a new maximum likelihood method for finding regions under positive selection. It is based on a Gaussian approximation to allele frequency changes and it incorporates admixture between populations. The method can analyze multiple populations simultaneously and retains power to detect selection signatures specific to ancestry components that are not representative of any extant populations. Using simulated data, we compare our method to related approaches, and show that it is orders of magnitude faster than the state-of-the-art, while retaining similar or higher power for most simulation scenarios. We also apply it to human genomic data and identify loci with extreme genetic differentiation between major geographic groups. Many of the genes identified are previously known selected loci relating to hair pigmentation and morphology, skin, and eye pigmentation. We also identify new candidate regions, including various selected loci in the Native American component of admixed Mexican-Americans. These involve diverse biological functions, such as immunity, fat distribution, food intake, vision, and hair development.

Keywords: admixture; human evolution; population structure; positive selection; selective sweeps.

PubMed Disclaimer

Figures

Fig. 1.
Fig. 1.
Simulation tests of Ohana performance and efficiency in detecting and mapping selected sites. (a–c) Power to detect selection relative to two comparable methods, BayPass and pcadapt. Error bars are 95% CIs; (d–f) efficacy of Ohana to fine-map the causal site; (g) computational efficiency compared with that of BayPass. Error bars are 5th to 95th percentiles.
Fig. 2.
Fig. 2.
Illustration of simulation models. (a) Model 1, a basic model of four-population split with no admixture. (b) Model 2, a four-population split with subsequent admixture. (c) Model 3, a four-population model mimicking human demographic models. Population size changes in model 3 are omitted from the visualization for simplicity. Selection is simulated to operate on the branch that has a larger width.
Fig. 3.
Fig. 3.
ROC curves for Ohana versus state-of-the-art methods, assessed using simulations with various values of the initial allele frequency at the beginning of selection (f) and different selection coefficients (s). Here the demographic model used was our human model (with selection in the Native American lineage), versus other demographic models considered (i.e., basic tree without and with admixture, supplementary figs. S1 and S2, Supplementary Material online, respectively).
Fig. 4.
Fig. 4.
Inferred unrooted tree of latent ancestry components for the analysis including the CHB, YRI, MXL, and GBR genomic panels. We label each component by the population in which it is maximized, but emphasize that the components and the populations are not equivalent entities.
Fig. 5.
Fig. 5.
Top 5 annotated peaks in each of the ancestry-specific selection studies. MXL-specific = scan for selection in Native American ancestry of MXL. GBR-specific = scan for selection in European ancestry of GBR. CHB-specific = scan for selection in CHB ancestry of CHB. YRI-specific = scan for selection in Yoruba African ancestry or ancestral non-African ancestry. We analyzed 5,601,710 variable sites across the autosomal genomes. We inferred genome-wide allele frequencies and covariances as described in Materials and Methods section. We applied a likelihood model for each SNP by rescaling all variances and covariances by a scalar multiplier α. Descriptions of each candidate region are in table 1. LLR, log-likelihood ratio score.
Fig. 6.
Fig. 6.
Selection hypotheses and their encodings as covariance matrices. In this example, the ancestry component E is assumed to be the potential target of selection. The entry E: E in the covariance matrix is therefore allowed to deviate from the globally estimated value.

Similar articles

Cited by

References

    1. Adzhubei I, Jordan DM, Sunyaev SR.. 2013. Predicting functional effect of human missense mutations using polyphen-2. Curr Protocols Hum Genet. 76(1):7–20. - PMC - PubMed
    1. Akey JM, Zhang G, Zhang K, Jin L, Shriver MD.. 2002. Interrogating a high-density SNP map for signatures of natural selection. Genome Res. 12(12):1805–1814. - PMC - PubMed
    1. Barban N, Jansen R, De Vlaming R, Vaez A, Mandemakers JJ, Tropf FC, Shen X, Wilson JF, Chasman DI, Nolte IM, et al. ; LifeLines Cohort Study. 2016. Genome-wide analysis identifies 12 loci influencing human reproductive behavior. Nat Genet. 48(12):1462–1472. - PMC - PubMed
    1. Barreiro LB, Ben-Ali M, Quach H, Laval G, Patin E, Pickrell JK, Bouchier C, Tichit M, Neyrolles O, Gicquel B, et al. 2009. Evolutionary dynamics of human toll-like receptors and their different contributions to host defense. PLoS Genet. 5(7):e1000562. - PMC - PubMed
    1. Beaumont MA, Balding DJ.. 2004. Identifying adaptive genetic divergence among populations from genome scans. Mol Ecol. 13(4):969–980. - PubMed

Publication types