Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2015 Feb;69(2):431-46.
doi: 10.1111/evo.12567. Epub 2015 Jan 16.

Footprints of ancient-balanced polymorphisms in genetic variation data from closely related species

Affiliations

Footprints of ancient-balanced polymorphisms in genetic variation data from closely related species

Ziyue Gao et al. Evolution. 2015 Feb.

Abstract

When long-lasting, balancing selection can lead to "trans-species" polymorphisms that are shared by two or more species identical by descent. In such cases, the gene genealogy at the selected site clusters by allele instead of by species, and nearby neutral sites also have unusual genealogies because of linkage. While this scenario is expected to leave discernible footprints in genetic variation data, the specific patterns remain poorly characterized. Motivated by recent findings in primates, we focus on the case of a biallelic polymorphism under ancient balancing selection and derive approximations for summaries of the polymorphism data from two species. Specifically, we characterize the length of the segment that carries most of the footprints, the expected number of shared neutral single nucleotide polymorphisms (SNPs), and the patterns of allelic associations among them. We confirm the accuracy of our approximations by coalescent simulations. We further show that for humans and chimpanzees-more generally, for pairs of species with low genetic diversity levels-these patterns are highly unlikely to be generated by neutral recurrent mutations. We discuss the implications for the design and interpretation of genome scans for ancient balanced polymorphisms in primates and other taxa.

Keywords: Ancient genetic variation; balancing selection; genome scan for selection; trans-species polymorphism.

PubMed Disclaimer

Figures

Figure 1
Figure 1
Sites linked to a balanced trans-species polymorphism have unusual genealogies. (A) The trees are ordered by the distance from the selected site. Blue and red lines represent lineages from the two allelic classes, respectively. (B) The state of each segment in ten simulation replicates. Each bar represents a 10 kb region centered on a trans-species balanced polymorphism (red dotted line). The color of the bar indicates the genealogical state of each segment (same as in A). Parameters were chosen to be plausible for humans and chimpanzees: N = 10,000, Na = 50,000, T = 160,000, p = 0.5, and r = 1.25 cM/Mb (see Supporting Information). (C) Summary of the coalescent time for a single realization of a segment carrying a balanced trans-species polymorphism. The sample consists of 20 lineages in total, five from each allelic class in each species. Each plot shows the time to the MRCA for a specific subset of the 20 lineages indicated on the top and left. The selected site is located in the center of the segment.
Figure 2
Figure 2
The genealogy of the contiguous ancestral segment. The proportion of each stage has been distorted for illustration purpose (e.g., stage I should be much shorter compared to stage II). See main text and Table1 for the meaning of symbols.
Figure 3
Figure 3
Expected coalescent time between two lineages carrying different alleles. Parameters are chosen to be plausible for the ancestral population for humans and chimpanzees: Na = 50,000, p = 0.5, and TS = 600,000 generations (see Supporting Information). The mean and standard deviation of the simulation results were obtained from 10,000 replicates.
Figure 4
Figure 4
Expected number of shared SNPs in the contiguous ancestral segment. The upper panel is a schematic diagram of the demographic history of humans and chimpanzees, which merged into an ancestral species five million years ago (Mya). The star indicates the age of the balanced polymorphism. The lower panel shows the expected number of shared SNPs (including the selected one) in the two-sided contiguous ancestral segment. Note that the age of the balanced polymorphism here is measured from present, which is the sum of TS and the lengths of stages I and II.
Figure 5
Figure 5
LD between shared SNPs generated by balancing selection or by recurrent mutations. Fifty chromosomes were sampled from each species in both scenarios. We assumed an allele frequency of 0.5 for the scenario of balancing selection and sampled an equal number of chromosomes from each allelic class (see Supporting Information for details of the simulations). Panels (A) and (B) show the distribution of r2 in the two species, respectively.

References

    1. Asthana S, Schmidt S. Sunyaev S. A limited role for balancing selection. Trends Genet. 2005;21:30–32. - PubMed
    1. Bachtrog D. Andolfatto P. Selection, recombination and demographic history in Drosophila miranda. Genetics. 2006;174:2045–2059. - PMC - PubMed
    1. Bergland AOB, Behrman EL, O'Brien KR, Schmidt PS. Petrov DA. 2013. :1303.5044 [q-bio.PE] and Genomic evidence of rapid and stable adaptive oscillations over seasonal time scales in Drosophila., arXiv.
    1. Bubb KL, Bovee D, Buckley D, Haugen E, Kibukawa M, Paddock M, Palmieri A, Subramanian S, Zhou Y, Kaul R, et al. Scan of human genome reveals no new Loci under ancient balancing selection. Genetics. 2006;173:2165–2177. - PMC - PubMed
    1. Charlesworth D. Balancing selection and its effects on sequences in nearby genome regions. PLoS Genet. 2006;2:e64. - PMC - PubMed