Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
[Preprint]. 2023 May 26:2023.05.25.542345.
doi: 10.1101/2023.05.25.542345.

The temporal and genomic scale of selection following hybridization

Affiliations

The temporal and genomic scale of selection following hybridization

Jeffrey Groh et al. bioRxiv. .

Update in

Abstract

Genomic evidence supports an important role for selection in shaping patterns of introgression along the genome, but frameworks for understanding the dynamics underlying these patterns within hybrid populations have been lacking. Here, we develop methods based on the Wavelet Transform to understand the spatial genomic scale of local ancestry variation and its association with recombination rates. We present theory and use simulations to show how wavelet-based decompositions of ancestry variance along the genome and the correlation between ancestry and recombination reflect the joint effects of recombination, genetic drift, and genome-wide selection against introgressed alleles. Due to the clock-like effect of recombination in hybrids breaking up parental haplotypes, drift and selection produce predictable patterns of local ancestry variation at varying spatial genomic scales through time. Using wavelet approaches to identify the genomic scale of variance in ancestry and its correlates, we show that these methods can detect temporally localized effects of drift and selection. We apply these methods to previously published datasets from hybrid populations of swordtail fish (Xiphophorus) and baboons (Papio), and to inferred Neanderthal introgression in modern humans. Across systems, we find that upwards of 20% of the variation in local ancestry at the broadest genomic scales can be attributed to systematic selection against introgressed alleles, consistent with strong selection acting on early-generation hybrids. We also see signals of selection at fine genomic scales and much longer time scales. However, we show that our ability to confidently infer selection at fine scales is likely limited by inherent biases in current methods for estimating local ancestry from genomic similarity. Wavelet approaches will become widely applicable as genomic data from systems with introgression become increasingly available, and can help shed light on generalities of the genomic consequences of interspecific hybridization.

PubMed Disclaimer

Figures

Figure 1:
Figure 1:
(Left) Ancestry states x, coded arbitrarily as x() = {1, −1} along three separate chromosomes, with examples of Haar wavelets with different scales and positions overlaid in black. From top to bottom, ancestry tracts are shorter, representing different histories of recombination. Shaded intervals highlight the portion of the ancestry signal contributing to the resulting wavelet coefficient. (Top left) Positive covariance between a ψλ=6 wavelet and x within the shaded interval yields a positive wavelet coefficient corresponding to a change in average ancestry state over the two halves of the chromosome. (Middle left) Negative covariance between a ψλ=4 wavelet and x within the shaded interval yields a negative wavelet coefficient. (Bottom left) Positive covariance between a ψλ=2 wavelet and x within the shaded interval gives a positive wavelet coefficient. (Right) The complete set of squared wavelet coefficients determines the power spectrum for the three ancestry signals, correspondence indicated in color.
Figure 2:
Figure 2:
Wavelet variance decomposition through time of ancestry proportion in a 50/50 population mixture undergoing genetic drift with recombination and no selection. We simulated (using SLiM, Haller and Messer, 2019) a population of constant size 2N=20000 (blue) and a population that undergoes a bottleneck to 2N=200 for just the first 10 generations of recombination in hybrids, then expands to 2N=20000 (maroon). (Top) Ancestry proportion along human chromosome 1 from a single simulation run. From left to right, shown after 10, 100, and 1000 generations of recombination in hybrids. (Bottom) Wavelet variance decomposition showing the spatial scale of variance in ancestry proportion. Points and error bars show means and 95% confidence intervals across 20 replicate simulations. Solid grey lines show theoretical expectations. Vertical dotted grey lines indicate the expected distance between recombination breakpoints that have accrued along a single chromosome since the hybridization pulse. Note that since results are shown on the genetic map, recombination rate variation does not influence these patterns, other than through interpolation error.
Figure 3:
Figure 3:
Simulations of selection following a pulse of hybridization starting from a 50/50 population mixture.(a) Genome-wide selection in conjunction with broad-scale variation in recombination rates leads to differential removal of introgressed ancestry at broad scales, thereby biasing the power spectrum towards greater variance at these scales compared to the neutral expectation (viewed after 100 generations). (b) Selection against many alleles on one ancestry background rapidly establishes broad-scale correlation between recombination and minor parent ancestry in early generation hybrids. (c) The overall correlation is dominated by broad-scales, but finer scales contribute increasingly more through time. (d) Selection acting only on the first 10 generations of recombinant hybrids generates significant positive wavelet correlations only at broad scales (brown) (viewed in generation 1000), whereas continuous selection over 1000 generations continues to generate correlations on finer scales (red). When selection acts continuously but reverses direction after 100 generations to favor the alternate ancestry, positive broad-scale correlations persist as negative correlations establish at finer scales. Only significant correlations are shown, error bars represent 95% confidence intervals across 20 replicate simulations.
Figure 4:
Figure 4:
Wavelet analysis of hybrid genomes in a population of swordtail fish from Acuapa River in Hidalgo, Mexico. (a) Power spectrum of the proportion of malinche-like haplotypes on the genetic map for five time points points between 2006 and 2018. Wavelet variances represent a weighted averaged across chromosomes, and error bars are 95% confidence intervals from a weighted jackknife of chromosomes. (b) Correlations at each spatial genomic scale (on the physical map) between wavelet coefficients for the proportion of malinche-like haplotypes and wavelet coefficients for recombination rate. Squared values give an estimate of the proportion of variance in ancestry state explained by systematic selection against malinche-like alleles. Data shown only for 2006 sample, patterns similar across years. (c) Contribution of each scale to the overall correlation. The overall positive correlation is dominated by patterns at broad genomic scales.
Figure 5:
Figure 5:
Wavelet analysis of hybrid genomes between yellow and anubis baboons in Amboseli, Kenya. (a) Power spectrum of the proportion of anubis-like haplotypes within diploids on the genetic map, stratified by quintile of genome-wide average anubis-like ancestry. Error bars are 95% confidence intervals using the standard error of the wavelet variance across individuals within each quintile. (b) Correlations at each spatial genomic scale (on the physical map) between wavelet coefficients for sample proportion of anubis-like haplotypes and wavelet coefficients for recombination rate. Squared values give an estimate of the proportion of variance in ancestry state explained by systematic selection against anubis-like alleles. Weighted-jacknife 95% confidence intervals shown for all but the largest scale which is present only on a single chromosome. (c) Contribution of each scale to the overall correlation. Although there is a strong negative correlation at scale 102400 kb, this scale is only present on chromosome 1 and does not contribute substantially to the overall positive correlation.
Figure 6:
Figure 6:
Wavelet analysis of inferred Neanderthal ancestry in modern humans. (a) Normalized power spectra of the proportion of Neanderthal-like tracts for three different studies (colored points) compared to a neutral expectation for 2,000 generations of admixture in a population size of 10,000 diploids (grey line). Error bars represent 95% confidence intervals from a weighted jackknife across chromosomes. (b) Correlations across scales between different similarity calls from different data sets. (c) Contribution of each scale to the overall observed correlation between the proportion of Neanderthal-like haplotypes and log-transformed recombination rates. Shapes correspond to different studies as indicated in panel (a).

References

    1. Aeschbacher S, Selby JP, Willis JH, Coop G. 2017. Population-genomic inference of the strength and timing of selection against gene flow. Proceedings of the National Academy of Sciences. 114:7061–7066. - PMC - PubMed
    1. Barton N, Bengtsson BO. 1986. The barrier to genetic exchange between hybridising populations. Heredity. 57:357–376. - PubMed
    1. Barton NH. 1983. Multilocus clines. Evolution. 37:454–471. - PubMed
    1. Berner D, Roesti M. 2017. Genomics of adaptive divergence with chromosome-scale heterogeneity in crossover rate. Molecular ecology. 26:6351–6369. - PubMed
    1. Calfee E, Gates D, Lorant A, Perkins MT, Coop G, Ross-Ibarra J. 2021. Selective sorting of ancestral introgression in maize and teosinte along an elevational cline. PLoS genetics. 17:e1009810. - PMC - PubMed

Publication types