Genealogies of rapidly adapting populations
- PMID: 23269838
- PMCID: PMC3545819
- DOI: 10.1073/pnas.1213113110
Genealogies of rapidly adapting populations
Abstract
The genetic diversity of a species is shaped by its recent evolutionary history and can be used to infer demographic events or selective sweeps. Most inference methods are based on the null hypothesis that natural selection is a weak or infrequent evolutionary force. However, many species, particularly pathogens, are under continuous pressure to adapt in response to changing environments. A statistical framework for inference from diversity data of such populations is currently lacking. Towards this goal, we explore the properties of genealogies in a model of continual adaptation in asexual populations. We show that lineages trace back to a small pool of highly fit ancestors, in which almost simultaneous coalescence of more than two lineages frequently occurs. Whereas such multiple mergers are unlikely under the neutral coalescent, they create a unique genetic footprint in adapting populations. The site frequency spectrum of derived neutral alleles, for example, is nonmonotonic and has a peak at high frequencies, whereas Tajima's D becomes more and more negative with increasing sample size. Because multiple merger coalescents emerge in many models of rapid adaptation, we argue that they should be considered as a null model for adapting populations.
Conflict of interest statement
The authors declare no conflict of interest.
Figures
, at two time points. Randomly sampled individuals (green, blue, and violet dots in the later population) tend to come from the center of the distribution, whereas ancestors tend to be among the fittest in the population. The ancestral lineages wiggle because of mutations that randomly perturb their fitness. Simultaneously, lineages move to the high fitness edge, where they are likely to meet and coalesce. The fittest individuals are typically at
above the mean fitness.
, curves for different N and s collapse onto a single master curve. This collapse shows that
is the timescale of coalescence. After a delay,
,
is exponentially distributed, which is apparent from the Inset showing the cumulative distribution
. An exponential
is indicated as a black dashed line. Different line styles correspond to
(solid),
(dashed), and
(dotted), whereas the mutation rate is
. For each parameter combination, random pairs are sampled at 10,000 time points
generations apart.
averaged over 10,000 runs. Line styles and parameters are as in Fig. 3.
, of log-fitness in the population (described in the text). These distributions were measured in forward simulations with
, and
. (B) SFS of derived neutral alleles in a background selection scenario with deleterious mutations of effect s. As the ratio
is varied while keeping
constant, the SFS interpolates between the expectation for the Kingman’s coalescent and the BSC (
).References
-
- Kingman J. On the genealogy of large populations. J Appl Probab. 1982;19A:27–43.
-
- Derrida B, Peliti L. Evolution in a flat fitness landscape. Bull Math Biol. 1991;53(3):355–382.
Publication types
MeSH terms
Grants and funding
LinkOut - more resources
Full Text Sources
