Inference Under a Wright-Fisher Model Using an Accurate Beta Approximation
- PMID: 26311474
- PMCID: PMC4649640
- DOI: 10.1534/genetics.115.179606
Inference Under a Wright-Fisher Model Using an Accurate Beta Approximation
Abstract
The large amount and high quality of genomic data available today enable, in principle, accurate inference of evolutionary histories of observed populations. The Wright-Fisher model is one of the most widely used models for this purpose. It describes the stochastic behavior in time of allele frequencies and the influence of evolutionary pressures, such as mutation and selection. Despite its simple mathematical formulation, exact results for the distribution of allele frequency (DAF) as a function of time are not available in closed analytical form. Existing approximations build on the computationally intensive diffusion limit or rely on matching moments of the DAF. One of the moment-based approximations relies on the beta distribution, which can accurately describe the DAF when the allele frequency is not close to the boundaries (0 and 1). Nonetheless, under a Wright-Fisher model, the probability of being on the boundary can be positive, corresponding to the allele being either lost or fixed. Here we introduce the beta with spikes, an extension of the beta approximation that explicitly models the loss and fixation probabilities as two spikes at the boundaries. We show that the addition of spikes greatly improves the quality of the approximation. We additionally illustrate, using both simulated and real data, how the beta with spikes can be used for inference of divergence times between populations with comparable performance to an existing state-of-the-art method.
Keywords: Wright-Fisher; beta; divergence times; linear evolutionary pressures; pure genetic drift.
Copyright © 2015 by the Genetics Society of America.
Figures
References
-
- Abramowitz M., Stegun I. A., 1964. Handbook of Mathematical Functions: With Formulas, Graphs, and Mathematical Tables. Dover Publications, Mineola, NY.
-
- Balding D. J., Nichols R. A., 1995. A method for quantifying differentiation between populations at multiallelic loci and its implications for investigating identity and paternity. Genetica 96: 3–12. - PubMed
-
- Balding D. J., Nichols R. A., 1997. Significant genetic correlations among Caucasians at forensic DNA loci. Heredity 78: 583–589. - PubMed
-
- Bank C., Ewing G. B., Ferrer-Admettla A., Foll M., Jensen J. D., 2014. Thinking too positive? Revisiting current methods of population genetic selection inference. Trends Genet. 30: 540–546. - PubMed
Publication types
MeSH terms
LinkOut - more resources
Full Text Sources
Other Literature Sources
Miscellaneous
