Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2020 Apr 1;12(4):443-455.
doi: 10.1093/gbe/evaa056.

The Limits to Estimating Population-Genetic Parameters with Temporal Data

Affiliations

The Limits to Estimating Population-Genetic Parameters with Temporal Data

Michael Lynch et al. Genome Biol Evol. .

Abstract

The ability to obtain genome-wide sequences of very large numbers of individuals from natural populations raises questions about optimal sampling designs and the limits to extracting information on key population-genetic parameters from temporal-survey data. Methods are introduced for evaluating whether observed temporal fluctuations in allele frequencies are consistent with the hypothesis of random genetic drift, and expressions for the expected sampling variances for the relevant statistics are given in terms of sample sizes and numbers. Estimation methods and aspects of statistical reliability are also presented for the mean and temporal variance of selection coefficients. For nucleotide sites that pass the test of neutrality, the current effective population size can be estimated by a method of moments, and expressions for its sampling variance provide insight into the degree to which such methodology can yield meaningful results under alternative sampling schemes. Finally, some caveats are raised regarding the use of the temporal covariance of allele-frequency change to infer selection. Taken together, these results provide a statistical view of the limits to population-genetic inference in even the simplest case of a closed population.

Keywords: effective population size; fluctuating selection; population genomics; selection coefficient.

PubMed Disclaimer

Figures

<sc>Fig</sc>. 1.
Fig. 1.
—Sampling scheme for allele frequencies, with pi denoting the parametric frequency in the population, p^i denoting the estimated frequency after sampling ni individuals, and Δ^ij denoting the interval-specific estimated change in frequency.
<sc>Fig</sc>. 2.
Fig. 2.
—(Left) Mean estimates of the selection coefficient s obtained from the least-squares regression approach. Each point is the average of the results from 107 simulations based on Wright–Fisher allele-frequency dynamics incorporating selection and drift, followed by random sampling of n =100 diploid individuals at each sampling point. Black symbols are for effective population size Ne=104, and red for Ne=106, and results are reported for a range of starting allele frequencies, p0. The horizontal dashed lines denote the expectations for four evaluated selection coefficients (with temporal variance, σs2, equal to zero), and the different symbols denote experiments of different durations (T). (Right) Sampling standard deviations for estimates of s for the case of σs2=0, from simulations as noted above for three values of Ne, four of s, and a sample size of 100, compared with the theoretical expectation, equation (10). The diagonal dashed line denotes points of perfect agreement, and many symbols cannot be seen as they overlie each other on this line.
<sc>Fig</sc>. 3.
Fig. 3.
—Mean and CV of estimates of σs2 for series of samples taken at T +1 consecutive time points, each involving sample sizes of n =100 or 1,000 diploid genomes. Results are given for a range of initial allele frequencies, each based on 106 simulations with an effective population size of 108 individuals, ensuring essentially no genetic drift on the time scale of the analyses, and mean selection coefficient μs=0.0. Closed points refer to situations in which σs2=103, whereas open points are for σs2=104. Data points are excluded for some cases at low allele frequencies where the mean estimates of σs2 were negative.

References

    1. Bollback JP, York TL, Nielsen R. 2008. Estimation of 2Nes from temporal allele frequency data. Genetics 179(1):497–502. - PMC - PubMed
    1. Buffalo V, Coop G. 2019. The linked selection signature of rapid adaptation in temporal genomic data. Genetics 213(3):1007–1045. - PMC - PubMed
    1. Charlesworth B. 2009. Fundamental concepts in genetics: effective population size and patterns of molecular evolution and variation. Nat Rev Genet. 10(3):195–205. - PubMed
    1. Crow JF, Kimura M. 1970. An introduction to population genetics theory. New York: Harper and Row.
    1. Ferrer-Admetlla A, Leuenberger C, Jensen JD, Wegmann D. 2016. An approximate Markov model for the Wright–Fisher diffusion and its application to time series data. Genetics 203(2):831–846. - PMC - PubMed

Publication types