Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2015 Mar;199(3):841-56.
doi: 10.1534/genetics.114.173807. Epub 2015 Jan 9.

Can the site-frequency spectrum distinguish exponential population growth from multiple-merger coalescents?

Affiliations

Can the site-frequency spectrum distinguish exponential population growth from multiple-merger coalescents?

Bjarki Eldon et al. Genetics. 2015 Mar.

Abstract

The ability of the site-frequency spectrum (SFS) to reflect the particularities of gene genealogies exhibiting multiple mergers of ancestral lines as opposed to those obtained in the presence of population growth is our focus. An excess of singletons is a well-known characteristic of both population growth and multiple mergers. Other aspects of the SFS, in particular, the weight of the right tail, are, however, affected in specific ways by the two model classes. Using an approximate likelihood method and minimum-distance statistics, our estimates of statistical power indicate that exponential and algebraic growth can indeed be distinguished from multiple-merger coalescents, even for moderate sample sizes, if the number of segregating sites is high enough. A normalized version of the SFS (nSFS) is also used as a summary statistic in an approximate Bayesian computation (ABC) approach. The results give further positive evidence as to the general eligibility of the SFS to distinguish between the different histories.

Keywords: approximate Bayesian computation; approximate maximum likelihood test; coalescent; multiple mergers; population growth; site-frequency spectrum.

PubMed Disclaimer

Figures

Figure 1
Figure 1
Matching ϕ1(n) [see Equation 2] for the different coalescent processes Π ∈ {A, B, D, E} with number of leaves n as shown. Expected values were computed exactly. The processes and their associated parameters are algebraic growth (A, γ), beta(2 − α, α)-coalescent (B, α), Dirac coalescent (D, ψ), and exponential growth (E, β). The values with label 5+ represent the collapsed tail i>5ϕi(n).
Figure 2
Figure 2
(A) Estimate of G˜(E,B;s) from Equation 10 based on the approximate likelihood from Equation 12 as a function of α (no lumping) with number of leaves n as shown and s = 50. (B) Estimate of G˜(B,E;s) from Equation 10 based on the approximate likelihood from Equation 12 as a function of β (no lumping) with number of leaves n as shown and s = 50. The symbols denote the size of the test, as shown in the legend. The interval hypotheses are discretized to ΘsE={β:β{0,1,2,…,10,20,…,1000}} and. ΘsB={α:α{1,1.025,…,2}}. In A, the beta(2 − α, α)-coalescent is the alternative; in B, exponential growth is the alternative.
Figure 3
Figure 3
The 2 distance d2(n)(E,X) for X{B,D} of the normalized expected spectra ϕi(n,E) [see Equation 2] and ϕi(n) as a function of α (X = B) [resp. ψ (X = D)] and β (E) for number of leaves n = 100. Expected values were computed exactly. The grid points are α ∈ {1,1.025,…,2} and ψ{0.01,0.02,,0.1,0.15,0.2,,0.95}; for β as shown.
Figure 4
Figure 4
The 2 distance d2(n)(A,X) for X ∈ {B, D} of the normalized expected spectra ϕi(n,A) [see Equation 2] and ϕi(n,X) as a function of α (X = B) [resp. ψ (X = D)] and γ (A) for number of leaves n = 100. Expected values were computed exactly. The grid points are ψ{0.01,0.02,…,0.1,0.15,0.2,…,0.95} and α ∈ {1, 1.025, …, 2}; for γ as shown.

References

    1. Achaz G., 2009. Frequency spectrum neutrality tests: one for all and all for one. Genetics 183: 249–258. - PMC - PubMed
    1. Árnason E., 2004. Mitochondrial cytochrome b variation in the high-fecundity Atlantic cod: trans-Atlantic clines and shallow gene genealogy. Genetics 166: 1871–1885. - PMC - PubMed
    1. Baragatti M., Pudlo P., 2014. An overview on approximate Bayesian computation. ESAIM Proc. 44: 291–299.
    1. Beaumont M. A., 2010. Approximate Bayesian computation in evolution and ecology. Annu. Rev. Ecol. Evol. Syst. 41: 379–406.
    1. Beckenbach A. T., 1994. Mitochondrial haplotype frequencies in oysters: neutral alternatives to selection models, pp. 188–198 in Non-Neutral Evolution, edited by Golding B. Chapman & Hall, New York.

Publication types

LinkOut - more resources