Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2020 Jul 1;37(7):2124-2136.
doi: 10.1093/molbev/msaa042.

Inferring the Demographic History of Inbred Species from Genome-Wide SNP Frequency Data

Affiliations

Inferring the Demographic History of Inbred Species from Genome-Wide SNP Frequency Data

Paul D Blischak et al. Mol Biol Evol. .

Abstract

Demographic inference using the site frequency spectrum (SFS) is a common way to understand historical events affecting genetic variation. However, most methods for estimating demography from the SFS assume random mating within populations, precluding these types of analyses in inbred populations. To address this issue, we developed a model for the expected SFS that includes inbreeding by parameterizing individual genotypes using beta-binomial distributions. We then take the convolution of these genotype probabilities to calculate the expected frequency of biallelic variants in the population. Using simulations, we evaluated the model's ability to coestimate demography and inbreeding using one- and two-population models across a range of inbreeding levels. We also applied our method to two empirical examples, American pumas (Puma concolor) and domesticated cabbage (Brassica oleracea var. capitata), inferring models both with and without inbreeding to compare parameter estimates and model fit. Our simulations showed that we are able to accurately coestimate demographic parameters and inbreeding even for highly inbred populations (F = 0.9). In contrast, failing to include inbreeding generally resulted in inaccurate parameter estimates in simulated data and led to poor model fit in our empirical analyses. These results show that inbreeding can have a strong effect on demographic inference, a pattern that was especially noticeable for parameters involving changes in population size. Given the importance of these estimates for informing practices in conservation, agriculture, and elsewhere, our method provides an important advancement for accurately estimating the demographic histories of these species.

Keywords: conservation; demography; domestication; inbreeding; site frequency spectrum.

PubMed Disclaimer

Figures

<sc>Fig</sc>. 1.
Fig. 1.
Comparison of expected spectra for F =0.5, 0.75, and 0.9 between ai (dark gray) and SLiM (light gray) for the equilibrium and bottleneck+growth models.
<sc>Fig</sc>. 2.
Fig. 2.
(a) Estimates of F from data generated with SLiM for the bottleneck and growth model (lower) plus an illustration of the model (upper). In this model, NA is the ancestral population size, ν0 is the size of the bottleneck (proportion of NA remaining after population reduction), and T is the amount of time for the population to recover back to a size of NA. (b) Estimates of F from data generated with SLiM for the divergence with one-way migration model (lower) plus an illustration of the model (upper). NA in this model is the same as the bottleneck model, ν2 is the size of the diverging population (again a proportion of NA), T is the divergence time between populations, and M21 is the one-way migration rate of individuals from population one into population two.
<sc>Fig</sc>. 3.
Fig. 3.
The observed joint site frequency spectrum for Puma concolor in Texas and Florida, along with the model fit and residuals, for models with inbreeding (middle) and without inbreeding (right). Residuals for each model are plotted below their expected spectra and a cartoon representation of the proposed demographic model is given in the bottom left.
<sc>Fig</sc>. 4.
Fig. 4.
The observed site frequency spectrum for Brassica oleracea var. capitata, along with the model fit (light gray) and residuals (bottom panels), for models with inbreeding (middle) and without inbreeding (right). On the left is a cartoon of the proposed demographic model with parameters labeled.

References

    1. Adams AM, Hudson RR.. 2004. Maximum-likelihood estimation of demographic parameters using the frequency spectrum of unlinked single-nucleotide polymorphisms. Genetics 168(3):1699–1712. - PMC - PubMed
    1. Balding DJ, Nichols RA.. 1995. A method for quantifying differentiation between populations at multi-allelic loci and its implications for investigating identity and paternity. Genetica 96(1–2):3–12. - PubMed
    1. Balding DJ, Nichols RA.. 1997. Significant genetic correlations among Caucasians at forensic DNA loci. Heredity 108:583–589. - PubMed
    1. Beissinger TM, Wang L, Crosby K, Durvasula A, Hufford MB, Ross-Ibarra J.. 2016. Recent demography drives changes in linked selection across the maize genome. Nat Plants. 2(7):16084. - PubMed
    1. Belser C, Istace B, Denis E, Dubarry M, Baurens F-C, Falentin C, Genete M, Berrabah W, Chèvre A-M, Delourme R, et al. 2018. Chromosome-scale assemblies of plant genomes using nanopore long reads and optical maps. Nat Plants. 4(11):879–887. - PubMed

Publication types