Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2019 May;78(6):1727-1769.
doi: 10.1007/s00285-018-01325-0. Epub 2019 Jan 28.

A general framework for moment-based analysis of genetic data

Affiliations

A general framework for moment-based analysis of genetic data

Maria Simonsen Speed et al. J Math Biol. 2019 May.

Abstract

In population genetics, the Dirichlet (also called the Balding-Nichols) model has for 20 years been considered the key model to approximate the distribution of allele fractions within populations in a multi-allelic setting. It has often been noted that the Dirichlet assumption is approximate because positive correlations among alleles cannot be accommodated under the Dirichlet model. However, the validity of the Dirichlet distribution has never been systematically investigated in a general framework. This paper attempts to address this problem by providing a general overview of how allele fraction data under the most common multi-allelic mutational structures should be modeled. The Dirichlet and alternative models are investigated by simulating allele fractions from a diffusion approximation of the multi-allelic Wright-Fisher process with mutation, and applying a moment-based analysis method. The study shows that the optimal modeling strategy for the distribution of allele fractions depends on the specific mutation process. The Dirichlet model is only an exceptionally good approximation for the pure drift, Jukes-Cantor and parent-independent mutation processes with small mutation rates. Alternative models are required and proposed for the other mutation processes, such as a Beta-Dirichlet model for the infinite alleles mutation process, and a Hierarchical Beta model for the Kimura, Hasegawa-Kishino-Yano and Tamura-Nei processes. Finally, a novel Hierarchical Beta approximation is developed, a Pyramidal Hierarchical Beta model, for the generalized time-reversible and single-step mutation processes.

Keywords: Allele fraction; Beta–Dirichlet; Diffusion; Dirichlet; Distribution of allele fractions; Evolutionary history; Hierarchical Beta; Moments; Multi-allelic Wright–Fisher; Mutation processes; Pyramid.

PubMed Disclaimer

Similar articles

Cited by

References

    1. Genetica. 1995;96(1-2):3-12 - PubMed
    1. Proc Natl Acad Sci U S A. 1978 Jun;75(6):2868-72 - PubMed
    1. J Mol Evol. 1985;22(2):160-74 - PubMed
    1. Genetics. 2015 Nov;201(3):1133-41 - PubMed
    1. Syst Biol. 2017 Jan 1;66(1):e30-e46 - PubMed

LinkOut - more resources