A general framework for moment-based analysis of genetic data
- PMID: 30734077
- DOI: 10.1007/s00285-018-01325-0
A general framework for moment-based analysis of genetic data
Abstract
In population genetics, the Dirichlet (also called the Balding-Nichols) model has for 20 years been considered the key model to approximate the distribution of allele fractions within populations in a multi-allelic setting. It has often been noted that the Dirichlet assumption is approximate because positive correlations among alleles cannot be accommodated under the Dirichlet model. However, the validity of the Dirichlet distribution has never been systematically investigated in a general framework. This paper attempts to address this problem by providing a general overview of how allele fraction data under the most common multi-allelic mutational structures should be modeled. The Dirichlet and alternative models are investigated by simulating allele fractions from a diffusion approximation of the multi-allelic Wright-Fisher process with mutation, and applying a moment-based analysis method. The study shows that the optimal modeling strategy for the distribution of allele fractions depends on the specific mutation process. The Dirichlet model is only an exceptionally good approximation for the pure drift, Jukes-Cantor and parent-independent mutation processes with small mutation rates. Alternative models are required and proposed for the other mutation processes, such as a Beta-Dirichlet model for the infinite alleles mutation process, and a Hierarchical Beta model for the Kimura, Hasegawa-Kishino-Yano and Tamura-Nei processes. Finally, a novel Hierarchical Beta approximation is developed, a Pyramidal Hierarchical Beta model, for the generalized time-reversible and single-step mutation processes.
Keywords: Allele fraction; Beta–Dirichlet; Diffusion; Dirichlet; Distribution of allele fractions; Evolutionary history; Hierarchical Beta; Moments; Multi-allelic Wright–Fisher; Mutation processes; Pyramid.
Similar articles
-
The multivariate Wright-Fisher process with mutation: Moment-based analysis and inference using a hierarchical Beta model.Theor Popul Biol. 2016 Apr;108:36-50. doi: 10.1016/j.tpb.2015.11.001. Epub 2015 Nov 29. Theor Popul Biol. 2016. PMID: 26612605
-
An approximate stationary solution for multi-allele neutral diffusion with low mutation rates.Theor Popul Biol. 2016 Dec;112:22-32. doi: 10.1016/j.tpb.2016.07.005. Epub 2016 Aug 2. Theor Popul Biol. 2016. PMID: 27495379
-
Inference from the stationary distribution of allele frequencies in a family of Wright-Fisher models with two levels of genetic variability.Theor Popul Biol. 2018 Jul;122:78-87. doi: 10.1016/j.tpb.2018.03.004. Epub 2018 Mar 21. Theor Popul Biol. 2018. PMID: 29574050 Free PMC article.
-
Statistical Inference in the Wright-Fisher Model Using Allele Frequency Data.Syst Biol. 2017 Jan 1;66(1):e30-e46. doi: 10.1093/sysbio/syw056. Syst Biol. 2017. PMID: 28173553 Free PMC article. Review.
-
A review on Monte Carlo simulation methods as they apply to mutation and selection as formulated in Wright-Fisher models of evolutionary genetics.Math Biosci. 2008 Feb;211(2):205-25. doi: 10.1016/j.mbs.2007.05.015. Epub 2007 Nov 28. Math Biosci. 2008. PMID: 18190932 Review.
Cited by
-
Extinction scenarios in evolutionary processes: a multinomial Wright-Fisher approach.J Math Biol. 2023 Sep 26;87(4):63. doi: 10.1007/s00285-023-01993-7. J Math Biol. 2023. PMID: 37751048 Free PMC article.
-
Stability of motor cortex network states during learning-associated neural reorganizations.J Neurophysiol. 2020 Nov 1;124(5):1327-1342. doi: 10.1152/jn.00061.2020. Epub 2020 Sep 16. J Neurophysiol. 2020. PMID: 32937084 Free PMC article.
References
MeSH terms
Grants and funding
LinkOut - more resources
Full Text Sources