Adaptive MCMC in Bayesian phylogenetics: an application to analyzing partitioned data in BEAST
- PMID: 28200071
- PMCID: PMC6044345
- DOI: 10.1093/bioinformatics/btx088
Adaptive MCMC in Bayesian phylogenetics: an application to analyzing partitioned data in BEAST
Abstract
Motivation: Advances in sequencing technology continue to deliver increasingly large molecular sequence datasets that are often heavily partitioned in order to accurately model the underlying evolutionary processes. In phylogenetic analyses, partitioning strategies involve estimating conditionally independent models of molecular evolution for different genes and different positions within those genes, requiring a large number of evolutionary parameters that have to be estimated, leading to an increased computational burden for such analyses. The past two decades have also seen the rise of multi-core processors, both in the central processing unit (CPU) and Graphics processing unit processor markets, enabling massively parallel computations that are not yet fully exploited by many software packages for multipartite analyses.
Results: We here propose a Markov chain Monte Carlo (MCMC) approach using an adaptive multivariate transition kernel to estimate in parallel a large number of parameters, split across partitioned data, by exploiting multi-core processing. Across several real-world examples, we demonstrate that our approach enables the estimation of these multipartite parameters more efficiently than standard approaches that typically use a mixture of univariate transition kernels. In one case, when estimating the relative rate parameter of the non-coding partition in a heterochronous dataset, MCMC integration efficiency improves by > 14-fold.
Availability and implementation: Our implementation is part of the BEAST code base, a widely used open source software package to perform Bayesian phylogenetic inference.
Contact: guy.baele@kuleuven.be.
Supplementary information: Supplementary data are available at Bioinformatics online.
© The Author 2017. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com
Figures



Similar articles
-
Posterior Summarization in Bayesian Phylogenetics Using Tracer 1.7.Syst Biol. 2018 Sep 1;67(5):901-904. doi: 10.1093/sysbio/syy032. Syst Biol. 2018. PMID: 29718447 Free PMC article.
-
Bayesian phylogenetics with BEAUti and the BEAST 1.7.Mol Biol Evol. 2012 Aug;29(8):1969-73. doi: 10.1093/molbev/mss075. Epub 2012 Feb 25. Mol Biol Evol. 2012. PMID: 22367748 Free PMC article.
-
AWTY (are we there yet?): a system for graphical exploration of MCMC convergence in Bayesian phylogenetics.Bioinformatics. 2008 Feb 15;24(4):581-3. doi: 10.1093/bioinformatics/btm388. Epub 2007 Aug 30. Bioinformatics. 2008. PMID: 17766271
-
High-Performance Computing in Bayesian Phylogenetics and Phylodynamics Using BEAGLE.Methods Mol Biol. 2019;1910:691-722. doi: 10.1007/978-1-4939-9074-0_23. Methods Mol Biol. 2019. PMID: 31278682
-
A biologist's guide to Bayesian phylogenetic analysis.Nat Ecol Evol. 2017 Oct;1(10):1446-1454. doi: 10.1038/s41559-017-0280-x. Epub 2017 Sep 21. Nat Ecol Evol. 2017. PMID: 28983516 Free PMC article. Review.
Cited by
-
Genotype F of Echovirus 25 with multiple recombination pattern have been persistently and extensively circulating in Chinese mainland.Sci Rep. 2024 Feb 8;14(1):3212. doi: 10.1038/s41598-024-53513-2. Sci Rep. 2024. PMID: 38332009 Free PMC article.
-
Mimicry can drive convergence in structural and light transmission features of transparent wings in Lepidoptera.Elife. 2021 Dec 21;10:e69080. doi: 10.7554/eLife.69080. Elife. 2021. PMID: 34930525 Free PMC article.
-
Online Bayesian Phylodynamic Inference in BEAST with Application to Epidemic Reconstruction.Mol Biol Evol. 2020 Jun 1;37(6):1832-1842. doi: 10.1093/molbev/msaa047. Mol Biol Evol. 2020. PMID: 32101295 Free PMC article.
-
Spatiotemporal and Species-Crossing Transmission Dynamics of Subclade 2.3.4.4b H5Nx HPAIVs.Transbound Emerg Dis. 2024 Jul 10;2024:2862053. doi: 10.1155/2024/2862053. eCollection 2024. Transbound Emerg Dis. 2024. PMID: 40303175 Free PMC article.
-
HetMM: A Michaelis-Menten model for non-homogeneous enzyme mixtures.iScience. 2024 Jan 19;27(2):108977. doi: 10.1016/j.isci.2024.108977. eCollection 2024 Feb 16. iScience. 2024. PMID: 38333698 Free PMC article.
References
-
- Baele G., Lemey P. (2013) Bayesian evolutionary model testing in the phylogenomics era: matching model complexity with computational efficiency. Bioinformatics, 29, 1970–1979. - PubMed
-
- Ferreira M.A.R., Suchard M.A. (2008) Bayesian anaylsis of elasped times in continuous-time Markov chains. Canadian Journal of Statistics, 26, 355–368.
MeSH terms
Grants and funding
LinkOut - more resources
Full Text Sources
Other Literature Sources
Miscellaneous