Searching for convergence in phylogenetic Markov chain Monte Carlo

Robert G Beiko¹, Jonathan M Keith, Timothy J Harlow, Mark A Ragan

Affiliations

PMID: 16857650
DOI: 10.1080/10635150600812544

Comparative Study

Searching for convergence in phylogenetic Markov chain Monte Carlo

Robert G Beiko et al. Syst Biol. 2006 Aug.

. 2006 Aug;55(4):553-65.

doi: 10.1080/10635150600812544.

Authors

Robert G Beiko¹, Jonathan M Keith, Timothy J Harlow, Mark A Ragan

Affiliation

¹ ARC Centre in Bioinformatics and Institute for Molecular Bioscience, The University of Queensland, Brisbane, Queensland 4072, Australia. r.beiko@gmail.com

PMID: 16857650
DOI: 10.1080/10635150600812544

Abstract

Markov chain Monte Carlo (MCMC) is a methodology that is gaining widespread use in the phylogenetics community and is central to phylogenetic software packages such as MrBayes. An important issue for users of MCMC methods is how to select appropriate values for adjustable parameters such as the length of the Markov chain or chains, the sampling density, the proposal mechanism, and, if Metropolis-coupled MCMC is being used, the number of heated chains and their temperatures. Although some parameter settings have been examined in detail in the literature, others are frequently chosen with more regard to computational time or personal experience with other data sets. Such choices may lead to inadequate sampling of tree space or an inefficient use of computational resources. We performed a detailed study of convergence and mixing for 70 randomly selected, putatively orthologous protein sets with different sizes and taxonomic compositions. Replicated runs from multiple random starting points permit a more rigorous assessment of convergence, and we developed two novel statistics, delta and epsilon, for this purpose. Although likelihood values invariably stabilized quickly, adequate sampling of the posterior distribution of tree topologies took considerably longer. Our results suggest that multimodality is common for data sets with 30 or more taxa and that this results in slow convergence and mixing. However, we also found that the pragmatic approach of combining data from several short, replicated runs into a "metachain" to estimate bipartition posterior probabilities provided good approximations, and that such estimates were no worse in approximating a reference posterior distribution than those obtained using a single long run of the same length as the metachain. Precision appears to be best when heated Markov chains have low temperatures, whereas chains with high temperatures appear to sample trees with high posterior probabilities only rarely.

PubMed Disclaimer

Cited by

Quantifying MCMC exploration of phylogenetic tree space.
Whidden C, Matsen FA 4th. Whidden C, et al. Syst Biol. 2015 May;64(3):472-91. doi: 10.1093/sysbio/syv006. Epub 2015 Jan 27. Syst Biol. 2015. PMID: 25631175 Free PMC article.
Protein evolution by molecular tinkering: diversification of the nuclear receptor superfamily from a ligand-dependent ancestor.
Bridgham JT, Eick GN, Larroux C, Deshpande K, Harms MJ, Gauthier ME, Ortlund EA, Degnan BM, Thornton JW. Bridgham JT, et al. PLoS Biol. 2010 Oct 5;8(10):e1000497. doi: 10.1371/journal.pbio.1000497. PLoS Biol. 2010. PMID: 20957188 Free PMC article.
Evolution of the parasitic wasp subfamily Rogadinae (Braconidae): phylogeny and evolution of lepidopteran host ranges and mummy characteristics.
Zaldívar-Riverón A, Shaw MR, Sáez AG, Mori M, Belokoblylskij SA, Shaw SR, Quicke DL. Zaldívar-Riverón A, et al. BMC Evol Biol. 2008 Dec 4;8:329. doi: 10.1186/1471-2148-8-329. BMC Evol Biol. 2008. PMID: 19055825 Free PMC article.
Evolution of general transcription factors.
Gunbin KV, Ruvinsky A. Gunbin KV, et al. J Mol Evol. 2013 Feb;76(1-2):28-47. doi: 10.1007/s00239-012-9535-y. Epub 2012 Dec 11. J Mol Evol. 2013. PMID: 23229069
Bipartite Network Analysis of Gene Sharings in the Microbial World.
Corel E, Méheust R, Watson AK, McInerney JO, Lopez P, Bapteste E. Corel E, et al. Mol Biol Evol. 2018 Apr 1;35(4):899-913. doi: 10.1093/molbev/msy001. Mol Biol Evol. 2018. PMID: 29346651 Free PMC article.

See all "Cited by" articles

Publication types

Actions
Actions

MeSH terms

Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions

LinkOut - more resources

Full Text Sources
- Silverchair Information Systems

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Searching for convergence in phylogenetic Markov chain Monte Carlo

Affiliation

Searching for convergence in phylogenetic Markov chain Monte Carlo

Authors

Affiliation

Abstract

Similar articles

Cited by

Publication types

MeSH terms

LinkOut - more resources

Full Text Sources