Properties of consensus methods for inferring species trees from gene trees
- PMID: 20525567
- PMCID: PMC2909780
- DOI: 10.1093/sysbio/syp008
Properties of consensus methods for inferring species trees from gene trees
Abstract
Consensus methods provide a useful strategy for summarizing information from a collection of gene trees. An important application of consensus methods is to combine gene trees to estimate a species tree. To investigate the theoretical properties of consensus trees that would be obtained from large numbers of loci evolving according to a basic evolutionary model, we construct consensus trees from rooted gene trees that occur in proportion to gene-tree probabilities derived from coalescent theory. We consider majority-rule, rooted triple (R(*)), and greedy consensus trees obtained from known, rooted gene trees, both in the asymptotic case as numbers of gene trees approach infinity and for finite numbers of genes. Our results show that for some combinations of species-tree branch lengths, increasing the number of independent loci can make the rooted majority-rule consensus tree more likely to be at least partially unresolved. However, the probability that the R(*) consensus tree has the species-tree topology approaches 1 as the number of gene trees approaches infinity. Although the greedy consensus algorithm can be the quickest to converge on the correct species-tree topology when increasing the number of gene trees, it can also be positively misleading. The majority-rule consensus tree is not a misleading estimator of the species-tree topology, and the R(*) consensus tree is a statistically consistent estimator of the species-tree topology. Our results therefore suggest a method for using multiple loci to infer the species-tree topology, even when it is discordant with the most likely gene tree.
Figures








Similar articles
-
Determining species tree topologies from clade probabilities under the coalescent.J Theor Biol. 2011 Nov 21;289:96-106. doi: 10.1016/j.jtbi.2011.08.006. Epub 2011 Aug 19. J Theor Biol. 2011. PMID: 21867714
-
Identifying the rooted species tree from the distribution of unrooted gene trees under the coalescent.J Math Biol. 2011 Jun;62(6):833-62. doi: 10.1007/s00285-010-0355-7. Epub 2010 Jul 23. J Math Biol. 2011. PMID: 20652704
-
Rooted triple consensus and anomalous gene trees.BMC Evol Biol. 2008 Apr 25;8:118. doi: 10.1186/1471-2148-8-118. BMC Evol Biol. 2008. PMID: 18439266 Free PMC article.
-
Coalescent methods for estimating phylogenetic trees.Mol Phylogenet Evol. 2009 Oct;53(1):320-8. doi: 10.1016/j.ympev.2009.05.033. Epub 2009 Jun 6. Mol Phylogenet Evol. 2009. PMID: 19501178 Review.
-
Computational approaches to species phylogeny inference and gene tree reconciliation.Trends Ecol Evol. 2013 Dec;28(12):719-28. doi: 10.1016/j.tree.2013.09.004. Epub 2013 Oct 1. Trends Ecol Evol. 2013. PMID: 24094331 Free PMC article. Review.
Cited by
-
A Guide to Phylogenomic Inference.Methods Mol Biol. 2024;2802:267-345. doi: 10.1007/978-1-0716-3838-5_11. Methods Mol Biol. 2024. PMID: 38819564
-
wQFM-TREE: highly accurate and scalable quartet-based species tree inference from gene trees.Bioinform Adv. 2025 Mar 13;5(1):vbaf053. doi: 10.1093/bioadv/vbaf053. eCollection 2025. Bioinform Adv. 2025. PMID: 40134580 Free PMC article.
-
Maximum Likelihood Estimation of Species Trees from Gene Trees in the Presence of Ancestral Population Structure.Genome Biol Evol. 2020 Feb 1;12(2):3977-3995. doi: 10.1093/gbe/evaa022. Genome Biol Evol. 2020. PMID: 32022857 Free PMC article.
-
Algorithms for MDC-based multi-locus phylogeny inference: beyond rooted binary gene trees on single alleles.J Comput Biol. 2011 Nov;18(11):1543-59. doi: 10.1089/cmb.2011.0174. Epub 2011 Oct 28. J Comput Biol. 2011. PMID: 22035329 Free PMC article.
-
On the unranked topology of maximally probable ranked gene tree topologies.J Math Biol. 2019 Sep;79(4):1205-1225. doi: 10.1007/s00285-019-01392-x. Epub 2019 Jun 21. J Math Biol. 2019. PMID: 31222377
References
-
- Ané C, Larget B, Baum DA, Smith SD, Rokas A. Bayesian estimation of concordance among gene trees. Mol. Biol. Evol. 2007;24:412–426. - PubMed
-
- Baum BR. Combining trees as a way of combining data sets for phylogenetic inference, and the desirability of combining trees. Taxon. 1992;41:3–10.
-
- Baum DA. Concordance trees, concordance factors, and the exploration of reticulate genealogy. Taxon. 2007;56:417–426.
-
- Bremer K. Combinable component consensus. Cladistics. 1990;6:369–372. - PubMed
-
- Bryant D. A classification of consensus methods for phylogenies. In: Janowitz M, Lapointe F-J, McMorris FR, Mirkin B, Roberts FS, editors. BioConsensus. Providence (RI): Center for Discrete Mathematics and Theoretical Computer Science, American Mathematical Society; 2003. pp. 163–183.