Robinson-Foulds supertrees
- PMID: 20181274
- PMCID: PMC2846952
- DOI: 10.1186/1748-7188-5-18
Robinson-Foulds supertrees
Abstract
Background: Supertree methods synthesize collections of small phylogenetic trees with incomplete taxon overlap into comprehensive trees, or supertrees, that include all taxa found in the input trees. Supertree methods based on the well established Robinson-Foulds (RF) distance have the potential to build supertrees that retain much information from the input trees. Specifically, the RF supertree problem seeks a binary supertree that minimizes the sum of the RF distances from the supertree to the input trees. Thus, an RF supertree is a supertree that is consistent with the largest number of clusters (or clades) from the input trees.
Results: We introduce efficient, local search based, hill-climbing heuristics for the intrinsically hard RF supertree problem on rooted trees. These heuristics use novel non-trivial algorithms for the SPR and TBR local search problems which improve on the time complexity of the best known (naïve) solutions by a factor of Theta(n) and Theta(n2) respectively (where n is the number of taxa, or leaves, in the supertree). We use an implementation of our new algorithms to examine the performance of the RF supertree method and compare it to matrix representation with parsimony (MRP) and the triplet supertree method using four supertree data sets. Not only did our RF heuristic provide fast estimates of RF supertrees in all data sets, but the RF supertrees also retained more of the information from the input trees (based on the RF distance) than the other supertree methods.
Conclusions: Our heuristics for the RF supertree problem, based on our new local search algorithms, make it possible for the first time to estimate large supertrees by directly optimizing the RF distance from rooted input trees to the supertrees. This provides a new and fast method to build accurate supertrees. RF supertrees may also be useful for estimating majority-rule(-) supertrees, which are a generalization of majority-rule consensus trees.
Figures
Similar articles
-
Fast local search for unrooted Robinson-Foulds supertrees.IEEE/ACM Trans Comput Biol Bioinform. 2012 Jul-Aug;9(4):1004-13. doi: 10.1109/TCBB.2012.47. IEEE/ACM Trans Comput Biol Bioinform. 2012. PMID: 22431553
-
Problems with supertrees based on the subtree prune-and-regraft distance, with comments on majority rule supertrees.Cladistics. 2016 Feb;32(1):82-89. doi: 10.1111/cla.12111. Epub 2015 Jan 30. Cladistics. 2016. PMID: 34732022
-
Performance of flip supertree construction with a heuristic algorithm.Syst Biol. 2004 Apr;53(2):299-308. doi: 10.1080/10635150490423719. Syst Biol. 2004. PMID: 15205054
-
Comparison of phylogenetic trees defined on different but mutually overlapping sets of taxa: A review.Ecol Evol. 2024 Aug 8;14(8):e70054. doi: 10.1002/ece3.70054. eCollection 2024 Aug. Ecol Evol. 2024. PMID: 39119174 Free PMC article. Review.
-
Genus-level supertree of Cyprinidae (Actinopterygii: Cypriniformes), partitioned qualitative clade support and test of macro-evolutionary scenarios.Biol Rev Camb Philos Soc. 2009 Nov;84(4):653-89. doi: 10.1111/j.1469-185X.2009.00091.x. Biol Rev Camb Philos Soc. 2009. PMID: 19857213 Review.
Cited by
-
Cubic time algorithms of amalgamating gene trees and building evolutionary scenarios.Biol Direct. 2012 Dec 22;7:48. doi: 10.1186/1745-6150-7-48. Biol Direct. 2012. PMID: 23259766 Free PMC article.
-
Bad Clade Deletion Supertrees: A Fast and Accurate Supertree Algorithm.Mol Biol Evol. 2017 Sep 1;34(9):2408-2421. doi: 10.1093/molbev/msx191. Mol Biol Evol. 2017. PMID: 28873954 Free PMC article.
-
A Bayesian Supertree Model for Genome-Wide Species Tree Reconstruction.Syst Biol. 2016 May;65(3):397-416. doi: 10.1093/sysbio/syu082. Epub 2014 Oct 3. Syst Biol. 2016. PMID: 25281847 Free PMC article.
-
Split-based computation of majority-rule supertrees.BMC Evol Biol. 2011 Jul 13;11:205. doi: 10.1186/1471-2148-11-205. BMC Evol Biol. 2011. PMID: 21752249 Free PMC article.
-
Linear-time algorithms for phylogenetic tree completion under Robinson-Foulds distance.Algorithms Mol Biol. 2020 Apr 13;15:6. doi: 10.1186/s13015-020-00166-1. eCollection 2020. Algorithms Mol Biol. 2020. PMID: 32313549 Free PMC article.
References
-
- Davies TJ, Barraclough TG, Chase MW, Soltis PS, Soltis DE, Savolainen V. Darwin's abominable mystery: Insights from a supertree of the angiosperms. Proceedings of the National Academy of Sciences of the United States of America. 2004;101(7):1904–1909. doi: 10.1073/pnas.0308127100. - DOI - PMC - PubMed
LinkOut - more resources
Full Text Sources