Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2011 Jul 13:11:205.
doi: 10.1186/1471-2148-11-205.

Split-based computation of majority-rule supertrees

Affiliations

Split-based computation of majority-rule supertrees

Anne Kupczok. BMC Evol Biol. .

Abstract

Background: Supertree methods combine overlapping input trees into a larger supertree. Here, I consider split-based supertree methods that first extract the split information of the input trees and subsequently combine this split information into a phylogeny. Well known split-based supertree methods are matrix representation with parsimony and matrix representation with compatibility. Combining input trees on the same taxon set, as in the consensus setting, is a well-studied task and it is thus desirable to generalize consensus methods to supertree methods.

Results: Here, three variants of majority-rule (MR) supertrees that generalize majority-rule consensus trees are investigated. I provide simple formulas for computing the respective score for bifurcating input- and supertrees. These score computations, together with a heuristic tree search minmizing the scores, were implemented in the python program PluMiST (Plus- and Minus SuperTrees) available from http://www.cibiv.at/software/plumist. The different MR methods were tested by simulation and on real data sets. The search heuristic was successful in combining compatible input trees. When combining incompatible input trees, especially one variant, MR(-) supertrees, performed well.

Conclusions: The presented framework allows for an efficient score computation of three majority-rule supertree variants and input trees. I combined the score computation with a heuristic search over the supertree space. The implementation was tested by simulation and on real data sets and showed promising results. Especially the MR(-) variant seems to be a reasonable score for supertree reconstruction. Generalizing these computations to multifurcating trees is an open problem, which may be tackled using this framework.

PubMed Disclaimer

Figures

Figure 1
Figure 1
Example input- and supertree. Example supertree (a) and input tree (b) for distance computations. Both trees are identified by its split sets: formula image = {AB|CDEFGH, ABC|DEFGH, ABCD|EFGH, FGH|ABCDE, GH|ABCDEF} and formula image = {DF|CGH, GH|CDF} The supertree pruned to the taxa in formula image is shown in (c). Thus d- = 2. (d) shows a tree that results in an optimal distance of d+ = 4 and (e) shows a tree that results in an optimal distance of formula image.
Figure 2
Figure 2
Distribution of inner splits with compatible input trees and 50% of the taxa deleted.
Figure 3
Figure 3
Results for incompatible input trees and n = 32. The distances are normalized by dividing by the maximal distances, that is, 2n - 6 for the RF distance and n - 3 for the other two distances. "Missing splits" is the proportion of splits that are in the model tree but not in the supertree and "Incorrect splits" are in the supertree but not in the model tree. The mean of the data is displayed by "+". Note the different scaling of the y-axes.
Figure 4
Figure 4
Results with sequence simulation. The mean of the data is displayed by "+".
Figure 5
Figure 5
Prokaryote trees. Reference tree for the bacteria is taken from Wu et al. [41]. Chlorobium tepidum is not present in [41] but is displayed at the position of the other Chlorobi. The Archaea are marked in turquoise. Full taxon names can be found in [40].
Figure 6
Figure 6
Modification of the supertree. The taxon set {A, B} is replaced by A'.

Similar articles

Cited by

References

    1. Bininda-Emonds ORP, (Ed) Phylogenetic Supertrees: Combining Information to Reveal the Tree of Life. Dordrecht: Kluwer Academic; 2004.
    1. Fitzpatrick D, Logue M, Stajich J, Butler G. A fungal phylogeny based on 42 complete genomes derived from supertree and combined gene analysis. BMC Evol Biol. 2006;6:99. doi: 10.1186/1471-2148-6-99. - DOI - PMC - PubMed
    1. Pisani D, Cotton JA, McInerney JO. Supertrees disentangle the chimerical origin of eukaryotic genomes. Mol Biol Evol. 2007;24(8):1752–1760. doi: 10.1093/molbev/msm095. - DOI - PubMed
    1. Bininda-Emonds ORP, Cardillo M, Jones KE, MacPhee RDE, Beck RMD, Grenyer R, Price SA, Vos RA, Gittleman JL, Purvis A. The delayed rise of present-day mammals. Nature. 2007;446:507–512. doi: 10.1038/nature05634. - DOI - PubMed
    1. Baker WJ, Savolainen V, Asmussen-Lange CB, Chase MW, Dransfield J, Forest F, Harley MM, Uhl NW, Wilkinson M. Complete Generic-Level Phylogenetic Analyses of Palms (Arecaceae) with Comparisons of Supertree and Supermatrix Approaches. Syst Biol. 2009;58(2):240–256. doi: 10.1093/sysbio/syp021. - DOI - PubMed

Publication types

LinkOut - more resources