Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2011 Mar;60(2):117-25.
doi: 10.1093/sysbio/syq072. Epub 2010 Dec 24.

Genome-scale phylogenetics: inferring the plant tree of life from 18,896 gene trees

Affiliations

Genome-scale phylogenetics: inferring the plant tree of life from 18,896 gene trees

J Gordon Burleigh et al. Syst Biol. 2011 Mar.

Abstract

Phylogenetic analyses using genome-scale data sets must confront incongruence among gene trees, which in plants is exacerbated by frequent gene duplications and losses. Gene tree parsimony (GTP) is a phylogenetic optimization criterion in which a species tree that minimizes the number of gene duplications induced among a set of gene trees is selected. The run time performance of previous implementations has limited its use on large-scale data sets. We used new software that incorporates recent algorithmic advances to examine the performance of GTP on a plant data set consisting of 18,896 gene trees containing 510,922 protein sequences from 136 plant taxa (giving a combined alignment length of >2.9 million characters). The relationships inferred from the GTP analysis were largely consistent with previous large-scale studies of backbone plant phylogeny and resolved some controversial nodes. The placement of taxa that were present in few gene trees generally varied the most among GTP bootstrap replicates. Excluding these taxa either before or after the GTP analysis revealed high levels of phylogenetic support across plants. The analyses supported magnoliids sister to a eudicot + monocot clade and did not support the eurosid I and II clades. This study presents a nuclear genomic perspective on the broad-scale phylogenic relationships among plants, and it demonstrates that nuclear genes with a history of duplication and loss can be phylogenetically informative for resolving the plant tree of life.

PubMed Disclaimer

Figures

F<sc>IGURE</sc> 1.
FIGURE 1.
Consensus of the 136 taxon NJ/PP GTP supertree bootstrap trees. Numbers represent supertree bootstrap percentages. Asterisk represents branches with at least 50% supertree bootstrap support that are not present in the species tree inferred using ML gene trees. 1
F<sc>IGURE</sc> 2.
FIGURE 2.
Average quartet similarity for each taxon among bootstrap trees. Each point in the graph represents a single taxon. The x-axis shows the number of gene families trees that have data from the taxon. The y-axis shows the average percentage frequency of quartets (four taxon statements) containing the taxon that are identical between two bootstrap trees. The shaded area in the graph contains all taxa that are present in less than 1300 gene trees.2
F<sc>IGURE</sc> 3.
FIGURE 3.
Reduced consensus of the GTP supertree bootstrap trees. To build the reduced consensus, 54 taxa that were present in less than 1300 of the gene trees were pruned from the bootstrap tree, and then a consensus was made from the pruned trees. Numbers above the branches are the supertree bootstrap percentages. Asterisk represents branches that are not present in the species tree inferred using ML gene trees.3

References

    1. Adams KL, Wendel JF. Polyploidy and genome evolution in plants. Curr. Opin. Plant Biol. 2005;8:135–141. - PubMed
    1. Åkerborg Ö, Sennblad B, Arvestad L, Lagergren J. Simultaneous Bayesian gene tree reconstruction and reconciliation analysis. Proc. Natl. Acad. Sci. U.S.A. 2009;106:5714–5719. - PMC - PubMed
    1. Angiosperm Phylogeny Group. An update of the angiosperm phylogeny group classification for the orders and families of flowering plants: APG II. Bot. J. Linn. Soc. 2003;141:399–436.
    1. Arvestad L, Berglund A-C, Lagergren J, Sennblad B. 2004. Gene tree reconstruction and orthology analysis based on an integrated model for duplications and sequence evolution. RECOMB. 2004:326–335.
    1. Bansal MS, Burleigh JG, Eulenstein O. Efficient genome-scale phylogenetic analysis under the duplication-loss and deep coalescence cost models. BMC Bioinformatics. 2010;11:S42. - PMC - PubMed

Publication types