Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Comparative Study
. 2002 Jul;12(7):1080-90.
doi: 10.1101/gr.187002.

A phylogenomic approach to bacterial phylogeny: evidence of a core of genes sharing a common history

Affiliations
Comparative Study

A phylogenomic approach to bacterial phylogeny: evidence of a core of genes sharing a common history

Vincent Daubin et al. Genome Res. 2002 Jul.

Abstract

It has been claimed that complete genome sequences would clarify phylogenetic relationships between organisms, but up to now, no satisfying approach has been proposed to use efficiently these data. For instance, if the coding of presence or absence of genes in complete genomes gives interesting results, it does not take into account the phylogenetic information contained in sequences and ignores hidden paralogies by using a BLAST reciprocal best hit definition of orthology. In addition, concatenation of sequences of different genes as well as building of consensus trees only consider the few genes that are shared among all organisms. Here we present an attempt to use a supertree method to build the phylogenetic tree of 45 organisms, with special focus on bacterial phylogeny. This led us to perform a phylogenetic study of congruence of tree topologies, which allows the identification of a core of genes supporting similar species phylogeny. We then used this core of genes to infer a tree. This phylogeny presents several differences with the rRNA phylogeny, notably for the position of hyperthermophilic bacteria.

PubMed Disclaimer

Figures

Figure 1
Figure 1
Construction of supertrees by MRP with bootstrap weighting. Each tree obtained for a set of species from a single orthologous gene family was coded into a binary matrix of informative sites. Only branches having a bootstrap value (or RELL-BP value for ML trees) over 50% were coded. The matrices obtained were concatenated into a supermatrix in which species absent from a gene family are encoded as unknown state (“?”). The supertree was calculated on the supermatrix using DNAPARS with all default options, and 500 replicates of bootstrap were made using SEQBOOT.
Figure 2
Figure 2
Supertrees of 45 species constructed with 730 trees. (A) Supertree based on trees made by BIONJ and a gamma distribution estimation of evolutionary rate heterogeneity. (B) Supertree made with ML trees. Only bootstrap values over 50% are shown.
Figure 2
Figure 2
Supertrees of 45 species constructed with 730 trees. (A) Supertree based on trees made by BIONJ and a gamma distribution estimation of evolutionary rate heterogeneity. (B) Supertree made with ML trees. Only bootstrap values over 50% are shown.
Figure 3
Figure 3
Plot of the two first axes of the PCO made from 310 BIONJ trees compared with RF distance. The trees chosen contained at least 10 bacterial species. The same experiment with ML-trees gave very similar results. Black dots correspond to informational genes, and gray dots correspond to operational genes. The ellipse contains the 121 trees retained for supertree reconstruction (see Table 1).
Figure 4
Figure 4
Supertrees of 45 species built with the trees selected using the PCO results. (A) Supertree based on 121 trees made by BIONJ and a gamma distribution estimation of evolutionary rate heterogeneity. (B) Supertree based on 118 ML trees.
Figure 4
Figure 4
Supertrees of 45 species built with the trees selected using the PCO results. (A) Supertree based on 121 trees made by BIONJ and a gamma distribution estimation of evolutionary rate heterogeneity. (B) Supertree based on 118 ML trees.

Similar articles

Cited by

References

    1. Baldauf SL, Roger AJ, Wenk-Siefert I, Doolittle WF. A kingdom-level phylogeny of eukaryotes based on combined protein data. Science. 2000;290:972–977. - PubMed
    1. Barns SM, Delwiche CF, Palmer JD, Pace NR. Perspectives on archaeal diversity, thermophily and monophyly from environmental rRNA sequences. Proc Natl Acad Sci. 1996;93:9188–9193. - PMC - PubMed
    1. Baum BR. Combining trees as a way of combining data sets for phylogenetic inference, and the desirability of combining gene trees. Taxon. 1992;41:3–10.
    1. Bellgard MI, Itoh T, Watanabe H, Imanishi T, Gojobori T. Dynamic evolution of genomes and the concept of genome space. Ann NY Acad Sci. 1999;870:293–300. - PubMed
    1. Bocchetta M, Gribaldo S, Sanangelantoni A, Cammarano P. Phylogenetic depth of the bacterial genera Aquifex and Thermotoga inferred from analysis of ribosomal protein, elongation factor, and RNA polymerase subunit sequences. J Mol Evol. 2000;50:366–380. - PubMed