Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2010 Sep;186(1):411-26.
doi: 10.1534/genetics.110.118448. Epub 2010 Jun 15.

Inferring bacterial genome flux while considering truncated genes

Affiliations

Inferring bacterial genome flux while considering truncated genes

Weilong Hao et al. Genetics. 2010 Sep.

Abstract

Bacterial gene content variation during the course of evolution has been widely acknowledged and its pattern has been actively modeled in recent years. Gene truncation or gene pseudogenization also plays an important role in shaping bacterial genome content. Truncated genes could also arise from small-scale lateral gene transfer events. Unfortunately, the information of truncated genes has not been considered in any existing mathematical models on gene content variation. In this study, we developed a model to incorporate truncated genes. Maximum-likelihood estimates (MLEs) of the new model reveal fast rates of gene insertions/deletions on recent branches, suggesting a fast turnover of many recently transferred genes. The estimates also suggest that many truncated genes are in the process of being eliminated from the genome. Furthermore, we demonstrate that the ignorance of truncated genes in the estimation does not lead to a systematic bias but rather has a more complicated effect. Analysis using the new model not only provides more accurate estimates on gene gains/losses (or insertions/deletions), but also reduces any concern of a systematic bias from applying simplified models to bacterial genome evolution. Although not a primary purpose, the model incorporating truncated genes could be potentially used for phylogeny reconstruction using gene family content.

PubMed Disclaimer

Figures

F<sc>igure</sc> 1.—
Figure 1.—
(A–D) Phylogenies with varied levels of divergence. Clade names and strain abbreviations are as in Table 1. Horizontal scale bar indicates genome size.
F<sc>igure</sc> 2.—
Figure 2.—
Association between estimated parameters and phylogenetic divergence in each group. Three maximum-likelihood estimates (MLEs), μ, πa, and αΓ, are estimated under the M0 + π model using different cutoff thresholds in BLASTP/TBLASTN searches. Solid symbols represent MLEs using a criterion of E-value ≤10−20 and match length ≥85%, while open symbols represent MLEs using a criterion of E-value ≤10−10 and match length ≥70%. The four groups are sorted from the least diverse group (Escherichia) on the left to the most diverse group (Bacillaceae) on the right. Although shown along with three estimates, tree length is not an estimate from the gene insertion/deletion model. Indeed, it is the sum of branch lengths based on nucleotide substitution and used as an indicator for the degree of divergence in the clade.
F<sc>igure</sc> 3.—
Figure 3.—
Estimated parameters of the instantaneous rate matrix in each clade. As described in methods, α was fixed to be 1, and β and γ were estimated under the M0 + π model.
F<sc>igure</sc> 4.—
Figure 4.—
Rate parameters estimated on a five-taxon phylogeny. Rates on external branches are μ1, and rates on internal branches are μ2.
F<sc>igure</sc> 5.—
Figure 5.—
Comparison of MLEs with (solid symbols) or without (open symbols) considering truncated genes in the model. MLEs were estimated under the M0 + π model. When not considering truncated genes, truncated genes were entirely classified as absent (top half) or present (bottom half).

Similar articles

Cited by

References

    1. Altschul, S. F., T. L. Madden, A. A. Schffer, J. Zhang, Z. Zhang et al., 1997. Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. Nucleic Acids Res. 25 3389–3402. - PMC - PubMed
    1. Archibald, J. M., and A. J. Roger, 2002. Gene duplication and gene conversion shape the evolution of archaeal chaperonins. J. Mol. Biol. 316 1041–1050. - PubMed
    1. Berg, O. G., and C. G. Kurland, 2002. Evolution of microbial genomes: sequence acquisition and loss. Mol. Biol. Evol. 19 2265–2276. - PubMed
    1. Brochier, C., H. Philippe and D. Moreira, 2000. The evolutionary history of ribosomal protein RpS14: horizontal gene transfer at the heart of the ribosome. Trends Genet. 16 529–533. - PubMed
    1. Chain, P. S., E. Carniel, F. W. Larimer, J. Lamerdin, P. O. Stoutland et al., 2004. Insights into the evolution of Yersinia pestis through whole-genome comparison with Yersinia pseudotuberculosis. Proc. Natl. Acad. Sci. USA 101 13826–13831. - PMC - PubMed

Publication types