Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2013 Dec 19:8:32.
doi: 10.1186/1745-6150-8-32.

Models of gene gain and gene loss for probabilistic reconstruction of gene content in the last universal common ancestor of life

Affiliations

Models of gene gain and gene loss for probabilistic reconstruction of gene content in the last universal common ancestor of life

Lavanya Kannan et al. Biol Direct. .

Abstract

Background: The problem of probabilistic inference of gene content in the last common ancestor of several extant species with completely sequenced genomes is: for each gene that is conserved in all or some of the genomes, assign the probability that its ancestral gene was present in the genome of their last common ancestor.

Results: We have developed a family of models of gene gain and gene loss in evolution, and applied the maximum-likelihood approach that uses phylogenetic tree of prokaryotes and the record of orthologous relationships between their genes to infer the gene content of LUCA, the Last Universal Common Ancestor of all currently living cellular organisms. The crucial parameter, the ratio of gene losses and gene gains, was estimated from the data and was higher in models that take account of the number of in-paralogs in genomes than in models that treat gene presences and absences as a binary trait.

Conclusion: While the numbers of genes that are placed confidently into LUCA are similar in the ML methods and in previously published methods that use various parsimony-based approaches, the identities of genes themselves are different. Most of the models of either kind treat the genes found in many existing genomes in a similar way, assigning to them high probabilities of being ancestral ("high ancestrality"). The ML models are more likely than others to assign high ancestrality to the genes that are relatively rare in the present-day genomes.

PubMed Disclaimer

Figures

Figure 1
Figure 1
Distribution of all COGs under models B2 and M2, as well as high-ancestrality COGs (LUCA-ML and LUCA1.0), by the number of genomes in which they are present.
Figure 2
Figure 2
Probability distribution of the COG ancestrality under various models. The first panel shows the frequency of COGs with the different probability of occurrence at LUCA, and the second panel shows the number of COGs above the different probability thresholds.
Figure 3
Figure 3
The reference species tree. The reference species tree was constructed using the sequences of 11 universal genes COG00081, COG00093, COG00096, COG00097, COG00099, COG00102, COG00103, COG00197, COG00244, COG00256, COG00533.
Figure 4
Figure 4
A toy example showing the parameters for calculation of the likelihood function. The elements of the phyletic vector of a gene are shown at the leaves. Branch lengths t1, t2, ⋯, t8 are known.
Figure 5
Figure 5
The strategy of editing phyletic vectors on the basis of information about HGT. A horizontal gene transfer event (red arrow) from ancestral species A to species B is inferred on the basis of comparison of the topologies of the species tree and the sequence family tree (not shown in this figure). The bottom vector is obtained by replacing all the presences by absences in all descendants of B, and the inference of ancestral state can be updated on this edited vector.

References

    1. Mushegian A. Gene content of LUCA, the last universal common ancestor. Front Biosci. 2008;8:4657–66. - PubMed
    1. Mirkin BG, Fenner TI, Galperin MY, Koonin EV. Algorithms for computing parsimonious evolutionary scenarios for genome evolution, the last universal common ancestor and dominance of horizontal gene transfer in the evolution of prokaryotes. BMC Evol Biol. 2003;8:2. doi: 10.1186/1471-2148-3-2. - DOI - PMC - PubMed
    1. Ouzounis CA. Ancestral state reconstructions for genomes. Curr Opin Genet Dev. 2005;8(6):595–600. doi: 10.1016/j.gde.2005.09.011. - DOI - PubMed
    1. Dagan T, Martin W. Ancestral genome sizes specify the minimum rate of lateral gene transfer during prokaryote evolution. Proc Natl Acad Sci USA. 2007;8(3):870–5. doi: 10.1073/pnas.0606318104. - DOI - PMC - PubMed
    1. Kristensen DM, Wolf YI, Mushegian AR, Koonin EV. Computational methods for Gene Orthology inference. Brief Bioinform. 2011;8(5):379–391. doi: 10.1093/bib/bbr030. - DOI - PMC - PubMed

Publication types

LinkOut - more resources