Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2011 Jun;62(6):833-62.
doi: 10.1007/s00285-010-0355-7. Epub 2010 Jul 23.

Identifying the rooted species tree from the distribution of unrooted gene trees under the coalescent

Affiliations

Identifying the rooted species tree from the distribution of unrooted gene trees under the coalescent

Elizabeth S Allman et al. J Math Biol. 2011 Jun.

Abstract

Gene trees are evolutionary trees representing the ancestry of genes sampled from multiple populations. Species trees represent populations of individuals-each with many genes-splitting into new populations or species. The coalescent process, which models ancestry of gene copies within populations, is often used to model the probability distribution of gene trees given a fixed species tree. This multispecies coalescent model provides a framework for phylogeneticists to infer species trees from gene trees using maximum likelihood or Bayesian approaches. Because the coalescent models a branching process over time, all trees are typically assumed to be rooted in this setting. Often, however, gene trees inferred by traditional phylogenetic methods are unrooted. We investigate probabilities of unrooted gene trees under the multispecies coalescent model. We show that when there are four species with one gene sampled per species, the distribution of unrooted gene tree topologies identifies the unrooted species tree topology and some, but not all, information in the species tree edges (branch lengths). The location of the root on the species tree is not identifiable in this situation. However, for 5 or more species with one gene sampled per species, we show that the distribution of unrooted gene tree topologies identifies the rooted species tree topology and all its internal branch lengths. The length of any pendant branch leading to a leaf of the species tree is also identifiable for any species from which more than one gene is sampled.

PubMed Disclaimer

References

    1. Nature. 2003 Oct 23;425(6960):798-804 - PubMed
    1. J Theor Biol. 2010 Mar 7;263(1):108-19 - PubMed
    1. Mol Biol Evol. 1987 Mar;4(2):167-91 - PubMed
    1. Syst Biol. 2009 Oct;58(5):489-500 - PubMed
    1. Genetics. 1989 Aug;122(4):957-66 - PubMed

Publication types

LinkOut - more resources