Coalescent-based species tree inference from gene tree topologies under incomplete lineage sorting by maximum likelihood
- PMID: 22380439
- DOI: 10.1111/j.1558-5646.2011.01476.x
Coalescent-based species tree inference from gene tree topologies under incomplete lineage sorting by maximum likelihood
Abstract
Incomplete lineage sorting can cause incongruence between the phylogenetic history of genes (the gene tree) and that of the species (the species tree), which can complicate the inference of phylogenies. In this article, I present a new coalescent-based algorithm for species tree inference with maximum likelihood. I first describe an improved method for computing the probability of a gene tree topology given a species tree, which is much faster than an existing algorithm by Degnan and Salter (2005). Based on this method, I develop a practical algorithm that takes a set of gene tree topologies and infers species trees with maximum likelihood. This algorithm searches for the best species tree by starting from initial species trees and performing heuristic search to obtain better trees with higher likelihood. This algorithm, called STELLS (which stands for Species Tree InfErence with Likelihood for Lineage Sorting), has been implemented in a program that is downloadable from the author's web page. The simulation results show that the STELLS algorithm is more accurate than an existing maximum likelihood method for many datasets, especially when there is noise in gene trees. I also show that the STELLS algorithm is efficient and can be applied to real biological datasets.
© 2011 The Author. Evolution© 2011 The Society for the Study of Evolution.
Similar articles
-
STELLS2: fast and accurate coalescent-based maximum likelihood inference of species trees from gene tree topologies.Bioinformatics. 2017 Jun 15;33(12):1789-1797. doi: 10.1093/bioinformatics/btx079. Bioinformatics. 2017. PMID: 28186220
-
An algorithm for computing the gene tree probability under the multispecies coalescent and its application in the inference of population tree.Bioinformatics. 2016 Jun 15;32(12):i225-i233. doi: 10.1093/bioinformatics/btw261. Bioinformatics. 2016. PMID: 27307621 Free PMC article.
-
Maximum likelihood estimates of species trees: how accuracy of phylogenetic inference depends upon the divergence history and sampling design.Syst Biol. 2009 Oct;58(5):501-8. doi: 10.1093/sysbio/syp045. Epub 2009 Aug 20. Syst Biol. 2009. PMID: 20525604
-
Challenges in Species Tree Estimation Under the Multispecies Coalescent Model.Genetics. 2016 Dec;204(4):1353-1368. doi: 10.1534/genetics.116.190173. Genetics. 2016. PMID: 27927902 Free PMC article. Review.
-
The inference of gene trees with species trees.Syst Biol. 2015 Jan;64(1):e42-62. doi: 10.1093/sysbio/syu048. Epub 2014 Jul 28. Syst Biol. 2015. PMID: 25070970 Free PMC article. Review.
Cited by
-
The probability of monophyly of a sample of gene lineages on a species tree.Proc Natl Acad Sci U S A. 2016 Jul 19;113(29):8002-9. doi: 10.1073/pnas.1601074113. Epub 2016 Jul 18. Proc Natl Acad Sci U S A. 2016. PMID: 27432988 Free PMC article.
-
Species Tree Estimation and the Impact of Gene Loss Following Whole-Genome Duplication.Syst Biol. 2022 Oct 12;71(6):1348-1361. doi: 10.1093/sysbio/syac040. Syst Biol. 2022. PMID: 35689633 Free PMC article.
-
RENT+: an improved method for inferring local genealogical trees from haplotypes with recombination.Bioinformatics. 2017 Apr 1;33(7):1021-1030. doi: 10.1093/bioinformatics/btw735. Bioinformatics. 2017. PMID: 28065901 Free PMC article.
-
Rooting phylogenetic trees under the coalescent model using site pattern probabilities.BMC Evol Biol. 2017 Dec 19;17(1):263. doi: 10.1186/s12862-017-1108-7. BMC Evol Biol. 2017. PMID: 29258427 Free PMC article.
-
ASTRAL-Pro: Quartet-Based Species-Tree Inference despite Paralogy.Mol Biol Evol. 2020 Nov 1;37(11):3292-3307. doi: 10.1093/molbev/msaa139. Mol Biol Evol. 2020. PMID: 32886770 Free PMC article.
Publication types
MeSH terms
LinkOut - more resources
Full Text Sources
Other Literature Sources