Estimating optimal species trees from incomplete gene trees under deep coalescence
- PMID: 22697236
- DOI: 10.1089/cmb.2012.0037
Estimating optimal species trees from incomplete gene trees under deep coalescence
Abstract
The estimation of species trees typically involves the estimation of trees and alignments on many different genes, so that the species tree can be based on many different parts of the genome. This kind of phylogenomic approach to species tree estimation has the potential to produce more accurate species tree estimates, especially when gene trees can differ from the species tree due to processes such as incomplete lineage sorting (ILS), gene duplication and loss, and horizontal gene transfer. Because ILS (also called "deep coalescence") is a frequent problem in systematics, many methods have been developed to estimate species trees from gene trees or alignments that specifically take ILS into consideration. In this paper we consider the problem of estimating species trees from gene trees and alignments for the general case where the gene trees and alignments can be incomplete, which means that not all the genes contain sequences for all the species. We formalize optimization problems for this context and prove theoretical results for these problems. We also present the results of a simulation study evaluating existing methods for estimating species trees from incomplete gene trees. Our simulation study shows that *BEAST, a statistical method for estimating species trees from gene sequence alignments, produces by far the most accurate species trees. However, *BEAST can only be run on small datasets. The second most accurate method, MRP (a standard supertree method), can analyze very large datasets and produces very good trees, making MRP a potentially acceptable alternative to *BEAST for large datasets.
Similar articles
-
From gene trees to species trees II: species tree inference by minimizing deep coalescence events.IEEE/ACM Trans Comput Biol Bioinform. 2011 Nov-Dec;8(6):1685-91. doi: 10.1109/TCBB.2011.83. IEEE/ACM Trans Comput Biol Bioinform. 2011. PMID: 21576759
-
Estimating species phylogeny from gene-tree probabilities despite incomplete lineage sorting: an example from Melanoplus grasshoppers.Syst Biol. 2007 Jun;56(3):400-11. doi: 10.1080/10635150701405560. Syst Biol. 2007. PMID: 17520504
-
Maximum likelihood estimates of species trees: how accuracy of phylogenetic inference depends upon the divergence history and sampling design.Syst Biol. 2009 Oct;58(5):501-8. doi: 10.1093/sysbio/syp045. Epub 2009 Aug 20. Syst Biol. 2009. PMID: 20525604
-
Coalescent methods for estimating phylogenetic trees.Mol Phylogenet Evol. 2009 Oct;53(1):320-8. doi: 10.1016/j.ympev.2009.05.033. Epub 2009 Jun 6. Mol Phylogenet Evol. 2009. PMID: 19501178 Review.
-
The inference of gene trees with species trees.Syst Biol. 2015 Jan;64(1):e42-62. doi: 10.1093/sysbio/syu048. Epub 2014 Jul 28. Syst Biol. 2015. PMID: 25070970 Free PMC article. Review.
Cited by
-
Multilocus inference of species trees and DNA barcoding.Philos Trans R Soc Lond B Biol Sci. 2016 Sep 5;371(1702):20150335. doi: 10.1098/rstb.2015.0335. Philos Trans R Soc Lond B Biol Sci. 2016. PMID: 27481787 Free PMC article. Review.
-
Phylogenomics and Diversification of the Schistosomatidae Based on Targeted Sequence Capture of Ultra-Conserved Elements.Pathogens. 2022 Jul 5;11(7):769. doi: 10.3390/pathogens11070769. Pathogens. 2022. PMID: 35890014 Free PMC article.
-
STELAR: a statistically consistent coalescent-based species tree estimation method by maximizing triplet consistency.BMC Genomics. 2020 Feb 10;21(1):136. doi: 10.1186/s12864-020-6519-y. BMC Genomics. 2020. PMID: 32039704 Free PMC article.
-
A Phylogenomic Perspective on the Radiation of Ray-Finned Fishes Based upon Targeted Sequencing of Ultraconserved Elements (UCEs).PLoS One. 2013 Jun 18;8(6):e65923. doi: 10.1371/journal.pone.0065923. Print 2013. PLoS One. 2013. PMID: 23824177 Free PMC article.
-
Rooting phylogenetic trees under the coalescent model using site pattern probabilities.BMC Evol Biol. 2017 Dec 19;17(1):263. doi: 10.1186/s12862-017-1108-7. BMC Evol Biol. 2017. PMID: 29258427 Free PMC article.
Publication types
MeSH terms
LinkOut - more resources
Full Text Sources