Efficient error correction algorithms for gene tree reconciliation based on duplication, duplication and loss, and deep coalescence
- PMID: 22759416
- PMCID: PMC3382437
- DOI: 10.1186/1471-2105-13-S10-S11
Efficient error correction algorithms for gene tree reconciliation based on duplication, duplication and loss, and deep coalescence
Abstract
Background: Gene tree - species tree reconciliation problems infer the patterns and processes of gene evolution within a species tree. Gene tree parsimony approaches seek the evolutionary scenario that implies the fewest gene duplications, duplications and losses, or deep coalescence (incomplete lineage sorting) events needed to reconcile a gene tree and a species tree. While a gene tree parsimony approach can be informative about genome evolution and phylogenetics, error in gene trees can profoundly bias the results.
Results: We introduce efficient algorithms that rapidly search local Subtree Prune and Regraft (SPR) or Tree Bisection and Reconnection (TBR) neighborhoods of a given gene tree to identify a topology that implies the fewest duplications, duplication and losses, or deep coalescence events. These algorithms improve on the current solutions by a factor of n for searching SPR neighborhoods and n2 for searching TBR neighborhoods, where n is the number of taxa in the given gene tree. They provide a fast error correction protocol for ameliorating the effects of gene tree error by allowing small rearrangements in the topology to improve the reconciliation cost. We also demonstrate a simple protocol to use the gene rearrangement algorithm to improve gene tree parsimony phylogenetic analyses.
Conclusions: The new gene tree rearrangement algorithms provide a fast method to address gene tree error. They do not make assumptions about the underlying processes of genome evolution, and they are amenable to analyses of large-scale genomic data sets. These algorithms are also easily incorporated into gene tree parsimony phylogenetic analyses, potentially producing more credible estimates of reconciliation cost.
Figures


Similar articles
-
Algorithms for genome-scale phylogenetics using gene tree parsimony.IEEE/ACM Trans Comput Biol Bioinform. 2013 Jul-Aug;10(4):939-56. doi: 10.1109/TCBB.2013.103. IEEE/ACM Trans Comput Biol Bioinform. 2013. PMID: 24334388
-
Efficient genome-scale phylogenetic analysis under the duplication-loss and deep coalescence cost models.BMC Bioinformatics. 2010 Jan 18;11 Suppl 1(Suppl 1):S42. doi: 10.1186/1471-2105-11-S1-S42. BMC Bioinformatics. 2010. PMID: 20122216 Free PMC article.
-
Algorithms: simultaneous error-correction and rooting for gene tree reconciliation and the gene duplication problem.BMC Bioinformatics. 2012 Jun 25;13 Suppl 10(Suppl 10):S14. doi: 10.1186/1471-2105-13-S10-S14. BMC Bioinformatics. 2012. PMID: 22759419 Free PMC article.
-
Models, algorithms and programs for phylogeny reconciliation.Brief Bioinform. 2011 Sep;12(5):392-400. doi: 10.1093/bib/bbr045. Brief Bioinform. 2011. PMID: 21949266 Review.
-
Phylogenetic reconciliation: making the most of genomes to understand microbial ecology and evolution.ISME J. 2024 Jan 8;18(1):wrae129. doi: 10.1093/ismejo/wrae129. ISME J. 2024. PMID: 39001714 Free PMC article. Review.
Cited by
-
Gene tree correction guided by orthology.BMC Bioinformatics. 2013;14 Suppl 15(Suppl 15):S5. doi: 10.1186/1471-2105-14-S15-S5. Epub 2013 Oct 15. BMC Bioinformatics. 2013. PMID: 24564227 Free PMC article.
-
Polytomy refinement for the correction of dubious duplications in gene trees.Bioinformatics. 2014 Sep 1;30(17):i519-26. doi: 10.1093/bioinformatics/btu463. Bioinformatics. 2014. PMID: 25161242 Free PMC article.
-
DrML: probabilistic modeling of gene duplications.J Comput Biol. 2014 Jan;21(1):89-98. doi: 10.1089/cmb.2013.0078. Epub 2013 Sep 27. J Comput Biol. 2014. PMID: 24073895 Free PMC article.
-
The inference of gene trees with species trees.Syst Biol. 2015 Jan;64(1):e42-62. doi: 10.1093/sysbio/syu048. Epub 2014 Jul 28. Syst Biol. 2015. PMID: 25070970 Free PMC article. Review.
-
Non-parametric correction of estimated gene trees using TRACTION.Algorithms Mol Biol. 2020 Jan 4;15:1. doi: 10.1186/s13015-019-0161-8. eCollection 2020. Algorithms Mol Biol. 2020. PMID: 31911812 Free PMC article.
References
-
- Maddison WP. Gene Trees in Species Trees. Systematic Biology. 1997;46:523–536. doi: 10.1093/sysbio/46.3.523. - DOI
-
- Goodman M, Czelusniak J, Moore GW, Romero-Herrera AE, Matsuda G. Fitting the gene lineage into its species lineage. A parsimony strategy illustrated by cladograms constructed from globin sequences. Systematic Zoology. 1979;28:132–163. doi: 10.2307/2412519. - DOI
Publication types
MeSH terms
LinkOut - more resources
Full Text Sources