Bayesian gene/species tree reconciliation and orthology analysis using MCMC
- PMID: 12855432
- DOI: 10.1093/bioinformatics/btg1000
Bayesian gene/species tree reconciliation and orthology analysis using MCMC
Abstract
Motivation: Comparative genomics in general and orthology analysis in particular are becoming increasingly important parts of gene function prediction. Previously, orthology analysis and reconciliation has been performed only with respect to the parsimony model. This discards many plausible solutions and sometimes precludes finding the correct one. In many other areas in bioinformatics probabilistic models have proven to be both more realistic and powerful than parsimony models. For instance, they allow for assessing solution reliability and consideration of alternative solutions in a uniform way. There is also an added benefit in making model assumptions explicit and therefore making model comparisons possible. For orthology analysis, uncertainty has recently been addressed using parsimonious reconciliation combined with bootstrap techniques. However, until now no probabilistic methods have been available.
Results: We introduce a probabilistic gene evolution model based on a birth-death process in which a gene tree evolves 'inside' a species tree. Based on this model, we develop a tool with the capacity to perform practical orthology analysis, based on Fitch's original definition, and more generally for reconciling pairs of gene and species trees. Our gene evolution model is biologically sound (Nei et al., 1997) and intuitively attractive. We develop a Bayesian analysis based on MCMC which facilitates approximation of an a posteriori distribution for reconciliations. That is, we can find the most probable reconciliations and estimate the probability of any reconciliation, given the observed gene tree. This also gives a way to estimate the probability that a pair of genes are orthologs. The main algorithmic contribution presented here consists of an algorithm for computing the likelihood of a given reconciliation. To the best of our knowledge, this is the first successful introduction of this type of probabilistic methods, which flourish in phylogeny analysis, into reconciliation and orthology analysis. The MCMC algorithm has been implemented and, although not yet being in its final form, tests show that it performs very well on synthetic as well as biological data. Using standard correspondences, our results carry over to allele trees as well as biogeography.
Similar articles
-
Integrating Sequence Evolution into Probabilistic Orthology Analysis.Syst Biol. 2015 Nov;64(6):969-82. doi: 10.1093/sysbio/syv044. Epub 2015 Jun 30. Syst Biol. 2015. PMID: 26130236
-
Probabilistic orthology analysis.Syst Biol. 2009 Aug;58(4):411-24. doi: 10.1093/sysbio/syp046. Epub 2009 Aug 18. Syst Biol. 2009. PMID: 20525594
-
An efficient method for exploring the space of gene tree/species tree reconciliations in a probabilistic framework.IEEE/ACM Trans Comput Biol Bioinform. 2012 Jan-Feb;9(1):26-39. doi: 10.1109/TCBB.2011.64. Epub 2011 Mar 30. IEEE/ACM Trans Comput Biol Bioinform. 2012. PMID: 21464510
-
Coalescent methods for estimating phylogenetic trees.Mol Phylogenet Evol. 2009 Oct;53(1):320-8. doi: 10.1016/j.ympev.2009.05.033. Epub 2009 Jun 6. Mol Phylogenet Evol. 2009. PMID: 19501178 Review.
-
Phylogeny estimation: traditional and Bayesian approaches.Nat Rev Genet. 2003 Apr;4(4):275-84. doi: 10.1038/nrg1044. Nat Rev Genet. 2003. PMID: 12671658 Review.
Cited by
-
Parsimonious reconstruction of network evolution.Algorithms Mol Biol. 2012 Sep 19;7(1):25. doi: 10.1186/1748-7188-7-25. Algorithms Mol Biol. 2012. PMID: 22992218 Free PMC article.
-
Inferring angiosperm phylogeny from EST data with widespread gene duplication.BMC Evol Biol. 2007 Feb 8;7 Suppl 1(Suppl 1):S3. doi: 10.1186/1471-2148-7-S1-S3. BMC Evol Biol. 2007. PMID: 17288576 Free PMC article.
-
Tools for simulating evolution of aligned genomic regions with integrated parameter estimation.Genome Biol. 2008 Oct 8;9(10):R147. doi: 10.1186/gb-2008-9-10-r147. Genome Biol. 2008. PMID: 18840304 Free PMC article.
-
Inferring gene family histories in yeast identifies lineage specific expansions.PLoS One. 2014 Jun 12;9(6):e99480. doi: 10.1371/journal.pone.0099480. eCollection 2014. PLoS One. 2014. PMID: 24921666 Free PMC article.
-
Probabilistic models of eukaryotic evolution: time for integration.Philos Trans R Soc Lond B Biol Sci. 2015 Sep 26;370(1678):20140338. doi: 10.1098/rstb.2014.0338. Philos Trans R Soc Lond B Biol Sci. 2015. PMID: 26323768 Free PMC article. Review.
Publication types
MeSH terms
Substances
LinkOut - more resources
Full Text Sources