Distance measures in terms of substitution processes
- PMID: 10329516
- DOI: 10.1006/tpbi.1998.1395
Distance measures in terms of substitution processes
Abstract
Phylogenetic reconstruction from DNA or amino acid sequences relies heavily on suitable distance measures. A number of new distance measures (asynchronous, LogDet, and paralinear distances) which possess the desired property of tree additivity under fairly general models of sequence evolution have been proposed recently, but they are not well understood from a mechanistic point of view. We review them here in a unifying framework, which is the substitution process in continuous time. The emerging interpretation will also clarify the relationship among these distance measures. We also tackle situations with site-to-site variation of substitution rates which is well known to cause non-additive distances and inconsistent branch lengths. For homogeneous, stationary, time-reversible models, this may be repaired provided that the distribution of rates is known. In contrast, we will show that, for non-stationary models, different tree topologies may produce identical joint distributions of letters in pairs of sequences, given the same distribution of rates. This precludes the existence of any tree-additive pairwise distance measure.
Copyright 1999 Academic Press.
Similar articles
-
Pitfalls of heterogeneous processes for phylogenetic reconstruction.Syst Biol. 2007 Feb;56(1):113-24. doi: 10.1080/10635150701245388. Syst Biol. 2007. PMID: 17366141
-
A basic limitation on inferring phylogenies by pairwise sequence comparisons.J Theor Biol. 2009 Feb 7;256(3):467-72. doi: 10.1016/j.jtbi.2008.10.010. Epub 2008 Oct 22. J Theor Biol. 2009. PMID: 19000697
-
On the quality of tree-based protein classification.Bioinformatics. 2005 May 1;21(9):1876-90. doi: 10.1093/bioinformatics/bti244. Epub 2005 Jan 12. Bioinformatics. 2005. PMID: 15647305
-
Testing substitution models within a phylogenetic tree.Mol Biol Evol. 2003 Apr;20(4):572-8. doi: 10.1093/molbev/msg073. Epub 2003 Apr 2. Mol Biol Evol. 2003. PMID: 12679552 Review.
-
Coalescent methods for estimating phylogenetic trees.Mol Phylogenet Evol. 2009 Oct;53(1):320-8. doi: 10.1016/j.ympev.2009.05.033. Epub 2009 Jun 6. Mol Phylogenet Evol. 2009. PMID: 19501178 Review.
Cited by
-
A perspective on 16S rRNA operational taxonomic unit clustering using sequence similarity.NPJ Biofilms Microbiomes. 2016 Apr 20;2:16004. doi: 10.1038/npjbiofilms.2016.4. eCollection 2016. NPJ Biofilms Microbiomes. 2016. PMID: 28721243 Free PMC article.
-
Lineage specificity of gene expression patterns.Proc Natl Acad Sci U S A. 2004 Apr 27;101(17):6508-13. doi: 10.1073/pnas.0401136101. Epub 2004 Apr 19. Proc Natl Acad Sci U S A. 2004. PMID: 15096607 Free PMC article.
-
Statistical measures of uncertainty for branches in phylogenetic trees inferred from molecular sequences by using model-based methods.J Appl Genet. 2008;49(1):49-67. doi: 10.1007/BF03195249. J Appl Genet. 2008. PMID: 18263970 Review.
Publication types
MeSH terms
LinkOut - more resources
Full Text Sources