Of Traits and Trees: Probabilistic Distances under Continuous Trait Models for Dissecting the Interplay among Phylogeny, Model, and Data
- PMID: 33587145
- PMCID: PMC8208806
- DOI: 10.1093/sysbio/syab009
Of Traits and Trees: Probabilistic Distances under Continuous Trait Models for Dissecting the Interplay among Phylogeny, Model, and Data
Abstract
Stochastic models of character trait evolution have become a cornerstone of evolutionary biology in an array of contexts. While probabilistic models have been used extensively for statistical inference, they have largely been ignored for the purpose of measuring distances between phylogeny-aware models. Recent contributions to the problem of phylogenetic distance computation have highlighted the importance of explicitly considering evolutionary model parameters and their impacts on molecular sequence data when quantifying dissimilarity between trees. By comparing two phylogenies in terms of their induced probability distributions that are functions of many model parameters, these distances can be more informative than traditional approaches that rely strictly on differences in topology or branch lengths alone. Currently, however, these approaches are designed for comparing models of nucleotide substitution and gene tree distributions, and thus, are unable to address other classes of traits and associated models that may be of interest to evolutionary biologists. Here, we expand the principles of probabilistic phylogenetic distances to compute tree distances under models of continuous trait evolution along a phylogeny. By explicitly considering both the degree of relatedness among species and the evolutionary processes that collectively give rise to character traits, these distances provide a foundation for comparing models and their predictions, and for quantifying the impacts of assuming one phylogenetic background over another while studying the evolution of a particular trait. We demonstrate the properties of these approaches using theory, simulations, and several empirical data sets that highlight potential uses of probabilistic distances in many scenarios. We also introduce an open-source R package named PRDATR for easy application by the scientific community for computing phylogenetic distances under models of character trait evolution.[Brownian motion; comparative methods; phylogeny; quantitative traits.].
© The Author(s) 2021. Published by Oxford University Press, on behalf of the Society of Systematic Biologists.
Figures









Similar articles
-
Probabilistic Species Tree Distances: Implementing the Multispecies Coalescent to Compare Species Trees Within the Same Model-Based Framework Used to Estimate Them.Syst Biol. 2020 Jan 1;69(1):194-207. doi: 10.1093/sysbio/syz031. Syst Biol. 2020. PMID: 31086978
-
The Cauchy Process on Phylogenies: A Tractable Model for Pulsed Evolution.Syst Biol. 2023 Dec 30;72(6):1296-1315. doi: 10.1093/sysbio/syad053. Syst Biol. 2023. PMID: 37603537
-
Fast likelihood calculation for multivariate Gaussian phylogenetic models with shifts.Theor Popul Biol. 2020 Feb;131:66-78. doi: 10.1016/j.tpb.2019.11.005. Epub 2019 Dec 2. Theor Popul Biol. 2020. PMID: 31805292
-
How traits shape trees: new approaches for detecting character state-dependent lineage diversification.J Evol Biol. 2014 Oct;27(10):2035-45. doi: 10.1111/jeb.12460. Epub 2014 Jul 25. J Evol Biol. 2014. PMID: 25066512 Review.
-
Measuring biodiversity to explain community assembly: a unified approach.Biol Rev Camb Philos Soc. 2011 Nov;86(4):792-812. doi: 10.1111/j.1469-185X.2010.00171.x. Epub 2010 Dec 14. Biol Rev Camb Philos Soc. 2011. PMID: 21155964 Review.
Cited by
-
A Tale of Too Many Trees: A Conundrum for Phylogenetic Regression.Mol Biol Evol. 2025 Mar 5;42(3):msaf032. doi: 10.1093/molbev/msaf032. Mol Biol Evol. 2025. PMID: 39930867 Free PMC article.
-
TraitTrainR: accelerating large-scale simulation under models of continuous trait evolution.Bioinform Adv. 2024 Dec 9;5(1):vbae196. doi: 10.1093/bioadv/vbae196. eCollection 2025. Bioinform Adv. 2024. PMID: 39758830 Free PMC article.
-
Piikun: an information theoretic toolkit for analysis and visualization of species delimitation metric space.BMC Bioinformatics. 2024 Dec 18;25(1):385. doi: 10.1186/s12859-024-05997-y. BMC Bioinformatics. 2024. PMID: 39695946 Free PMC article.
-
New generalized metric based on branch length distance to compare B cell lineage trees.Algorithms Mol Biol. 2024 Oct 5;19(1):22. doi: 10.1186/s13015-024-00267-1. Algorithms Mol Biol. 2024. PMID: 39369262 Free PMC article.
-
Discriminating models of trait evolution.bioRxiv [Preprint]. 2025 Jun 13:2025.06.12.659377. doi: 10.1101/2025.06.12.659377. bioRxiv. 2025. PMID: 40661575 Free PMC article. Preprint.
References
-
- Abou-Moustafa K.T., Ferrie F.P.. 2012. A note on metric properties for some divergence measures: the Gaussian case. J. Mach. Learn. Res. 15:1–15.
-
- Adams R.H., Castoe T.A.. 2019a. Statistical binning leads to profound model violation due to gene tree error incurred by trying to avoid gene tree error. Mol. Phylogenet. Evol. 134:164–171. - PubMed
-
- Adams R.H., Castoe T.A.. 2019b. Probabilistic species tree distances: implementing the multispecies coalescent to compare species trees within the same model-based framework used to estimate them. Syst. Biol. 61:194–207. - PubMed
-
- Akaike H. 1973. Information theory and an extension of the maximum likelihood principle. 2nd International Symposium on Information Theory. Budapest: Akademiai Kiado. p. 267–281.
-
- Aldous D.J. 1995. Probability distributions on cladograms. In: Aldous D.J., Pemantle R., editors. Random discrete structures. Berlin: Springer. p. 1–18.
Publication types
MeSH terms
Associated data
Grants and funding
LinkOut - more resources
Full Text Sources
Other Literature Sources