Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2018 May 1;67(3):384-399.
doi: 10.1093/sysbio/syx066.

Phylogenetic Factor Analysis

Affiliations

Phylogenetic Factor Analysis

Max R Tolkoff et al. Syst Biol. .

Abstract

Phylogenetic comparative methods explore the relationships between quantitative traits adjusting for shared evolutionary history. This adjustment often occurs through a Brownian diffusion process along the branches of the phylogeny that generates model residuals or the traits themselves. For high-dimensional traits, inferring all pair-wise correlations within the multivariate diffusion is limiting. To circumvent this problem, we propose phylogenetic factor analysis (PFA) that assumes a small unknown number of independent evolutionary factors arise along the phylogeny and these factors generate clusters of dependent traits. Set in a Bayesian framework, PFA provides measures of uncertainty on the factor number and groupings, combines both continuous and discrete traits, integrates over missing measurements and incorporates phylogenetic uncertainty with the help of molecular sequences. We develop Gibbs samplers based on dynamic programming to estimate the PFA posterior distribution, over 3-fold faster than for multivariate diffusion and a further order-of-magnitude more efficiently in the presence of latent traits. We further propose a novel marginal likelihood estimator for previously impractical models with discrete data and find that PFA also provides a better fit than multivariate diffusion in evolutionary questions in columbine flower development, placental reproduction transitions and triggerfish fin morphometry.

PubMed Disclaimer

Figures

Figure 1.
Figure 1.
Processes driving columbine flower evolution inferred through PFA or LMBD. a) Loadings formula image estimates from a formula image factor PFA model. Purple circles represent traits positively associated with traits represented by other purple circles within a loading, and negatively associated with traits represented by green circles within a loading. Similarly, traits represented by green circles are positively associated with traits represented by green circles within a loading. Size represents the magnitude of the value of the loadings. Opacity represents the posterior probability that the sign of the given element is equal to the sign of the posterior mean. The greyed out cell represents a structural 0 introduced for identifiability reasons. The magnitude for anthocyanins and pollinator type is less relevant since those measurements are discrete. b) Correlation matrix estimate from a LMBD model. Red represents positive correlation, blue represents anticorrelation, and opacity represents the absolute difference in posterior probability of being greater than 0 and less than 0. Size of the circle represents the magnitude of the correlation. The PFA captures well two independent processes, while the LMBD groups these processes together.
Figure 2.
Figure 2.
Processes driving transitions to placental reproduction inferred through PFAs. Loading formula image estimates from the a) formula image, b) formula image and c) formula image factor models. Loadings size, coloring and density follow those of Figure 1. Note that the magnitude for dichromatism, courtship behavior, ornamental display traits, sexual selection index, and superfetation is less relevant since those data are discrete. We include the two factor model for direct comparison to the results of Pollux et al. (2014). Loadings in the more probable formula image and formula image factor models do not support an association between matrotrophy index and gonopodium length nor body weights and lengths.
Figure 3.
Figure 3.
Expected triggerfish fin shape given a range of a) first factor values formula image and b) third factor values formula image, holding all others constant. Purple dots estimate semilandmark locations. Green lines are interpolated to present a clearer outline of the fin shape. For the relation represented by formula image the dorsal and anal fins go from more pointed to less pointed. For the relation represented by formula image we see a rotation in the pectoral fin.
Figure 4.
Figure 4.
Evolution of independent factors formula image driving triggerfish fin morphology along inferred phylogeny. The colorings display contemporary and ancestral first formula image and third formula image factor values under a formula image factor PFA model. For formula image, green represents positive values and purple represents negative values. For formula image, the scale is orange to blue. The Supplementary material available on Dryad contains plots for formula image, formula image and formula image. Balistes polylepis and Balistes vetula have negative factor values for the first factor formula image whereas the clade containing genus Rhinecanthus has positive factor values. In the third factor formula image, the Balistes genus and the species Pseudobalistes fuscus have positive factor values whereas the genus Rhinecanthus has near 0 factor values. Conversely, the genus Xanthichthys has a negative factor value for formula image, and has a near 0 value for formula image. We display the posterior clade probabilities for probabilities formula image99%.
Figure 5.
Figure 5.
Inferred ancestral fin shapes at the MRCA and formula image, formula image and formula image of the expected substitution distance between the MRCA and two contemporaneous triggerfish species. In a), Xanthichthys mento has a flat dorsal and anal fin with a point, and a clockwise rotated pectoral fin relative to its ancestors. The dorsal and anal fins become rounder and the pectoral fin rotates counterclockwise moving backwards in time. In contrast, in b), Balistes capriscus has a broad pointed dorsal and anal fin, and a counterclockwise anal fin. The dorsal and anal fins become more pointed and then round out, while the pectoral fin rotates clockwise.

Similar articles

Cited by

References

    1. Adams D. 2014. A method for assessing phylogenetic least squares models for shape and other high-dimensional multivariate data. Evolution 68(9): 2675–2688. - PubMed
    1. Aguilar O.,, West M. 2000. Bayesian dynamic factor models and portfolio allocation. J. Bus. Econ. Stat. 18:338–357.
    1. Baele G.,, Lemey P.,, Bedford T.,, Rambaut A.,, Suchard M.A.,, Alekseyenko A.V. 2012. Improving the accuracy of demographic and molecular clock model comparison while accommodating phylogenetic uncertainty. Mol. Biol. Evol. 29:2157–2167. - PMC - PubMed
    1. Baele G.,, Lemey P.,, Suchard, M.A. 2016. Genealogical working distributions for Bayesian model testing with phylogenetic uncertainty. Syst. Biol. 65:250–264. - PMC - PubMed
    1. Beguin A.,, Glas C. 2001. MCMC estimation and some model-fit analysis of multidimensional IRT models. Psychometrika 66:541–562.

Publication types