Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2024 Jan 20;24(1):11.
doi: 10.1186/s12862-024-02201-w.

Evolutionary shift detection with ensemble variable selection

Affiliations

Evolutionary shift detection with ensemble variable selection

Wensha Zhang et al. BMC Ecol Evol. .

Abstract

Abrupt environmental changes can lead to evolutionary shifts in trait evolution. Identifying these shifts is an important step in understanding the evolutionary history of phenotypes. The detection performances of different methods are influenced by many factors, including different numbers of shifts, shift sizes, where a shift occurs on a tree, and the types of phylogenetic structure. Furthermore, the model assumptions are oversimplified, so are likely to be violated in real data, which could cause the methods to fail. We perform simulations to assess the effect of these factors on the performance of shift detection methods. To make the comparisons more complete, we also propose an ensemble variable selection method (R package ELPASO) and compare it with existing methods (R packages [Formula: see text]1ou and PhylogeneticEM). The performances of methods are highly dependent on the selection criterion. [Formula: see text]1ou+pBIC is usually the most conservative method and it performs well when signal sizes are large. [Formula: see text]1ou+BIC is the least conservative method and it performs well when signal sizes are small. The ensemble method provides more balanced choices between those two methods. Moreover, the performances of all methods are heavily impacted by measurement error, tree reconstruction error and shifts in variance.

Keywords: ELPASO; Ensemble method; Evolutionary shift detection; LASSO; Ornstein-Uhlenbeck model; Phylogenetic comparative methods; Trait evolution.

PubMed Disclaimer

Conflict of interest statement

The authors declare no competing interests.

Figures

Fig. 1
Fig. 1
The model structure of ensemble method for shift detection
Fig. 2
Fig. 2
Tree used in simulations to compare the precision and recall of different methods. The shifts positions are indicated by asterisks. Different colours indicate different optimal values for the trait
Fig. 3
Fig. 3
True positive numbers versus False positive numbers with 7 shifts
Fig. 4
Fig. 4
The mean log likelihood on 1000 test datasets
Fig. 5
Fig. 5
ARI with 7 true shifts
Fig. 6
Fig. 6
Shifts in different positions of tree
Fig. 7
Fig. 7
True positive versus false positive with different shift positions
Fig. 8
Fig. 8
Four different types of tree
Fig. 9
Fig. 9
True positive versus false positive with different types of trees
Fig. 10
Fig. 10
Average test log likelihood with parameters estimated from training data without measurement error using the shifts selected from training data with measurement error
Fig. 11
Fig. 11
Regenerated tree with different beta
Fig. 12
Fig. 12
Number of non-converging cases when analyzing on a misspecified tree
Fig. 13
Fig. 13
True positive versus false positive with applying methods on misspecified trees (larger triangles show larger difference with the original tree)
Fig. 14
Fig. 14
Average test log likelihood with applying methods on misspecified trees
Fig. 15
Fig. 15
Average test log likelihood with changing estimated alpha
Fig. 16
Fig. 16
Diffusion variance parameter changes on branch 195
Fig. 17
Fig. 17
Diffusion variance parameter changes on branch 195

References

    1. Felsenstein J. Phylogenies and the Comparative Method. Am Nat. 1985;125(1):1–15. doi: 10.1086/284325. - DOI
    1. Davis CC, Latvis M, Nickrent DL, Wurdack KJ, Baum DA. Floral gigantism in Rafflesiaceae. Science. 2007;315(5820):1812. doi: 10.1126/science.1135260. - DOI - PubMed
    1. Beaulieu JM, Jhwueng DC, Boettiger C, O’Meara BC. Modeling stabilizing selection: expanding the Ornstein-Uhlenbeck model of adaptive evolution. Evol Int J Org Evol. 2012;66(8):2369–2383. doi: 10.1111/j.1558-5646.2012.01619.x. - DOI - PubMed
    1. Gill MS, Ho LST, Baele G, Lemey P, Suchard MA. A relaxed directional random walk model for phylogenetic trait evolution. Syst Biol. 2016 doi: 10.1093/sysbio/syw093. - DOI - PMC - PubMed
    1. Hassler G, Tolkoff MR, Allen WL, Ho LST, Lemey P, Suchard MA. Inferring phenotypic trait evolution on large trees with many incomplete measurements. J Am Stat Assoc. 2020:1–15. 10.1080/01621459.2020.1799812. - PMC - PubMed

LinkOut - more resources