Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2023 Nov 1;39(11):btad660.
doi: 10.1093/bioinformatics/btad660.

An evolution strategy approach for the balanced minimum evolution problem

Affiliations

An evolution strategy approach for the balanced minimum evolution problem

Andrea Gasparin et al. Bioinformatics. .

Abstract

Motivation: The Balanced Minimum Evolution (BME) is a powerful distance based phylogenetic estimation model introduced by Desper and Gascuel and nowadays implemented in popular tools for phylogenetic analyses. It was proven to be computationally less demanding than more sophisticated estimation methods, e.g. maximum likelihood or Bayesian inference while preserving the statistical consistency and the ability to run with almost any kind of data for which a dissimilarity measure is available. BME can be stated in terms of a nonlinear non-convex combinatorial optimization problem, usually referred to as the Balanced Minimum Evolution Problem (BMEP). Currently, the state-of-the-art among approximate methods for the BMEP is represented by FastME (version 2.0), a software which implements several deterministic phylogenetic construction heuristics combined with a local search on specific neighbourhoods derived by classical topological tree rearrangements. These combinations, however, may not guarantee convergence to close-to-optimal solutions to the problem due to the lack of solution space exploration, a phenomenon which is exacerbated when tackling molecular datasets characterized by a large number of taxa.

Results: To overcome such convergence issues, in this article, we propose a novel metaheuristic, named PhyloES, which exploits the combination of an exploration phase based on Evolution Strategies, a special type of evolutionary algorithm, with a refinement phase based on two local search algorithms. Extensive computational experiments show that PhyloES consistently outperforms FastME, especially when tackling larger datasets, providing solutions characterized by a shorter tree length but also significantly different from the topological perspective.

Availability and implementation: The software and the data are available at https://github.com/andygaspar/PHYLOES.

PubMed Disclaimer

Conflict of interest statement

None declared.

Figures

Figure 1.
Figure 1.
(Left) Representation of an NNI move. When edge e is selected, two new trees are created by swapping subtrees A–D and A–C, respectively. (Right) Representation of an SPR move. When edge e is selected, the corresponding subtree is pruned from the initial phylogeny and then regrafted onto edge e.
Figure 2.
Figure 2.
Two examples of tree encoding. (Top) The ordered edge list at the initial step is E={(t1,i6),(t2,i6),(t3,i6)}. The edge selected for the insertion of t4 is (t3,i6), which is the 3-rd in the list. At the next step E becomes {(t1,i6),(t2,i6),(t3,i7),(t4,i7),(i6,i7)}; t5 is then inserted in the edge t5 is (i6,i7), 5-th in the list, so the resulting code is T1=(3,5). (Bottom) The initial edge list is E={(t1,i6),(t2,i6),(t3,i6)}. t4 is inserted in (t1,i6), 1-st in the list. At the next step E becomes {(t1,i7),(t2,i6),(t3,i6),(t4,i7),(i6,i7)}; the edge selected is (t3,i6), 3-rd in the list, providing the resulting code is T2=(1,3)
Figure 3.
Figure 3.
Number of BNNI (red line) and BSPR (blue line) iterations per generation in PhyloES.

References

    1. Azouri D, Abadi S, Mansour Y. et al. Harnessing machine learning to guide phylogenetic-tree search algorithms. Nat Commun 2021;12:1983. - PMC - PubMed
    1. Bäck T. Evolutionary Algorithms in Theory and Practice: Evolution Strategies, Evolutionary Programming, Genetic Algorithms. Oxford, UK: Oxford University Press, 1996.
    1. Bartoli A, De Lorenzo A, Medvet E. et al. Multi-level diversity promotion strategies for grammar-guided genetic programming. Applied Soft Computing 2019;83:105599.
    1. Bordewich M, Gascuel O, Huber KT. et al. Consistency of topological moves based on the balanced minimum evolution principle of phylogenetic inference. IEEE/ACM Trans Comput Biol Bioinform 2009;6:110–7. - PubMed
    1. Brauer MJ, Holder MT, Dries LA. et al. Genetic algorithms and parallel processing in maximum-likelihood phylogeny inference. Mol Biol Evol 2002;19:1717–26. - PubMed