Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2024 Jan 31;86(3):24.
doi: 10.1007/s11538-023-01244-2.

Ranked Subtree Prune and Regraft

Affiliations

Ranked Subtree Prune and Regraft

Lena Collienne et al. Bull Math Biol. .

Abstract

Phylogenetic trees are a mathematical formalisation of evolutionary histories between organisms, species, genes, cancer cells, etc. For many applications, e.g. when analysing virus transmission trees or cancer evolution, (phylogenetic) time trees are of interest, where branch lengths represent times. Computational methods for reconstructing time trees from (typically molecular) sequence data, for example Bayesian phylogenetic inference using Markov Chain Monte Carlo (MCMC) methods, rely on algorithms that sample the treespace. They employ tree rearrangement operations such as [Formula: see text] (Subtree Prune and Regraft) and [Formula: see text] (Nearest Neighbour Interchange) or, in the case of time tree inference, versions of these that take times of internal nodes into account. While the classic [Formula: see text] tree rearrangement is well-studied, its variants for time trees are less understood, limiting comparative analysis for time tree methods. In this paper we consider a modification of the classical [Formula: see text] rearrangement on the space of ranked phylogenetic trees, which are trees equipped with a ranking of all internal nodes. This modification results in two novel treespaces, which we propose to study. We begin this study by discussing algorithmic properties of these treespaces, focusing on those relating to the complexity of computing distances under the ranked [Formula: see text] operations as well as similarities and differences to known tree rearrangement based treespaces. Surprisingly, we show the counterintuitive result that adding leaves to trees can actually decrease their ranked [Formula: see text] distance, which may have an impact on the results of time tree sampling algorithms given uncertain "rogue taxa".

Keywords: Phylogenetics; Ranked trees; Time trees; Tree Rearrangements; Tree distance; Treespace.

PubMed Disclaimer

Conflict of interest statement

The authors declare no competing interests.

Figures

Fig. 1
Fig. 1
Two rooted SPR moves. The cross marks the edge that is cut to prune the subtree T|v of T with leaf set {l1,l2}. In the tree on the left this subtree is reattached on the edge highlighted by a circle, connecting l5 with its parent, and in the tree on the right the pruned subtree is reattached as child of a newly introduced root ρ
Fig. 2
Fig. 2
A rooted tree Tu on the left and a ranked tree T=(Tu,rank) on the right
Fig. 3
Fig. 3
Rank move on the left, swapping the ranks of the nodes highlighted the leftmost tree. On the right an HSPR move at rank 2 is illustrated, moving the subtree induced by {l1,l2} by cutting the edge highlighted by a cross and re-attaching it at the edge highlighted by a circle
Fig. 4
Fig. 4
HSPR move pruning the subtree T|v, moving it to the edge (i+,i-). Dotted edges represent parts of the tree that potentially contain further nodes
Fig. 5
Fig. 5
Path between T and T2 as described in the proof of Theorem 4. Only subtrees involved in the moves given in the theorem are shown in this illustration
Fig. 6
Fig. 6
Trees T and R of the counterexample to the weak cluster property in RSPR and HSPR in Theorem 7. The labels of the arrows indicate leaves that are pruned in the corresponding HSPR moves
Fig. 7
Fig. 7
Shortest HSPR path between trees T and R. The subtree consisting of only the leaf a5 is moved by the first and the last HSPR move. The path displayed here does not preserve the shared cluster {a1,a2,a3} at rank three. The nodes inducing this cluster are highlighted in T and R. On this path only subtrees consisting of single leaves move
Fig. 8
Fig. 8
Trees T and R with cherry {x,y} at rank one and all moves on a path p from T to R that does not preserve the cherry as described in the proof of Theorem 9. The labels of the arrows indicate the leaves that are pruned and reattached by the corresponding HSPR moves
Fig. 9
Fig. 9
Trees T, T, and R on p, and alternative path p from T to R via T at the bottom if T and T are connected by an HSPR move at rank k and the nodes of rank k and k+1 in T are connected by an edge, as explained in case 2.1. The dotted parts of the trees might contain further nodes and leaves
Fig. 10
Fig. 10
Trees T, T, and R on p at the top, and alternative path p from T to R via T at the bottom if T and T are connected by an HSPR move at rank k and the nodes of rank k and k+1 in T are not connected by an edge. The dotted parts of the trees might contain further nodes and leaves
Fig. 11
Fig. 11
Trees T, T, and R on p, and alternative path p from T to R via T at the bottom if T and T are connected by an HSPR move at rank k+1 and the nodes of rank k and k+1 in T are connected by an edge, The dotted parts of the trees might contain further nodes and leaves
Fig. 12
Fig. 12
Trees T, T, and R on p at the top, and alternative path p from T to R via T at the bottom if T and T are connected by an HSPR move at rank k and the nodes of rank k and k+1 in T are not connected by an edge. The dotted parts of the trees might contain further nodes and leaves
Fig. 13
Fig. 13
Top: trees T and R with HSPR distance greater than or equal to n-12 (Lemma 6). Bottom: Adding leaf ln+1 to both T and R results in trees T and R that are connected by one HSPR move that moves l1

Similar articles

Cited by

References

    1. Aberer AJ, Krompass D, Stamatakis A. Pruning rogue taxa improves phylogenetic accuracy: an efficient algorithm and webservice. Syst Biol. 2013;62(1):162–166. doi: 10.1093/sysbio/sys078. - DOI - PMC - PubMed
    1. Allen BL, Steel M. Subtree transfer operations and their induced metrics on evolutionary trees. Ann Comb. 2001;5(1):1–15. doi: 10.1007/s00026-001-8006-8. - DOI
    1. Alves JM, Prado-López S, Cameselle-Teijeiro JM, Posada D. Rapid evolution and bio-geographic spread in a colorectal cancer. Nat Commun. 2019;10(11):5139. doi: 10.1038/s41467-019-12926-8. - DOI - PMC - PubMed
    1. Atkins R, McDiarmid C. Extremal distances for subtree transfer operations in binary trees. Ann Comb. 2019;23(1):1–26. doi: 10.1007/s00026-018-0410-4. - DOI
    1. Bordewich M, Semple C. On the computational complexity of the rooted subtree prune and regraft distance. Ann Comb. 2005;8(4):409–423. doi: 10.1007/s00026-004-0229-z. - DOI

Publication types

LinkOut - more resources