Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2024 Aug 22;86(10):121.
doi: 10.1007/s11538-024-01350-9.

Bounding the Softwired Parsimony Score of a Phylogenetic Network

Affiliations

Bounding the Softwired Parsimony Score of a Phylogenetic Network

Janosch Döcker et al. Bull Math Biol. .

Abstract

In comparison to phylogenetic trees, phylogenetic networks are more suitable to represent complex evolutionary histories of species whose past includes reticulation such as hybridisation or lateral gene transfer. However, the reconstruction of phylogenetic networks remains challenging and computationally expensive due to their intricate structural properties. For example, the small parsimony problem that is solvable in polynomial time for phylogenetic trees, becomes NP-hard on phylogenetic networks under softwired and parental parsimony, even for a single binary character and structurally constrained networks. To calculate the parsimony score of a phylogenetic network N, these two parsimony notions consider different exponential-size sets of phylogenetic trees that can be extracted from N and infer the minimum parsimony score over all trees in the set. In this paper, we ask: What is the maximum difference between the parsimony score of any phylogenetic tree that is contained in the set of considered trees and a phylogenetic tree whose parsimony score equates to the parsimony score of N? Given a gap-free sequence alignment of multi-state characters and a rooted binary level-k phylogenetic network, we use the novel concept of an informative blob to show that this difference is bounded by k + 1 times the softwired parsimony score of N. In particular, the difference is independent of the alignment length and the number of character states. We show that an analogous bound can be obtained for the softwired parsimony score of semi-directed networks, while under parental parsimony on the other hand, such a bound does not hold.

Keywords: Level-k; Parental parsimony; Phylogenetic networks; Softwired parsimony.

PubMed Disclaimer

Figures

Fig. 1
Fig. 1
A phylogenetic network N (left), an embedding S of a phylogenetic X-tree displayed by N together with a binary character f and an extension F (middle; also indicated by the dashed lines in the left panel), and the phylogenetic X-tree T obtained from S by suppressing vertices of in-degree one and out-degree one (right)
Fig. 2
Fig. 2
An example of a blob reduction applied to a phylogenetic network N that replaces the maximal blob B with source s and cl(s)={x1,x2,x3} with a single new leaf y to obtain N(Y¯)
Fig. 3
Fig. 3
An embedding S of a phylogenetic X-tree that is displayed by the phylogenetic network N on X as shown in Fig. 2, the cluster tree pair (S(Y),S(Y¯)) of S relative to the blob B that is also shown in Fig. 2, and a pair of cluster extensions FY and FY¯ with respect to the character f on X. For reasons of clarity, the extensions FY and FY¯ to the vertices of S(Y) and S(Y¯), respectively, are only shown for some vertices. Observe that FY¯(y)=FY(ρY) and that all leaves of S(Y) and S(Y¯), except for y, are assigned to the same character state as in S
Fig. 4
Fig. 4
A level-k network N on X={x0}{xi,xi,xi:1ik} (left) and two phylogenetic X-trees T and T displayed by N together with a binary character f and extensions F and F to V(T) and V(T), respectively (right)
Fig. 5
Fig. 5
A phylogenetic network N on X={x1,x2,,x4}, a phylogenetic X-tree T such that TP(N) and TD(N), and the multilabelled tree U(N). The dashed lines in N indicate how T can be drawn inside N
Fig. 6
Fig. 6
A level-1 network N on X={x1,x2,,xn+1} with n being even and two parental trees T and T of N together with a binary character f on X. Here, PSpa(N,f)=PS(f,T)=1, whereas PS(f,T)=n/2
Fig. 7
Fig. 7
An unrooted level-1 network U (left) with its display set D(U) (middle), and two orientations N and N of U (right) with their respective display sets indicated by a dashed rectangle. More precisely, D(N) and D(N) can be obtained by orienting the enclosed unrooted phylogenetic X-trees

References

    1. Allman ES, Baños H, Rhodes JA (2019) NANUQ: a method for inferring species networks from gene trees under the coalescent model. Algorithms Mol Biol 14:1–25 - PMC - PubMed
    1. Cardona G, Rossello F, Valiente G (2009) Comparison of tree-child phylogenetic networks. IEEE/ACM Trans Comput Biol Bioinf 6(4):552–569 - PubMed
    1. Döcker J, Linz S, Semple C (2024) Hypercubes and Hamilton cycles of display sets of rooted phylogenetic networks. Adv Appl Math 152:102595
    1. Felsenstein J (2004) Inferring phylogenies. Sinauer Associates, New York
    1. Fitch WM (1971) Toward defining the course of evolution: minimum change for a specific tree topology. Syst Biol 20(4):406–416

Grants and funding

LinkOut - more resources