Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2012 May 15:7:13.
doi: 10.1186/1748-7188-7-13. eCollection 2012.

Tree-average distances on certain phylogenetic networks have their weights uniquely determined

Affiliations

Tree-average distances on certain phylogenetic networks have their weights uniquely determined

Stephen J Willson. Algorithms Mol Biol. .

Abstract

A phylogenetic network N has vertices corresponding to species and arcs corresponding to direct genetic inheritance from the species at the tail to the species at the head. Measurements of DNA are often made on species in the leaf set, and one seeks to infer properties of the network, possibly including the graph itself. In the case of phylogenetic trees, distances between extant species are frequently used to infer the phylogenetic trees by methods such as neighbor-joining. This paper proposes a tree-average distance for networks more general than trees. The notion requires a weight on each arc measuring the genetic change along the arc. For each displayed tree the distance between two leaves is the sum of the weights along the path joining them. At a hybrid vertex, each character is inherited from one of its parents. We will assume that for each hybrid there is a probability that the inheritance of a character is from a specified parent. Assume that the inheritance events at different hybrids are independent. Then for each displayed tree there will be a probability that the inheritance of a given character follows the tree; this probability may be interpreted as the probability of the tree. The tree-average distance between the leaves is defined to be the expected value of their distance in the displayed trees. For a class of rooted networks that includes rooted trees, it is shown that the weights and the probabilities at each hybrid vertex can be calculated given the network and the tree-average distances between the leaves. Hence these weights and probabilities are uniquely determined. The hypotheses on the networks include that hybrid vertices have indegree exactly 2 and that vertices that are not leaves have a tree-child.

Keywords: digraph; distance; hybrid; metric; network; normal network; phylogeny; tree-child.

PubMed Disclaimer

Figures

Figure 1
Figure 1
A network N with root 1, and the two trees Np and Np' that it displays. If N is equiprobable, then 7 inherits approximately half its characters from 6 and the other characters from 8.
Figure 2
Figure 2
A minimal configuration needed to be able to find the probability α1 = α(q1, h0) = 1 - α2 that a character state in h0 is inherited from q1.
Figure 3
Figure 3
This tree-child network has X = {1, 2, 3, 4}. There are 7 arcs not leading to a hybrid vertex but only 6 distances, and the weights are not uniquely determined. The network is not normal because arc (5, 10) is redundant.
Figure 4
Figure 4
A normal phylogenetic X-network with X = {1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11} and root 1. The root corresponds to an outgroup species. Measurements on DNA are assumed possible on members of X.
Figure 5
Figure 5
The situation of Lemma 4.2 (a), the situation of Lemmas 4.4 and 4.7 (b). If p is a parent map with p(a) = q1, the figure shows part of Np together with the arc (q2, a).
Figure 6
Figure 6
Case A1B3.
Figure 7
Figure 7
Case A2B3(a), Case A3B3(b).

References

    1. Bandelt H-J, Dress A. Split decomposition: a new and useful approach to phylogenetic analysis of distance data. Molecular Phylogenetics and Evolution. 1992;1:242–252. doi: 10.1016/1055-7903(92)90021-8. - DOI - PubMed
    1. Baroni M, Semple C, Steel M. A framework for representing reticulate evolution. Annals of Combinatorics. 2004;8:391–408.
    1. Moret BME, Nakhleh L, Warnow T, Linder CR, Tholse A, Padolina A, Sun J, Timme R. Phylogenetic networks: modeling, reconstructibility, and accuracy. IEEE/ACM Transactions on Computational Biology and Bioinformatics. 2004;1:13–23. doi: 10.1109/TCBB.2004.10. - DOI - PubMed
    1. Nakhleh L, Warnow T, Linder CR. In: Proceedings of the Eighth Annual International Conference on Computational Molecular Biology (RECOMB '04 March 27-31, 2004. Bourne PE, Gusfield D, editor. San Diego, California), ACM, New York; 2004. Reconstructing reticulate evolution in species-theory and practice; pp. 337–346.
    1. Huson D, Rupp R, Scornavacca C. Phylogenetic Networks: Concepts, Algoriithms and Applications. Cambridge, Cambridge University Press; 2010.

LinkOut - more resources