Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2016 Jul 19;113(29):8002-9.
doi: 10.1073/pnas.1601074113. Epub 2016 Jul 18.

The probability of monophyly of a sample of gene lineages on a species tree

Affiliations

The probability of monophyly of a sample of gene lineages on a species tree

Rohan S Mehta et al. Proc Natl Acad Sci U S A. .

Abstract

Monophyletic groups-groups that consist of all of the descendants of a most recent common ancestor-arise naturally as a consequence of descent processes that result in meaningful distinctions between organisms. Aspects of monophyly are therefore central to fields that examine and use genealogical descent. In particular, studies in conservation genetics, phylogeography, population genetics, species delimitation, and systematics can all make use of mathematical predictions under evolutionary models about features of monophyly. One important calculation, the probability that a set of gene lineages is monophyletic under a two-species neutral coalescent model, has been used in many studies. Here, we extend this calculation for a species tree model that contains arbitrarily many species. We study the effects of species tree topology and branch lengths on the monophyly probability. These analyses reveal new behavior, including the maintenance of nontrivial monophyly probabilities for gene lineage samples that span multiple species and even for lineages that do not derive from a monophyletic species group. We illustrate the mathematical results using an example application to data from maize and teosinte.

Keywords: coalescent theory; monophyly; phylogeography.

PubMed Disclaimer

Conflict of interest statement

The authors declare no conflict of interest.

Figures

Fig. 1.
Fig. 1.
Notation for computing monophyly probabilities above a species tree node x. Nodes xLL, xLR, and xR are leaves. S lineages appear in blue, C lineages in orange, and M lineages in green. The figure illustrates reciprocal monophyly. Sequentially listing the numbers of S, C, and M lineages as a vector, the outputs of branch x are nxO=(0,0,1). Inputs are nxL=(1,1,0) and nxR=(2,1,0). Farther down the tree, branch xL has inputs nxLL=(1,0,0) and nxLR=(0,1,0). Adopting the convention that leaf inputs enter from the left, branch xR has inputs nxRL=(4,1,0) and nxRR=(0,0,0). Descending one more level—which is only possible for xL—the inputs for branch xLL are nxLLL=(1,0,0) and nxLLR=(0,0,0), and for branch xLR, they are nxLRL=(0,2,0) and nxLRR=(0,0,0). Branch widths represent constant population sizes but do not indicate relative magnitudes of these sizes.
Fig. 2.
Fig. 2.
All cases required for computing combinatorial terms KS and KSC in monophyly probabilities. (AG) Cases for monophyly of S (Eq. 4). (H) A case for reciprocal monophyly (Eq. 5). In each panel, lineages coalesce from bottom to top, with the width of a shape corresponding to the number of lineages present. A single lineage is represented by a line, and multiple freely coalescing lineages are represented by shaded polygons with horizontal cross-section proportional to the number of extant lineages. Lineages represented in the same shape or in touching shapes can coalesce with each other. Lineage colors follow Fig. 1.
Fig. 3.
Fig. 3.
The effect on monophyly probabilities of changing two branch lengths in relation to each other. (A) Model species tree. If the branch length coefficient r is 0, then the tree has a polytomy, and if r=1, then the tree reduces to a two-species tree. (B–E) The probabilities of ES (Eq. 7) for monophyly of S for the tree in A under different scenarios: (B) (S1,S2,S3)=(2,0,2), (C1,C2,C3)=(2,2,2). (C) (S1,S2,S3)=(2,0,2), (C1,C2,C3)=(0,2,0). (D) (S1,S2,S3)=(2,2,0), (C1,C2,C3)=(2,0,2). (E) (S1,S2,S3)=(2,2,0), (C1,C2,C3)=(2,2,2).
Fig. 4.
Fig. 4.
The effect on monophyly probabilities of pooling lineages from separate species. (A–D) Model species trees. Labels record numbers of input lineages (S in blue, C in orange). (E–J) Probabilities of monophyly events. The trajectories represent species trees with six class-S lineages evenly distributed over one (A), two (B), three (C), and six (D) species. (E) ES (Eq. 7). (F) EC (Eq. 8). (G) ESC (Eq. 10). (H) ESC (Eq. 11). (I) ESC (Eq. 9). (J) ESC (Eq. 12).
Fig. 5.
Fig. 5.
Monophyly frequencies in maize and teosinte. (A) Model species tree. (B) Violin-plot distributions across lineage subsamples of monophyly frequencies for four clades. Means of the observed distributions (excluding outliers for the improved and parviglumis clades) appear as circles and theoretical values appear as triangles. Outliers appear for a single point at frequency 0.43 in the improved clade and for several points at frequency >0.17 in the parviglumis clade, with the cross indicating the mean of the parviglumis outliers (Supporting Information).

Similar articles

Cited by

References

    1. Tajima F. Evolutionary relationship of DNA sequences in finite populations. Genetics. 1983;105(2):437–460. - PMC - PubMed
    1. Takahata N, Nei M. Gene genealogy and variance of interpopulational nucleotide differences. Genetics. 1985;110(2):325–344. - PMC - PubMed
    1. Neigel J, Avise J. Phylogenetic relationships of mitochondrial DNA under various demographic models of speciation. In: Karlin S, Nevo E, editors. Evolutionary Processes and Theory. Academic Press; San Diego: 1986. pp. 515–534.
    1. Rosenberg NA. The shapes of neutral gene genealogies in two species: Probabilities of monophyly, paraphyly, and polyphyly in a coalescent model. Evolution. 2003;57(7):1465–1477. - PubMed
    1. Degnan JH, Salter LA. Gene tree distributions under the coalescent process. Evolution. 2005;59(1):24–37. - PubMed

Publication types

LinkOut - more resources