Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2011 Mar;60(2):138-49.
doi: 10.1093/sysbio/syq084. Epub 2011 Jan 19.

Coalescent histories on phylogenetic networks and detection of hybridization despite incomplete lineage sorting

Affiliations

Coalescent histories on phylogenetic networks and detection of hybridization despite incomplete lineage sorting

Yun Yu et al. Syst Biol. 2011 Mar.

Abstract

Analyses of the increasingly available genomic data continue to reveal the extent of hybridization and its role in the evolutionary diversification of various groups of species. We show, through extensive coalescent-based simulations of multilocus data sets on phylogenetic networks, how divergence times before and after hybridization events can result in incomplete lineage sorting with gene tree incongruence signatures identical to those exhibited by hybridization. Evolutionary analysis of such data under the assumption of a species tree model can miss all hybridization events, whereas analysis under the assumption of a species network model would grossly overestimate hybridization events. These issues necessitate a paradigm shift in evolutionary analysis under these scenarios, from a model that assumes a priori a single source of gene tree incongruence to one that integrates multiple sources in a unifying framework. We propose a framework of coalescence within the branches of a phylogenetic network and show how this framework can be used to detect hybridization despite incomplete lineage sorting. We apply the model to simulated data and show that the signature of hybridization can be revealed as long as the interval between the divergence times of the species involved in hybridization is not too small. We reanalyze a data set of 106 loci from 7 in-group Saccharomyces species for which a species tree with no hybridization has been reported in the literature. Our analysis supports the hypothesis that hybridization occurred during the evolution of this group, explaining a large amount of the incongruence in the data. Our findings show that an integrative approach to gene tree incongruence and its reconciliation is needed. Our framework will help in systematically analyzing genomic data for the occurrence of hybridization and elucidating its evolutionary role.

PubMed Disclaimer

Figures

F<sc>IGURE</sc> 1.
FIGURE 1.
Phylogenetic networks and deep coalescence. Two valid coalescent histories that give rise to gene tree (A,(B,(C,D))) are shown within the branches of a phylogenetic network of four species. The effective population sizes, times t1 and t2, as well as the probability γ of a gene being inherited through hybridization, all determine the distribution of gene genealogies in the four species.1
F<sc>IGURE</sc> 2.
FIGURE 2.
The 26 possible gene tree topologies with four leaves. The numbers within the Newick format description of each tree correspond to the branch numbers in the phylogenetic network of Figure 1.2
F<sc>IGURE</sc> 3.
FIGURE 3.
The species tree of the yeast data set, as proposed by Rokas et al. a) The single optimal tree under the MDC criterion for the data set. The number of extra lineages resulting from reconciling all 106 gene trees within the branches of this tree is 127. b) The best suboptimal tree under the MDC criterion. The number of extra lineages resulting from reconciling all 106 gene trees within the branches of this tree is 134, which is just seven extra lineages away from the optimal value of 127 achieved by the tree in (a). The number on a branch indicates the number of extra lineages along that branch once all 106 gene trees are reconciled within the branches of the tree.3
F<sc>IGURE</sc> 4.
FIGURE 4.
Estimated γ (formula image in Equation (3)) from true gene trees. The variance of the estimates is lower than 0.001 in most cases, and reaches a maximum value of 0.008.4
F<sc>IGURE</sc> 5.
FIGURE 5.
Estimated γ (formula image in Equation (3)) from reconstructed gene trees. The variance of the estimates is lower than 0.001 in most cases, and reaches a maximum value of 0.008.
F<sc>IGURE</sc> 6.
FIGURE 6.
Theoretical distribution of 15 gene tree topologies. White bars: γ = 0; gray bars: γ = 0.3; black bars: γ = 0.5.
F<sc>IGURE</sc> 7.
FIGURE 7.
Three hybridization scenarios for the yeast data set. Each of the networks requires 69 extra lineages to reconcile all 106 gene trees, and depicts a slightly different hybridization scenario. The number on a branch indicates the number of extra lineages along that branch once all 106 gene trees are reconciled within the branches of the network.

References

    1. Arnold ML. Natural hybridization and evolution. Oxford: Oxford University Press; 1997.
    1. Bloomquist EW, Suchard MA. Unifying vertical and nonvertical evolution: a stochastic ARG-based framework. Syst. Biol. 2010;59:27–41. - PMC - PubMed
    1. Cranston KA, Hurwitz B, Ware D, Stein L, Wing RA. Species trees from highly incongruent gene trees in rice. Syst. Biol. 2009;58:489–500. - PubMed
    1. Degnan JH, Rosenberg NA. Gene tree discordance, phylogenetic inference and the multispecies coalescent. Trends Ecol. Evol. 2009;24:332–340. - PubMed
    1. Degnan JH, Salter LA. Gene tree distributions under the coalescent process. Evolution. 2005;59:24–37. - PubMed

Publication types