Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Comparative Study
. 2014 Nov 18;111(46):16448-53.
doi: 10.1073/pnas.1407950111. Epub 2014 Nov 3.

Maximum likelihood inference of reticulate evolutionary histories

Affiliations
Comparative Study

Maximum likelihood inference of reticulate evolutionary histories

Yun Yu et al. Proc Natl Acad Sci U S A. .

Abstract

Hybridization plays an important role in the evolution of certain groups of organisms, adaptation to their environments, and diversification of their genomes. The evolutionary histories of such groups are reticulate, and methods for reconstructing them are still in their infancy and have limited applicability. We present a maximum likelihood method for inferring reticulate evolutionary histories while accounting simultaneously for incomplete lineage sorting. Additionally, we propose methods for assessing confidence in the amount of reticulation and the topology of the inferred evolutionary history. Our method obtains accurate estimates of reticulate evolutionary histories on simulated datasets. Furthermore, our method provides support for a hypothesis of a reticulate evolutionary history inferred from a set of house mouse (Mus musculus) genomes. As evidence of hybridization in eukaryotic groups accumulates, it is essential to have methods that infer reticulate evolutionary histories. The work we present here allows for such inference and provides a significant step toward putting phylogenetic networks on par with phylogenetic trees as a model of capturing evolutionary relationships.

Keywords: incomplete lineage sorting; maximum likelihood; phylogenetic networks; reticulate evolution.

PubMed Disclaimer

Conflict of interest statement

The authors declare no conflict of interest.

Figures

Fig. 1.
Fig. 1.
Phylogenetic networks. Here, the MRCA of A and B split from its MRCA with C, and some time after A and B split, hybridization occurred between B and C. Four independent loci, ▲, ●, ■, and ♦, are illustrated, for which a single individual is sampled from each of A and C and six individuals are sampled from B. Two gene trees are depicted for the ▲ and ♦ loci, and both trees agree in terms of their shapes. However, the disagreement of the species splitting pattern with the gene tree in red is due to ILS, whereas the disagreement with the gene tree in blue is due to hybridization. Furthermore, the ▲ locus exhibits no evidence of hybridization in B, the ♦ locus has lost all signal of vertical inheritance from the MRCA of B with A, and the other two loci exhibit varying degrees of hybridization signal in the population. Locus-specific inheritance probabilities are needed to capture such scenarios.
Fig. 2.
Fig. 2.
Accuracy of the method on simulated data. (A) Data were generated down the phylogenetic network Ψ1 (all internal branches, except for the horizontal edge, have lengths of 1 coalescent unit, and the inheritance probability is 0.1 for all loci). Results based on gene tree topology estimates (B) and gene tree topology and branch length estimates (C) are shown. For every number of loci, the rightmost bar corresponds to inference from the true gene genealogies and the other three bars, from left to right, correspond to gene genealogies estimated (using 100 bootstrap replicates and Eq. 3) from sequences of lengths of 250, 500, and 1,000, respectively. The dark blue, cyan, and yellow regions correspond to the number of times each of the networks Ψ1, Ψ2, and Ψ3, respectively, in A was inferred. The maroon region corresponds to the number of times any other network with a single reticulation was inferred. Here, one individual was sampled per taxon for each of the loci.
Fig. 3.
Fig. 3.
Optimal phylogenetic network inferred on the house mouse (M. musculus) dataset. A single individual was sampled from each of five populations: M. m. domesticus from France (DF), M. m. domesticus from Germany (DG), M. m. musculus from the Czech Republic (MZ), M. m. musculus from Kazakhstan (MK), and M. m. musculus from China (MC). The analysis found multiple, almost equally optimal, phylogenetic networks with two reticulation events. These multiple networks all agreed on the recipient populations but disagreed on the donor populations. One hybridization (the top dashed horizontal arrow) involves the MRCA of DF and DG as a recipient population, yet seems to have involved MK, MC, or their MRCA as the donor population. The second hybridization (the bottom dashed horizontal arrow) involves MZ as a recipient population, yet seems to have involved DF, DG, or their MRCA as the donor population. Branch lengths in coalescent units (on the tree branches) and inheritance probabilities (on the horizontal edges) are shown (full details of the data and results are provided in SI Appendix).

References

    1. Bapteste E, et al. Networks: Expanding evolutionary thinking. Trends Genet. 2013;29(8):439–441. - PubMed
    1. Nakhleh L. Evolutionary phylogenetic networks: Models and issues. In: Heath L, Ramakrishnan N, editors. The Problem Solving Handbook for Computational Biology and Bioinformatics. Springer; Berlin: 2010. pp. 125–158.
    1. Huson D, Rupp R, Scornavacca C. Phylogenetic Networks: Concepts, Algorithms, and Applications. Cambridge Univ Press; New York: 2010.
    1. Nakhleh L. Computational approaches to species phylogeny inference and gene tree reconciliation. Trends Ecol Evol. 2013;28(12):719–728. - PMC - PubMed
    1. Green RE, et al. A draft sequence of the Neandertal genome. Science. 2010;328(5979):710–722. - PMC - PubMed

Publication types

LinkOut - more resources