Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2013 Sep;62(5):689-706.
doi: 10.1093/sysbio/syt032. Epub 2013 May 7.

Inferring phylogeny and introgression using RADseq data: an example from flowering plants (Pedicularis: Orobanchaceae)

Affiliations

Inferring phylogeny and introgression using RADseq data: an example from flowering plants (Pedicularis: Orobanchaceae)

Deren A R Eaton et al. Syst Biol. 2013 Sep.

Abstract

Phylogenetic relationships among recently diverged species are often difficult to resolve due to insufficient phylogenetic signal in available markers and/or conflict among gene trees. Here we explore the use of reduced-representation genome sequencing, specifically in the form of restriction-site associated DNA (RAD), for phylogenetic inference and the detection of ancestral hybridization in non-model organisms. As a case study, we investigate Pedicularis section Cyathophora, a systematically recalcitrant clade of flowering plants in the broomrape family (Orobanchaceae). Two methods of phylogenetic inference, maximum likelihood and Bayesian concordance, were applied to data sets that included as many as 40,000 RAD loci. Both methods yielded similar topologies that included two major clades: a "rex-thamnophila" clade, composed of two species and several subspecies with relatively low floral diversity, and geographically widespread distributions at lower elevations, and a "superba" clade, composed of three species characterized by relatively high floral diversity and isolated geographic distributions at higher elevations. Levels of molecular divergence between subspecies in the rex-thamnophila clade are similar to those between species in the superba clade. Using Patterson's D-statistic test, including a novel extension of the method that enables finer-grained resolution of introgression among multiple candidate taxa by removing the effect of their shared ancestry, we detect significant introgression among nearly all taxa in the rex-thamnophila clade, but not between clades or among taxa within the superba clade. These results suggest an important role for geographic isolation in the emergence of species barriers, by facilitating local adaptation and differentiation in the absence of homogenizing gene flow.

PubMed Disclaimer

Figures

Figure 1
Figure 1
Sampling localities and photographs of taxa in Pedicularis sect. Cyathophora. Codes for taxon names are listed in Table 1.
Figure 2
Figure 2
Coalescent model used to simulate RADseq-like data. Unidirectional migration (gene flow) from taxon Ci into Bi (solid gray arrow) occurred over 10,000 generations, beginning 50,000 generations in the past. Three data sets were created with different rates of gene flow, and one additional data set with gene flow from both C and Ci into Bi (dashed gray arrow).
FIGURE 3
FIGURE 3
A schematic description of Patterson’s four-taxon D-statistic test for hybridization and the five-taxon partitioned D-statistic test. a) The four-taxon test measures asymmetry in the occurrence of two incongruent allele patterns (shown in the box), which are expected to arise with equal frequencies in the absence of introgression, but to deviate from symmetry if P3 exchanged genes with either P1 or P2 to the exclusion of the other. b) The five-taxon test samples an additional population within the P3 lineage, the two samples now denoted P31 and P32, and measures asymmetry in three sets of allele patterns (the three boxes shown), thus measuring three separate D-statistics. The first (D1) measures asymmetry in incongruent alleles where the derived allele is present in P31 but not P32, the second (D2) where it is present in P32 but not P31, and the third where the derived allele is shared by both P31 and P32, having arisen in their ancestor (branch indicated by **).
FIGURE 4
FIGURE 4
Phylogeny of Pedicularis sect. Cyathophora inferred from RADseq data. ML trees were estimated from the sparse (minimum-taxa) supermatrix a) and the densely sampled (full-taxa) supermatrix b), yielding high bootstrap support (100 except where indicated). Primary concordance trees c, d) and population trees e, f) were inferred on 945 variable loci from the full-taxa supermatrix, at either α = 0.01 (c, e) or 100 (d, f). The 95% CI for CFs are shown on primary concordance trees, those in bold did not overlap with any conflicting CF. Quartet CFs are shown on population trees.
Figure 5
Figure 5
Censored data sets, in which one or more taxa have been removed to minimize the effect of their introgressed alleles, are used to compare alternative topologies within the Shimodeira–Hasegawa test. The ML topology with P. rex subsp. rex and P. rex subsp. lipskyana removed a) is compared with an alternative topology in which P. thamnophila is monophyletic b) Similarly, the ML topology with both subspecies of P. thamnophila removed c) is compared with two alternative resolutions for the three subspecies of P. rex d) and e).
Figure 6
Figure 6
Temporal and geographic reconstructions of population divergence and gene flow, as inferred from the partitioned D-statistic. Codes for taxon names are listed in Table 1. a) Phylogeny of rex-thamnophila based on censored subtree comparisons which account for hybrid introgression. Gray bars indicate distinct time periods discussed in the text, and arrows are the direction of introgressive gene flow. b) Schematic reconstruction of historical biogeography in the rex-thamnophila clade, inferred from present-day distributions and patterns of historical introgression. Overlapping edges indicate geographic overlap, and arrows represent introgression.

References

    1. Ané C. Reconstructing concordance trees and testing the coalescent model from genome-wide data sets. In: Knowles L.L., Kubatko L.S., editors. Estimating species trees: practical and theoretical aspects. Hoboken, NJ: John Wiley & Sons; 2010. pp. 35–51.
    1. Ané C., Larget B., Baum D.A., Smith S.D., Rokas A. Bayesian estimation of concordance among gene trees. Mol. Biol. Evol. 2007;24:412–426. - PubMed
    1. Armbruster S., Edwards M., Debevec E. Floral character displacement generates assemblage structure of Western Australian triggerplants (Stylidium) Ecology. 1994;75:315–329.
    1. Arnold M.L., Ballerini E.S., Brothers A.N. Hybrid fitness, adaptation and evolutionary diversification: lessons learned from Louisiana Irises. Heredity. 2012;108:159–166. - PMC - PubMed
    1. Baird N., Etter P., Atwood T., Currey M., Shiver A., Lewis Z., Selker E., Cresko W., Johnson E. Rapid SNP discovery and genetic mapping using sequenced RAD markers. PLoS One. 2008;3:e3376. - PMC - PubMed

Publication types