Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2018 Jan 26;2(1):9-21.
doi: 10.1002/evl3.37. eCollection 2018 Feb.

Ancient genomic variation underlies repeated ecological adaptation in young stickleback populations

Affiliations

Ancient genomic variation underlies repeated ecological adaptation in young stickleback populations

Thomas C Nelson et al. Evol Lett. .

Abstract

Adaptation in the wild often involves standing genetic variation (SGV), which allows rapid responses to selection on ecological timescales. However, we still know little about how the evolutionary histories and genomic distributions of SGV influence local adaptation in natural populations. Here, we address this knowledge gap using the threespine stickleback fish (Gasterosteus aculeatus) as a model. We extend restriction site-associated DNA sequencing (RAD-seq) to produce phased haplotypes approaching 700 base pairs (bp) in length at each of over 50,000 loci across the stickleback genome. Parallel adaptation in two geographically isolated freshwater pond populations consistently involved fixation of haplotypes that are identical-by-descent. In these same genomic regions, sequence divergence between marine and freshwater stickleback, as measured by dXY , reaches tenfold higher than background levels and genomic variation is structured into distinct marine and freshwater haplogroups. By combining this dataset with a de novo genome assembly of a related species, the ninespine stickleback (Pungitius pungitius), we find that this habitat-associated divergent variation averages six million years old, nearly twice the genome-wide average. The genomic variation that is involved in recent and rapid local adaptation in stickleback has therefore been evolving throughout the 15-million-year history since the two species lineages split. This long history of genomic divergence has maintained large genomic regions of ancient ancestry that include multiple chromosomal inversions and extensive linked variation. These discoveries of ancient genetic variation spread broadly across the genome in stickleback demonstrate how selection on ecological timescales is a result of genome evolution over geological timescales, and vice versa.

Keywords: Adaptation; RAD‐seq; evolutionary genomics; speciation islands.

PubMed Disclaimer

Figures

Figure 1
Figure 1
Stickleback sampling and RAD sequencing to measure haplotype variation. (A) Threespine stickleback sampling locations in this study. Colors represent habitat type: red: marine; blue: freshwater. Number of haploid genome sampled is shown. (B–D) We modified the original RAD‐seq protocol to generate local haplotypes. Colored bars represent polymorphic sites. For a detailed description of haplotype construction, see Methods. (B) Overlapping paired‐end reads are anchored to PstI restriction sites. (C) Paired reads mapping to each half‐site are merged into contigs. Contigs mapping to the same restriction site are identified by alignment to the reference genome. (D) Sequences from each half of a restriction site are phased to generate a single RAD locus. RAD tags in the background represent multiple genotypes used in phasing.
Figure 2
Figure 2
The genealogical structure of parallel genomic divergence. (A) Genome‐wide FST for both marine‐freshwater comparisons was kernel‐smoothed using a normally distributed kernel with a window size of 500 kb. Inverted triangles indicate the locations of two genes known to show extensive marine‐freshwater haplotype divergence, eda and atp1a1. Three chromosomal inversions are highlighted in yellow. (B) Lineage sorting patterns were identified from maximum clade credibility trees for each RAD locus. Blue bars: haplotypes from both freshwater populations form a single monophyletic group; red: haplotypes from the marine population form a monophyletic group; black: A RAD locus is structured into reciprocally monophyletic marine and freshwater haplogroups.
Figure 3
Figure 3
Extensive sequence divergence between marine and freshwater haplogroups accompanies reciprocal monophyly. For each reciprocally monophyletic RAD locus, we calculated sequence variation (π) within and sequence divergence between habitat types (dXY). Each RAD locus is shown as a pair of lines connecting estimates of π and dXY. Boxplots show distributions across all reciprocally monophyletic RAD loci: Boxes are upper and lower quartiles, including the median; whiskers extend to 1.5 × interquartile range. Dashed lines are the genome‐wide medians. Single RAD loci from within the transcribed regions of eda and atp1a1 are shown as gold and green lines, respectively, and presented as haplotype networks. Dots represent mutational steps. Circle sizes indicate the number of haplotypes and colors indicate population of origin as in Figure 1. Each network = 28 haplotypes.
Figure 4
Figure 4
Marine‐freshwater divergence has evolved over millions of years, affecting large genomic regions. We performed Bayesian estimation of the time to the most recent common ancestor (TMRCA) of alleles at threespine stickleback RAD loci. We calibrated coalescence times within threespine stickleback by including a de novo genome assembly from the ninespine stickleback (Pungitius pungitius) and setting threespine‐ninespine divergence at 15 million years ago. (A) Maximum clade credibility RAD gene tree representative of the genome‐wide average TMRCA. Branches within threespine are colored by population of origin. (B) Kernel‐smoothed densities of TMRCA distributions for all RAD loci containing a monophyletic group of threespine stickleback alleles (light gray) and those structured into reciprocally monophyletic marine and freshwater haplogroups. (C) The genomic distribution of reciprocally monophyletic RAD loci (black, as in Fig. 2) is associated with increased TMRCA at a genomic scale. TMRCA outlier windows (those exceeding 99.9% of permuted genomic windows) are shown as gray bars. Genome‐wide TMRCA was kernel‐smoothed using a normally distributed kernel with a window size of 500 kb. Inverted triangles indicate the locations of eda and atp1a1. Three chromosomal inversions are highlighted in yellow.

References

    1. Aeschbacher, S. , Selby J. P., Willis J. H., and Coop G.. 2017. Population‐genomic inference of the strength and timing of selection against gene flow. Proc. Natl. Acad. Sci. USA 114:7061–7066. - PMC - PubMed
    1. Aldenhoven, J. T. , Miller M. A., Corneli P. S., and Shapiro M. D.. 2010. Phylogeography of ninespine sticklebacks (Pungitius pungitius) in North America: glacial refugia and the origins of adaptive traits. Mol. Ecol. 19:4061–4076. - PubMed
    1. Arnegard, M. E. , McGee M. D., Matthews B., Marchinko K. B., Conte G. L., Kabir S., Bedford N., Bergek S., Chan Y. F., Jones F. C., et al. 2014. Genetics of ecological divergence during speciation. Nature 511:307–311. - PMC - PubMed
    1. Arnold, B. , Corbett‐Detig R. B., Hartl D., and Bomblies K.. 2013. RADseq underestimates diversity and introduces genealogical biases due to nonrandom haplotype sampling. Mol. Ecol. 22:3179–3190. - PubMed
    1. Aronesty, E. 2011. ea‐utils: command‐line tools for processing biological sequencing data. Expression Analysis, Durham, NC.