Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2017 Aug 1;34(8):1863-1877.
doi: 10.1093/molbev/msx154.

Soft Sweeps Are the Dominant Mode of Adaptation in the Human Genome

Affiliations

Soft Sweeps Are the Dominant Mode of Adaptation in the Human Genome

Daniel R Schrider et al. Mol Biol Evol. .

Abstract

The degree to which adaptation in recent human evolution shapes genetic variation remains controversial. This is in part due to the limited evidence in humans for classic "hard selective sweeps", wherein a novel beneficial mutation rapidly sweeps through a population to fixation. However, positive selection may often proceed via "soft sweeps" acting on mutations already present within a population. Here, we examine recent positive selection across six human populations using a powerful machine learning approach that is sensitive to both hard and soft sweeps. We found evidence that soft sweeps are widespread and account for the vast majority of recent human adaptation. Surprisingly, our results also suggest that linked positive selection affects patterns of variation across much of the genome, and may increase the frequencies of deleterious mutations. Our results also reveal insights into the role of sexual selection, cancer risk, and central nervous system development in recent human evolution.

Keywords: adaptation; machine learning; population genomics; selective sweeps; soft sweeps.

PubMed Disclaimer

Figures

<sc>Fig</sc>. 1.
Fig. 1.
Enrichment of various annotation features in regions classified as sweeps or linked to sweeps relative. The fold enrichment is the ratio of the number of base pairs in the intersection between windows assigned to a given class and an annotation feature divided by the mean of this intersection across the permuted data sets (see Materials and Methods). This was calculated separately for each population. (A) Enrichment of elements in windows classified as hard sweeps. (B) Same as A, but for soft sweeps. (C) Enrichment of elements in windows classified as affected by linked hard sweeps. (D) Linked soft sweeps.
<sc>Fig</sc>. 2.
Fig. 2.
The number of windows assigned to each class by S/HIC in each population.
<sc>Fig</sc>. 3.
Fig. 3.
Enrichment of pairs of interacting genes each falling within a window classified as a sweep. The fold enrichment is the ratio of the number of pairs of interacting genes overlapping a window classified as a sweep of a given type divided by the mean of this number across the permuted data sets (see Materials and Methods). This was calculated separately for each population. When no pairs of interacting sweep genes were observed in our true data set or a population, no bar was drawn. (A) Enrichment of pairs of genes encoding protein products that physically interact with each other (data from BioGRID) and both overlap hard sweep windows. (B) Same as A, but for soft sweeps. (C) Enrichment of pairs of genes, one of which is encodes a transcription factor that affects expression of the other (data from ORegAnno), where both overlap hard sweep windows. (D) Same as D, but for soft sweeps. (E) Enrichment of pairs of genes for which a genetic interaction has been observed (data from BioGRID) and both overlap hard sweep windows. (F) Same as E, but for soft sweeps.
<sc>Fig</sc>. 4.
Fig. 4.
Hard selective sweep near several SPATA31 spermatogenesis-associated genes. The S/HIC classification tracks show the raw classifier output for each population (red = hard sweep, blue = soft sweep, light red = hard-linked, light blue = soft-linked, black = neutral). We also show the values of various population genetic summary and test statistics (π, Tajima’s D, Kelly’s ZnS, and the SweepFinder composite likelihood ratio, or CLR). To avoid clutter, we only show statistics from CEU.
<sc>Fig</sc>. 5.
Fig. 5.
Soft selective sweeps near CADM1. The same tracks are shown as in figure 4.

References

    1. Akey JM. 2009. Constructing genomic maps of positive selection in humans: where do we go from here? Genome Res. 19:711–722. - PMC - PubMed
    1. Alexander DH, Novembre J, Lange K.. 2009. Fast model-based estimation of ancestry in unrelated individuals. Genome Res. 19:1655–1664. - PMC - PubMed
    1. Allinen M, Peri L, Kujala S, Lahti‐Domenici J, Outila K, Karppinen SM, Launonen V, Winqvist R.. 2002. Analysis of 11q21–24 loss of heterozygosity candidate target genes in breast cancer: indications of TSLC1 promoter hypermethylation. Genes Chromosomes Cancer 34:384–389. - PubMed
    1. Amberger JS, Bocchini CA, Schiettecatte F, Scott AF, Hamosh A.. 2015. OMIM. org: online Mendelian inheritance in man (OMIM®), an online catalog of human genes and genetic disorders. Nucleic Acids Res. 43:D789–D798. - PMC - PubMed
    1. Andolfatto P. 2007. Hitchhiking effects of recurrent beneficial amino acid substitutions in the Drosophila melanogaster genome. Genome Res. 17:1755–1762. - PMC - PubMed

Publication types

LinkOut - more resources