Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2020 Jul 7;10(7):2317-2326.
doi: 10.1534/g3.120.401052.

Inferring Parameters of the Distribution of Fitness Effects of New Mutations When Beneficial Mutations Are Strongly Advantageous and Rare

Affiliations

Inferring Parameters of the Distribution of Fitness Effects of New Mutations When Beneficial Mutations Are Strongly Advantageous and Rare

Tom R Booker. G3 (Bethesda). .

Abstract

Characterizing the distribution of fitness effects (DFE) for new mutations is central in evolutionary genetics. Analysis of molecular data under the McDonald-Kreitman test has suggested that adaptive substitutions make a substantial contribution to between-species divergence. Methods have been proposed to estimate the parameters of the distribution of fitness effects for positively selected mutations from the unfolded site frequency spectrum (uSFS). Such methods perform well when beneficial mutations are mildly selected and frequent. However, when beneficial mutations are strongly selected and rare, they may make little contribution to standing variation and will thus be difficult to detect from the uSFS. In this study, I analyze uSFS data from simulated populations subject to advantageous mutations with effects on fitness ranging from mildly to strongly beneficial. As expected, frequent, mildly beneficial mutations contribute substantially to standing genetic variation and parameters are accurately recovered from the uSFS. However, when advantageous mutations are strongly selected and rare, there are very few segregating in populations at any one time. Fitting the uSFS in such cases leads to underestimates of the strength of positive selection and may lead researchers to false conclusions regarding the relative contribution adaptive mutations make to molecular evolution. Fortunately, the parameters for the distribution of fitness effects for harmful mutations are estimated with high accuracy and precision. The results from this study suggest that the parameters of positively selected mutations obtained by analysis of the uSFS should be treated with caution and that variability at linked sites should be used in conjunction with standing variability to estimate parameters of the distribution of fitness effects in the future.

Keywords: adaptation; beneficial mutations; distribution of fitness effects; population genetics.

PubMed Disclaimer

Figures

Figure 1
Figure 1
Population genetic summary statistics collated across all simulated genes. αObs is the observed proportion of substitutions fixed by positive selection. πs0 is genetic diversity relative to neutral expectation (π0 = 0.01). SAdv./S is the proportion of segregating nonsynonymous sites that are advantageous in the simulated datasets.
Figure 2
Figure 2
The uSFS for advantageous mutations under different combinations of positive selection parameters. The three bar charts show observed uSFS from simulations that model positive selection parameters that yield similar α. The lines in each panel show the expected frequency spectra for different strengths of beneficial mutations and were obtained using Equation 2 from Tataru et al., (2017).
Figure 3
Figure 3
Estimates of the parameters of advantageous mutations and the proportion of adaptive substitutions they imply from simulated datasets. A) γa is the inferred selective effect of a new advantageous mutation; B) pa is the proportion of new mutations that are beneficial, the horizontal dashed gray lines indicate the simulated values in each case; C) αDFE is the proportion of adaptive substitutions expected under the inferred DFE, the dashed lines indicate αObs, the proportion of adaptive substitutions observed in the simulated datasets. Error bars indicate the 95% range of 100 bootstrap replicates.
Figure 4
Figure 4
The likelihood surface for the γa and pa parameters for three simulated datasets. Hue indicates differences in log likelihood between a particular parameter combination and the best-fitting model. Best fitting models are indicated by red points and the true parameters are given above the plots and indicated by the white plus signs on the likelihood surface. The relation γapa = 0.1 is shown as a turquoise line and is constant across the three datasets shown.

References

    1. Bailey S. F., and Bataillon T., 2016. Can the experimental evolution programme help us elucidate the genetic basis of adaptation in nature? Mol. Ecol. 25: 203–218. 10.1111/mec.13378 - DOI - PMC - PubMed
    1. Bank C., Hietpas R. T., Wong A., Bolon D. N., and Jensen J. D., 2014. A Bayesian MCMC approach to assess the complete distribution of fitness effects of new mutations: uncovering the potential for adaptive walks in challenging environments. Genetics 196: 841–852. 10.1534/genetics.113.156190 - DOI - PMC - PubMed
    1. Barton N. H., 2000. Genetic hitchhiking. Philos Trans R Soc L. B Biol Sci 355: 1553–1562. 10.1098/rstb.2000.0716 - DOI - PMC - PubMed
    1. Barton H. J., and Zeng K., 2018. New methods for inferring the distribution of fitness effects for INDELs and SNPs. Mol. Biol. Evol. 35: 1536–1546. 10.1093/molbev/msy054 - DOI - PMC - PubMed
    1. Böndel K. B., Kraemer S. A., Samuels T., McClean D., Lachapelle J. et al. , 2019. Inferring the distribution of fitness effects of spontaneous mutations in Chlamydomonas reinhardtii. PLoS Biol. 17: e3000192 10.1371/journal.pbio.3000192 - DOI - PMC - PubMed

LinkOut - more resources