Weak selection and protein evolution

Hiroshi Akashi¹, Naoki Osada, Tomoko Ohta

Affiliations

PMID: 22964835
PMCID: PMC3430532
DOI: 10.1534/genetics.112.140178

Review

Weak selection and protein evolution

Hiroshi Akashi et al. Genetics. 2012 Sep.

. 2012 Sep;192(1):15-31.

doi: 10.1534/genetics.112.140178.

Authors

Hiroshi Akashi¹, Naoki Osada, Tomoko Ohta

Affiliation

¹ Division of Evolutionary Genetics, Department of Population Genetics, National Institute of Genetics, Mishima, Shizuoka 411-8540, Japan. hakashi@nig.ac.jp

PMID: 22964835
PMCID: PMC3430532
DOI: 10.1534/genetics.112.140178

Abstract

The "nearly neutral" theory of molecular evolution proposes that many features of genomes arise from the interaction of three weak evolutionary forces: mutation, genetic drift, and natural selection acting at its limit of efficacy. Such forces generally have little impact on allele frequencies within populations from generation to generation but can have substantial effects on long-term evolution. The evolutionary dynamics of weakly selected mutations are highly sensitive to population size, and near neutrality was initially proposed as an adjustment to the neutral theory to account for general patterns in available protein and DNA variation data. Here, we review the motivation for the nearly neutral theory, discuss the structure of the model and its predictions, and evaluate current empirical support for interactions among weak evolutionary forces in protein evolution. Near neutrality may be a prevalent mode of evolution across a range of functional categories of mutations and taxa. However, multiple evolutionary mechanisms (including adaptive evolution, linked selection, changes in fitness-effect distributions, and weak selection) can often explain the same patterns of genome variation. Strong parameter sensitivity remains a limitation of the nearly neutral model, and we discuss concave fitness functions as a plausible underlying basis for weak selection.

PubMed Disclaimer

Figures

**Figure 1**
Polymorphism and divergence under weak selection. Expected levels of nucleotide diversity and DNA divergence (each relative to neutral mutations) are shown. The dotted line represents nucleotide diversity (probability of observing a polymorphism at a given nucleotide site in a pair of randomly chosen chromosomes) and is calculated using sampling formulas from Sawyer and Hartl (1992), assuming an infinite-sites mutation model and constant N_e. The solid line shows the fixation probabilities of mutations (Kimura 1962). The plots assume directional (genic) selection with fitness values 1, 1 + 1/2s, and 1 + s for the homozygote for the ancestral allele, heterozygote, and homozygote for a new mutation, respectively. The plots assume independent evolution among sites and are based on Kimura (1983, p. 44).

**Figure 2**
Example evolutionary patterns under slightly deleterious mutations. (A) A probability density function for negative selection coefficients (gamma distribution with shape parameter 0.2 and scale parameter 0.05). The area under the curve gives the proportion of mutations in a given fitness range. This distribution of s was chosen to allow substantial increases in the effectively neutral proportion for population sizes in the range 10²–10⁸ and is assumed in plots in B, C, and D. Under this distribution of selective effects (DSEs), <25% of newly arising mutations have s < −0.01 and <2% of mutations have s < −0.1. (B) Cumulative distribution function for selective effects of new mutations. y-axis values are the total areas under the curve in A for x < s < 0. f′_n, the proportion of “effectively neutral” mutations, −1 < N_es ≤ 0, for a given population size is the y-axis value at x = 1/N_e (values are marked for N_e of 10², 10⁴, 10⁶, 10⁸). (C) Cumulative distribution function for N_es. y-axis values are the areas under the DSE curve for x < N_es < 0 in A. Curves are shown for N_e of 10², 10⁴, 10⁶, and 10⁸ (thicker lines represent larger population sizes). f′_n values are indicated (as solid circles) for each population size. (D) Polymorphism and divergence as a function of N_e. Expected DNA diversity (π_N/π_S, dotted line) and divergence (d_N/d_S, solid line) are shown. The dashed line shows f′_n, and values for N_e of 10², 10⁴, 10⁶, and 10⁸ are marked. Expected divergence is smaller than f′_n because selection reduces fixation rates for slightly deleterious mutations within this range. π_N/π_S values are higher than f′_n because mutations in the range N_es < −1 contribute to polymorphism (Figure 1). These plots assume independent evolution among sites.

**Figure 3**
Example evolutionary patterns for slightly deleterious and advantageous mutations. (A) Probability density function for positive selection coefficients (gamma distribution with shape parameter 1 and scale parameter 5 × 10⁻⁷). (B) Cumulative distribution function for selective effects of new mutations. y-Axis values are the total areas under the curve for 0 < s < x in A. f′_n for a given population size is the y-axis value at x = 1/N_e (values are marked for N_e of 10⁴, 10⁶, 10⁸). Almost all beneficial mutations are effectively neutral in N_e of 10² and 10⁴, and almost none are effectively neutral in N_e of 10⁸. (C) Cumulative distribution function for N_es or scaled selective effects. y-Axis values are the areas under the DSE curve for 0 < N_es < x in A. Curves are shown for N_e values of 10², 10⁴, 10⁶, and 10⁸ (thicker lines represent larger population sizes). f′_n values are shown as solid circles for each population size. (D) Polymorphism and divergence as a function of N_e for a distribution of fitness effects that combines the density functions in Figure 2A (99% of new mutations) and Figure 3A (1% of new mutations). The dashed line shows the proportion of advantageous fixations. Expected DNA diversity (π_N/π_S, dotted line) and divergence (d_N/d_S, solid line) are shown. These predictions assume independent evolution among sites.

**Figure 4**
Levels of nonsynonymous and synonymous DNA polymorphism among populations. DNA diversity for nonsynonymous mutations (scaled to DNA diversity for synonymous mutations) is plotted against DNA diversity for synonymous mutations (an estimate of population size). π_S is a proxy for population size if mutation rates are similar among the species compared. Note that statistical analyses of such data must account for the contribution of π_S to both axes (Piganeau and Eyre-Walker 2009; Elyashiv *et al.* 2010). Common symbols in each plot indicate the same set of genes compared among species. Data are shown for taxa for which six or more independent populations have been sampled for ≥20 nuclear genes. See Table S1 for species names, sample numbers, number of loci, and references (as well as data for a more limited number of *Drosophila* species).

**Figure 5**
Concave fitness functions and near neutrality. The curve y = x/(1 + x) shows a hypothetical relationship between fitness and phenotypic values of a trait. The slope of the curve at a given point determines the fitness effect of small phenotypic changes (slopes are shown for phenotypic values of 1 and 3). The slope decreases as a function of the phenotypic value (*i.e.*, the distribution of s changes with character values). If a large fraction of mutations have small phenotypic effects and if the rate of mutation to deleterious alleles is higher than the rate to advantageous mutations, populations will evolve to a point on the curve where slightly deleterious mutations that move the population away from the optimum will be balanced by weak positive selection. The left and right points marked in the figure correspond to equilibrium points in species with small and large population sizes, respectively (this assumes constant mutation rates and population sizes).

See this image and copyright information in PMC

References

1. Abbot P., Moran N. A., 2002. Extremely low levels of genetic polymorphism in endosymbionts (Buchnera) of aphids (Pemphigus). Mol. Ecol. 11: 2649–2660 - PubMed
1. Agrawal A. F., Whitlock M. C., 2011. Inferences about the distribution of dominance drawn from yeast gene knockout data. Genetics 187: 553–566 - PMC - PubMed
1. Akashi H., 1995. Inferring weak selection from patterns of polymorphism and divergence at “silent” sites in Drosophila DNA. Genetics 139: 1067–1076 - PMC - PubMed
1. Akashi H., 1996. Molecular evolution between Drosophila melanogaster and D. simulans: reduced codon bias, faster rates of amino acid substitution, and larger proteins in D. melanogaster. Genetics 144: 1297–1307 - PMC - PubMed
1. Akashi H., 1999a. Inferring the fitness effects of DNA mutations from polymorphism and divergence data: statistical power to detect directional selection under stationarity and free recombination. Genetics 151: 221–238 - PMC - PubMed

Publication types

Actions

MeSH terms

Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions

Substances

Actions

LinkOut - more resources

Full Text Sources

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Weak selection and protein evolution

Affiliation

Weak selection and protein evolution

Authors

Affiliation

Abstract

Figures

References

Publication types

MeSH terms

Substances

LinkOut - more resources

Full Text Sources