Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2010 Feb 16;107(7):2983-8.
doi: 10.1073/pnas.0910445107. Epub 2010 Jan 26.

Universal distribution of protein evolution rates as a consequence of protein folding physics

Affiliations

Universal distribution of protein evolution rates as a consequence of protein folding physics

Alexander E Lobkovsky et al. Proc Natl Acad Sci U S A. .

Abstract

The hypothesis that folding robustness is the primary determinant of the evolution rate of proteins is explored using a coarse-grained off-lattice model. The simplicity of the model allows rapid computation of the folding probability of a sequence to any folded conformation. For each robust folder, the network of sequences that share its native structure is identified. The fitness of a sequence is postulated to be a simple function of the number of misfolded molecules that have to be produced to reach a characteristic protein abundance. After fixation probabilities of mutants are computed under a simple population dynamics model, a Markov chain on the fold network is constructed, and the fold-averaged evolution rate is computed. The distribution of the logarithm of the evolution rates across distinct networks exhibits a peak with a long tail on the low rate side and resembles the universal empirical distribution of the evolutionary rates more closely than either distribution resembles the log-normal distribution. The results suggest that the universal distribution of the evolutionary rates of protein-coding genes is a direct consequence of the basic physics of protein folding.

PubMed Disclaimer

Conflict of interest statement

The authors declare no conflict of interest.

Figures

Fig. 1.
Fig. 1.
Distribution (probability density function) of the logarithmic ratio of the probability of a sequence to fold into the native structure and the probability to fold to an alternative structure. See SI Materials and Methods for the values of parameters used to compute these distributions.
Fig. 2.
Fig. 2.
Distributions (probability densities) of the logarithm of the fold-averaged ER for three chain lengths N and typical values of abundance (A = 10−4), effective population size (Ne = 104), and per-monomer mistranslation probability (r = 0.02).
Fig. 3.
Fig. 3.
Comparison of the model-derived (N = 18, r = 0.02, Ne = 104) and observed distributions of ER. The quantile-quantile plot compares the empirical ER distributions for the sets of orthologous proteins taken from ref. for (A) Homo sapiens and Macaca mulatta (denoted Homsa, 16603 proteins) and (B) two strains of the bacterium Burkholderia sp (Bursp, 4014 proteins) with the distributions yielded by the model with three different abundance levels. Solid lines are linear fits to the A = 0.5 × 10−4 data.
Fig. 4.
Fig. 4.
Normal probability plot comparing the normalized model-derived and empirical ER distributions with the normal distribution. The distributions are the same as used for Fig. 3. The empirical ER distributions are more similar to each other and to the model distribution (all are convex upward) than to the normal distribution. A normal distribution would appear in a normal probability plot as a 45° line, shown here for reference.
Fig. 5.
Fig. 5.
Dependence of the mean, median, and Pearson’s skewness, defined as 3(median − mean)/(standard deviation), of the ER distribution derived from the model on abundance, effective population size, and per-monomer mistranslation probability. The chain length is N = 18. The dashed lines are the empirical Sk values computed for the Bursp (blue) and Homsa (green) samples.

Similar articles

Cited by

References

    1. Kimura M. The Neutral Theory of Molecular Evolution. Cambridge, UK: Cambridge Univ Press; 1983.
    1. Gillespie JH. The Causes of Molecular Evolution. Oxford: Oxford Univ Press; 1994.
    1. Grishin NV, Wolf YI, Koonin EV. From complete genomes to measures of substitution rate variability within and between proteins. Genome Res. 2000;10:991–1000. - PMC - PubMed
    1. Wolf YI, Novichkov PS, Karev GP, Koonin EV, Lipman DJ. Inaugural Article: The universal distribution of evolutionary rates of genes and distinct characteristics of eukaryotic genes of different apparent ages. Proc Natl Acad Sci USA. 2009;106:7273–7280. - PMC - PubMed
    1. Zuckerkandl E. Evolutionary processes and evolutionary noise at the molecular level. I. Functional density in proteins. J Mol Evol. 1976;7:167–183. - PubMed

Publication types

LinkOut - more resources