Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2007 Oct 9;104(41):16152-7.
doi: 10.1073/pnas.0705366104. Epub 2007 Oct 3.

Protein stability imposes limits on organism complexity and speed of molecular evolution

Affiliations

Protein stability imposes limits on organism complexity and speed of molecular evolution

Konstantin B Zeldovich et al. Proc Natl Acad Sci U S A. .

Abstract

Classical population genetics a priori assigns fitness to alleles without considering molecular or functional properties of proteins that these alleles encode. Here we study population dynamics in a model where fitness can be inferred from physical properties of proteins under a physiological assumption that loss of stability of any protein encoded by an essential gene confers a lethal phenotype. Accumulation of mutations in organisms containing Gamma genes can then be represented as diffusion within the Gamma-dimensional hypercube with adsorbing boundaries determined, in each dimension, by loss of a protein's stability and, at higher stability, by lack of protein sequences. Solving the diffusion equation whose parameters are derived from the data on point mutations in proteins, we determine a universal distribution of protein stabilities, in agreement with existing data. The theory provides a fundamental relation between mutation rate, maximal genome size, and thermodynamic response of proteins to point mutations. It establishes a universal speed limit on rate of molecular evolution by predicting that populations go extinct (via lethal mutagenesis) when mutation rate exceeds approximately six mutations per essential part of genome per replication for mesophilic organisms and one to two mutations per genome per replication for thermophilic ones. Several RNA viruses function close to the evolutionary speed limit, whereas error correction mechanisms used by DNA viruses and nonmutant strains of bacteria featuring various genome lengths and mutation rates have brought these organisms universally approximately 1,000-fold below the natural speed limit.

PubMed Disclaimer

Conflict of interest statement

The authors declare no conflict of interest.

Figures

Fig. 1.
Fig. 1.
A schematic representation of the model. (Top) Illustration of the effect of sequence depletion as energy of the native state of a protein decreases. Middle and Bottom show the accessible range of native-state energies within which proteins can mutate. It is limited from below by Emin, where complete sequence depletion occurs. From above, this range is limited by Emax when proteins become unstable, conferring a lethal phenotype to the organism carrying this protein's gene. The range of possible energies is broader for mesophilic organisms and narrows for thermophilic ones.
Fig. 2.
Fig. 2.
Distribution of stabilities of single-domain proteins and the prediction (smooth line) from analytical equation (6) with parameters h and D derived from mutation data on proteins as explained in the text and in SI Text. The stability data were collected from the ProTherm database (18). The value ΔGmax − ΔGmin is taken to be 20 kcal/mol.
Fig. 3.
Fig. 3.
The distribution of number of genes per viral genome. The red histogram corresponds to RNA viruses, whereas the black histogram is for dsDNA viruses. The data are taken from National Center for Biotechnology Information Genome database, www.ncbi.nlm.nih.gov/genomes/static/vis.html. The genomes of RNA viruses are much shorter than those of dsDNA viruses.
Fig. 4.
Fig. 4.
Distribution of the number of genes in 202 prokaryotic genomes for organisms with various optimal growth temperatures. Thermophilic and hyperthermophilic organisms systematically possess shorter genomes than mesophilic ones.
Fig. 5.
Fig. 5.
Histogram of protein point mutation data obtained from the ProTherm database (15). Two kinds of unfolding methods, ΔΔG and ΔGH2O, are considered separately, each with 2,804 and 2,418 different point mutations. The two groups of data produce the same shape of ΔΔG distribution. ΔΔG stands for the free energy of unfolding obtained with the Schellman equation (ΔΔG = dTm/dS) in the case of the thermal denaturation method, whereas ΔΔGH2O stands for the free energy of unfolding in water, determined by the denaturant (urea, guanidine·HCl, glutathione disulfide/GSH, and guanidine thiocyanate) denaturation of proteins and extrapolation of the data to zero concentration of denaturant; see ref. for more detailed explanations of experimental methods.

References

    1. Gillespie JH. The Causes of Molecular Evolution. Oxford: Oxford Univ Press; 1991.
    1. Shakhnovich EI. Chem Rev. 2006;106:1559–1588. - PMC - PubMed
    1. Fraser AG, Kamath RS, Zipperlen P, Martinez-Campos M, Sohrmann M, Ahringer J. Nature. 2000;408:325–330. - PubMed
    1. Herring CD, Blattner FR. J Bacteriol. 2004;186:2673–2681. - PMC - PubMed
    1. Dyson HJ, Wright PE. Nat Rev Mol Cell Biol. 2005;6:197–208. - PubMed

Publication types

LinkOut - more resources