Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2010 Mar;27(3):735-41.
doi: 10.1093/molbev/msp270. Epub 2009 Nov 12.

Highly expressed and slowly evolving proteins share compositional properties with thermophilic proteins

Affiliations

Highly expressed and slowly evolving proteins share compositional properties with thermophilic proteins

Joshua L Cherry. Mol Biol Evol. 2010 Mar.

Abstract

The sequences of proteins encoded by a genome evolve at different rates. A correlate of a protein's evolutionary rate is its expression level: highly expressed proteins tend to evolve slowly. Some explanations of rate variation and the correlation between rate and expression predict that more slowly evolving and more highly expressed proteins have more favorable equilibrium constants for folding. Proteins from thermophiles generally have more stable folds than proteins from mesophiles, and it is known that there are systematic differences in amino acid content between thermophilic and mesophilic proteins. I examined whether there are analogous correlations of amino acid frequencies with evolutionary rate and expression level within genomes. In most of the organisms analyzed, there is a striking tendency for more slowly evolving proteins to be more thermophile-like in their amino acid compositions when adjustments are made for variation in GC content. More highly expressed proteins also tend to be more thermophile-like by the same criteria. These results suggest that part of the evolutionary rate variation among proteins is due to variation in the strength of selection for stability of the folded state. They also suggest that increasing strength of this selective force with expression level plays a role in the correlation between evolutionary rate and expression level.

PubMed Disclaimer

Figures

F<sc>IG</sc>. 1.
FIG. 1.
The relationship between amino acid frequency and evolutionary rate among human proteins. The curve for each amino acid conveys how its frequency varies with protein sequence distance when GC content is taken into account. Each curve was produced by smoothing the data with a Gaussian kernel with an SD of 1/ln(10). The raw data points were the residuals of cubic polynomial fits of amino acid frequencies and the logarithm (base 10) of sequence distance to GC fraction. (A) Amino acids that are overrepresented in thermophiles. (B) Amino acids that are underrepresented in thermophiles. (C) Other amino acids.
F<sc>IG</sc>. 2.
FIG. 2.
Summary of correlation results for dN across organisms. For each correlation, “+” indicates a statistically significant (P < 0.05) positive correlation, “−” indicates a significant negative correlation, and “ns” indicates that the correlation was not statistically significant. Negative correlations, which are predicted for amino acids more common are thermophiles, are also indicated by orange coloring, and positive correlations are indicated by blue coloring.

Similar articles

Cited by

References

    1. Akashi H. Gene expression and molecular evolution. Curr Opin Genet Develop. 2001;11:660–666. - PubMed
    1. Akashi H, Gojobori T. Metabolic efficiency and amino acid composition in the proteomes of Escherichia coli and Bacillus subtilis. Proc Natl Acad Sci. 2002;99:3695–3700. - PMC - PubMed
    1. Boyd D, Schierle C, Beckwith J. How many membrane proteins are there? Protein Sci. 1998;7:201–205. - PMC - PubMed
    1. Brunner E, Ahrens CH, Mohanty S, Baetschmann H, Loevenich S, et al. A high-quality catalog of the Drosophila melanogaster proteome. Nat Biotech. 2007;25:576–583. - PubMed
    1. Chen P, Shakhnovich EI. Lethal mutagenesis in viruses and bacteria. Genetics. 2009;183:639–650. - PMC - PubMed

Publication types