Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2023 Jul 3;15(7):evad112.
doi: 10.1093/gbe/evad112.

Highly Abundant Proteins Are Highly Thermostable

Affiliations

Highly Abundant Proteins Are Highly Thermostable

Agusto R Luzuriaga-Neira et al. Genome Biol Evol. .

Abstract

Highly abundant proteins tend to evolve slowly (a trend called E-R anticorrelation), and a number of hypotheses have been proposed to explain this phenomenon. The misfolding avoidance hypothesis attributes the E-R anticorrelation to the abundance-dependent toxic effects of protein misfolding. To avoid these toxic effects, protein sequences (particularly those of highly expressed proteins) would be under selection to fold properly. One prediction of the misfolding avoidance hypothesis is that highly abundant proteins should exhibit high thermostability (i.e., a highly negative free energy of folding, ΔG). Thus far, only a handful of analyses have tested for a relationship between protein abundance and thermostability, producing contradictory results. These analyses have been limited by 1) the scarcity of ΔG data, 2) the fact that these data have been obtained by different laboratories and under different experimental conditions, 3) the problems associated with using proteins' melting energy (Tm) as a proxy for ΔG, and 4) the difficulty of controlling for potentially confounding variables. Here, we use computational methods to compare the free energy of folding of pairs of human-mouse orthologous proteins with different expression levels. Even though the effect size is limited, the most highly expressed ortholog is often the one with a more negative ΔG of folding, indicating that highly expressed proteins are often more thermostable.

Keywords: expression levels; misfolding avoidance hypothesis; protein thermostability; rates of evolution; translational robustness hypothesis.

PubMed Disclaimer

Figures

Fig. 1.
Fig. 1.
Changes in free energy of folding due to mutations (ΔΔG of mutations) used in the study. For each term, the direction of the arrow indicates the initial sequence and the final sequence. Positive ΔΔG values indicate that the final sequence is less stable than the initial sequence. Negative ΔΔG values indicate that the final sequence is more stable than the initial sequence.
Fig. 2.
Fig. 2.
Correlation between the difference in the free energy of folding of human and mouse proteins and the difference in the abundance of human and mouse. Each dot corresponds to a pair of human–mouse orthologs (n = 47). The shaded area represents the 95% confidence interval. Spearman's rank correlation test significance level: *, P < 0.05.
Fig. 3.
Fig. 3.
Correlation between changes in free energy of folding (ΔΔG) and changes in protein abundance at different branches of the primate and rodent phylogeny. Each dot corresponds to a human or mouse protein. The shaded areas represent the 95% confidence intervals. Spearman's rank correlation test significance level: *, P < 0.05.
Fig. 4.
Fig. 4.
Correlation between the differences in the free energy of folding due to all possible translation errors (ΔΔGt) and protein abundance. Each dot corresponds to one human or mouse protein structure (n = 51). Average and median ΔΔGt values were estimated by substituting every amino acid position with the other 19 standard amino acids. The percent of destabilizing translation errors was computed as the fraction of translation errors with ΔΔGt > 1. The shaded areas represent the 95% confidence intervals.

References

    1. Alvarez-Ponce D. 2014. Why proteins evolve at different rates: the determinants of proteins’ rates of evolution. In: Fares MA, editor. Natural selection: methods and applications. London: CRC Press (Taylor & Francis). p. 126–178.
    1. Ancien F, Pucci F, Godfroid M, Rooman M. 2018. Prediction and interpretation of deleterious coding variants in terms of protein structural stability. Sci Rep. 8:4480. - PMC - PubMed
    1. Apweiler R, et al. . 2004. Uniprot: the universal protein knowledgebase. Nucleic Acids Res. 32:D115–D119. - PMC - PubMed
    1. Ashkenazy H, et al. . 2012. FastML: a web server for probabilistic reconstruction of ancestral sequences. Nucleic Acids Res. 40:W580–W584. - PMC - PubMed
    1. Baugh EH, et al. . 2016. Robust classification of protein variation using structural modelling and large-scale data integration. Nucleic Acids Res. 44:2501–2513. - PMC - PubMed

Publication types