Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2010 Jun 17;465(7300):922-6.
doi: 10.1038/nature09105. Epub 2010 May 19.

Sequence space and the ongoing expansion of the protein universe

Affiliations

Sequence space and the ongoing expansion of the protein universe

Inna S Povolotskaya et al. Nature. .

Abstract

The need to maintain the structural and functional integrity of an evolving protein severely restricts the repertoire of acceptable amino-acid substitutions. However, it is not known whether these restrictions impose a global limit on how far homologous protein sequences can diverge from each other. Here we explore the limits of protein evolution using sequence divergence data. We formulate a computational approach to study the rate of divergence of distant protein sequences and measure this rate for ancient proteins, those that were present in the last universal common ancestor. We show that ancient proteins are still diverging from each other, indicating an ongoing expansion of the protein sequence universe. The slow rate of this divergence is imposed by the sparseness of functional protein sequences in sequence space and the ruggedness of the protein fitness landscape: approximately 98 per cent of sites cannot accept an amino-acid substitution at any given moment but a vast majority of all sites may eventually be permitted to evolve when other, compensatory, changes occur. Thus, approximately 3.5 x 10(9) yr has not been enough to reach the limit of divergent evolution of proteins, and for most proteins the limit of sequence similarity imposed by common function may not exceed that of random sequences.

PubMed Disclaimer

References

    1. Nature. 2008 Jan 31;451(7178):541-4 - PubMed
    1. Curr Opin Struct Biol. 2000 Jun;10(3):355-8 - PubMed
    1. J Mol Evol. 2006 Oct;63(4):513-25 - PubMed
    1. Prog Biophys Mol Biol. 2000;73(5):321-37 - PubMed
    1. Science. 2006 Apr 7;312(5770):111-4 - PubMed