Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 1998 Apr 28;95(9):4987-90.
doi: 10.1073/pnas.95.9.4987.

Are protein folds atypical?

Affiliations

Are protein folds atypical?

H Li et al. Proc Natl Acad Sci U S A. .

Abstract

Protein structures are a very special class among all possible structures. It has been suggested that a "designability principle" plays a crucial role in nature's selection of protein sequences and structures. Here, we provide a theoretical base for such a selection principle, using a simple model of protein folding based on hydrophobic interactions. A structure is reduced to a string of 0s and 1s, which represent the surface and core sites, respectively, as the backbone is traced. Each structure is therefore associated with one point in a high dimensional space. Sequences are represented by strings of their hydrophobicities and thus can be mapped into the same space. A sequence that lies closer to a particular structure in this space than to any other structures will have that structure as its ground state. Atypical structures, namely those far away from other structures in the high dimensional space, have more sequences that fold into them and are thermodynamically more stable. We argue that the most common folds of proteins are the most atypical in the space of possible structures.

PubMed Disclaimer

Figures

Figure 1
Figure 1
Structures are represented by strings s of 0s and 1s, according to whether a site is on the surface or in the core, respectively. Shown are two examples of compact 6 × 6 lattice structures. (a) A typical structure. Dotted lines indicate local changes that can be performed to transform it to other compact structures. Note that the change at the lower right corner does not change the string pattern, so this structure is a degenerate one. (b) The most designable structure.
Figure 2
Figure 2
Schematic plot of the sequence and the structure spaces and the Voronoi construction. The Voronoi polytope is the shaded region.
Figure 3
Figure 3
Histogram of the designability obtained by random sampling by using 19,492,200 sequences.
Figure 4
Figure 4
Correlation functions for the structures.
Figure 5
Figure 5
Number of structures vs. the Hamming distance for three structures with: low (circles), intermediate (triangles), and high (squares) designability.

Similar articles

Cited by

References

    1. Levitt M, Chothia C. Nature (London) 1976;261:552–558. - PubMed
    1. Richardson J S. Proc Natl Acad Sci USA. 1976;73:2619–2623. - PMC - PubMed
    1. Richardson J S. Adv Protein Chem. 1981;34:167–339. - PubMed
    1. Murzin A G, Brenner S E, Hubbard T, Chothia C. J Mol Biol. 1995;247:536–540. - PubMed
    1. Orengo C A, Jones D T, Thornton J M. Nature (London) 1994;372:631–634. - PubMed

LinkOut - more resources