Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2003 Mar 4;100(5):2386-90.
doi: 10.1073/pnas.2628030100. Epub 2003 Feb 26.

A global representation of the protein fold space

Affiliations

A global representation of the protein fold space

Jingtong Hou et al. Proc Natl Acad Sci U S A. .

Abstract

One of the principal goals of the structural genomics initiative is to identify the total repertoire of protein folds and obtain a global view of the "protein structure universe." Here, we present a 3D map of the protein fold space in which structurally related folds are represented by spatially adjacent points. Such a representation reveals a high-level organization of the fold space that is intuitively interpretable. The shape of the fold space and the overall distribution of the folds are defined by three dominant trends: secondary structure class, chain topology, and protein domain size. Random coil-like structures of small proteins and peptides are mapped to a region where the three trends converge, offering an interesting perspective on both the demography of fold space and the evolution of protein structures.

PubMed Disclaimer

Figures

Figure 1
Figure 1
The first 20 eigen values of the metric matrix calculated from the 498 × 498 dali structural alignment scores.
Figure 2
Figure 2
A 3D representation of the protein fold space. Each sphere represents a protein fold family. Protein folds that belong to the α, β, and α/β scop classes are clustered mostly around three separate axes, α (red), β (yellow), and α/β (cyan). Most protein folds of the α+β class (blue) fall between the α and β axes and on or near the plane defined by the α and β axes. The α and β axes approximately intersect at one point indicated by the green ball.
Figure 3
Figure 3
The three factorial axes defined by the eigen values of the metric matrix. (A) The first two axes discriminate among α, β, and α/β folds (color scheme the same as Fig. 2). (B) The third factorial axis correlates with the change in protein domain size, corresponding to a dominant trend in the distribution of folds in the fold space.
Figure 4
Figure 4
Comparing the fold usages between two species in the eubacterial domain (Chlamydia versus Aquifex, A) and between two species representing two different domains (Chlamydia of bacteria versus Halobacterium of archaea, B). Structural annotations of the ORFs encoded by each genome are obtained by a blast search against the scop sequence database. Percentage of the annotated ORFs in a genome that adopts a particular fold defines the usage of the fold by the organism. The usages of the 498 folds by the second organism are subtracted from the fold usages by the first organism. A contour surface (mesh) is then constructed and set at the values of 0.4% for blue and −0.4% for red. Regions within the blue contour include folds that appear more frequently in the first organism, whereas regions within the red contour include folds that occur more frequently in the second organism.

Similar articles

Cited by

References

    1. Richardson J S. Adv Protein Chem. 1981;34:167–339. - PubMed
    1. Chothia C. Nature. 1992;357:543–544. - PubMed
    1. Zhang C, DeLisi C. J Mol Biol. 1998;284:1301–1305. - PubMed
    1. Kim S H. Nat Struct Biol. 1998;5,Suppl.:643–645. - PubMed
    1. Sanchez R, Pieper U, Melo F, Eswar N, Marti-Renom M A, Madhusudhan M S, Mirkovic N, Sali A. Nat Struct Biol. 2000;7,Suppl.:986–990. - PubMed

Publication types

LinkOut - more resources