Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2014 Jan;42(Database issue):D310-4.
doi: 10.1093/nar/gkt1242. Epub 2013 Nov 29.

SCOP2 prototype: a new approach to protein structure mining

Affiliations

SCOP2 prototype: a new approach to protein structure mining

Antonina Andreeva et al. Nucleic Acids Res. 2014 Jan.

Erratum in

  • Nucleic Acids Res. 2014 Oct;42(18):11847

Abstract

We present a prototype of a new structural classification of proteins, SCOP2 (http://scop2.mrc-lmb.cam.ac.uk/), that we have developed recently. SCOP2 is a successor to the Structural Classification of Proteins (SCOP, http://scop.mrc-lmb.cam.ac.uk/scop/) database. Similarly to SCOP, the main focus of SCOP2 is to organize structurally characterized proteins according to their structural and evolutionary relationships. SCOP2 was designed to provide a more advanced framework for protein structure annotation and classification. It defines a new approach to the classification of proteins that is essentially different from SCOP, but retains its best features. The SCOP2 classification is described in terms of a directed acyclic graph in which nodes form a complex network of many-to-many relationships and are represented by a region of protein structure and sequence. The new classification project is expected to ensure new advances in the field and open new areas of research.

PubMed Disclaimer

Figures

Figure 1.
Figure 1.
SCOP and SCOP2 graphs compared. (Left) A section of the SCOP hierarchical tree, showing the classification into the six obligatory levels: Protein species (SP), Protein (PR), Family (FA), Superfamily (SF), Fold (CF) and Class (CL). The homologous proteins with distinct folds, e.g., the Cro repressors (25), are compulsorily assigned in the same family and so progressively to the same superfamily, fold and class. There is an obligatory node at every level from the root to a leaf, even if such node does not represent any actual relationship, e.g., ‘singleton’ protein family consisting of a single protein that only has a relationship with itself. (Right) In SCOP2, the structural and evolutionary relationships are separated, allowing the classification of the homologous proteins into different folds and structural classes while keeping them in the same evolutionary family and superfamily. Non-obligatory single-child nodes are skipped to emphasize that relatives of a ‘singleton’ protein exist only at the superfamily level. The new category specific to SCOP2, 'other' relationships (IR), is also shown on the graph. These relationships include but are not limited to non-hierarchical relationships between homologous and non-homologous proteins with different folds sharing a large common substructure or motif (see also Figure 2).
Figure 2.
Figure 2.
Exploring SCOP2 with a custom graph viewer tool. A screenshot of a SCOP2-graph webpage showing a combined ancestor chart of two orthologous Cro repressors from bacteriophages lambda and p22. (The URL to view this page is http://scop2.mrc-lmb.cam.ac.uk/graph/example1). In addition to the protein relationships, schematically shown in Figure 1, the chart includes additional ‘ontology’ categories. The displayed chart can be expanded or collapsed via the node popup window. The node popup window also displays additional information about the selected node, in particular the associated structural and sequence domains and their boundaries representing the relationship. In this example, the domain, representing the common subfold of the different Cro types, is shown in the context of the full-length protein sequence and structure.

References

    1. Murzin AG, Brenner SE, Hubbard T, Chothia C. SCOP: a structural classification of proteins database for the investigation of sequences and structures. J. Mol. Biol. 1995;247:536–540. - PubMed
    1. Levitt M, Chothia C. Structural patterns in globular proteins. Nature. 1976;261:552–558. - PubMed
    1. Richardson JS. The anatomy and taxonomy of protein structure. Adv. Protein Chem. 1981;34:167–339. - PubMed
    1. Chothia C. Principles that determine the structure of proteins. Annu. Rev. Biochem. 1984;53:537–572. - PubMed
    1. Janin J, Chothia C. Domains in proteins: definitions, location, and structural principles. Methods Enzymol. 1985;115:420–430. - PubMed

Publication types