Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2017 Feb 3;429(3):348-355.
doi: 10.1016/j.jmb.2016.11.023. Epub 2016 Nov 30.

SCOPe: Manual Curation and Artifact Removal in the Structural Classification of Proteins - extended Database

Affiliations

SCOPe: Manual Curation and Artifact Removal in the Structural Classification of Proteins - extended Database

John-Marc Chandonia et al. J Mol Biol. .

Abstract

SCOPe (Structural Classification of Proteins-extended, http://scop.berkeley.edu) is a database of relationships between protein structures that extends the Structural Classification of Proteins (SCOP) database. SCOP is an expert-curated ordering of domains from the majority of proteins of known structure in a hierarchy according to structural and evolutionary relationships. SCOPe classifies the majority of protein structures released since SCOP development concluded in 2009, using a combination of manual curation and highly precise automated tools, aiming to have the same accuracy as fully hand-curated SCOP releases. SCOPe also incorporates and updates the ASTRAL compendium, which provides several databases and tools to aid in the analysis of the sequences and structures of proteins classified in SCOPe. SCOPe continues high-quality manual classification of new superfamilies, a key feature of SCOP. Artifacts such as expression tags are now separated into their own class, in order to distinguish them from the homology-based annotations in the remainder of the SCOPe hierarchy. SCOPe 2.06 contains 77,439 Protein Data Bank entries, double the 38,221 structures classified in SCOP.

Keywords: database; protein evolution; structure classification.

PubMed Disclaimer

Figures

Fig. 1
Fig. 1. Changes to SCOP(e) design and size
All stable SCOP and SCOPe releases since the introduction of stable identifiers (SCOP 1.55, July 2001) are shown. The height of the vertical line for each release represents the number of PDB entries classified. The angle of the blue baseline between releases reflects the degree of divergence from comprehensive and fully manually curated releases. SCOP2 [31] is a major redesign of SCOP that enables curators to annotate a richer set of evolutionary relationships between proteins, providing a more precise and accurate characterization of protein relationships. SCOP2 is currently available as a prototype that classifies 995 proteins. A dashed line indicates that the SCOP2 prototype is partially based on SCOP 1.75.
Fig. 2
Fig. 2. Examples of manual curation in SCOPe
A) Top and side views of F1 and V1 ATP synthase subunits; the “top” view is oriented towards the membrane. Conserved N-terminal, middle, and C-terminal subunits of the alpha and beta subunits of F1 (A and B subunits of V1) are shown in blue, orange, and green, respectively. The “bulge” domain of V1, which represents a new SCOPe superfamily in 2.06, is shown in red. Other subunits of F1 and V1 are shown in light grey. B) Four homologous domains from bacterial toxins are colored in a spectrum ranging from blue at the N-terminal end to red at the C-terminal end. Other domains from the structures are shown in grey. C) Two homologous domains from the PA14 superfamily are colored in a spectrum ranging from blue at the N-terminal end to red at the C-terminal end. Other domains from the structures are shown in grey. PA14 was part of the Anthrax Protective Antigen fold (the entire structure shown on the left) in versions of SCOP and SCOPe prior to SCOPe 2.04.

Similar articles

Cited by

References

    1. Murzin AG, Brenner SE, Hubbard T, Chothia C. SCOP: a structural classification of proteins database for the investigation of sequences and structures. J Mol Biol. 1995;247:536–540. doi: 10.1006/jmbi.1995.0159. - DOI - PubMed
    1. Lo Conte L, Brenner SE, Hubbard TJP, Chothia C, Murzin AG. SCOP database in 2002: refinements accommodate structural genomics. Nucleic Acids Res. 2002;30:264–267. - PMC - PubMed
    1. Andreeva A, Howorth D, Brenner SE, Hubbard TJP, Chothia C, Murzin AG. SCOP database in 2004: refinements integrate structure and sequence family data. Nucleic Acids Res. 2004;32:D226–229. doi: 10.1093/nar/gkh039. - DOI - PMC - PubMed
    1. Andreeva A, Howorth D, Chandonia JM, Brenner SE, Hubbard TJP, Chothia C, Murzin AG. Data growth and its impact on the SCOP database: new developments. Nucleic Acids Res. 2008;36:D419–425. doi: 10.1093/nar/gkm993. - DOI - PMC - PubMed
    1. Fox NK, Brenner SE, Chandonia JM. SCOPe: Structural Classification of Proteins--extended, integrating SCOP and ASTRAL data and classification of new structures. Nucleic Acids Res. 2014;42:D304–309. doi: 10.1093/nar/gkt1240. - DOI - PMC - PubMed

Publication types