SCOPe: Structural Classification of Proteins--extended, integrating SCOP and ASTRAL data and classification of new structures

Naomi K Fox¹, Steven E Brenner, John-Marc Chandonia

Affiliations

Affiliation

¹ Physical Biosciences Division, Lawrence Berkeley National Laboratory, Berkeley, CA 94720, USA and Department of Plant and Microbial Biology, University of California, Berkeley, CA 94720, USA.

PMID: 24304899
PMCID: PMC3965108
DOI: 10.1093/nar/gkt1240

SCOPe: Structural Classification of Proteins--extended, integrating SCOP and ASTRAL data and classification of new structures

Naomi K Fox et al. Nucleic Acids Res. 2014 Jan.

. 2014 Jan;42(Database issue):D304-9.

doi: 10.1093/nar/gkt1240. Epub 2013 Dec 3.

Authors

Naomi K Fox¹, Steven E Brenner, John-Marc Chandonia

Affiliation

¹ Physical Biosciences Division, Lawrence Berkeley National Laboratory, Berkeley, CA 94720, USA and Department of Plant and Microbial Biology, University of California, Berkeley, CA 94720, USA.

PMID: 24304899
PMCID: PMC3965108
DOI: 10.1093/nar/gkt1240

Abstract

Structural Classification of Proteins-extended (SCOPe, http://scop.berkeley.edu) is a database of protein structural relationships that extends the SCOP database. SCOP is a manually curated ordering of domains from the majority of proteins of known structure in a hierarchy according to structural and evolutionary relationships. Development of the SCOP 1.x series concluded with SCOP 1.75. The ASTRAL compendium provides several databases and tools to aid in the analysis of the protein structures classified in SCOP, particularly through the use of their sequences. SCOPe extends version 1.75 of the SCOP database, using automated curation methods to classify many structures released since SCOP 1.75. We have rigorously benchmarked our automated methods to ensure that they are as accurate as manual curation, though there are many proteins to which our methods cannot be applied. SCOPe is also partially manually curated to correct some errors in SCOP. SCOPe aims to be backward compatible with SCOP, providing the same parseable files and a history of changes between all stable SCOP and SCOPe releases. SCOPe also incorporates and updates the ASTRAL database. The latest release of SCOPe, 2.03, contains 59 514 Protein Data Bank (PDB) entries, increasing the number of structures classified in SCOP by 55% and including more than 65% of the protein structures in the PDB.

PubMed Disclaimer

Figures

**Figure 1.**
Errors identified during benchmarking. We detected errors in 70 manually curated domains by running benchmarking and manually inspecting predicted domains that did not sufficiently match the manually annotated domains. These errors in domain boundaries in multi-domain chains were manually fixed in SCOPe 2.03. We also detected and fixed inconsistencies in 5054 domains that had been predicted and classified with the SCOP 1.73 automated method. We review some of the types of errors detected. (a) The SCOP 1.73 automated method used to predict domain d2p8qa1 had included approximately half the residues in the chain. This was inconsistent with all other manually curated entries in its species-level clade that included the entire chain. (b) A strand of beta sheet was included in the d1tqya2 domain by manual curation. (c) All of chain I from 1oyv had been placed into a single domain. (d) The manually curated domain d1seja2 excluded the first helix in the chain.

**Figure 2.**
Automated curation example. This figure depicts an example of applying the automated method for domain prediction and classification to 1vj5, chain A, released on 2004-04-27. We attempted to automatically classify it into SCOP 1.67, based only on domains defined in SCOP 1.65. 1vj5A has 554 residues, of which residues 2-547 are observed (found in the ATOM records in PDB data). Two significant BLAST hits were found to the classified chain 1ek1A, which has a distinct sequence from 1vj5A but also has 554 residues, of which residues 4-19, 48-66 and 90-544 are observed. The two BLAST hits include residues 2-224 and 226-544 in 1vj5A. The final predicted domains in 1vj5A are 2-225 and 226-547. The manually annotated domains for 1vj5A are 2-223 and 224-547. Since the end of each predicted domain differs from the manually annotated domain by at most 10 residues, this domain prediction is deemed to fall within the error tolerance for validation.

See this image and copyright information in PMC

References

1. Murzin AG, Brenner SE, Hubbard T, Chothia C. SCOP: a structural classification of proteins database for the investigation of sequences and structures. J. Mol. Biol. 1995;247:536–540. - PubMed
1. Brenner SE, Chothia C, Hubbard TJ, Murzin AG. Understanding protein structure: using scop for fold interpretation. Methods Enzymol. 1996;266:635–643. - PubMed
1. Lo Conte L, Brenner SE, Hubbard T, Chothia C, Murzin AG. SCOP database in 2002: refinements accommodate structural genomics. Nucleic Acids Res. 2002;30:264–267. - PMC - PubMed
1. Andreeva A, Howorth D, Chandonia JM, Brenner SE, Hubbard TJ, Chothia C, Murzin AG. Data growth and its impact on the SCOP database: new developments. Nucleic Acids Res. 2008;36:D419–D425. - PMC - PubMed
1. Brenner SE, Koehl P, Levitt M. The ASTRAL compendium for protein structure and sequence analysis. Nucleic Acids Res. 2000;28:254–256. - PMC - PubMed

Publication types

Actions
Actions

MeSH terms

Actions
Actions
Actions
Actions
Actions

Substances

Actions

Grants and funding

LinkOut - more resources

Full Text Sources
Other Literature Sources
- The Lens - Patent Citations Database
- scite Smart Citations

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

SCOPe: Structural Classification of Proteins--extended, integrating SCOP and ASTRAL data and classification of new structures

Affiliation

SCOPe: Structural Classification of Proteins--extended, integrating SCOP and ASTRAL data and classification of new structures

Authors

Affiliation

Abstract

Figures

References

Publication types

MeSH terms

Substances

Grants and funding

LinkOut - more resources

Full Text Sources

Other Literature Sources