A systematic comparison of protein structure classifications: SCOP, CATH and FSSP
- PMID: 10508779
- DOI: 10.1016/s0969-2126(99)80177-4
A systematic comparison of protein structure classifications: SCOP, CATH and FSSP
Abstract
Background: Several methods of structural classification have been developed to introduce some order to the large amount of data present in the Protein Data Bank. Such methods facilitate structural comparisons and provide a greater understanding of structure and function. The most widely used and comprehensive databases are SCOP, CATH and FSSP, which represent three unique methods of classifying protein structures: purely manual, a combination of manual and automated, and purely automated, respectively. In order to develop reliable template libraries and benchmarks for protein-fold recognition, a systematic comparison of these databases has been carried out to determine their overall agreement in classifying protein structures.
Results: Approximately two-thirds of the protein chains in each database are common to all three databases. Despite employing different methods, and basing their systems on different rules of protein structure and taxonomy, SCOP, CATH and FSSP agree on the majority of their classifications. Discrepancies and inconsistencies are accounted for by a small number of explanations. Other interesting features have been identified, and various differences between manual and automatic classification methods are presented.
Conclusions: Using these databases requires an understanding of the rules upon which they are based; each method offers certain advantages depending on the biological requirements and knowledge of the user. The degree of discrepancy between the systems also has an impact on reliability of prediction methods that employ these schemes as benchmarks. To generate accurate fold templates for threading, we extract information from a consensus database, encompassing agreements between SCOP, CATH and FSSP.
Similar articles
-
Automated assignment of SCOP and CATH protein structure classifications from FSSP scores.Proteins. 2002 Mar 1;46(4):405-15. doi: 10.1002/prot.1176. Proteins. 2002. PMID: 11835515
-
What are the baselines for protein fold recognition?Bioinformatics. 2001 Jan;17(1):63-72. doi: 10.1093/bioinformatics/17.1.63. Bioinformatics. 2001. PMID: 11222263
-
Systematic comparison of SCOP and CATH: a new gold standard for protein structure analysis.BMC Struct Biol. 2009 Apr 17;9:23. doi: 10.1186/1472-6807-9-23. BMC Struct Biol. 2009. PMID: 19374763 Free PMC article.
-
Contemporary approaches to protein structure classification.Bioessays. 1998 Nov;20(11):884-91. doi: 10.1002/(SICI)1521-1878(199811)20:11<884::AID-BIES3>3.0.CO;2-H. Bioessays. 1998. PMID: 9872054 Review.
-
The history of the CATH structural classification of protein domains.Biochimie. 2015 Dec;119:209-17. doi: 10.1016/j.biochi.2015.08.004. Epub 2015 Aug 4. Biochimie. 2015. PMID: 26253692 Free PMC article. Review.
Cited by
-
Structural characteristics of novel protein folds.PLoS Comput Biol. 2010 Apr 22;6(4):e1000750. doi: 10.1371/journal.pcbi.1000750. PLoS Comput Biol. 2010. PMID: 20421995 Free PMC article.
-
Bacterial protein structures reveal phylum dependent divergence.Comput Biol Chem. 2011 Feb;35(1):24-33. doi: 10.1016/j.compbiolchem.2010.12.004. Epub 2011 Jan 18. Comput Biol Chem. 2011. PMID: 21315656 Free PMC article.
-
Compression-based classification of biological sequences and structures via the Universal Similarity Metric: experimental assessment.BMC Bioinformatics. 2007 Jul 13;8:252. doi: 10.1186/1471-2105-8-252. BMC Bioinformatics. 2007. PMID: 17629909 Free PMC article.
-
ECOD: an evolutionary classification of protein domains.PLoS Comput Biol. 2014 Dec 4;10(12):e1003926. doi: 10.1371/journal.pcbi.1003926. eCollection 2014 Dec. PLoS Comput Biol. 2014. PMID: 25474468 Free PMC article.
-
Touring protein space with Matt.IEEE/ACM Trans Comput Biol Bioinform. 2012 Jan-Feb;9(1):286-93. doi: 10.1109/TCBB.2011.70. Epub 2011 Apr 1. IEEE/ACM Trans Comput Biol Bioinform. 2012. PMID: 21464511 Free PMC article.
Publication types
MeSH terms
Substances
LinkOut - more resources
Full Text Sources