Structural descriptor database: a new tool for sequence-based functional site prediction
- PMID: 19032768
- PMCID: PMC2612011
- DOI: 10.1186/1471-2105-9-492
Structural descriptor database: a new tool for sequence-based functional site prediction
Abstract
Background: The Structural Descriptor Database (SDDB) is a web-based tool that predicts the function of proteins and functional site positions based on the structural properties of related protein families. Structural alignments and functional residues of a known protein set (defined as the training set) are used to build special Hidden Markov Models (HMM) called HMM descriptors. SDDB uses previously calculated and stored HMM descriptors for predicting active sites, binding residues, and protein function. The database integrates biologically relevant data filtered from several databases such as PDB, PDBSUM, CSA and SCOP. It accepts queries in fasta format and predicts functional residue positions, protein-ligand interactions, and protein function, based on the SCOP database.
Results: To assess the SDDB performance, we used different data sets. The Trypsion-like Serine protease data set assessed how well SDDB predicts functional sites when curated data is available. The SCOP family data set was used to analyze SDDB performance by using training data extracted from PDBSUM (binding sites) and from CSA (active sites). The ATP-binding experiment was used to compare our approach with the most current method. For all evaluations, significant improvements were obtained with SDDB.
Conclusion: SDDB performed better when trusty training data was available. SDDB worked better in predicting active sites rather than binding sites because the former are more conserved than the latter. Nevertheless, by using our prediction method we obtained results with precision above 70%.
Figures




References
-
- Chandonia J, Brenner S. The impact of structural genomics: expectations and outcomes. Science. 2006;311:347–351. - PubMed
-
- Bateman A, Valencia A. Structural genomics meets computational biology. Bioinformatics. 2006;22:2319. - PubMed
-
- Kim S, Shin D, Choi I, Gahmen U, Chen S, Kim R. Structure-based functional inference in structural genomics. J Struct Funct Genomics. 2003;4:129–135. - PubMed
-
- Watson J, Laskowski R, Thornton J. Predicting protein function from sequence and structural data. Current opinion in structural biology. 2005;15:275–284. - PubMed
-
- Baker E, Arcus V, Lott J. Protein structure prediction and analysis as a tool for functional genomics. Applied bioinformatics. 2003;2:S3–10. - PubMed
Publication types
MeSH terms
Substances
LinkOut - more resources
Full Text Sources