A model for statistical significance of local similarities in structure
- PMID: 12595245
- DOI: 10.1016/s0022-2836(03)00045-7
A model for statistical significance of local similarities in structure
Abstract
Structural biology can provide three-dimensional structures for proteins of unknown function. When sequence or structure comparisons fail to suggest a function, insights can come from discovery of functionally important local structural patterns. Existing methods to detect such patterns lack rigorous statistics needed for widespread application. Here, we derive a formula to calculate statistical significance of the root-mean-square deviation between atoms in such patterns. When combined with a database search method, our statistics permit true functional or structural patterns in different folds to be discerned from noise. The approach is highly complementary to fold comparison for providing functional clues for new structures, and is key for the detection of recurrences of any new pattern.
Similar articles
-
Recognition of analogous and homologous protein folds: analysis of sequence and structure conservation.J Mol Biol. 1997 Jun 13;269(3):423-39. doi: 10.1006/jmbi.1997.1019. J Mol Biol. 1997. PMID: 9199410
-
Annotation in three dimensions. PINTS: Patterns in Non-homologous Tertiary Structures.Nucleic Acids Res. 2003 Jul 1;31(13):3341-4. doi: 10.1093/nar/gkg506. Nucleic Acids Res. 2003. PMID: 12824322 Free PMC article.
-
Supersites within superfolds. Binding site similarity in the absence of homology.J Mol Biol. 1998 Oct 2;282(4):903-18. doi: 10.1006/jmbi.1998.2043. J Mol Biol. 1998. PMID: 9743635
-
Assigning amino acid sequences to 3-dimensional protein folds.FASEB J. 1996 Jan;10(1):126-36. doi: 10.1096/fasebj.10.1.8566533. FASEB J. 1996. PMID: 8566533 Review.
-
Protein structure comparison: implications for the nature of 'fold space', and structure and function prediction.Curr Opin Struct Biol. 2006 Jun;16(3):393-8. doi: 10.1016/j.sbi.2006.04.007. Epub 2006 May 4. Curr Opin Struct Biol. 2006. PMID: 16678402 Review.
Cited by
-
The LabelHash algorithm for substructure matching.BMC Bioinformatics. 2010 Nov 11;11:555. doi: 10.1186/1471-2105-11-555. BMC Bioinformatics. 2010. PMID: 21070651 Free PMC article.
-
A unified statistical model to support local sequence order independent similarity searching for ligand-binding sites and its application to genome-based drug discovery.Bioinformatics. 2009 Jun 15;25(12):i305-12. doi: 10.1093/bioinformatics/btp220. Bioinformatics. 2009. PMID: 19478004 Free PMC article.
-
Comprehensive evaluation of protein structure alignment methods: scoring by geometric measures.J Mol Biol. 2005 Mar 4;346(4):1173-88. doi: 10.1016/j.jmb.2004.12.032. Epub 2005 Jan 16. J Mol Biol. 2005. PMID: 15701525 Free PMC article.
-
Modeling regionalized volumetric differences in protein-ligand binding cavities.Proteome Sci. 2012 Jun 21;10 Suppl 1(Suppl 1):S6. doi: 10.1186/1477-5956-10-S1-S6. Proteome Sci. 2012. PMID: 22759583 Free PMC article.
-
Exhaustive comparison and classification of ligand-binding surfaces in proteins.Protein Sci. 2013 Oct;22(10):1379-91. doi: 10.1002/pro.2329. Epub 2013 Sep 4. Protein Sci. 2013. PMID: 23934772 Free PMC article.
Publication types
MeSH terms
Substances
LinkOut - more resources
Full Text Sources