Similarity search for local protein structures at atomic resolution by exploiting a database management system
- PMID: 27857569
- PMCID: PMC5036654
- DOI: 10.2142/biophysics.3.75
Similarity search for local protein structures at atomic resolution by exploiting a database management system
Abstract
A method to search for local structural similarities in proteins at atomic resolution is presented. It is demonstrated that a huge amount of structural data can be handled within a reasonable CPU time by using a conventional relational database management system with appropriate indexing of geometric data. This method, which we call geometric indexing, can enumerate ligand binding sites that are structurally similar to sub-structures of a query protein among more than 160,000 possible candidates within a few hours of CPU time on an ordinary desktop computer. After detecting a set of high scoring ligand binding sites by the geometric indexing search, structural alignments at atomic resolution are constructed by iteratively applying the Hungarian algorithm, and the statistical significance of the final score is estimated from an empirical model based on a gamma distribution. Applications of this method to several protein structures clearly shows that significant similarities can be detected between local structures of non-homologous as well as homologous proteins.
Keywords: Hungarian algorithm; geometric indexing; ligand binding sites; relational database; structural alignment.
Figures








Similar articles
-
GIRAF: a method for fast search and flexible alignment of ligand binding interfaces in proteins at atomic resolution.Biophysics (Nagoya-shi). 2012 May 31;8:79-94. doi: 10.2142/biophysics.8.79. eCollection 2012. Biophysics (Nagoya-shi). 2012. PMID: 27493524 Free PMC article.
-
PocketAlign a novel algorithm for aligning binding sites in protein structures.J Chem Inf Model. 2011 Jul 25;51(7):1725-36. doi: 10.1021/ci200132z. Epub 2011 Jun 21. J Chem Inf Model. 2011. PMID: 21662242
-
Potential for dramatic improvement in sequence alignment against structures of remote homologous proteins by extracting structural information from multiple structure alignment.J Mol Biol. 2003 Sep 5;332(1):127-42. doi: 10.1016/s0022-2836(03)00858-1. J Mol Biol. 2003. PMID: 12946352
-
Automated protein sequence database classification. I. Integration of compositional similarity search, local similarity search, and multiple sequence alignment.Bioinformatics. 1998;14(2):164-73. doi: 10.1093/bioinformatics/14.2.164. Bioinformatics. 1998. PMID: 9545449
-
Integrated search and alignment of protein structures.Bioinformatics. 2008 Dec 15;24(24):2872-9. doi: 10.1093/bioinformatics/btn545. Epub 2008 Oct 22. Bioinformatics. 2008. PMID: 18945684
Cited by
-
Analysis of substructural variation in families of enzymatic proteins with applications to protein function prediction.BMC Bioinformatics. 2010 May 11;11:242. doi: 10.1186/1471-2105-11-242. BMC Bioinformatics. 2010. PMID: 20459833 Free PMC article.
-
MICAN: a protein structure alignment algorithm that can handle Multiple-chains, Inverse alignments, C(α) only models, Alternative alignments, and Non-sequential alignments.BMC Bioinformatics. 2013 Jan 18;14:24. doi: 10.1186/1471-2105-14-24. BMC Bioinformatics. 2013. PMID: 23331634 Free PMC article.
-
GIRAF: a method for fast search and flexible alignment of ligand binding interfaces in proteins at atomic resolution.Biophysics (Nagoya-shi). 2012 May 31;8:79-94. doi: 10.2142/biophysics.8.79. eCollection 2012. Biophysics (Nagoya-shi). 2012. PMID: 27493524 Free PMC article.
-
Composite structural motifs of binding sites for delineating biological functions of proteins.PLoS One. 2012;7(2):e31437. doi: 10.1371/journal.pone.0031437. Epub 2012 Feb 8. PLoS One. 2012. PMID: 22347478 Free PMC article.
-
Exhaustive comparison and classification of ligand-binding surfaces in proteins.Protein Sci. 2013 Oct;22(10):1379-91. doi: 10.1002/pro.2329. Epub 2013 Sep 4. Protein Sci. 2013. PMID: 23934772 Free PMC article.
References
-
- Jones S, Thornton JM. Searching for functional sites in protein structures. Curr Opin Struct Biol. 2004;8:3–7. - PubMed
-
- Kinoshita K, Sadanami K, Kidera A, Go N. Structural motif of phosphate-binding site common to various protein superfamilies: all-against-all structural comparison of protein-mononucleotide complexes. Protein Eng. 1999;12:11–14. - PubMed
-
- Brakoulias A, Jackson RM. Towards a structural classification of phosphate binding sites in protein-nucleotide complexes: an automated all-against-all structural comparison using geometric matching. Proteins. 2004;56:250–260. - PubMed
LinkOut - more resources
Full Text Sources
Miscellaneous