Comprehensive inventory of protein complexes in the Protein Data Bank from consistent classification of interfaces
- PMID: 18474114
- PMCID: PMC2413245
- DOI: 10.1186/1471-2105-9-234
Comprehensive inventory of protein complexes in the Protein Data Bank from consistent classification of interfaces
Abstract
Background: Protein-protein interactions are ubiquitous and essential for all cellular processes. High-resolution X-ray crystallographic structures of protein complexes can reveal the details of their function and provide a basis for many computational and experimental approaches. Differentiation between biological and non-biological contacts and reconstruction of the intact complex is a challenging computational problem. A successful solution can provide additional insights into the fundamental principles of biological recognition and reduce errors in many algorithms and databases utilizing interaction information extracted from the Protein Data Bank (PDB).
Results: We have developed a method for identifying protein complexes in the PDB X-ray structures by a four step procedure: (1) comprehensively collecting all protein-protein interfaces; (2) clustering similar protein-protein interfaces together; (3) estimating the probability that each cluster is relevant based on a diverse set of properties; and (4) combining these scores for each PDB entry in order to predict the complex structure. The resulting clusters of biologically relevant interfaces provide a reliable catalog of evolutionary conserved protein-protein interactions. These interfaces, as well as the predicted protein complexes, are available from the Protein Interface Server (PInS) website (see Availability and requirements section).
Conclusion: Our method demonstrates an almost two-fold reduction of the annotation error rate as evaluated on a large benchmark set of complexes validated from the literature. We also estimate relative contributions of each interface property to the accurate discrimination of biologically relevant interfaces and discuss possible directions for further improving the prediction method.
Figures





Similar articles
-
SCOWLP: a web-based database for detailed characterization and visualization of protein interfaces.BMC Bioinformatics. 2006 Mar 2;7:104. doi: 10.1186/1471-2105-7-104. BMC Bioinformatics. 2006. PMID: 16512892 Free PMC article.
-
PIBASE: a comprehensive database of structurally defined protein interfaces.Bioinformatics. 2005 May 1;21(9):1901-7. doi: 10.1093/bioinformatics/bti277. Epub 2005 Jan 18. Bioinformatics. 2005. PMID: 15657096
-
Uncovering the structural basis of protein interactions with efficient clustering of 3-D interaction interfaces.Comput Syst Bioinformatics Conf. 2007;6:287-97. doi: 10.1142/9781860948732_0030. Comput Syst Bioinformatics Conf. 2007. PMID: 17951832
-
Protein complexes: structure prediction challenges for the 21st century.Curr Opin Struct Biol. 2005 Feb;15(1):15-22. doi: 10.1016/j.sbi.2005.01.012. Curr Opin Struct Biol. 2005. PMID: 15718128 Review.
-
Characterization and prediction of protein interfaces to infer protein-protein interaction networks.Curr Pharm Biotechnol. 2008 Apr;9(2):67-76. doi: 10.2174/138920108783955191. Curr Pharm Biotechnol. 2008. PMID: 18393863 Review.
Cited by
-
DASMI: exchanging, annotating and assessing molecular interaction data.Bioinformatics. 2009 May 15;25(10):1321-8. doi: 10.1093/bioinformatics/btp142. Bioinformatics. 2009. PMID: 19420069 Free PMC article.
-
Comparison of tertiary structures of proteins in protein-protein complexes with unbound forms suggests prevalence of allostery in signalling proteins.BMC Struct Biol. 2012 May 3;12:6. doi: 10.1186/1472-6807-12-6. BMC Struct Biol. 2012. PMID: 22554255 Free PMC article.
-
Non-redundant unique interface structures as templates for modeling protein interactions.PLoS One. 2014 Jan 27;9(1):e86738. doi: 10.1371/journal.pone.0086738. eCollection 2014. PLoS One. 2014. PMID: 24475173 Free PMC article.
-
PIE-efficient filters and coarse grained potentials for unbound protein-protein docking.Proteins. 2010 Feb 1;78(2):400-19. doi: 10.1002/prot.22550. Proteins. 2010. PMID: 19768784 Free PMC article.
-
IDDI: integrated domain-domain interaction and protein interaction analysis system.Proteome Sci. 2012 Jun 21;10 Suppl 1(Suppl 1):S9. doi: 10.1186/1477-5956-10-S1-S9. Proteome Sci. 2012. PMID: 22759586 Free PMC article.
References
Publication types
MeSH terms
Substances
LinkOut - more resources
Full Text Sources