How many protein-protein interactions types exist in nature?
- PMID: 22719985
- PMCID: PMC3374795
- DOI: 10.1371/journal.pone.0038913
How many protein-protein interactions types exist in nature?
Abstract
"Protein quaternary structure universe" refers to the ensemble of all protein-protein complexes across all organisms in nature. The number of quaternary folds thus corresponds to the number of ways proteins physically interact with other proteins. This study focuses on answering two basic questions: Whether the number of protein-protein interactions is limited and, if yes, how many different quaternary folds exist in nature. By all-to-all sequence and structure comparisons, we grouped the protein complexes in the protein data bank (PDB) into 3,629 families and 1,761 folds. A statistical model was introduced to obtain the quantitative relation between the numbers of quaternary families and quaternary folds in nature. The total number of possible protein-protein interactions was estimated around 4,000, which indicates that the current protein repository contains only 42% of quaternary folds in nature and a full coverage needs approximately a quarter century of experimental effort. The results have important implications to the protein complex structural modeling and the structure genomics of protein-protein interactions.
Conflict of interest statement
Figures




Similar articles
-
NMR in structural genomics to increase structural coverage of the protein universe: Delivered by Prof. Kurt Wüthrich on 7 July 2013 at the 38th FEBS Congress in St. Petersburg, Russia.FEBS J. 2016 Nov;283(21):3870-3881. doi: 10.1111/febs.13751. Epub 2016 Jun 9. FEBS J. 2016. PMID: 27154589 Free PMC article.
-
The number of protein folds and their distribution over families in nature.Proteins. 2004 Feb 15;54(3):491-9. doi: 10.1002/prot.10514. Proteins. 2004. PMID: 14747997
-
Exploring dynamics of protein structure determination and homology-based prediction to estimate the number of superfamilies and folds.BMC Struct Biol. 2006 Mar 20;6:6. doi: 10.1186/1472-6807-6-6. BMC Struct Biol. 2006. PMID: 16549009 Free PMC article.
-
Structural analyses reveal two distinct families of nucleoside phosphorylases.Biochem J. 2002 Jan 1;361(Pt 1):1-25. doi: 10.1042/0264-6021:3610001. Biochem J. 2002. PMID: 11743878 Free PMC article. Review.
-
GWIDD: a comprehensive resource for genome-wide structural modeling of protein-protein interactions.Hum Genomics. 2012 Jul 11;6(1):7. doi: 10.1186/1479-7364-6-7. Hum Genomics. 2012. PMID: 23245398 Free PMC article. Review.
Cited by
-
The challenge and promise of glycomics.Chem Biol. 2014 Jan 16;21(1):1-15. doi: 10.1016/j.chembiol.2013.12.010. Chem Biol. 2014. PMID: 24439204 Free PMC article. Review.
-
Non-redundant unique interface structures as templates for modeling protein interactions.PLoS One. 2014 Jan 27;9(1):e86738. doi: 10.1371/journal.pone.0086738. eCollection 2014. PLoS One. 2014. PMID: 24475173 Free PMC article.
-
Replica exchange improves sampling in low-resolution docking stage of RosettaDock.PLoS One. 2013 Aug 29;8(8):e72096. doi: 10.1371/journal.pone.0072096. eCollection 2013. PLoS One. 2013. PMID: 24009670 Free PMC article.
-
Algorithmic approaches to protein-protein interaction site prediction.Algorithms Mol Biol. 2015 Feb 15;10:7. doi: 10.1186/s13015-015-0033-9. eCollection 2015. Algorithms Mol Biol. 2015. PMID: 25713596 Free PMC article.
-
In silico identification of essential proteins in Corynebacterium pseudotuberculosis based on protein-protein interaction networks.BMC Syst Biol. 2016 Nov 4;10(1):103. doi: 10.1186/s12918-016-0346-4. BMC Syst Biol. 2016. PMID: 27814699 Free PMC article.
References
-
- Chothia C. Proteins. One thousand families for the molecular biologist. Nature. 1992;357:543–544. - PubMed
-
- Zhang C, DeLisi C. Estimating the number of protein folds. J Mol Biol. 1998;284:1301–1305. - PubMed
-
- Govindarajan S, Recabarren R, Goldstein RA. Estimating the total number of protein folds. Proteins. 1999;35:408–414. - PubMed
-
- Liu X, Fan K, Wang W. The number of protein folds and their distribution over families in nature. Proteins. 2004;54:491–499. - PubMed
Publication types
MeSH terms
Substances
Grants and funding
LinkOut - more resources
Full Text Sources
Molecular Biology Databases