Nature of the protein universe
- PMID: 19541617
- PMCID: PMC2698892
- DOI: 10.1073/pnas.0905029106
Nature of the protein universe
Abstract
The protein universe is the set of all proteins of all organisms. Here, all currently known sequences are analyzed in terms of families that have single-domain or multidomain architectures and whether they have a known three-dimensional structure. Growth of new single-domain families is very slow: Almost all growth comes from new multidomain architectures that are combinations of domains characterized by approximately 15,000 sequence profiles. Single-domain families are mostly shared by the major groups of organisms, whereas multidomain architectures are specific and account for species diversity. There are known structures for a quarter of the single-domain families, and >70% of all sequences can be partially modeled thanks to their membership in these families.
Conflict of interest statement
The author declares no conflict of interest.
Figures





Similar articles
-
Intra-chain 3D segment swapping spawns the evolution of new multidomain protein architectures.J Mol Biol. 2012 Jan 6;415(1):221-35. doi: 10.1016/j.jmb.2011.10.045. Epub 2011 Nov 4. J Mol Biol. 2012. PMID: 22079367 Free PMC article.
-
Diversity and evolution of the thyroglobulin type-1 domain superfamily.Mol Biol Evol. 2006 Apr;23(4):744-55. doi: 10.1093/molbev/msj082. Epub 2005 Dec 20. Mol Biol Evol. 2006. PMID: 16368776
-
Exploration of uncharted regions of the protein universe.PLoS Biol. 2009 Sep;7(9):e1000205. doi: 10.1371/journal.pbio.1000205. Epub 2009 Sep 29. PLoS Biol. 2009. PMID: 19787035 Free PMC article.
-
Protein families and their evolution-a structural perspective.Annu Rev Biochem. 2005;74:867-900. doi: 10.1146/annurev.biochem.74.082803.133029. Annu Rev Biochem. 2005. PMID: 15954844 Review.
-
Sequence analysis of multidomain proteins: past perspectives and future directions.Adv Protein Chem. 2002;61:75-98. doi: 10.1016/s0065-3233(02)61002-2. Adv Protein Chem. 2002. PMID: 12461821 Review. No abstract available.
Cited by
-
Origin and evolution of protein fold designs inferred from phylogenomic analysis of CATH domain structures in proteomes.PLoS Comput Biol. 2013;9(3):e1003009. doi: 10.1371/journal.pcbi.1003009. Epub 2013 Mar 28. PLoS Comput Biol. 2013. PMID: 23555236 Free PMC article.
-
NMR in structural genomics to increase structural coverage of the protein universe: Delivered by Prof. Kurt Wüthrich on 7 July 2013 at the 38th FEBS Congress in St. Petersburg, Russia.FEBS J. 2016 Nov;283(21):3870-3881. doi: 10.1111/febs.13751. Epub 2016 Jun 9. FEBS J. 2016. PMID: 27154589 Free PMC article.
-
Structure-based prediction of protein-protein interactions on a genome-wide scale.Nature. 2012 Oct 25;490(7421):556-60. doi: 10.1038/nature11503. Epub 2012 Sep 30. Nature. 2012. PMID: 23023127 Free PMC article.
-
Structural and Kinetic Views of Molecular Chaperones in Multidomain Protein Folding.Int J Mol Sci. 2022 Feb 24;23(5):2485. doi: 10.3390/ijms23052485. Int J Mol Sci. 2022. PMID: 35269628 Free PMC article. Review.
-
SIMAP--the database of all-against-all protein sequence similarities and annotations with new interfaces and increased coverage.Nucleic Acids Res. 2014 Jan;42(Database issue):D279-84. doi: 10.1093/nar/gkt970. Epub 2013 Oct 27. Nucleic Acids Res. 2014. PMID: 24165881 Free PMC article.
References
-
- Ladunga I. Phylogenetic continuum indicates galaxies in the protein universe: Preliminary results on the natural group structures of proteins. J Mol Evol. 1992;4:358–375. - PubMed
-
- Sanger F. Arrangement of amino acids in proteins. Adv Protein Chem. 1952;7:1–66. - PubMed
-
- Fitch WM. Distinguishing homologous from analogous proteins. Syst Zool. 1970;19:99–113. - PubMed
-
- Altschul SF, Gish W, Miller W, Myers EW, Lipman DJ. Basic local alignment search tool. J Mol Biol. 1990;215:403–410. - PubMed
-
- Li W, Jaroszewski L, Godzik A. Clustering of highly homologous sequences to reduce the size of large protein databases. Bioinformatics. 2001;17:282–283. - PubMed
Publication types
MeSH terms
Substances
Grants and funding
LinkOut - more resources
Full Text Sources
Other Literature Sources