The language of the protein universe
- PMID: 26451980
- PMCID: PMC4695241
- DOI: 10.1016/j.gde.2015.08.010
The language of the protein universe
Abstract
Proteins, the main cell machinery which play a major role in nearly every cellular process, have always been a central focus in biology. We live in the post-genomic era, and inferring information from massive data sets is a steadily growing universal challenge. The increasing availability of fully sequenced genomes can be regarded as the 'Rosetta Stone' of the protein universe, allowing the understanding of genomes and their evolution, just as the original Rosetta Stone allowed Champollion to decipher the ancient Egyptian hieroglyphics. In this review, we consider aspects of the protein domain architectures repertoire that are closely related to those of human languages and aim to provide some insights about the language of proteins.
Copyright © 2015 Elsevier Ltd. All rights reserved.
Figures
References
-
- Searls DB. The language of genes. Nature. 2002;420(211-217) - PubMed
-
- Gimona M. Protein linguistics - a grammar for modular protein assembly? Nat Rev Mol Cell Bio. 2006;7(1):68–73. - PubMed
-
- Eisenhaber F. A decade after the first full human genome sequencing: When will we understand our own genome? J Bioinf Comput Biol. 2012;10(5) - PubMed
-
- Chomsky N. Logical-structures in language. Am Doc. 1957;8(4):284–291.
-
- Chomsky N. Fundamentals of language - jakobson,r, halle,m. Int J Am Linguist. 1957;23(3):234–242.
Publication types
MeSH terms
Substances
Grants and funding
LinkOut - more resources
Full Text Sources
Other Literature Sources
