Proteomic signatures: amino acid and oligopeptide compositions differentiate among phyla
- PMID: 14705021
- DOI: 10.1002/prot.10559
Proteomic signatures: amino acid and oligopeptide compositions differentiate among phyla
Abstract
Availability of complete genome sequences allows in-depth comparison of single-residue and oligopeptide compositions of the corresponding proteomes. We have used principal component analysis (PCA) to study the landscape of compositional motifs across more than 70 genera from all three superkingdoms. Unexpectedly, the first two principal components clearly differentiate archaea, eubacteria, and eukaryota from each other. In particular, we contrast compositional patterns typical of the three superkingdoms and characterize differences between species and phyla, as well as among patterns shared by all compositional proteomic signatures. These species-specific patterns may even extend to subsets of the entire proteome, such as proteins pertaining to individual yeast chromosomes. We identify factors that affect compositional signatures, such as living habitat, and detect strong eukaryotic preference for homopeptides and palindromic tripeptides. We further detect oligopeptides that are either universally over- or underabundant across the whole proteomic landscape, as well as oligopeptides whose over- or underabundance is phylum- or species-specific. Finally, we report that species composition signatures preserve evolutionary memory, providing a new method to compare phylogenetic relationships among species that avoids problems of sequence alignment and ortholog detection.
Copyright 2003 Wiley-Liss, Inc.
Similar articles
-
[Comparative analysis of internal repeating segments in proteins of species from the three kingdoms of life].Yi Chuan Xue Bao. 2005 Mar;32(3):315-21. Yi Chuan Xue Bao. 2005. PMID: 15931794 Chinese.
-
Evolution of proteomes: fundamental signatures and global trends in amino acid compositions.BMC Genomics. 2006 Dec 5;7:307. doi: 10.1186/1471-2164-7-307. BMC Genomics. 2006. PMID: 17147802 Free PMC article.
-
A tree of life based on protein domain organizations.Mol Biol Evol. 2007 May;24(5):1181-9. doi: 10.1093/molbev/msm034. Epub 2007 Mar 1. Mol Biol Evol. 2007. PMID: 17331957
-
Modeling sequence evolution.Methods Mol Biol. 2008;452:255-85. doi: 10.1007/978-1-60327-159-2_13. Methods Mol Biol. 2008. PMID: 18566769 Review.
-
Potential implications of availability of short amino acid sequences in proteins: an old and new approach to protein decoding and design.Biotechnol Annu Rev. 2008;14:109-41. doi: 10.1016/S1387-2656(08)00004-5. Biotechnol Annu Rev. 2008. PMID: 18606361 Review.
Cited by
-
Global chemical modifications comparison of human plasma proteome from two different age groups.Sci Rep. 2020 Sep 14;10(1):14998. doi: 10.1038/s41598-020-72196-z. Sci Rep. 2020. PMID: 32929118 Free PMC article.
-
Use of a multi-way method to analyze the amino acid composition of a conserved group of orthologous proteins in prokaryotes.BMC Bioinformatics. 2006 May 18;7:257. doi: 10.1186/1471-2105-7-257. BMC Bioinformatics. 2006. PMID: 16709240 Free PMC article.
-
Combining machine learning and homology-based approaches to accurately predict subcellular localization in Arabidopsis.Plant Physiol. 2010 Sep;154(1):36-54. doi: 10.1104/pp.110.156851. Epub 2010 Jul 20. Plant Physiol. 2010. PMID: 20647376 Free PMC article.
-
Tracing the birth and intrinsic disorder of loops and domains in protein evolution.Biophys Rev. 2024 Nov 20;16(6):723-735. doi: 10.1007/s12551-024-01251-0. eCollection 2024 Dec. Biophys Rev. 2024. PMID: 39830125 Free PMC article. Review.
-
The oligodeoxynucleotide sequences corresponding to never-expressed peptide motifs are mainly located in the non-coding strand.BMC Bioinformatics. 2010 Jul 20;11:383. doi: 10.1186/1471-2105-11-383. BMC Bioinformatics. 2010. PMID: 20646284 Free PMC article.
Publication types
MeSH terms
Substances
LinkOut - more resources
Full Text Sources
Other Literature Sources
Molecular Biology Databases