Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2012 Apr;52(2):195-205.
doi: 10.1002/jobm.201100067. Epub 2011 Jul 21.

GC content-independent amino acid patterns in bacteria and archaea

Affiliations

GC content-independent amino acid patterns in bacteria and archaea

Andre Schmidt et al. J Basic Microbiol. 2012 Apr.

Abstract

Every organism can be characterized by the amino acid composition of its proteome. So far it was assumed that these compositions are determined by the GC content of the DNA or, in some cases, by extreme lifestyles, like thermophily or halophily. Here, we focussed our analysis on eight amino acids, each of which is encoded by both, GC and AT rich codons, to identify finer amino acid patterns beyond the GC dominance. We investigated the conceptually translated proteomes of 1029 bacterial and archaeal strains with sequenced genomes for amino acid composition. Using correspondence analysis, we found that phylogenetic groups within bacteria and archaea generally can be discriminated from other groups due to their amino acid composition. In some cases, single organisms, e.g. Treponema pallidum strains or Mycoplasma penetrans, are characterized by extreme amino acid compositions. We assume that our data could provide a basis for a new approach to analyze evolution of bacterial and archaeal groups. Furthermore, for single organisms, the detailed knowledge of the amino acid composition of the entire proteome encoded in the genome could lead to a better understanding, important for pharmaceutical or biotechnological applications. We recommend that information about amino acid compositions should be provided in databases, comparable to the GC content of genomes.

PubMed Disclaimer

Publication types

LinkOut - more resources