Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2014 Dec 12:9:29.
doi: 10.1186/s13062-014-0029-2.

The fundamental tradeoff in genomes and proteomes of prokaryotes established by the genetic code, codon entropy, and physics of nucleic acids and proteins

Affiliations

The fundamental tradeoff in genomes and proteomes of prokaryotes established by the genetic code, codon entropy, and physics of nucleic acids and proteins

Alexander Goncearenco et al. Biol Direct. .

Abstract

Background: Mutations in nucleotide sequences provide a foundation for genetic variability, and selection is the driving force of the evolution and molecular adaptation. Despite considerable progress in the understanding of selective forces and their compositional determinants, the very nature of underlying mutational biases remains unclear.

Results: We explore here a fundamental tradeoff, which analytically describes mutual adjustment of the nucleotide and amino acid compositions and its possible effect on the mutational biases. The tradeoff is determined by the interplay between the genetic code, optimization of the codon entropy, and demands on the structure and stability of nucleic acids and proteins.

Conclusion: The tradeoff is the unifying property of all prokaryotes regardless of the differences in their phylogenies, life styles, and extreme environments. It underlies mutational biases characteristic for genomes with different nucleotide and amino acid compositions, providing foundation for evolution and adaptation.

PubMed Disclaimer

Figures

Figure 1
Figure 1
Distribution of the GC CB and GC NCB content and the derived theoretical limits for 1364 prokaryotic genomes. A, the highest (red) and lowest (green) limits on nucleotide composition obtained by replacing natural codons with synonymous GC-richest and GC-poorest ones. B, the theoretical limits (red and green) on amino acid composition (GCNCB), see also Additional file 1: Figure S1.
Figure 2
Figure 2
The tradeoff between nucleotide and amino acid compositions. Theoretical model. Black circles represent the genomes. The lower (green) and upper (red) limits for GCCB are calculated in the same way as in Figure 1a. The tradeoff describes the relation between the two components of GC depicted by an orange curve (with the coefficients a = 20.82, b = −16.28, c = −49.4, r = 0.255). The colored circles illustrate three pairs of genomes with the same GCNAT (45 percent – magenta, 51 – yellow, and 63 – blue) obtained by combining the GCCB and GCNCB in different proportions (see Additional file 1: Table S7 for details).
Figure 3
Figure 3
Simulations of the tradeoff dynamics via random nucleotide mutations. A, B, simulations for two genomes (Nocardiopsis dassonvillei subs. dassonvillei DSM 43111, GCNAT = 72.7; and Streptobacillus moniliformis DSM 12112, GCNAT = 26.3) with distorted codon bias. A, changes in the GCCB/GCNCB pairs related to the tradeoff. B, the increase of the genomes’ codon entropy in relation to the distribution of entropies in all genomes. C and D, simulations of mutations in 1364 natural genomes; the simulation starts at points marked by filled circles and continues along the lines. The traces are colored by average amino acid depth C, changes in the GCCB/GCNCB pairs related to the tradeoff. D, behavior of the codon entropy.
Figure 4
Figure 4
Environmental factors in relation to the tradeoff: A, Temperature; B, Habitat; C, Aerobicity; D, Domain of Life.
Figure 5
Figure 5
Limits of the codon entropy in genomes with the natural amino acid composition preserved. A, Natural nucleotide composition (black); synonymous codons substituted with the GC-richest ones (red) and the GC-poorest ones (green); synonymous codons were used uniformly (blue), GCNCB. B, Natural nucleotide composition (black); reduced synonymous codons (orange) – in cases when there are several synonymous codons with the same GC saturation, only one codon was used.
Figure 6
Figure 6
Dependence of the purine/pyrimidine ratio (A) and the average amino acid depth (B) on the GC content of protein-coding sequences.
Figure 7
Figure 7
Dependence of the purine-purine RpR (A) pyrimidine-pyrimidine YpY (B), pyrimidine-purine YpR (C), purine-pyrimidine RpY (D) dinucleotides on the genomic GC NAT .
Figure 8
Figure 8
Dependence of the adenine-adenine ApA (A), guanine-guanine GpG (B), adenine-guanine ApG (C), guanine-adenine GpA (D) dinucleotides on the genomic GC NAT .

Similar articles

Cited by

References

    1. Berezovsky IN, Zeldovich KB, Shakhnovich EI. Positive and negative design in stability and thermal adaptation of natural proteins. PLoS Comput Biol. 2007;3(3):e52. doi: 10.1371/journal.pcbi.0030052. - DOI - PMC - PubMed
    1. Goncearenco A, Ma BG, Berezovsky IN. Molecular mechanisms of adaptation emerging from the physics and evolution of nucleic acids and proteins. Nucleic Acids Res. 2014;42(5):2879–2892. doi: 10.1093/nar/gkt1336. - DOI - PMC - PubMed
    1. Ma BG, Goncearenco A, Berezovsky IN. Thermophilic adaptation of protein complexes inferred from proteomic homology modeling. Structure. 2010;18(7):819–828. doi: 10.1016/j.str.2010.04.004. - DOI - PubMed
    1. Pe’er I, Felder CE, Man O, Silman I, Sussman JL, Beckmann JS. Proteomic signatures: amino acid and oligopeptide compositions differentiate among phyla. Proteins. 2004;54(1):20–40. doi: 10.1002/prot.10559. - DOI - PubMed
    1. Zeldovich KB, Berezovsky IN, Shakhnovich EI. Protein and DNA sequence determinants of thermophilic adaptation. PLoS Comput Biol. 2007;3(1):e5. doi: 10.1371/journal.pcbi.0030005. - DOI - PMC - PubMed

LinkOut - more resources