Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2008 Nov;4(11):e1000256.
doi: 10.1371/journal.pgen.1000256. Epub 2008 Nov 14.

Synonymous genes explore different evolutionary landscapes

Affiliations

Synonymous genes explore different evolutionary landscapes

Guillaume Cambray et al. PLoS Genet. 2008 Nov.

Abstract

The evolutionary potential of a gene is constrained not only by the amino acid sequence of its product, but by its DNA sequence as well. The topology of the genetic code is such that half of the amino acids exhibit synonymous codons that can reach different subsets of amino acids from each other through single mutation. Thus, synonymous DNA sequences should access different regions of the protein sequence space through a limited number of mutations, and this may deeply influence the evolution of natural proteins. Here, we demonstrate that this feature can be of value for manipulating protein evolvability. We designed an algorithm that, starting from an input gene, constructs a synonymous sequence that systematically includes the codons with the most different evolutionary perspectives; i.e., codons that maximize accessibility to amino acids previously unreachable from the template by point mutation. A synonymous version of a bacterial antibiotic resistance gene was computed and synthesized. When concurrently submitted to identical directed evolution protocols, both the wild type and the recoded sequence led to the isolation of specific, advantageous phenotypic variants. Simulations based on a mutation isolated only from the synthetic gene libraries were conducted to assess the impact of sub-functional selective constraints, such as codon usage, on natural adaptation. Our data demonstrate that rational design of synonymous synthetic genes stands as an affordable improvement to any directed evolution protocol. We show that using two synonymous DNA sequences improves the overall yield of the procedure by increasing the diversity of mutants generated. These results provide conclusive evidence that synonymous coding sequences do experience different areas of the corresponding protein adaptive landscape, and that a sequence's codon usage effectively constrains the evolution of the encoded protein.

PubMed Disclaimer

Conflict of interest statement

The authors have declared that no competing interests exist.

Figures

Figure 1
Figure 1. Standard genetic code and codon evolutionary landscapes.
Every codon can access nine other codons by single mutation, which corresponds to only 5.8 proximal aa on average due to the code redundancy. These evolutionary perspectives are highlighted in yellow and blue and for the arginine codons CGG and CGT, respectively (green indicates codons accessible from both). Amino acids reachable by only one of the two codons are bold and underlined.
Figure 2
Figure 2. Evolutionary landscape comparison of the aacWT and aacELP synonymous genes.
aacELP has been designed by the ELP software to maximize evolutionary perspective divergence from aacWT when subjected to single mutation. The 184 successive positions along AAC(6′) are represented as columns. Each row stands for an aa, as indicated by the one-letter code in first column. Intersection between aa and position are coloured according to the following code: black, aa encoded at that position; blue, aa specifically accessible by aacWT; red, aa specifically accessible by aacELP; yellow, aa directly accessible from both aac genes; grey, aa inaccessible from either aac genes through single mutation. Blue and red patterns, denoting different evolutionary perspectives, are widely and evenly distributed along the sequence. The histogram on the bottom summarizes the number of aa directly accessible only by aacWT (blue squares) or aacELP (red squares) for each position. Mutations isolated in this study are indicated above the diagram and corresponding positions are black lined. Mutations' outcomes are highlighted with bright green borders. Asterisks denote mutations isolated together.
Figure 3
Figure 3. Comparison of the amino acid substitutions identified in alleles sampled from the mutant libraries before selection.
This diagram is reminiscent of Figure 2. The 184 successive positions along the AAC(6′)-Ib protein are represented as columns. Each row stands for an aa, as indicated by the one-letter code in first column. Black cells show the WT sequence, other coloured cells highlight the substitutions identified by sequencing 297 aacWT and 267 aacELP alleles, independent and randomly chosen before any selection. The color code is the following: blue, observed substitution only directly accessible from aacWT; red, observed substitution only directly accessible from aacELP; yellow, observed substitution directly accessible from both genes; green, observed substitution resulting from a double mutation in the same codon. Overall, we identified 524 substitutions out of which 110 were only accessible from one of the version of the gene and not the other. The mutations specific of each gene versions were only observed from the cognate template. The histogram on the bottom summarizes the number of aa substitution specific to each version that were sampled at each position (aacWT, blue squares; aacELP, red squares). The shaded area recalls the theoretical pattern as shown in Figure 4.
Figure 4
Figure 4. Impact of the low-usage leucine codon CTA on L55Q evolutionary pathways.
The L55Q mutation has only been isolated from aacELP libraries and has never been described in natural isolates. In aacWT, L55 is encoded by TTA. Panel A: two main pathways can be followed to access Gln under selection, one with a single CTA intermediate, which might be associated with a lower fitness (top, selective pathway) and the other with two neutral intermediates (bottom, neutral pathway). Every genotype presented in these schematic landscapes has been constructed and assessed for growth on isepamicin. The colouring scheme, with darker colours associated with lower fitness, derives from these results except for the weakly used codon CTA for which the putative fitness effect was not measurable. Panel B: Monte-Carlo simulations of the evolution of TTA alleles populations were carried out, assuming various fitness f of CTA. s denotes the selective coefficient associated with the CTA codon. The cumulative number of simulations where CAA (top, selective pathway) or CAG (bottom, neutral pathway) reached fixation is plotted over time in generations (2,000 repetitions). Even weak selective coefficient against CTA (e.g. 5 10−4) greatly affects the use of the selective pathway. Although longer, the neutral pathway is preferred over the selective one from selective coefficients as low as 2.5 10−3.

Similar articles

Cited by

References

    1. Aharoni A, Gaidukov L, Khersonsky O, Gould SM, Roodveldt C, et al. The ‘evolvability’ of promiscuous protein functions. Nat Genet. 2005;37:73–76. - PubMed
    1. Weinreich DM, Delaney NF, DePristo MA, Hartl DL. Darwinian Evolution Can Follow Only Very Few Mutational Paths to Fitter Proteins. Science. 2006;312:111–114. - PubMed
    1. Sprinzak D, Elowitz MB. Reconstruction of genetic circuits. Nature. 2005;438:443. - PubMed
    1. Peisajovich SG, Tawfik DS. Protein engineers turned evolutionists. Nature Methods. 2007:991–994. - PubMed
    1. Maynard Smith J. Natural selection and the concept of a protein space. Nature. 1970;225:563–564. - PubMed

Publication types

Substances

Associated data