Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2018 Mar 1;10(3):816-825.
doi: 10.1093/gbe/evy046.

Glutamine Codon Usage and polyQ Evolution in Primates Depend on the Q Stretch Length

Affiliations

Glutamine Codon Usage and polyQ Evolution in Primates Depend on the Q Stretch Length

Pablo Mier et al. Genome Biol Evol. .

Abstract

Amino acid usage in a proteome depends mostly on its taxonomy, as it does the codon usage in transcriptomes. Here, we explore the level of variation in the codon usage of a specific amino acid, glutamine, in relation to the number of consecutive glutamine residues. We show that CAG triplets are consistently more abundant in short glutamine homorepeats (polyQ, four to eight residues) than in shorter glutamine stretches (one to three residues), leading to the evolutionary growth of the repeat region in a CAG-dependent manner. The length of orthologous polyQ regions is mostly stable in primates, particularly the short ones. Interestingly, given a short polyQ the CAG usage is higher in unstable-in-length orthologous polyQ regions. This indicates that CAG triplets produce the necessary instability for a glutamine stretch to grow. Proteins related to polyQ-associated diseases behave in a more extreme way, with longer glutamine stretches in human and evolutionarily closer nonhuman primates, and an overall higher CAG usage. In the light of our results, we suggest an evolutionary model to explain the glutamine codon usage in polyQ regions.

PubMed Disclaimer

Figures

<sc>Fig</sc>. 1.
Fig. 1.
—Characterization of glutamine stretches in complete proteomes. (a) Number of glutamine stretches per number of proteins per proteome, depending on the stretch length. (b) Percentage of CAG triplets in glutamine stretches per proteome, depending on the stretch length. (c) Percentage of CAG triplets in glutamine stretches of lengths 4–8 compared with lengths 1–3; the result for each proteome is colored depending on the pairwise divergence time with human. The discontinuous line represents x = y values. (d) Overall CAG percentages in primates in glutamine stretches of varying lengths.
<sc>Fig</sc>. 2.
Fig. 2.
—Length differences of glutamine stretches between primates. Length differences from the reference glutamine stretch to the rest of the orthologous glutamine stretches, when the length of the reference stretch is (a) 0–3, (b) 4–8, and (c) >8.
<sc>Fig</sc>. 3.
Fig. 3.
—Codon purity of glutamine stretches in primates. Percentage of glutamine stretches per length encoded by (a) only CAG codons, (b) a mix of CAG and CAA codons, (c) a mix of CAG, CAA, and other interrupting codons, and (d) only CAA codons. Considering only glutamine stretches encoded by more than one different triplet, maximum number of consecutive (e) CAG, and (f) CAA per length.
<sc>Fig</sc>. 4.
Fig. 4.
—Length-stability of glutamine stretches. Percentage of orthologous glutamine stretch regions with a stable length in at least half of the orthologs.
<sc>Fig</sc>. 5.
Fig. 5.
—Glutamine codon usage in stable- versus unstable-in-length polyQ. Codon usage calculated in stable-in-length (S) and unstable-in-length (U), short (4–8 Q), and long (>8 Q) polyQ stretches, in 12 primates.
<sc>Fig</sc>. 6.
Fig. 6.
—PolyQ lengths and glutamine codon usage in proteins related to polyQ-expansion diseases. Divergence time for each organism and human is measured in million years (Myr). The tree on top relates the species based on their NCBI taxonomy. Each protein is appended with its related disease. The overall Q lengths (in blue) and CAG percentage prevalence (in red) plots take into account the results per species of the nine proteins shown above.

Similar articles

Cited by

References

    1. Abascal F, Zardoya R, Telford MJ.. 2010. TranslatorX: multiple alignment of nucleotide sequences guided by amino acid translations. Nucleic Acids Res. 38(Suppl_2):W7–13. - PMC - PubMed
    1. Albà MM, Guigó R.. 2004. Comparative analysis of amino acid repeats in rodents and humans. Genome Res. 144:549–554. - PMC - PubMed
    1. Athey J, et al.2017. A new and updated resource for codon usage tables. BMC Bioinformatics 181. - PMC - PubMed
    1. Barik S. 2017. Amino acid repeats avert mRNA folding through conservative substitutions and synonymous codons, regardless of codon bias. Heliyon 3:12. - PMC - PubMed
    1. Barton S, Jacak R, Khare SD, Ding F, Dokholyan NV.. 2007. The length dependence of the polyQ-mediated protein aggregation. J Biol Chem. 28235:25487–25492. - PubMed

Publication types

LinkOut - more resources