Low-usage codons in Escherichia coli, yeast, fruit fly and primates
- PMID: 1937008
- DOI: 10.1016/0378-1119(91)90514-c
Low-usage codons in Escherichia coli, yeast, fruit fly and primates
Abstract
Codon usage is compared between four classes of species, with an emphasis on characterization of low-usage codons. The classes of species analyzed include the bacterium Escherichia coli (ECO), the yeast Saccharomyces cerevisiae (YSC), the fruit fly Drosophila melanogaster (DRO), and several species of primates (PRI) (taken as a group; includes eleven species for which nucleotide sequence data have been reported to GenBank, however, greater than 90% of the sequences were from Homo sapiens). The number of protein-coding sequences analyzed were 968 for ECO, 484 for YSC, 244 for DRO, and 1518 for PRI. Three methods have been used to determine low-usage codons in these species. The first and most common way of assessing codon usage is by summing the number of time codons appear in reading frames of the genome in question. The second way is to examine the distribution of usage in different genes by scoring the number of protein reading frames in which a particular codon does not appear. The third way starts with a similar notion, but instead considers combinations of codons that are missing from the maximum number of genes. These three methods give very similar results. Each species has a unique combination of eight least-used codons, but all species contain the arginine codons, CGA and CGG. The agreement between YSC and PRI is particularly striking as they share six low-usage codons. All six carry the dinucleotide sequence, CG. The eight least-used codons in PRI include all codons that contain the CG dinucleotide sequence. Low-usage codons are clearly avoided in genes encoding abundant proteins for ECO, YSC DRO. In all species, proteins containing a high percentage of low-usage codons could be characterized as cases where an excess of the protein could be detrimental. Low codon usage is relatively insensitive to gross base composition. However, dinucleotide usage can sometimes influence codon usage. This is particularly notable in the case of CG dinucleotides in PRI.
Similar articles
-
Codon usage bias is correlated with gene expression levels in the fission yeast Schizosaccharomyces pombe.Genes Cells. 2009 Apr;14(4):499-509. doi: 10.1111/j.1365-2443.2009.01284.x. Genes Cells. 2009. PMID: 19335619
-
An evaluation of measures of synonymous codon usage bias.J Mol Evol. 1998 Sep;47(3):268-74. doi: 10.1007/pl00006384. J Mol Evol. 1998. PMID: 9732453
-
Codon usage and tRNA genes in eukaryotes: correlation of codon usage diversity with translation efficiency and with CG-dinucleotide usage as assessed by multivariate analysis.J Mol Evol. 2001 Oct-Nov;53(4-5):290-8. doi: 10.1007/s002390010219. J Mol Evol. 2001. PMID: 11675589
-
Codon usage patterns in Escherichia coli, Bacillus subtilis, Saccharomyces cerevisiae, Schizosaccharomyces pombe, Drosophila melanogaster and Homo sapiens; a review of the considerable within-species diversity.Nucleic Acids Res. 1988 Sep 12;16(17):8207-11. doi: 10.1093/nar/16.17.8207. Nucleic Acids Res. 1988. PMID: 3138659 Free PMC article. Review.
-
Analysis of synonymous codon usage patterns in the edible fungus Volvariella volvacea.Biotechnol Appl Biochem. 2017 Mar;64(2):218-224. doi: 10.1002/bab.1538. Epub 2016 Dec 15. Biotechnol Appl Biochem. 2017. PMID: 27696508 Review.
Cited by
-
Dynamic changes in translational efficiency are deduced from codon usage of the transcriptome.Nucleic Acids Res. 2012 Nov 1;40(20):10053-63. doi: 10.1093/nar/gks772. Epub 2012 Aug 31. Nucleic Acids Res. 2012. PMID: 22941644 Free PMC article.
-
A dual-reporter system for investigating and optimizing protein translation and folding in E. coli.Nat Commun. 2021 Oct 19;12(1):6093. doi: 10.1038/s41467-021-26337-1. Nat Commun. 2021. PMID: 34667164 Free PMC article.
-
The impact of ribosomal interference, codon usage, and exit tunnel interactions on translation elongation rate variation.PLoS Genet. 2018 Jan 16;14(1):e1007166. doi: 10.1371/journal.pgen.1007166. eCollection 2018 Jan. PLoS Genet. 2018. PMID: 29337993 Free PMC article.
-
Characterisation of full-length cDNA sequences provides insights into the Eimeria tenella transcriptome.BMC Genomics. 2012 Jan 13;13:21. doi: 10.1186/1471-2164-13-21. BMC Genomics. 2012. PMID: 22244352 Free PMC article.
-
Synonymous codon usage in different protein secondary structural classes of human genes: implication for increased non-randomness of GC3 rich genes towards protein stability.J Biosci. 2007 Aug;32(5):947-63. doi: 10.1007/s12038-007-0095-z. J Biosci. 2007. PMID: 17914237
Publication types
MeSH terms
Substances
Grants and funding
LinkOut - more resources
Full Text Sources
Other Literature Sources
Molecular Biology Databases
Research Materials