Internal correspondence analysis of codon and amino-acid usage in thermophilic bacteria
- PMID: 12817570
Internal correspondence analysis of codon and amino-acid usage in thermophilic bacteria
Abstract
Starting from two datasets of codon usage in coding sequences from mesophilic and thermophilic bacteria, we used internal correspondence analysis to study the variability of codon usage within and between species, and within and between amino acids. The first dataset included 18,958,458 codons from 58,482 coding sequences from completely sequenced genomes of 25 species, along with 6,793,581 dinucleotides from 21,876 intergenic spaces. The second dataset, with partially sequenced genomes, included 97,095,873 codons from 293 bacterial species. Results were consistent between the two datasets. The trend for the amino-acid composition of thermophilic proteins was found to be under the control of a pressure at the nucleic acid level, not a selection at the protein level. This effect was not present in intergenic spaces, ruling out a pressure at the DNA level. The pattern at the mRNA level was more complex than a simple purine enrichment of the sense strand of coding sequences. Outliers in the partial genome dataset introduced a note of caution about the interpretation of temperature as the direct determinant of the trend observed in thermophiles. The surprising lack of selection on the amino-acid content of thermophilic proteins suggests that the amino-acid repertoire was set up in a hot environment.
Similar articles
-
Investigation on the causes of codon and amino acid usages variation between thermophilic Aquifex aeolicus and mesophilic Bacillus subtilis.J Biomol Struct Dyn. 2004 Oct;22(2):205-14. doi: 10.1080/07391102.2004.10506996. J Biomol Struct Dyn. 2004. PMID: 15317481
-
[Analysis, identification and correction of some errors of model refseqs appeared in NCBI Human Gene Database by in silico cloning and experimental verification of novel human genes].Yi Chuan Xue Bao. 2004 May;31(5):431-43. Yi Chuan Xue Bao. 2004. PMID: 15478601 Chinese.
-
Comparing the base usage frequency between bacteria DNA double strand.Yi Chuan Xue Bao. 2003 Feb;30(2):189-92. Yi Chuan Xue Bao. 2003. PMID: 12776609
-
[Nonsense codon is not stop codon].Tanpakushitsu Kakusan Koso. 1993 Jun;38(8):1361-78. Tanpakushitsu Kakusan Koso. 1993. PMID: 8337405 Review. Japanese. No abstract available.
-
Codon catalog usage and the genome hypothesis.Nucleic Acids Res. 1980 Jan 11;8(1):r49-r62. doi: 10.1093/nar/8.1.197-c. Nucleic Acids Res. 1980. PMID: 6986610 Free PMC article. Review.
Cited by
-
Unresolved orthology and peculiar coding sequence properties of lamprey genes: the KCNA gene family as test case.BMC Genomics. 2011 Jun 23;12:325. doi: 10.1186/1471-2164-12-325. BMC Genomics. 2011. PMID: 21699680 Free PMC article.
-
Modified 'one amino acid-one codon' engineering of high GC content TaqII-coding gene from thermophilic Thermus aquaticus results in radical expression increase.Microb Cell Fact. 2014 Jan 11;13:7. doi: 10.1186/1475-2859-13-7. Microb Cell Fact. 2014. PMID: 24410856 Free PMC article.
-
Thermostable proteins bioprocesses: The activity of restriction endonuclease-methyltransferase from Thermus thermophilus (RM.TthHB27I) cloned in Escherichia coli is critically affected by the codon composition of the synthetic gene.PLoS One. 2017 Oct 17;12(10):e0186633. doi: 10.1371/journal.pone.0186633. eCollection 2017. PLoS One. 2017. PMID: 29040308 Free PMC article.
-
Genome Data Exploration Using Correspondence Analysis.Bioinform Biol Insights. 2016 Jun 7;10:59-72. doi: 10.4137/BBI.S39614. eCollection 2016. Bioinform Biol Insights. 2016. PMID: 27279736 Free PMC article. Review.
-
Codon usage between genomes is constrained by genome-wide mutational processes.Proc Natl Acad Sci U S A. 2004 Mar 9;101(10):3480-5. doi: 10.1073/pnas.0307827100. Epub 2004 Feb 27. Proc Natl Acad Sci U S A. 2004. PMID: 14990797 Free PMC article.