Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2019 Feb 15;35(4):579-583.
doi: 10.1093/bioinformatics/bty678.

Global importance of RNA secondary structures in protein-coding sequences

Affiliations

Global importance of RNA secondary structures in protein-coding sequences

Markus Fricke et al. Bioinformatics. .

Abstract

Motivation: The protein-coding sequences of messenger RNAs are the linear template for translation of the gene sequence into protein. Nevertheless, the RNA can also form secondary structures by intramolecular base-pairing.

Results: We show that the nucleotide distribution within codons is biased in all taxa of life on a global scale. Thereby, RNA secondary structures that require base-pairing between the position 1 of a codon with the position 1 of an opposing codon (here named RNA secondary structure class c1) are under-represented. We conclude that this bias may result from the co-evolution of codon sequence and mRNA secondary structure, suggesting that RNA secondary structures are generally important in protein-coding regions of mRNAs. The above result also implies that codon position 2 has a smaller influence on the amino acid choice than codon position 1.

Supplementary information: Supplementary data are available at Bioinformatics online.

PubMed Disclaimer

Figures

Fig. 1.
Fig. 1.
(A) Three base pair/RNA secondary structure classes that represent all possible base-pairs between positions 1, 2 and 3 of one codon with positions 1, 2 and 3 of an opposing codon in RNA secondary structures. The classes are based on the following groups c1={1:1, 2:3, 3:2}, c2={1:3, 2:2, 3:1} and c3={1:2, 2:1, 3:3}. (B) An artificial example RNA secondary structure with the different classes corresponding to A. Shifts between the different classes are possible, introduced through loops and bulges
Fig. 2.
Fig. 2.
Distribution of class c1, c2 and c3 within all selected coding sequences (sCDS, see methods for selection procedure) except mitochondria and chloroplasts for each taxonomic group. #species – number of different species; #CDS – number of coding sequence (CDS); #sCDS – number of selected CDS; #nt/sCDS – number of nucleotides per selected CDS
Fig. 3.
Fig. 3.
(A) Class distribution and (B) nucleotide distribution of each codon positions of all human mRNAs with less stable (mean MFE>−8 kcal/mol over all folding windows, #7104 mRNAs, TOP) and rather stable (mean MFE  −8 kcal/mol over all folding windows, #15022 mRNAs, BOTTOM) RNA secondary structures. The nucleotide distributions in codon positions 2 and 3 of the less stable mRNAs are similar, since base-pairings of 2:3 and 3:2 are likely. In contrast, mRNAs with more stable RNA secondary structure show an increased GC content at codon position 1 and in particular at position 3. This increases the G:C and C:G base-pairings 1:3, 3:1 and 3:3 (Supplementary Fig. S6) and consequently increases the frequency of classes c2 and c3
Fig. 4.
Fig. 4.
Base pair/RNA secondary structure class distribution in mitochondria and chloroplasts

Similar articles

Cited by

References

    1. Allen J.F. (2015) Why chloroplasts and mitochondria retain their own genomes and genetic systems: colocation for redox regulation of gene expression. Proc. Natl. Acad. Sci. USA, 112, 10231–10238. - PMC - PubMed
    1. Anderson C.P. et al. (2012) Mammalian iron metabolism and its control by iron regulatory proteins. Biochim. Biophys. Acta, 1823, 1468–1483. - PMC - PubMed
    1. Ball L.A. (1973) Secondary structure and coding potential of the coat protein gene of bacteriophage MS2. Nature New Biol., 242, 44–45. - PubMed
    1. Berleant D. et al. (2009) The genetic code–more than just a table. Cell Biochem. Biophys., 55, 107–116. - PubMed
    1. Carlini D.B. et al. (2001) The relationship between third-codon position nucleotide content, codon bias, mRNA secondary structure and gene expression in the drosophilid alcohol dehydrogenase genes Adh and Adhr. Genetics, 159, 623–633. - PMC - PubMed

Publication types