The genomic features that affect the lengths of 5' untranslated regions in multicellular eukaryotes
- PMID: 22152105
- PMCID: PMC3283318
- DOI: 10.1186/1471-2105-12-S9-S3
The genomic features that affect the lengths of 5' untranslated regions in multicellular eukaryotes
Abstract
Background: The lengths of 5'UTRs of multicellular eukaryotes have been suggested to be subject to stochastic changes, with upstream start codons (uAUGs) as the major constraint to suppress 5'UTR elongation. However, this stochastic model cannot fully explain the variations in 5'UTR length. We hypothesize that the selection pressure on a combination of genomic features is also important for 5'UTR evolution. The ignorance of these features may have limited the explanatory power of the stochastic model. Furthermore, different selective constraints between vertebrates and invertebrates may lead to differences in the determinants of 5'UTR length, which have not been systematically analyzed.
Methods: Here we use a multiple linear regression model to delineate the correlation between 5'UTR length and the combination of a series of genomic features (G+C content, observed-to-expected (OE) ratios of uAUGs, upstream stop codons (uSTOPs), methylation-related CG/UG dinucleotides, and mRNA-destabilizing UU/UA dinucleotides) in six vertebrates (human, mouse, rat, chicken, African clawed frog, and zebrafish) and four invertebrates (fruit fly, mosquito, sea squirt, and nematode). The relative contributions of each feature to the variation of 5'UTR length were also evaluated.
Results: We found that 14%~33% of the 5'UTR length variations can be explained by a linear combination of the analyzed genomic features. The most important genomic features are the OE ratios of uSTOPs and G+C content. The surprisingly large weightings of uSTOPs highlight the importance of selection on upstream open reading frames (which include both uAUGs and uSTOPs), rather than on uAUGs per se. Furthermore, G+C content is the most important determinants for most invertebrates, but for vertebrates its effect is second to uSTOPs. We also found that shorter 5'UTRs are affected more by the stochastic process, whereas longer 5'UTRs are affected more by selection pressure on genomic features.
Conclusions: Our results suggest that upstream open reading frames may be the real target of selection, rather than uAUGs. We also show that the selective constraints on genomic features of 5'UTRs differ between vertebrates and invertebrates, and between longer and shorter 5'UTRs. A more comprehensive model that takes these findings into consideration is needed to better explain 5'UTR length evolution.
Figures

Similar articles
-
Exploring the selective constraint on the sizes of insertions and deletions in 5' untranslated regions in mammals.BMC Evol Biol. 2011 Jul 5;11:192. doi: 10.1186/1471-2148-11-192. BMC Evol Biol. 2011. PMID: 21726469 Free PMC article.
-
Combining a prioritization strategy and functional studies nominates 5'UTR variants underlying inherited retinal disease.Genome Med. 2024 Jan 6;16(1):7. doi: 10.1186/s13073-023-01277-1. Genome Med. 2024. PMID: 38184646 Free PMC article.
-
Evolution of alternative and constitutive regions of mammalian 5'UTRs.BMC Genomics. 2009 Apr 16;10:162. doi: 10.1186/1471-2164-10-162. BMC Genomics. 2009. PMID: 19371439 Free PMC article.
-
The Functional Meaning of 5'UTR in Protein-Coding Genes.Int J Mol Sci. 2023 Feb 3;24(3):2976. doi: 10.3390/ijms24032976. Int J Mol Sci. 2023. PMID: 36769304 Free PMC article. Review.
-
Machine Learning for Designing Next-Generation mRNA Therapeutics.Acc Chem Res. 2022 Jan 4;55(1):24-34. doi: 10.1021/acs.accounts.1c00621. Epub 2021 Dec 14. Acc Chem Res. 2022. PMID: 34905691 Review.
Cited by
-
Functionally Significant Features in the 5' Untranslated Region of the ABCA1 Gene and Their Comparison in Vertebrates.Cells. 2019 Jun 21;8(6):623. doi: 10.3390/cells8060623. Cells. 2019. PMID: 31234415 Free PMC article.
-
Ribosome profiling reveals translational regulation of mammalian cells in response to hypoxic stress.BMC Genomics. 2017 Aug 21;18(1):638. doi: 10.1186/s12864-017-3996-8. BMC Genomics. 2017. PMID: 28826393 Free PMC article.
-
Control of Translation at the Initiation Phase During Glucose Starvation in Yeast.Int J Mol Sci. 2019 Aug 19;20(16):4043. doi: 10.3390/ijms20164043. Int J Mol Sci. 2019. PMID: 31430885 Free PMC article. Review.
-
Expression and regulation of mouse selenoprotein P transcript variants differing in non-coding RNA.RNA Biol. 2012 Nov;9(11):1361-9. doi: 10.4161/rna.22290. Epub 2012 Oct 12. RNA Biol. 2012. PMID: 23064117 Free PMC article.
-
Generation of Transcript Length Variants and Reprogramming of mRNA Splicing During Atherosclerosis Progression in ApoE-Deficient Mice.Biomedicines. 2024 Nov 26;12(12):2703. doi: 10.3390/biomedicines12122703. Biomedicines. 2024. PMID: 39767610 Free PMC article.
References
-
- Osada N, Hirata M, Tanuma R, Kusuda J, Hida M, Suzuki Y, Sugano S, Gojobori T, Shen CK, Wu CI, Hashimoto K. Substitution rate and structural divergence of 5'UTR evolution: comparative analysis between human and cynomolgus monkey cDNAs. Mol Biol Evol. 2005;22(10):1976–1982. doi: 10.1093/molbev/msi187. - DOI - PubMed
-
- Iacono M, Mignone F, Pesole G. uAUG and uORFs in human and rodent 5'untranslated mRNAs. Gene. 2005;349:97–105. - PubMed
Publication types
MeSH terms
Substances
LinkOut - more resources
Full Text Sources