Origination of the split structure of spliceosomal genes from random genetic sequences
- PMID: 18941625
- PMCID: PMC2565106
- DOI: 10.1371/journal.pone.0003456
Origination of the split structure of spliceosomal genes from random genetic sequences
Abstract
The mechanism by which protein-coding portions of eukaryotic genes came to be separated by long non-coding stretches of DNA, and the purpose for this perplexing arrangement, have remained unresolved fundamental biological problems for three decades. We report here a plausible solution to this problem based on analysis of open reading frame (ORF) length constraints in the genomes of nine diverse species. If primordial nucleic acid sequences were random in sequence, functional proteins that are innately long would not be encoded due to the frequent occurrence of stop codons. The best possible way that a long protein-coding sequence could have been derived was by evolving a split-structure from the random DNA (or RNA) sequence. Results of the systematic analyses of nine complete genome sequences presented here suggests that perhaps the major underlying structural features of split-genes have evolved due to the indigenous occurrence of split protein-coding genes in primordial random nucleotide sequence. The results also suggest that intron-rich genes containing short exons may have been the original form of genes intrinsically occurring in random DNA, and that intron-poor genes containing long exons were perhaps derived from the original intron-rich genes.
Conflict of interest statement
Figures







Similar articles
-
An ancient spliceosomal intron in the ribosomal protein L7a gene (Rpl7a) of Giardia lamblia.BMC Evol Biol. 2005 Aug 18;5:45. doi: 10.1186/1471-2148-5-45. BMC Evol Biol. 2005. PMID: 16109161 Free PMC article.
-
Features of coding and noncoding sequences based on 3-tuple distributions.Yi Chuan Xue Bao. 2005 Oct;32(10):1018-26. Yi Chuan Xue Bao. 2005. PMID: 16252696
-
Origin of eukaryotic introns: a hypothesis, based on codon distribution statistics in genes, and its implications.Proc Natl Acad Sci U S A. 1986 Apr;83(7):2133-7. doi: 10.1073/pnas.83.7.2133. Proc Natl Acad Sci U S A. 1986. PMID: 3457379 Free PMC article.
-
Coevolution of genomic intron number and splice sites.Trends Genet. 2007 Jul;23(7):321-5. doi: 10.1016/j.tig.2007.04.001. Epub 2007 Apr 18. Trends Genet. 2007. PMID: 17442445 Review.
-
Steady progress and recent breakthroughs in the accuracy of automated genome annotation.Nat Rev Genet. 2008 Jan;9(1):62-73. doi: 10.1038/nrg2220. Nat Rev Genet. 2008. PMID: 18087260 Review.
Cited by
-
Reverse transcriptase and intron number evolution.Stem Cell Investig. 2014 Sep 28;1:17. doi: 10.3978/j.issn.2306-9759.2014.08.01. eCollection 2014. Stem Cell Investig. 2014. PMID: 27358863 Free PMC article.
-
Human prefoldin modulates co-transcriptional pre-mRNA splicing.Nucleic Acids Res. 2021 Jun 21;49(11):6267-6280. doi: 10.1093/nar/gkab446. Nucleic Acids Res. 2021. PMID: 34096575 Free PMC article.
-
Targeted genome-wide enrichment of functional regions.PLoS One. 2010 Jun 16;5(6):e11138. doi: 10.1371/journal.pone.0011138. PLoS One. 2010. PMID: 20585402 Free PMC article.
-
RNA-Seq Analysis Reveals Localization-Associated Alternative Splicing across 13 Cell Lines.Genes (Basel). 2020 Jul 18;11(7):820. doi: 10.3390/genes11070820. Genes (Basel). 2020. PMID: 32708427 Free PMC article.
References
-
- Roy SW, Gilbert W. The evolution of spliceosomal introns: patterns, puzzles and progress. Nature Rev Genet. 2006;7:211–221. - PubMed
-
- Rogozin IB, Wolf YI, Sorokin AV, Mirkin BG, Koonin EV. Remarkable interkingdom conservation of intron positions and massive, lineage-specific intron loss and gain in eukaryotic evolution. Curr Biol. 2003;13:1512–1517. - PubMed
Publication types
MeSH terms
LinkOut - more resources
Full Text Sources
Medical