Testing the "proto-splice sites" model of intron origin: evidence from analysis of intron phase correlations
- PMID: 11110894
- DOI: 10.1093/oxfordjournals.molbev.a026279
Testing the "proto-splice sites" model of intron origin: evidence from analysis of intron phase correlations
Abstract
A few nucleotide sites of nuclear exons that flank introns are often conserved. A hypothesis has suggested that these sites, called "proto-splice sites," are remnants of recognition signals for the insertion of introns in the early evolution of eukaryotic genes. This notion of proto-splice sites has been an important basis for the insertional theory of introns. This hypothesis predicts that the distribution of proto-splice sites would determine the distribution of intron phases, because the positions of introns are just a subset of the proto-splice sites. We previously tested this prediction by examining the proportions of the phases of proto-splice sites, revealing nothing in these proportion distributions similar to observed proportions of intron phases. Here, we provide a second independent test of the proto-splice site hypothesis, with regard to its prediction that the proto-splice sites would mimic intron phase correlations, using a CDS database we created from GenBank. We tested four hypothetical proto-splice sites G / G, AG / G, AG / GT, and C/AAG / R. Interestingly, while G / G and AG / GT site phase distributions are not consistent with actual introns, we observed that AG / G and C/AAG / R sites have a symmetric phase excess. However, the patterns of the excess are quite different from the actual intron phase distribution. In addition, particular amino acid repeats in proteins were found to partially contribute to the excess of symmetry at these two types of sites. The phase associations of all four sites are significantly different from those of intron phases. Furthermore, a general model of intron insertion into proto-splice sites was simulated by Monte Carlo simulation to investigate the probability that the random insertion of introns into AG / G and C/AAG / R sites could generate the observed intron phase distribution. The simulation showed that (1) no observed correlation of intron phases was statistically consistent with the phase distribution of proto-splice sites in the simulated virtual genes; (2) most conservatively, no simulation in 10,000 Monte Carlo experiments gave a pattern with an excess of symmetric (1, 1) exons larger than those of (0, 0) and (2, 2), a major statistical feature of intron phase distribution that is consistent with the directly observed cases of exon shuffling. Thus, these results reject the null hypothesis that introns are randomly inserted into preexisting proto-splice sites, as suggested by the insertional theory of introns.
Similar articles
-
Relationship between "proto-splice sites" and intron phases: evidence from dicodon analysis.Proc Natl Acad Sci U S A. 1998 Jan 6;95(1):219-23. doi: 10.1073/pnas.95.1.219. Proc Natl Acad Sci U S A. 1998. PMID: 9419356 Free PMC article.
-
Intron phase correlations and the evolution of the intron/exon structure of genes.Proc Natl Acad Sci U S A. 1995 Dec 19;92(26):12495-9. doi: 10.1073/pnas.92.26.12495. Proc Natl Acad Sci U S A. 1995. PMID: 8618928 Free PMC article.
-
Can codon usage bias explain intron phase distributions and exon symmetry?J Mol Evol. 2005 Jan;60(1):99-104. doi: 10.1007/s00239-004-0032-9. J Mol Evol. 2005. PMID: 15696372
-
Analysis of evolution of exon-intron structure of eukaryotic genes.Brief Bioinform. 2005 Jun;6(2):118-34. doi: 10.1093/bib/6.2.118. Brief Bioinform. 2005. PMID: 15975222 Review.
-
Evolution of the intron-exon structure of eukaryotic genes.Curr Opin Genet Dev. 1995 Dec;5(6):774-8. doi: 10.1016/0959-437x(95)80010-3. Curr Opin Genet Dev. 1995. PMID: 8745076 Review.
Cited by
-
Phylogenetic distribution of intron positions in alpha-amylase genes of bilateria suggests numerous gains and losses.PLoS One. 2011;6(5):e19673. doi: 10.1371/journal.pone.0019673. Epub 2011 May 17. PLoS One. 2011. PMID: 21611157 Free PMC article.
-
Comprehensive genomic analyses with 115 plastomes from algae to seed plants: structure, gene contents, GC contents, and introns.Genes Genomics. 2020 May;42(5):553-570. doi: 10.1007/s13258-020-00923-x. Epub 2020 Mar 21. Genes Genomics. 2020. PMID: 32200544
-
Genome-wide analysis of the chalcone synthase superfamily genes of Physcomitrella patens.Plant Mol Biol. 2010 Feb;72(3):247-63. doi: 10.1007/s11103-009-9565-z. Epub 2009 Oct 31. Plant Mol Biol. 2010. PMID: 19876746
-
Intron distribution difference for 276 ancient and 131 modern genes suggests the existence of ancient introns.Proc Natl Acad Sci U S A. 2001 Nov 6;98(23):13177-82. doi: 10.1073/pnas.231491498. Epub 2001 Oct 30. Proc Natl Acad Sci U S A. 2001. PMID: 11687643 Free PMC article.
-
Origination of the split structure of spliceosomal genes from random genetic sequences.PLoS One. 2008;3(10):e3456. doi: 10.1371/journal.pone.0003456. Epub 2008 Oct 20. PLoS One. 2008. PMID: 18941625 Free PMC article.
Publication types
MeSH terms
Substances
LinkOut - more resources
Full Text Sources
Miscellaneous