Statistical analysis of the exon-intron structure of higher and lower eukaryote genes
- PMID: 10563578
- DOI: 10.1080/07391102.1999.10508361
Statistical analysis of the exon-intron structure of higher and lower eukaryote genes
Abstract
Statistics of the exon-intron structure and splicing sites of several diverse eukaryotes was studied. The yeast exon-intron structures have a number of unique features. A yeast gene usually have at most one intron. The branch site is strongly conserved, whereas the polypirimidine tract is short. Long yeast introns tend to have stronger acceptor sites. In other species the branch site is less conserved and often cannot be determined. In non-yeast samples there is an almost universal correlation between lengths of neighboring exons (all samples excluding protists) and correlation between lengths of neighboring introns (human, drosophila, protists). On the average first introns are longer, and anomalously long introns are usually first introns in a gene. There is a universal preference for exons and exon pairs with the (total) length divisible by 3. Introns positioned between codons are preferred, whereas those positioned between the first and second positions in codon are avoided. The choice of A or G at the third position of intron (the donor splice sites generally prefer purines at this position) is correlated with the overall GC-composition of the gene. In all samples dinucleotide AG is avoided in the region preceding the acceptor site.
Similar articles
-
[Statistical analysis of the exon-intron structure of higher eukaryote genes].Biofizika. 1999 Jul-Aug;44(4):595-600. Biofizika. 1999. PMID: 10544807 Russian.
-
Intron-exon structures of eukaryotic model organisms.Nucleic Acids Res. 1999 Aug 1;27(15):3219-28. doi: 10.1093/nar/27.15.3219. Nucleic Acids Res. 1999. PMID: 10454621 Free PMC article.
-
Differential GC content between exons and introns establishes distinct strategies of splice-site recognition.Cell Rep. 2012 May 31;1(5):543-56. doi: 10.1016/j.celrep.2012.03.013. Epub 2012 May 3. Cell Rep. 2012. PMID: 22832277
-
Advances in the Exon-Intron Database (EID).Brief Bioinform. 2006 Jun;7(2):178-85. doi: 10.1093/bib/bbl003. Epub 2006 Mar 9. Brief Bioinform. 2006. PMID: 16772261 Review.
-
Analysis of evolution of exon-intron structure of eukaryotic genes.Brief Bioinform. 2005 Jun;6(2):118-34. doi: 10.1093/bib/6.2.118. Brief Bioinform. 2005. PMID: 15975222 Review.
Cited by
-
Conservation/Mutation in the splice sites of cytokine receptor genes of mouse and human.Int J Evol Biol. 2013;2013:818954. doi: 10.1155/2013/818954. Epub 2013 Dec 17. Int J Evol Biol. 2013. PMID: 24455408 Free PMC article.
-
Effect of 5'UTR introns on gene expression in Arabidopsis thaliana.BMC Genomics. 2006 May 19;7:120. doi: 10.1186/1471-2164-7-120. BMC Genomics. 2006. PMID: 16712733 Free PMC article.
-
Xpro: database of eukaryotic protein-encoding genes.Nucleic Acids Res. 2004 Jan 1;32(Database issue):D59-63. doi: 10.1093/nar/gkh051. Nucleic Acids Res. 2004. PMID: 14681359 Free PMC article.
-
HybGFS: a hybrid method for genome-fingerprint scanning.BMC Bioinformatics. 2006 Oct 29;7:479. doi: 10.1186/1471-2105-7-479. BMC Bioinformatics. 2006. PMID: 17069662 Free PMC article.
-
An intron-containing glycoside hydrolase family 9 cellulase gene encodes the dominant 90 kDa component of the cellulosome of the anaerobic fungus Piromyces sp. strain E2.Biochem J. 2002 Jul 1;365(Pt 1):193-204. doi: 10.1042/BJ20011866. Biochem J. 2002. PMID: 12071852 Free PMC article.
Publication types
MeSH terms
Substances
LinkOut - more resources
Full Text Sources
Molecular Biology Databases
Miscellaneous