Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Comparative Study
. 2003 Jun;13(6B):1290-300.
doi: 10.1101/gr.1017303.

Impact of alternative initiation, splicing, and termination on the diversity of the mRNA transcripts encoded by the mouse transcriptome

Affiliations
Comparative Study

Impact of alternative initiation, splicing, and termination on the diversity of the mRNA transcripts encoded by the mouse transcriptome

Mihaela Zavolan et al. Genome Res. 2003 Jun.

Abstract

We analyzed the FANTOM2 clone set of 60,770 RIKEN full-length mouse cDNA sequences and 44,122 public mRNA sequences. We developed a new computational procedure to identify and classify the forms of splice variation evident in this data set and organized the results into a publicly accessible database that can be used for future expression array construction, structural genomics, and analyses of the mechanism and regulation of alternative splicing. Statistical analysis shows that at least 41% and possibly as much as 60% of multiexon genes in mouse have multiple splice forms. Of the transcription units with multiple splice forms, 49% contain transcripts in which the apparent use of an alternative transcription start (stop) is accompanied by alternative splicing of the initial (terminal) exon. This implies that alternative transcription may frequently induce alternative splicing. The fact that 73% of all exons with splice variation fall within the annotated coding region indicates that most splice variation is likely to affect the protein form. Finally, we compared the set of constitutive (present in all transcripts) exons with the set of cryptic (present only in some transcripts) exons and found statistically significant differences in their length distributions, the nucleotide distributions around their splice junctions, and the frequencies of occurrence of several short sequence motifs.

PubMed Disclaimer

Figures

Figure 1
Figure 1
Distribution of distances between alternative splice sites: (A) 5′-splice sites; (B) 3′-splice sites. Note that we only show the distribution up to a distance of 100 nt.
Figure 2
Figure 2
Estimated frequency of lengths of constitutive (black) and cryptic (red) exons. For each value of the length L we determined the frequency in our data set of exons with length between 0.8 L and 1.2 L.
Figure 3
Figure 3
Nucleotide distribution in the (A,C) 5′- and (B,D) 3′-splice signals flanking constitutive (upper panels) and cryptic (lower panels) exons. The relative sizes of the letters indicate the relative frequencies of the nucleotides at that distance from the splice junctions. The absolute sizes of the letters correspond to the information score of the nucleotide distribution at that position.
Figure 4
Figure 4
Position-specific probability that the nucleotides at (A) 5′- and (B) 3′-splice signals of cryptic exons were generated from different versus the same underlying distribution as those of the constitutive exons.
Figure 5
Figure 5
Splice variants of polypyrimidine-tract-binding protein.
Figure 6
Figure 6
Annotation of variant exons. The splice sites responsible for the annotation of the associated exons as variant are indicated by vertical bars. Exons with invariant splice sites are shown in green, exons with alternative 5′ sites are shown in yellow, and those with alternative 3′ sites in blue. Cryptic exons are indicated by a black box surrounding the exon. Introns are shown in red.

References

    1. Basu, A., Dong, B., Krainer, C., and Howe, A.R. 1997. The intracisternal A-particle proximal enhancer-binding protein activates transcription and is identical to the RNA- and DNA-binding protein p54nrb/NonO. Mol. Cell. Biol. 17: 677-686. - PMC - PubMed
    1. Berget, S. 1995. Exon recognition in vertebrate splicing. J. Biol. Chem. 270: 2411-2414. - PubMed
    1. Blencowe, B., Bauren, G., Eldridge, A., Issner, R., Nickerson, J., Rosonina, E., and Sharp, P. 2000. The SRm160/300 splicing coactivator subunits. RNA 6: 111-120. - PMC - PubMed
    1. Brett, D., Hanke, J., Lehmann, G., Haase, S., Delbruck, S., Krueger, S., Reich, J., and Bork, P. 2000. EST comparison indicates 38% of human mRNAs contain possible alternative splice forms. FEBS Lett. 474: 83-86. - PubMed
    1. Brett, D., Pospisil, H., Valcarcel, J., Reich, J., and Bork, P. 2002. Alternative splicing and genome complexity. Nat. Genet. 30: 29-30. - PubMed

WEB SITE REFERENCES

    1. ftp://wolfram.wi.mit.edu/pub/mousecontigs/MGSCV3; draft of the mouse genome sequence.
    1. http://facts.gsc.riken.go.jp; Functional Association/annotation of cDNA clones from Text/sequence Sources (FACTS).
    1. http://genomes.rockefeller.edu/MouSDB; database of alternative splice forms in the mouse transcriptome.
    1. http://smart.embl-heidelberg.de; Simple Modular Architecture Research Tool (SMART). - PMC - PubMed

Publication types

MeSH terms

LinkOut - more resources