Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2008;9(9):R141.
doi: 10.1186/gb-2008-9-9-r141.

Exon creation and establishment in human genes

Affiliations

Exon creation and establishment in human genes

André Corvelo et al. Genome Biol. 2008.

Abstract

Background: A large proportion of species-specific exons are alternatively spliced. In primates, Alu elements play a crucial role in the process of exon creation but many new exons have appeared through other mechanisms. Despite many recent studies, it is still unclear which are the splicing regulatory requirements for de novo exonization and how splicing regulation changes throughout an exon's lifespan.

Results: Using comparative genomics, we have defined sets of exons with different evolutionary ages. Younger exons have weaker splice-sites and lower absolute values for the relative abundance of putative splicing regulators between exonic and adjacent intronic regions, indicating a less consolidated splicing regulation. This relative abundance is shown to increase with exon age, leading to higher exon inclusion. We show that this local difference in the density of regulators might be of biological significance, as it outperforms other measures in real exon versus pseudo-exon classification. We apply this new measure to the specific case of the exonization of anti-sense Alu elements and show that they are characterized by a general lack of exonic splicing silencers.

Conclusions: Our results suggest that specific sequence environments are required for exonization and that these can change with time. We propose a model of exon creation and establishment in human genes, in which splicing decisions depend on the relative local abundance of regulatory motifs. Using this model, we provide further explanation as to why Alu elements serve as a major substrate for exon creation in primates. Finally, we discuss the benefits of integrating such information in gene prediction.

PubMed Disclaimer

Figures

Figure 1
Figure 1
EST inclusion level and symmetry. (a) EST inclusion levels for the three age groups. The x-axis shows the inclusion levels in ranges of 10, and the y-axis shows the proportion of exons from each subset falling within each range. For each exon, the EST inclusion level is defined as Ni/(Ni + Ns) × 100%, where Ni is the number of ESTs including the exon and Ns the number of ESTs skipping the exon. Only exons with Ni + Ns ≥ 10 were considered. On the left of the dashed line we plot the frequencies for exons with zero EST inclusion level. (b) Percentage of symmetric exons (length multiple of three) for each age group. (c) Percentage of symmetric exons by EST inclusion level category for each age group. Only alternative spliced exons with Ni + Ns ≥ 10 were considered.
Figure 2
Figure 2
Intronic densities for the main classes of repetitive elements. (a) Primate specific, (b) mammalian specific and (c) vertebrate and older. At each intronic position, the density was calculated as the proportion of cases in which the base was covered by a given type of repetitive element. We give on the x-axis the relative position from the splice junctions as negative if upstream of the acceptor site or positive if downstream of the donor site.
Figure 3
Figure 3
Performance comparison in real/pseudo-exon discrimination between different measures. ROC curves (vertically averaged) for exonic density, intronic density and ERA, using (a) ESEcomb, (b) SRall and (c) ESScomb as informative features. The average was calculated from 10 different subsets of the data (see text for details). (d) The corresponding AUCs. The error bars represent the standard error. FPR, false positive rate; TPR, true positive rate.
Figure 4
Figure 4
SRE ERA changes with age. Mean exonic relative abundance values for the three age groups (PS, MS and VO) and a set of pseudo-exons not overlapping any repeats (pseudo-INT) calculated for the three motif sets (ESEcomb, ESScomb and SRall). Exons overlapping Alu elements were excluded from the PS set. The standard error is also shown.
Figure 5
Figure 5
SRE functional conservation between human and mouse. SRE FCS between human and mouse of exonic regions covered by ESEcomb, SRall and ESScomb motifs for mammalian specific and vertebrate or older exons. See Materials and methods section for formula.
Figure 6
Figure 6
SRE exonic relative abundance and EST inclusion levels. Cumulative plot of ERA variation (y-axis) for bins of increasing maximum EST inclusion levels (x-axis) for (a) ESEcomb, (b) SRall and (c) ESScomb. The standard errors are also shown.
Figure 7
Figure 7
Alu's unique cis-regulatory context. Exonic and intronic densities of (a) ESEcomb, (b) SRall and (c) ESScomb motifs on primate specific exons overlapping Alu elements (PS-Alu) and on Alu pseudo-exons (pseudo-Alu). (d) Exonic relative abundance of ESEcomb, SRall and ESScomb motifs for primate specific exons overlapping Alu elements (PS-Alu) and for Alu pseudo-exons (pseudo-Alu). (e) Exonic relative abundance for the same sets of motifs in pseudo-exons overlapping other classes of repeats, namely DNA, LTR, LINE and SINE non-Alu (MIR) repeats. The error bars represent the standard error.

Similar articles

Cited by

References

    1. Zhang XH, Chasin LA. Comparison of multiple vertebrate genomes reveals the birth and evolution of human exons. Proc Natl Acad Sci USA. 2006;103:13427–13432. doi: 10.1073/pnas.0603042103. - DOI - PMC - PubMed
    1. Alekseyenko AV, Kim N, Lee CJ. Global analysis of exon creation versus loss and the role of alternative splicing in 17 vertebrate genomes. RNA. 2007;13:661–670. doi: 10.1261/rna.325107. - DOI - PMC - PubMed
    1. Modrek B, Lee CJ. Alternative splicing in the human, mouse and rat genomes is associated with an increased frequency of exon creation and/or loss. Nat Genet. 2003;34:177–180. doi: 10.1038/ng1159. - DOI - PubMed
    1. Nurtdinov RN, Artamonova II, Mironov AA, Gelfand MS. Low conservation of alternative splicing patterns in the human and mouse genomes. Hum Mol Genet. 2003;12:1313–1320. doi: 10.1093/hmg/ddg137. - DOI - PubMed
    1. Iida K, Akashi H. A test of translational selection at 'silent' sites in the human genome: base composition comparisons in alternatively spliced genes. Gene. 2000;261:93–105. doi: 10.1016/S0378-1119(00)00482-0. - DOI - PubMed

Publication types

LinkOut - more resources