Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2022 Nov 29;119(48):e2209766119.
doi: 10.1073/pnas.2209766119. Epub 2022 Nov 23.

Transposable elements drive intron gain in diverse eukaryotes

Affiliations

Transposable elements drive intron gain in diverse eukaryotes

Landen Gozashti et al. Proc Natl Acad Sci U S A. .

Abstract

There is massive variation in intron numbers across eukaryotic genomes, yet the major drivers of intron content during evolution remain elusive. Rapid intron loss and gain in some lineages contrast with long-term evolutionary stasis in others. Episodic intron gain could be explained by recently discovered specialized transposons called Introners, but so far Introners are only known from a handful of species. Here, we performed a systematic search across 3,325 eukaryotic genomes and identified 27,563 Introner-derived introns in 175 genomes (5.2%). Species with Introners span remarkable phylogenetic diversity, from animals to basal protists, representing lineages whose last common ancestor dates to over 1.7 billion years ago. Aquatic organisms were 6.5 times more likely to contain Introners than terrestrial organisms. Introners exhibit mechanistic diversity but most are consistent with DNA transposition, indicating that Introners have evolved convergently hundreds of times from nonautonomous transposable elements. Transposable elements and aquatic taxa are associated with high rates of horizontal gene transfer, suggesting that this combination of factors may explain the punctuated and biased diversity of species containing Introners. More generally, our data suggest that Introners may explain the episodic nature of intron gain across the eukaryotic tree of life. These results illuminate the major source of ongoing intron creation in eukaryotic genomes.

Keywords: comparative genomics; evolution; genome structure; intron; splicing.

PubMed Disclaimer

Conflict of interest statement

The authors declare no competing interest.

Figures

Fig. 1.
Fig. 1.
Diversity and characteristics of Introners across eukaryotes. Results are shown from 130 genomes representing 32 lineages with putatively independent acquisitions of Introners (different colors). Leaf tip colors indicate the total number of predicted Introners for each genome. Proportion in genes is shown by the red mark, which consistently exceeds the expected values as determined by randomization within each genome (black box plots; center line denotes median; box limits denote upper and lower quartiles; whiskers denote 1.5x interquartile range). Heat maps represent predicted nucleosome occupancy for Introner insertion sites and surrounding genomic regions for genomes in which accurate nucleosome occupancy prediction was possible (see Methods). Introner insertion sites consistently show reduced histone occupancy (dark) relative to surrounding regions. For genomes within the same genus, multiple genomes are shown only if the genomes have different complements of Introner families (see Methods).
Fig. 2.
Fig. 2.
Examples of diverse Introner intron creation mechanisms. Splice sites are shown in bold, Introner boundaries are denoted by a vertical bar, and introns and exons are represented by lines and boxes, respectively. (A) Introners in Alternaria alternata do not exhibit specific sequence features associated with known DNA transposition mechanisms and appear to replicate via direct insertion. (B) Introners in Symbiodinium microadriaticum show clear evidence of 4 bp TSDs (shown in green) and TIRs (underlined), consistent with many known DNA transposons, carry their 3′ splice site, and co-opt their 5′ splice site from their TSDs upon insertion. (C) Introners in Chrysochromulina sp. show evidence of 4 bp TSDs but no evidence of TIRs, carry their 5′ splice site, and co-opt their 3′ splice site. (D) Introners in Acanthoeca sp. show clear evidence of TIRs but no TSDs. (E) Introners in Florenciella sp. do not carry either splice site and instead co-opt both from their insertion site. (F) Introners in Aureococcus sp. carry both splice sites but add an extra 12 bp into the transcript upon insertion (4 bp from the Introner + 8 bp from the TSD), (G) resulting in the addition of four amino acids to the respective protein when compared to an ortholog from a different isolate which lacks an Introner at that position.
Fig. 3.
Fig. 3.
Relative to other introns, Introners are more efficiently spliced but less frequently found in highly expressed genes. (A) Explanation of PSI. Species with greater PSI coefficients (above 0, red) have Introners spliced in more frequently than other introns in the same genome. (B) Introners are more efficiently spliced than other introns in most species, as indicated by PSI coefficients less or equal to 0 for 31/36 species. (C) Introners are overrepresented in lowly expressed genes for most species. Relative log-normalized gene expression values for Introner-containing genes relative to other Intron-containing genes are shown.

References

    1. Irimia M., Roy S. W., Origin of spliceosomal introns and alternative splicing. Cold Spring Harb. Perspect. Biol. 6, a016071 (2014). - PMC - PubMed
    1. Graveley B. R., Alternative splicing: Increasing diversity in the proteomic world. Trends Genet. 17, 100–107 (2001). - PubMed
    1. Jaillon O., et al. , Translational control of intron splicing in eukaryotes. Nature 451, 359–362 (2008). - PubMed
    1. Scofield D. G., Hong X., Lynch M., Position of the final intron in full-length transcripts: Determined by NMD? Mol. Biol. Evol. 24, 896–899 (2007). - PubMed
    1. Niu D.-K., Protecting exons from deleterious R-loops: A potential advantage of having introns. Biol. Direct. 2, 11 (2007). - PMC - PubMed

Publication types

Substances