Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2010 Jun 8;8(6):e1000391.
doi: 10.1371/journal.pbio.1000391.

Mechanisms used for genomic proliferation by thermophilic group II introns

Affiliations

Mechanisms used for genomic proliferation by thermophilic group II introns

Georg Mohr et al. PLoS Biol. .

Abstract

Mobile group II introns, which are found in bacterial and organellar genomes, are site-specific retroelements hypothesized to be evolutionary ancestors of spliceosomal introns and retrotransposons in higher organisms. Most bacteria, however, contain no more than one or a few group II introns, making it unclear how introns could have proliferated to higher copy numbers in eukaryotic genomes. An exception is the thermophilic cyanobacterium Thermosynechococcus elongatus, which contains 28 closely related copies of a group II intron, constituting approximately 1.3% of the genome. Here, by using a combination of bioinformatics and mobility assays at different temperatures, we identified mechanisms that contribute to the proliferation of T. elongatus group II introns. These mechanisms include divergence of DNA target specificity to avoid target site saturation; adaptation of some intron-encoded reverse transcriptases to splice and mobilize multiple degenerate introns that do not encode reverse transcriptases, leading to a common splicing apparatus; and preferential insertion within other mobile introns or insertion elements, which provide new unoccupied sites in expanding non-essential DNA regions. Additionally, unlike mesophilic group II introns, the thermophilic T. elongatus introns rely on elevated temperatures to help promote DNA strand separation, enabling access to a larger number of DNA target sites by base pairing of the intron RNA, with minimal constraint from the reverse transcriptase. Our results provide insight into group II intron proliferation mechanisms and show that higher temperatures, which are thought to have prevailed on Earth during the emergence of eukaryotes, favor intron proliferation by increasing the accessibility of DNA target sites. We also identify actively mobile thermophilic introns, which may be useful for structural studies, gene targeting in thermophiles, and as a source of thermostable reverse transcriptases.

PubMed Disclaimer

Conflict of interest statement

The authors have declared that no competing interests exist.

Figures

Figure 1
Figure 1. T. elongatus group II Intron families and insertion sites.
The 25 intact introns are classified into six families (F1–F6) based on their EBS sequences. Three other introns are fragments (TeI3g retains ∼340 nts of the 3′ part of the intron starting in the En domain of the IEP; TeI3m lacks regions upstream of DIVa(3′); and TeI3n has a large internal deletion between DId(5′) and DIVa1). Colors highlight EBS sequences and complementary nucleotide residues in the IBS sequences. The EBS2 sequence of TeI4h could not be identified unambiguously from the secondary structure model and was defined by in vivo selections with donor and recipient plasmids in which potential EBS2 and IBS2 nucleotide residues were randomized (G.M. and A.M.L., unpublished data).
Figure 2
Figure 2. T. elongatus group II intron RNA secondary structure and IEP.
(A) Predicted secondary structure of TeI4h. Differences in TeI4c, TeI4f, and TeI3c are indicated in red, boxed, and blue letters, respectively. The structure consists of six conserved domains (DI–DVI). Subdomains and further subdivisions are denoted with letters followed by numbers (e.g., DIc1). Greek letters indicate nucleotide sequences involved in long-range tertiary interactions ; 5′ and 3′ exon (E1 and E2, respectively) are boxed; and splice sites are indicated by open arrowheads. The gray boxes show a region of DIII that is replaced by a different sequence in TeI3c (blue, inset). (B) Secondary structure of DIV of ORF-containing TeI4 introns. The figure shows the secondary structure of DIV of TeI4h, with differences in TeI4c, 4f, and 4g indicated in red, boxed, and white letters in black boxes, respectively. The two potential start codons and the stop codon of the intron ORF are circled, and the arrow between the two potential start codons indicates the site at which TeI3c and other F3 introns insert into TeI4c and other F1 introns, resulting in the formation of twintrons. Regions that differ substantially in the ORF-less TeI3 introns are shaded gray. (C) Secondary structure of DIV of the ORF-less TeI3 introns. The figure shows the secondary structure of TeI3c, with differences in TeI3f, 3k, and 3l indicated in orange, green, and purple, respectively. Regions that differ from the ORF-containing TeI4 introns are shaded gray. Potential base pairings between the DIVa1 and DIVa2 loops are indicated at the upper right. A red circle highlights the extra U residue in DIVa1 of ORF-less introns (see also Figure S2). (D) Schematic of the TeI4h IEP. Conserved protein domains are: RT, containing conserved amino acid sequence blocks RT1–7 characteristic of the finger and palm regions of retroviral RTs; X/Thumb, region associated with maturase activity and corresponding in part to the RT thumb; D, DNA binding; and En, DNA endonuclease; RT-0 is a region conserved in the RTs of non-LTR retroelements ,. Multiple sequence alignments of the TeI4h and other IEPs are shown in Figure S1.
Figure 3
Figure 3. Phylogeny of T. elongatus introns.
The figure shows a phylogram for all 25 intact T. elongatus introns. TeI4 introns were aligned with TeI3 introns by deleting ORF sequences in DIVb (positions 755–2290 of TeI4h). RNA sequences were aligned with ClustalX , and the alignment was refined manually and used as input for Phylip (ver. 3.69, with default parameters [50]). The phylogenies were generated with program modules DNAdist and DNAcomp using all of the Distance settings (F84, Kimura, Jukes-Cantor, LogDet) independently and varying the out-group (EcI5 or random Te intron). Trees were visualized with Treeview , and were essentially the same regardless of distance or out-group settings. Support for the major groupings of the phylogram was obtained by bootstrapping 1,000 data sets (using Seqboot from Phylip ver. 3.69) and using these as input for DNAdist. The output of the latter program was then used to obtain a consensus tree with Consense. The numbers indicate the percentage of times a particular grouping occurred in the 1,000 data sets.
Figure 4
Figure 4. TeI4h intron mobility assays.
(A) E. coli genetic assay of intron mobility. The CapR donor plasmid uses a T7lac promoter (PT7lac) to express a ΔORF intron (I-ΔORF) with short flanking 5′ and 3′ exons (E1 and E2, respectively) and the IEP downstream of E2. The intron, which carries a T7 promoter (PT7) in DIVb, integrates into a target site (ligated E1–E2 sequences) cloned in an AmpR recipient plasmid upstream of a promoterless tetR gene, thereby activating that gene. The donor and recipient plasmids are derivatives of pACD2X and pBRR-tet, respectively (see Materials and Methods). The assays are done in E. coli HMS174(DE3), which contains an IPTG-inducible T7 RNA polymerase, with intron expression induced with 500 µM IPTG for 1 h at different temperatures. Mobility efficiencies are calculated as the ratio of (TetR+AmpR)/AmpR colonies. (B) Mobility efficiency of the TeI4h-ΔORF (blue) and Ll.LtrB-ΔORF (red) introns as a function of induction temperature. The donor plasmid for the Ll.LtrB-ΔORF intron was pACD2X .
Figure 5
Figure 5. Identification of critical nucleotide residues in the distal 5′-exon and 3′-exon regions of the DNA target sites of T. elongatus introns.
(A) Intron donor plasmid TeI4h*/4h* at 37°C. Intron donor plasmid TeI4h*/4h* at 48°C. (C) Intron donor plasmid TeI4c/4c at 48°C. (D) Intron donor plasmid TeI3c/4c at 48°C. Selection experiments were done in E. coli HMS174(DE3) containing the indicated intron donor plasmid and a recipient plasmid library randomized at the positions shown, as described in Materials and Methods. After selection by plating on LB medium containing antibiotics, AmpR+TetR colonies were analyzed by colony PCR and sequencing of the 5′- and 3′-integration junctions to identify nucleotide residues in active target sites. The WebLogo representation depicts nucleotide frequencies at each randomized position in ∼100 selected target sites, corrected for biases in the initial pool based on sequences of ∼100 randomly chosen recipient plasmids . The sequence of the intron-insertion site in the T. elongatus genome is shown, with white bases on black background indicating randomized nucleotides belonging to IBS2. Summarized below are nucleotide frequencies (percent) at each randomized position in (i) active target sites after intron insertion (“selected”), (ii) randomly chosen recipient plasmids from the original pool (“pool”), and (iii) active target sites corrected for nucleotide frequency biases in the initial pools (“corrected”). The latter were used to generate the WebLogos. In some cases, percentage totals do not equal 100 due to rounding off.

Comment in

References

    1. Lambowitz A. M, Zimmerly S. Mobile group II introns. Annu Rev Genet. 2004;38:1–35. - PubMed
    1. Pyle A. M, Lambowitz A. M. Group II introns: ribozymes that splice RNA and invade DNA. In: Gesteland R. F, Cech T, Atkins J. F, editors. The RNA world, third edition. Cold Spring Harbor, NY: Cold Spring Harbor Laboratory Press; 2006. pp. 469–505.
    1. Michel F, Ferat J. L. Structure and activities of group II introns. Annu Rev Biochem. 1995;64:435–461. - PubMed
    1. Toor N, Keating K. S, Taylor S. D, Pyle A. M. Crystal structure of a self-spliced group II intron. Science. 2008;320:77–82. - PMC - PubMed
    1. Peebles C. L, Perlman P. S, Mecklenburg K. L, Petrillo M. L, Tabor J. H, et al. A self-splicing RNA excises an intron lariat. Cell. 1986;44:213–223. - PubMed

Publication types