Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2013 Nov 1;3(6):e27755.
doi: 10.4161/mge.27755. Epub 2014 Jan 10.

Continuing analysis of microRNA origins: Formation from transposable element insertions and noncoding RNA mutations

Affiliations

Continuing analysis of microRNA origins: Formation from transposable element insertions and noncoding RNA mutations

Justin T Roberts et al. Mob Genet Elements. .

Abstract

MicroRNAs (miRs) are small noncoding RNAs that typically act as regulators of gene expression by base pairing with the 3' UTR of messenger RNAs (mRNAs) and either repressing their translation or initiating degradation. As of this writing over 24,500 distinct miRs have been identified, but the functions of the vast majority of these remain undescribed. This paper represents a summary of our in depth analysis of the genomic origins of miR loci, detailing the formation of 1,213 of the 7,321 recently identified miRs and thereby bringing the total number of miR loci with defined molecular origin to 3,605. Interestingly, our analyses also identify evidence for a second, novel mechanism of miR locus generation through describing the formation of 273 miR loci from mutations to other forms of noncoding RNAs. Importantly, several independent investigations of the genomic origins of miR loci have now supported the hypothesis that miR hairpins are formed by the adjacent genomic insertion of two complementary transposable elements (TEs) into opposing strands. While our results agree that subsequent transcription over such TE interfaces leads to the formation of the majority of functional miR loci, we now also find evidence suggesting that a subset of miR loci were actually formed by an alternative mechanism-point mutations in other structurally complex, noncoding RNAs (e.g., tRNAs and snoRNAs).

Keywords: LINE; SINE; miR; miRNA; microRNA; noncoding RNA; repetitive; retrotransposon; transposable; transposon.

PubMed Disclaimer

Figures

None
Figure 1. Formation of miRs typically occurs when two complementary TEs inserted into opposing strands are then subsequently transcribed. (A) Cartoon representation illustrating the proposed origin of many miRs. A miR hairpin is depicted just above an arrow indicating read through transcription from a positive strand transposable element (TE) into an adjacent negative strand TE. Transcriptional read through would result in an imperfect RNA hairpin being produced which could potentially be recognized and processed by the RNAi machinery with each stem corresponding to the terminal nucleotides of the contributing TEs. (B) Pan troglodytes miR-95 alignment to the RepBase data set. All repetitive elements taken from RepBase (indicated by open rectangles) occurring within 200 bp (5′ and 3′) have been included in the scale diagram. The repetitive element annotations are described immediately beneath the diagram as “Element 1, Element 2, etc…” as they occur 5′ to 3′. “Base Positions” refers to base pair location in the genome occupied by the miR hairpin (according to the current Ensembl assembly; www.ensembl.org). All loci have been shown with respect to the positive strand and the orientation of internal repetitive elements illustrated by their relative position above (5′ to 3′) or below (3′ to 5′) the center line. Repetitive element base pair positions are relative to the distance (±) from the first nucleotide of the pre-miR (as occurring on the positive strand). Figure directly adapted from reference .
None
Figure 2. Cartoon illustration of the genomic events believed to be responsible for the formation of many miRs. Origin of numerous miRs occurs when random TE insertions gives rise to a beneficial regulatory adaptation within the organism. Subsequent transcription of this TE interface by RNA polymerase followed by RISC processing can lead to miR establishment if the resulting small RNA confers some advantage in gene expression.
None
Figure 3. Alignment of microRNA-444 family. Alignment of the eight miR-444 hairpins is illustrated. Each individual hairpin sequence is shown with the associated species on the right and the miRBase identifier on the left. In all, six unique miR-444 hairpin origins were characterized by familial inclusion. *, indicates 100% conservation of the nucleotide. Grey shading indicates specific miR hairpins initially identified as bearing significant sequence complementarity to Gypsy repeats through sequence based alignment.
None
Figure 4. Depiction of various noncoding RNAs where distinct mutations resulted in microRNA formation. The distribution of 273 noncoding RNA miR alignments indicating non-transposable element origins identified in our analyses are depicted. In all, 60 corresponded to snoRNAs, 16 to scaRNAs, 41 to rRNAs, 84 to tRNAs, 11 to snRNAs, 38 to various ribozymes and 23 to other forms of noncoding RNAs such as antisense RNAs, long noncoding RNAs, tmRNAs, and sRNAs.
None
Figure 5. Formation of a miR through tRNA mutation. (A) Structural diagram illustrating the potential effects of a single point mutation to tRNA secondary structure. The most thermodynamically stable conformations of two tRNA sequences are shown. The two sequences resulting in the distinct structural conformations are identical except that the uracil at position 50 in the endogenous tRNA shown on the left has been replaced by a cytosine at position 50 on the right. Secondary structures and thermodynamic stabilities were computed using Mfold. (B) Caenorhabditis elegans miR-4937 alignment to the RFAM data set. All repetitive elements and noncoding RNAs (open rectangles) occurring within 500 bp (5′ and 3′) have been included in the scale diagram as depicted in Figure 1B. While we find numerous loci arising by the mechanism depicted in Figure 1A, we find others (like miR-4937) do not. Significantly aligning to a C. elegans tRNA (tRNAArgTCG), we propose an additional mechanism (point mutation(s) resulting in an alteration of normal tRNA secondary structure gave rise to pre-miR-4937.
None
Figure 6. Alignments of miRs with predicted targets. Illustration shows a predicted miR target 3′UTRs on top, a consensus transposable element in the middle, and the corresponding miR sequence on the bottom. Mature miRs are highlighted in gray. Open boxes indicate perfect seed matches. To qualify as a 3′UTR match alignments were required to 1) contain a perfect seed match, 2) match ≥ 50% of the flanking sequence used in the target query, and 3) occur within a 3′UTR sequence aligning to a miR’s progenitor TE sequence. Vertical lines indicate base identity with the TE consensus sequence. Dotted lines indicate purine/pyrimidine conservation. PAH, Gallus gallus phenylalanine-4-hydroxylase ENSGALG00000012754. PTGIR, Sus scrufa prostaglandin I2 (prostacyclin) receptor ENSSSCG00000026602. RC, reverse complemented
None
Figure 7. Noncoding RNA miR origins. Alignments showing the high degree of sequence conservation between select miRs (top) and progenitor noncoding RNA (bottom) sequences. miR-1839, cgr-mir-1839 mi0020426; scaRNA, RF00426 SCARNA15 ABDC01642996.1/97-188; miR-6173, hbr-MIR6173 mi0021483; rRNA, RF01959 SSU rRNA AF166114.1/117516-116014; miR-1903, cgr-mir-1903 mi0020435; tRNA, RF00005 tRNA AAHX01000286.1/15293-15364; miR-1291, ggo-mir-1291 mi0020755; snoRNA, RF00410 SNORA2 CEC01039678.1/1523-1387; miR-3473b, mmu-mir-3473b mi0016997; U4, RF00015 U4 AAHX01028334.1/65497-65654.

Similar articles

Cited by

References

    1. Lee RC, Feinbaum RL, Ambros V. The C. elegans heterochronic gene lin-4 encodes small RNAs with antisense complementarity to lin-14. Cell. 1993;75:843–54. doi: 10.1016/0092-8674(93)90529-Y. - DOI - PubMed
    1. Hutvágner G, Zamore PD. A microRNA in a multiple-turnover RNAi enzyme complex. Science. 2002;297:2056–60. doi: 10.1126/science.1073827. - DOI - PubMed
    1. Fire A, Xu S, Montgomery MK, Kostas SA, Driver SE, Mello CC. Potent and specific genetic interference by double-stranded RNA in Caenorhabditis elegans. Nature. 1998;391:806–11. doi: 10.1038/35888. - DOI - PubMed
    1. Cai X, Hagedorn CH, Cullen BR. Human microRNAs are processed from capped, polyadenylated transcripts that can also function as mRNAs. RNA. 2004;10:1957–66. doi: 10.1261/rna.7135204. - DOI - PMC - PubMed
    1. Smalheiser NR, Torvik VI. Complications in mammalian microRNA target prediction. Methods Mol Biol. 2006;342:115–27. - PubMed

LinkOut - more resources