Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2010;11(4):R39.
doi: 10.1186/gb-2010-11-4-r39. Epub 2010 Apr 6.

miRTRAP, a computational method for the systematic identification of miRNAs from high throughput sequencing data

Affiliations

miRTRAP, a computational method for the systematic identification of miRNAs from high throughput sequencing data

David Hendrix et al. Genome Biol. 2010.

Abstract

MicroRNAs (miRs) have been broadly implicated in animal development and disease. We developed a novel computational strategy for the systematic, whole-genome identification of miRs from high throughput sequencing information. This method, miRTRAP, incorporates the mechanisms of miR biogenesis and includes additional criteria regarding the prevalence and quality of small RNAs arising from the antisense strand and neighboring loci. This program was applied to the simple chordate Ciona intestinalis and identified nearly 400 putative miR loci.

PubMed Disclaimer

Figures

Figure 1
Figure 1
Outline of the miRTRAP program, Ciona abundance versus conservation, neighbor window. (a) Schematic illustration of the miRTRAP program. The algorithm first identifies read regions that do not overlap repeats or tRNAs. The genomic region up to 150 nucleotides around the individual read is folded using RNAfold. Then, all read products within the hairpin window are identified as 5p-miR/3p-miR, 5p-moR/3p-moR or loop based on their positions relative to the hairpin and loop. Each read region is then evaluated by a set of filters to remove those incompatible with the biochemical rules of miR biogenesis. All the rejected read regions are used to filter the initial set of candidate loci to produce a list of positive predictions. (b) Average antisense product displacement (AAPD) score distribution from the Ciona dataset shows that the majority of known miRs have an AAPD score of zero, while non-miR loci have a broad distribution and peaks at 8 and 10. (c) The difference between the non-miR neighbor counts within windows centered at known miRs and non-miR loci in Ciona. Whereas non-miR neighbor counts centered around non-miR loci increases sharply as window sizes expand, all known miR loci have non-miR neighbor counts equal or fewer than 10.
Figure 2
Figure 2
Comparison of miRTRAP with miRDeep. (a) miRTRAP out-performed miRDeep for the Ciona library data set, identifying approximately five times more miRs. In addition, it identified 11 mirtron/half-mirtrons, while miRDeep found only 1. (b) For the Drosophila small RNA data set, miRTRAP identified 25 more known fly miRs than miRDeep. In particular, miRTRAP found 12 out of 14 mirtrons, while miRDeep identified only 3. (c) Example of a novel Drosophila mirtron predicted by miRTRAP. (d) A novel Drosophila miR/miR* containing locus predicted by miRTRAP.
Figure 3
Figure 3
Phylogeny of Ciona miRNA families in the deuterostome lineage. Newly identified conserved Ciona miR families (shaded circles) and previously known Ciona miRs (dark circles) are grouped with homologous miR families from representative deuterostome species (echinoderm, Strongylocentrotus purpuratus; hemichordate, Saccoglossus kowalevskii; Amphioxus, Branchiostoma floridae). Missing miRs are shown as empty circles. It is evident from the phylogenetic tree that the miR repertoire from Ciona is closely related to the vertebrate miRs.
Figure 4
Figure 4
Prevalence of antisense miRs in Ciona. (a) In the scaffold 20 11-miR cluster, three miR loci have antisense reads that exactly match the sense miR/miR* products. (b) Secondary structures of Ci-mir-2217-1 and its associated antisense locus, miR-2217-1-as, both form highly symmetric hairpins, on which the miR and miR* products are indicated as lines. (c) In one case, we observed the antisense locus of Ci-mir-2246 produces not only miR/miR*, but also a 5p-moR product. (d) Secondary structures and product distribution of Ci-mir-2246 and Ci-mir-2246-as.
Figure 5
Figure 5
Non-canonical miR examples from Ciona. (a) A classic example of mirtron Ci-mir-2219-2 shows the miR and miR* products are produced from the precisely spliced intron from gene ci100134440. (b) In some cases, only one of the miR/miR* products abuts the splice junction, while the other product is fully inside the intron. A so-called half-mirtron example, Ci-mir-2227, is represented. (c) Ci-mir-2233 produces a miR/miR* pair from a perfectly structured hairpin, which overlaps with a protein coding exon in the gene ci0100152310.

References

    1. He L, Hannon GJ. MicroRNAs: small RNAs with a big role in gene regulation. Nat Rev Genet. 2004;5:522–531. doi: 10.1038/nrg1379. - DOI - PubMed
    1. Liu J. Control of protein synthesis and mRNA degradation by microRNAs. Curr Opin Cell Biol. 2008;20:214–221. doi: 10.1016/j.ceb.2008.01.006. - DOI - PubMed
    1. Stefani G, Slack FJ. Small non-coding RNAs in animal development. Nat Rev Mol Cell Biol. 2008;9:219–230. doi: 10.1038/nrm2347. - DOI - PubMed
    1. Shi W, Hendrix D, Levine M, Haley B. A distinct class of small RNAs arises from pre-miRNA-proximal regions in a simple chordate. Nat Struct Mol Biol. 2009;16:183–189. doi: 10.1038/nsmb.1536. - DOI - PMC - PubMed
    1. Langenberger D, Bermudez-Santana C, Hertel J, Hoffmann S, Khaitovich P, Stadler PF. Evidence for human microRNA-offset RNAs in small RNA sequencing data. Bioinformatics. 2009;25:2298–2301. doi: 10.1093/bioinformatics/btp419. - DOI - PubMed

Publication types