Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Comparative Study
. 2005 Jul;11(7):995-1003.
doi: 10.1261/rna.7290705. Epub 2005 May 31.

Weighted sequence motifs as an improved seeding step in microRNA target prediction algorithms

Affiliations
Comparative Study

Weighted sequence motifs as an improved seeding step in microRNA target prediction algorithms

Ola Saetrom et al. RNA. 2005 Jul.

Abstract

We present a new microRNA target prediction algorithm called TargetBoost, and show that the algorithm is stable and identifies more true targets than do existing algorithms. TargetBoost uses machine learning on a set of validated microRNA targets in lower organisms to create weighted sequence motifs that capture the binding characteristics between microRNAs and their targets. Existing algorithms require candidates to have (1) near-perfect complementarity between microRNAs' 5' end and their targets; (2) relatively high thermodynamic duplex stability; (3) multiple target sites in the target's 3' UTR; and (4) evolutionary conservation of the target between species. Most algorithms use one of the two first requirements in a seeding step, and use the three others as filters to improve the method's specificity. The initial seeding step determines an algorithm's sensitivity and also influences its specificity. As all algorithms may add filters to increase the specificity, we propose that methods should be compared before such filtering. We show that TargetBoost's weighted sequence motif approach is favorable to using both the duplex stability and the sequence complementarity steps. (TargetBoost is available as a Web tool from http://www.interagon.com/demo/.).

PubMed Disclaimer

Figures

FIGURE 1.
FIGURE 1.
Overall ROC-curves for each algorithm.
FIGURE 2.
FIGURE 2.
ROC-curves. A–D compare the performance of RNAhybrid, Nucleus, and TargetBoost at predicting the true target sites of let-7, lin-4, miR-13a, and bantam.
FIGURE 2.
FIGURE 2.
ROC-curves. A–D compare the performance of RNAhybrid, Nucleus, and TargetBoost at predicting the true target sites of let-7, lin-4, miR-13a, and bantam.
FIGURE 2.
FIGURE 2.
ROC-curves. A–D compare the performance of RNAhybrid, Nucleus, and TargetBoost at predicting the true target sites of let-7, lin-4, miR-13a, and bantam.
FIGURE 2.
FIGURE 2.
ROC-curves. A–D compare the performance of RNAhybrid, Nucleus, and TargetBoost at predicting the true target sites of let-7, lin-4, miR-13a, and bantam.
FIGURE 3.
FIGURE 3.
ROC-curves comparing different parameter settings on RNAhybrid (A) and Nucleus (B). We can see increased sensitivity for high-specificity values for RNAhybrid in A.
FIGURE 3.
FIGURE 3.
ROC-curves comparing different parameter settings on RNAhybrid (A) and Nucleus (B). We can see increased sensitivity for high-specificity values for RNAhybrid in A.
FIGURE 4.
FIGURE 4.
MicroRNA target query examples. (A) General pattern generated after the first boosting iteration in TargetBoost. This pattern has been translated using the let-7 miRNA in a complemented form. (B) The translated query matched against the lin-14 target site.
FIGURE 4.
FIGURE 4.
MicroRNA target query examples. (A) General pattern generated after the first boosting iteration in TargetBoost. This pattern has been translated using the let-7 miRNA in a complemented form. (B) The translated query matched against the lin-14 target site.
FIGURE 5.
FIGURE 5.
Aligned D. melanogaster target sites.
FIGURE 6.
FIGURE 6.
The grammar (A) and semantics (B) of the pattern language used by TargetBoost. The grammar and semantics are explained in the main text.
FIGURE 7.
FIGURE 7.
Two example patterns from our pattern language. Pi denotes nucleotide i in the miRNA counted from the 3′ end.

Similar articles

Cited by

References

    1. Bartel, D.P. 2004. MicroRNAs: Genomics, biogenesis, mechanism, and function. Cell 116: 281–297. - PubMed
    1. Boutla, A., Delidakis, C., and Tabler, M. 2003. Developmental defects by antisense-mediated inactivation of micro-RNAs 2 and 13 in Drosophila and the identification of putative target genes. Nucleic Acids Res. 31: 4973–4980. - PMC - PubMed
    1. Breiman, L., Friedman, J.H., Olshen, R.A., and Stone, C.J. 1984. Classification and regression trees. Wadsworth, Belmont, CA.
    1. Brennecke, J., Hipfner, D.R., Stark, A., Russell, R.B., and Cohen, S.M. 2003. bantam Encodes a developmentally regulated miRNA that controls cell proliferation and regulates the proapoptotic gene hid in Drosophila. Cell 113: 25–36. - PubMed
    1. Doench, J.G. and Sharp, P.A. 2004. Specificity of microRNA target selection in translational repression. Genes & Dev. 18: 504–511. - PMC - PubMed

Publication types

MeSH terms