Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2013;14 Suppl 14(Suppl 14):S9.
doi: 10.1186/1471-2105-14-S14-S9. Epub 2013 Oct 9.

PHDcleav: a SVM based method for predicting human Dicer cleavage sites using sequence and secondary structure of miRNA precursors

PHDcleav: a SVM based method for predicting human Dicer cleavage sites using sequence and secondary structure of miRNA precursors

Firoz Ahmed et al. BMC Bioinformatics. 2013.

Abstract

Background: Dicer, an RNase III enzyme, plays a vital role in the processing of pre-miRNAs for generating the miRNAs. The structural and sequence features on pre-miRNA which can facilitate position and efficiency of cleavage are not well known. A precise cleavage by Dicer is crucial because an inaccurate processing can produce miRNA with different seed regions which can alter the repertoire of target genes.

Results: In this study, a novel method has been developed to predict Dicer cleavage sites on pre-miRNAs using Support Vector Machine. We used the dataset of experimentally validated human miRNA hairpins from miRBase, and extracted fourteen nucleotides around Dicer cleavage sites. We developed number of models using various types of features and achieved maximum accuracy of 66% using binary profile of nucleotide sequence taken from 5p arm of hairpin. The prediction performance of Dicer cleavage site improved significantly from 66% to 86% when we integrated secondary structure information. This indicates that secondary structure plays an important role in the selection of cleavage site. All models were trained and tested on 555 experimentally validated cleavage sites and evaluated using 5-fold cross validation technique. In addition, the performance was also evaluated on an independent testing dataset that achieved an accuracy of ~82%.

Conclusion: Based on this study, we developed a webserver PHDcleav (http://www.imtech.res.in/raghava/phdcleav/) to predict Dicer cleavage sites in pre-miRNA. This tool can be used to investigate functional consequences of genetic variations/SNPs in miRNA on Dicer cleavage site, and gene silencing. Moreover, it would also be useful in the discovery of miRNAs in human genome and design of Dicer specific pre-miRNAs for potent gene silencing.

PubMed Disclaimer

Figures

Figure 1
Figure 1
Schematic diagram of pre-miRNA, hsa-mir-200c, predicted by quikfold software and patterns of Dicer cleavage site at 5p arm. (A) miR* derived from 5p arm and miR derived from 3p arm of hairpin, bases are represented in capital letter. CD-5p and CD-3p are cleavage sites of Dicer at 5' and 3' arm respectively. (B) Sequence of CP-5p cleavage pattern of 14 nucleotides having cleavage site CD-5p at center. Following each cleavage pattern, features of mononucleotide and binary used as input feature for SVM are given. (C) Structure of CP-5p cleavage pattern of 14 nucleotides having cleavage site CD-5p at center and its partially complementary strand. Base pairs are indicated with arrows. Zero (0) indicates that no base pairing occurs between complementary strands. The pattern of 14+14 is used to generate binary pattern. Mononucleotide having 4, sequence binary pattern having 56, and structure binary pattern having 112 dimensional vector. +1 is the class for cleavage pattern. Binary pattern is represented only for highlighted nucleotides.
Figure 2
Figure 2
Performance of various SVM models for Dicer cleavage site at 5p arm (CD-5p) shown by ROC plots. Bin [str, miRNA.str]: binary feature used for structure of CP-5p taken from miRNA.srt. Mono [str, quikfold]: mononucleotide composition used for structure of CP-5p taken from quikfold. Bin [seq, quikfold]: binary feature used for sequence of CP-5p taken from quikfold. Bin [str, quikfold]: binary feature used for structure of CP-5p taken from quikfold. The value indicates the AUC for the corresponding model.

References

    1. Chendrimada TP, Gregory RI, Kumaraswamy E, Norman J, Cooch N, Nishikura K, Shiekhattar R. TRBP recruits the Dicer complex to Ago2 for microRNA processing and gene silencing. Nature. 2005;14(7051):740–744. doi: 10.1038/nature03868. - DOI - PMC - PubMed
    1. Jaskiewicz L, Filipowicz W. Role of Dicer in posttranscriptional RNA silencing. Curr Top Microbiol Immunol. 2008;14:77–97. doi: 10.1007/978-3-540-75157-1_4. - DOI - PubMed
    1. Bartel DP. MicroRNAs: genomics, biogenesis, mechanism, and function. Cell. 2004;14(2):281–297. doi: 10.1016/S0092-8674(04)00045-5. - DOI - PubMed
    1. Yang WJ, Yang DD, Na S, Sandusky GE, Zhang Q, Zhao G. Dicer is required for embryonic angiogenesis during mouse development. J Biol Chem. 2005;14(10):9330–9335. - PubMed
    1. Harris KS, Zhang Z, McManus MT, Harfe BD, Sun X. Dicer function is essential for lung epithelium morphogenesis. Proc Natl Acad Sci USA. 2006;14(7):2208–2213. doi: 10.1073/pnas.0510839103. - DOI - PMC - PubMed

Publication types

LinkOut - more resources