Predicting microRNA precursors with a generalized Gaussian components based density estimation algorithm
- PMID: 20122227
- PMCID: PMC3009525
- DOI: 10.1186/1471-2105-11-S1-S52
Predicting microRNA precursors with a generalized Gaussian components based density estimation algorithm
Abstract
Background: MicroRNAs (miRNAs) are short non-coding RNA molecules, which play an important role in post-transcriptional regulation of gene expression. There have been many efforts to discover miRNA precursors (pre-miRNAs) over the years. Recently, ab initio approaches have attracted more attention because they do not depend on homology information and provide broader applications than comparative approaches. Kernel based classifiers such as support vector machine (SVM) are extensively adopted in these ab initio approaches due to the prediction performance they achieved. On the other hand, logic based classifiers such as decision tree, of which the constructed model is interpretable, have attracted less attention.
Results: This article reports the design of a predictor of pre-miRNAs with a novel kernel based classifier named the generalized Gaussian density estimator (G2DE) based classifier. The G2DE is a kernel based algorithm designed to provide interpretability by utilizing a few but representative kernels for constructing the classification model. The performance of the proposed predictor has been evaluated with 692 human pre-miRNAs and has been compared with two kernel based and two logic based classifiers. The experimental results show that the proposed predictor is capable of achieving prediction performance comparable to those delivered by the prevailing kernel based classification algorithms, while providing the user with an overall picture of the distribution of the data set.
Conclusion: Software predictors that identify pre-miRNAs in genomic sequences have been exploited by biologists to facilitate molecular biology research in recent years. The G2DE employed in this study can deliver prediction accuracy comparable with the state-of-the-art kernel based machine learning algorithms. Furthermore, biologists can obtain valuable insights about the different characteristics of the sequences of pre-miRNAs with the models generated by the G2DE based predictor.
Figures



Similar articles
-
Using a kernel density estimation based classifier to predict species-specific microRNA precursors.BMC Bioinformatics. 2008 Dec 12;9 Suppl 12(Suppl 12):S2. doi: 10.1186/1471-2105-9-S12-S2. BMC Bioinformatics. 2008. PMID: 19091019 Free PMC article.
-
Ab initio identification of human microRNAs based on structure motifs.BMC Bioinformatics. 2007 Dec 18;8:478. doi: 10.1186/1471-2105-8-478. BMC Bioinformatics. 2007. PMID: 18088431 Free PMC article.
-
Classification of real and pseudo microRNA precursors using local structure-sequence features and support vector machine.BMC Bioinformatics. 2005 Dec 29;6:310. doi: 10.1186/1471-2105-6-310. BMC Bioinformatics. 2005. PMID: 16381612 Free PMC article.
-
Predicting novel microRNA: a comprehensive comparison of machine learning approaches.Brief Bioinform. 2019 Sep 27;20(5):1607-1620. doi: 10.1093/bib/bby037. Brief Bioinform. 2019. PMID: 29800232 Review.
-
Popular Computational Tools Used for miRNA Prediction and Their Future Development Prospects.Interdiscip Sci. 2020 Dec;12(4):395-413. doi: 10.1007/s12539-020-00387-3. Epub 2020 Sep 21. Interdiscip Sci. 2020. PMID: 32959233 Review.
Cited by
-
The prediction of the porcine pre-microRNAs in genome-wide based on support vector machine (SVM) and homology searching.BMC Genomics. 2012 Dec 27;13:729. doi: 10.1186/1471-2164-13-729. BMC Genomics. 2012. PMID: 23268561 Free PMC article.
-
The discriminant power of RNA features for pre-miRNA recognition.BMC Bioinformatics. 2014 May 2;15:124. doi: 10.1186/1471-2105-15-124. BMC Bioinformatics. 2014. PMID: 24884650 Free PMC article.
-
Genome-wide approaches in the study of microRNA biology.Wiley Interdiscip Rev Syst Biol Med. 2011 Sep-Oct;3(5):491-512. doi: 10.1002/wsbm.128. Epub 2010 Dec 31. Wiley Interdiscip Rev Syst Biol Med. 2011. PMID: 21197653 Free PMC article. Review.
-
Integrated sequence-structure motifs suffice to identify microRNA precursors.PLoS One. 2012;7(3):e32797. doi: 10.1371/journal.pone.0032797. Epub 2012 Mar 15. PLoS One. 2012. PMID: 22438883 Free PMC article.
-
Automatic learning of pre-miRNAs from different species.BMC Bioinformatics. 2016 May 28;17(1):224. doi: 10.1186/s12859-016-1036-3. BMC Bioinformatics. 2016. PMID: 27233515 Free PMC article.
References
Publication types
MeSH terms
Substances
LinkOut - more resources
Full Text Sources