Systematic Curation of miRBase Annotation Using Integrated Small RNA High-Throughput Sequencing Data for C. elegans and Drosophila
- PMID: 22303321
- PMCID: PMC3268580
- DOI: 10.3389/fgene.2011.00025
Systematic Curation of miRBase Annotation Using Integrated Small RNA High-Throughput Sequencing Data for C. elegans and Drosophila
Abstract
MicroRNAs (miRNAs) are a class of 20-23 nucleotide small RNAs that regulate gene expression post-transcriptionally in animals and plants. Annotation of miRNAs by the miRNA database (miRBase) has largely relied on computational approaches. As a result, many miRBase entries lack experimental validation, and discrepancies between miRBase annotation and actual miRNA sequences are often observed. In this study, we integrated the small RNA sequencing (smRNA-seq) datasets in Caenorhabditis elegans and Drosophila melanogaster and devised an analytical pipeline coupled with detailed manual inspection to curate miRNA annotation systematically in miRBase. Our analysis reveals 19 (17.0%) and 51 (31.3%) miRNAs entries with detectable smRNA-seq reads have mature sequence discrepancies in C. elegans and D. melanogaster, respectively. These discrepancies frequently occur either for conserved miRNA families whose mature sequences were predicted according to their homologous counterparts in other species or for miRNAs whose precursor miRNA (pre-miRNA) hairpins produce an abundance of multiple miRNA isoforms or variants. Our analysis shows that while Drosophila pre-miRNAs, on average, produce less than 60% accurate mature miRNA reads in addition to their 5' and 3' variant isoforms, the precision of miRNA processing in C. elegans is much higher, at over 90%. Based on the revised miRNA sequences, we analyzed expression patterns of the more conserved (MC) and less conserved (LC) miRNAs and found that, whereas MC miRNAs are often co-expressed at multiple developmental stages, LC miRNAs tend to be expressed specifically at fewer stages.
Keywords: database curation; deep sequencing; microRNA.
Figures







Similar articles
-
Improved annotation of C. elegans microRNAs by deep sequencing reveals structures associated with processing by Drosha and Dicer.RNA. 2011 Apr;17(4):563-77. doi: 10.1261/rna.2432311. Epub 2011 Feb 9. RNA. 2011. PMID: 21307183 Free PMC article.
-
Sequence relationships among C. elegans, D. melanogaster and human microRNAs highlight the extensive conservation of microRNAs in biology.PLoS One. 2008 Jul 30;3(7):e2818. doi: 10.1371/journal.pone.0002818. PLoS One. 2008. PMID: 18665242 Free PMC article.
-
Deep parallel sequencing reveals conserved and novel miRNAs in gill and hepatopancreas of giant freshwater prawn.Fish Shellfish Immunol. 2013 Oct;35(4):1061-9. doi: 10.1016/j.fsi.2013.06.017. Epub 2013 Jun 29. Fish Shellfish Immunol. 2013. PMID: 23816854
-
Creating and maintaining a high-confidence microRNA repository for crop research: A brief review and re-examination of the current crop microRNA registries.J Plant Physiol. 2022 Mar;270:153636. doi: 10.1016/j.jplph.2022.153636. Epub 2022 Feb 2. J Plant Physiol. 2022. PMID: 35124290 Review.
-
Bioinformatics of cardiovascular miRNA biology.J Mol Cell Cardiol. 2015 Dec;89(Pt A):3-10. doi: 10.1016/j.yjmcc.2014.11.027. Epub 2014 Dec 5. J Mol Cell Cardiol. 2015. PMID: 25486579 Review.
Cited by
-
Bias-minimized quantification of microRNA reveals widespread alternative processing and 3' end modification.Nucleic Acids Res. 2019 Mar 18;47(5):2630-2640. doi: 10.1093/nar/gky1293. Nucleic Acids Res. 2019. PMID: 30605524 Free PMC article.
-
Selective inhibition of miR-21 by phage display screened peptide.Nucleic Acids Res. 2015 Apr 30;43(8):4342-52. doi: 10.1093/nar/gkv185. Epub 2015 Mar 30. Nucleic Acids Res. 2015. PMID: 25824952 Free PMC article.
-
MirGeneDB 2.0: the metazoan microRNA complement.Nucleic Acids Res. 2020 Jan 8;48(D1):D132-D141. doi: 10.1093/nar/gkz885. Nucleic Acids Res. 2020. PMID: 31598695 Free PMC article.
-
An estimate of the total number of true human miRNAs.Nucleic Acids Res. 2019 Apr 23;47(7):3353-3364. doi: 10.1093/nar/gkz097. Nucleic Acids Res. 2019. PMID: 30820533 Free PMC article.
-
miRDis: a Web tool for endogenous and exogenous microRNA discovery based on deep-sequencing data analysis.Brief Bioinform. 2018 May 1;19(3):415-424. doi: 10.1093/bib/bbw140. Brief Bioinform. 2018. PMID: 28073746 Free PMC article.
References
Grants and funding
LinkOut - more resources
Full Text Sources
Molecular Biology Databases
Research Materials