Identification of putative noncoding RNAs among the RIKEN mouse full-length cDNA collection
- PMID: 12819127
- PMCID: PMC403720
- DOI: 10.1101/gr.1011603
Identification of putative noncoding RNAs among the RIKEN mouse full-length cDNA collection
Abstract
With the sequencing and annotation of genomes and transcriptomes of several eukaryotes, the importance of noncoding RNA (ncRNA)-RNA molecules that are not translated to protein products-has become more evident. A subclass of ncRNA transcripts are encoded by highly regulated, multi-exon, transcriptional units, are processed like typical protein-coding mRNAs and are increasingly implicated in regulation of many cellular functions in eukaryotes. This study describes the identification of candidate functional ncRNAs from among the RIKEN mouse full-length cDNA collection, which contains 60,770 sequences, by using a systematic computational filtering approach. We initially searched for previously reported ncRNAs and found nine murine ncRNAs and homologs of several previously described nonmouse ncRNAs. Through our computational approach to filter artifact-free clones that lack protein coding potential, we extracted 4280 transcripts as the largest-candidate set. Many clones in the set had EST hits, potential CpG islands surrounding the transcription start sites, and homologies with the human genome. This implies that many candidates are indeed transcribed in a regulated manner. Our results demonstrate that ncRNAs are a major functional subclass of processed transcripts in mammals.
Figures
References
-
- Argaman, L., Hershberg, R., Vogel, J., Bejerano, G., Wagner, E.G., Margalit, H., and Altuvia, S. 2001. Novel small RNA-encoding genes in the intergenic regions of Escherichia coli. Curr. Biol. 11: 941-950. - PubMed
-
- Burge, C. and Karlin, S. 1997. Prediction of complete gene structures in human genomic DNA. J. Mol. Biol. 268: 78-94. - PubMed
-
- Chan, A.S., Thorner, P.S., Squire, J.A., and Zielenska, M. 2002. Identification of a novel gene NCRMS on chromosome 12q21 with differential expression between rhabdomyosarcoma subtypes. Oncogene 21: 3029-3037. - PubMed
WEB SITE REFERENCES
-
- http://biobases.ibch.poznan.pl/ncRNA/; Noncoding RNAs database.
-
- ftp://us.expasy.org/databases/sp_tr_nrdb/; data set for known protein sequences.
-
- ftp://ftp.ncbi.nih.gov/blast/db/; database of mouse EST sequences and human EST sequences.
-
- ftp://ftp.ncbi.nih.gov/genomes/R_norvegicus/; database of rat EST sequences.
-
- http://www.ncbi.nlm.nih.gov/blast; executable files of BLASTN and BLASTX.
MeSH terms
Substances
LinkOut - more resources
Full Text Sources
Molecular Biology Databases
Research Materials