Structured RNAs in the ENCODE selected regions of the human genome
- PMID: 17568003
- PMCID: PMC1891344
- DOI: 10.1101/gr.5650707
Structured RNAs in the ENCODE selected regions of the human genome
Abstract
Functional RNA structures play an important role both in the context of noncoding RNA transcripts as well as regulatory elements in mRNAs. Here we present a computational study to detect functional RNA structures within the ENCODE regions of the human genome. Since structural RNAs in general lack characteristic signals in primary sequence, comparative approaches evaluating evolutionary conservation of structures are most promising. We have used three recently introduced programs based on either phylogenetic-stochastic context-free grammar (EvoFold) or energy directed folding (RNAz and AlifoldZ), yielding several thousand candidate structures (corresponding to approximately 2.7% of the ENCODE regions). EvoFold has its highest sensitivity in highly conserved and relatively AU-rich regions, while RNAz favors slightly GC-rich regions, resulting in a relatively small overlap between methods. Comparison with the GENCODE annotation points to functional RNAs in all genomic contexts, with a slightly increased density in 3'-UTRs. While we estimate a significant false discovery rate of approximately 50%-70% many of the predictions can be further substantiated by additional criteria: 248 loci are predicted by both RNAz and EvoFold, and an additional 239 RNAz or EvoFold predictions are supported by the (more stringent) AlifoldZ algorithm. Five hundred seventy RNAz structure predictions fall into regions that show signs of selection pressure also on the sequence level (i.e., conserved elements). More than 700 predictions overlap with noncoding transcripts detected by oligonucleotide tiling arrays. One hundred seventy-five selected candidates were tested by RT-PCR in six tissues, and expression could be verified in 43 cases (24.6%).
Figures
References
-
- Bentwich I., Avniel A.A., Karov Y., Aharonov R., Gilad S., Barad O., Barzilai A., Einat P., Einav U., Meiri E., Avniel A.A., Karov Y., Aharonov R., Gilad S., Barad O., Barzilai A., Einat P., Einav U., Meiri E., Karov Y., Aharonov R., Gilad S., Barad O., Barzilai A., Einat P., Einav U., Meiri E., Aharonov R., Gilad S., Barad O., Barzilai A., Einat P., Einav U., Meiri E., Gilad S., Barad O., Barzilai A., Einat P., Einav U., Meiri E., Barad O., Barzilai A., Einat P., Einav U., Meiri E., Barzilai A., Einat P., Einav U., Meiri E., Einat P., Einav U., Meiri E., Einav U., Meiri E., Meiri E., et al. Identification of hundreds of conserved and nonconserved human microRNAs. Nat. Genet. 2005;37:766–770. - PubMed
-
- Bertone P., Stoc V., Royce T.E., Rozowsky J.S., Urban A.E., Zhu X., Rinn J.L., Tongprasit W., Samanta M., Weissman S., Stoc V., Royce T.E., Rozowsky J.S., Urban A.E., Zhu X., Rinn J.L., Tongprasit W., Samanta M., Weissman S., Royce T.E., Rozowsky J.S., Urban A.E., Zhu X., Rinn J.L., Tongprasit W., Samanta M., Weissman S., Rozowsky J.S., Urban A.E., Zhu X., Rinn J.L., Tongprasit W., Samanta M., Weissman S., Urban A.E., Zhu X., Rinn J.L., Tongprasit W., Samanta M., Weissman S., Zhu X., Rinn J.L., Tongprasit W., Samanta M., Weissman S., Rinn J.L., Tongprasit W., Samanta M., Weissman S., Tongprasit W., Samanta M., Weissman S., Samanta M., Weissman S., Weissman S., et al. Global identification of human transcribed sequences with genome tiling arrays. Science. 2004;306:2242–2246. - PubMed
-
- Blanchette M., Kent W.J., Riemer C., Elnitski L., Smit A.F., Roskin K.M., Baertsch R., Rosenbloom K., Clawson H., Green E.D., Kent W.J., Riemer C., Elnitski L., Smit A.F., Roskin K.M., Baertsch R., Rosenbloom K., Clawson H., Green E.D., Riemer C., Elnitski L., Smit A.F., Roskin K.M., Baertsch R., Rosenbloom K., Clawson H., Green E.D., Elnitski L., Smit A.F., Roskin K.M., Baertsch R., Rosenbloom K., Clawson H., Green E.D., Smit A.F., Roskin K.M., Baertsch R., Rosenbloom K., Clawson H., Green E.D., Roskin K.M., Baertsch R., Rosenbloom K., Clawson H., Green E.D., Baertsch R., Rosenbloom K., Clawson H., Green E.D., Rosenbloom K., Clawson H., Green E.D., Clawson H., Green E.D., Green E.D., et al. Aligning multiple genomic sequences with the threaded blockset aligner. Genome Res. 2004;14:708–715. - PMC - PubMed
-
- Blankenberg D., Taylor J., Schenck I., He J., Zhang Y., Ghent M., Veeraraghavan N., Albert I., Miller W., Makova K., Taylor J., Schenck I., He J., Zhang Y., Ghent M., Veeraraghavan N., Albert I., Miller W., Makova K., Schenck I., He J., Zhang Y., Ghent M., Veeraraghavan N., Albert I., Miller W., Makova K., He J., Zhang Y., Ghent M., Veeraraghavan N., Albert I., Miller W., Makova K., Zhang Y., Ghent M., Veeraraghavan N., Albert I., Miller W., Makova K., Ghent M., Veeraraghavan N., Albert I., Miller W., Makova K., Veeraraghavan N., Albert I., Miller W., Makova K., Albert I., Miller W., Makova K., Miller W., Makova K., Makova K., et al. A framework for collaborative analysis of ENCODE data: Making large-scale analyses biologist-friendly. Genome Res. 2007 doi: 10.1101/gr.5578007. (this issue) - DOI - PMC - PubMed
-
- Bompfünewerer A.F., Flamm C., Fried C., Fritzsch G., Hofacker I.L., Lehmann J., Missal K., Mosig A., Müller B., Prohaska S.J., Flamm C., Fried C., Fritzsch G., Hofacker I.L., Lehmann J., Missal K., Mosig A., Müller B., Prohaska S.J., Fried C., Fritzsch G., Hofacker I.L., Lehmann J., Missal K., Mosig A., Müller B., Prohaska S.J., Fritzsch G., Hofacker I.L., Lehmann J., Missal K., Mosig A., Müller B., Prohaska S.J., Hofacker I.L., Lehmann J., Missal K., Mosig A., Müller B., Prohaska S.J., Lehmann J., Missal K., Mosig A., Müller B., Prohaska S.J., Missal K., Mosig A., Müller B., Prohaska S.J., Mosig A., Müller B., Prohaska S.J., Müller B., Prohaska S.J., Prohaska S.J., et al. Evolutionary patterns of non-coding RNAs. Theory Biosci.. 2005;123:301–369. - PubMed
Publication types
MeSH terms
Substances
Grants and funding
LinkOut - more resources
Full Text Sources
Miscellaneous