Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2008 Apr;82(4):971-81.
doi: 10.1016/j.ajhg.2008.02.016.

Mapping of small RNAs in the human ENCODE regions

Affiliations

Mapping of small RNAs in the human ENCODE regions

Christelle Borel et al. Am J Hum Genet. 2008 Apr.

Abstract

The elucidation of the largely unknown transcriptome of small RNAs is crucial for the understanding of genome and cellular function. We report here the results of the analysis of small RNAs (< 50 nt) in the ENCODE regions of the human genome. Size-fractionated RNAs from four different cell lines (HepG2, HelaS3, GM06990, SK-N-SH) were mapped with the forward and reverse ENCODE high-density resolution tiling arrays. The top 1% of hybridization signals are termed SmRfrags (Small RNA fragments). Eight percent of SmRfrags overlap the GENCODE genes (CDS), given that the majority map to intergenic regions (34%), intronic regions (53%), and untranslated regions (UTRs) (5%). In addition, 9.6% and 16.8% of SmRfrags in the 5' UTR regions overlap significantly with His/Pol II/TAF250 binding sites and DNase I Hypersensitive sites, respectively (compared to the 5.3% and 9% expected). Interestingly, 17%-24% (depending on the cell line) of SmRfrags are sense-antisense strand pairs that show evidence of overlapping transcription. Only 3.4% and 7.2% of SmRfrags in intergenic regions overlap transcribed fragments (Txfrags) in HeLa and GM06990 cell lines, respectively. We hypothesized that a fraction of the identified SmRfrags corresponded to microRNAs. We tested by Northern blot a set of 15 high-likelihood predictions of microRNA candidates that overlap with smRfrags and validated three potential microRNAs ( approximately 20 nt length). Notably, most of the remaining candidates showed a larger hybridizing band ( approximately 100 nt) that could be a microRNA precursor. The small RNA transcriptome is emerging as an important and abundant component of the genome function.

PubMed Disclaimer

Figures

Figure 1
Figure 1
SmRfrags Map in CDS ENCODE Regions, in 5′ UTR ENCODE Regions, and across the First Six Exons of ENCODE Genes in Different Cell Lines All panels show the percentage of total SmRfrags (CDS ENCODE regions [A], 5′ UTR ENCODE regions [B], across the first six exons of ENCODE genes [C]). “Encode Regions” indicates the ENCODE-array content in each category. p value was determined by Hypergeometric testing. Statistical significance is labeled by ∗∗∗ for p values < 0.005. The analysis was performed on the forward data, but there were no identifiable differences between SmRfrags distributions on the two strands (data not shown). “RA” indicates retinoic-acid treatment (6 μM, 48 hr).
Figure 2
Figure 2
SmRfrags Localization Relative to TSSs, His.Pol.TAF Sites, and Proximal DNase Hypersensitive Sites The data are shown for SmRfrags mapping in the 5′ UTRs of annotated genes (TSSs [A], His.Pol.TAF sites [B], proximal DNase Hypersensitive sites [C]). Statistical significance is labeled by for p values < 0.05, by ∗∗ for p values < 0.01, and by ∗∗∗ for p values < 0.005, via Hypergeometric test. “A” indicates retinoic-acid treatment (6 μM, 48 hr).
Figure 3
Figure 3
Correlation of Gene Expression and SmRfrags in TSSs Colors in the pie charts indicate the proportions of expressed genes (green [HeLa S3] and purple [GM06990]) and nonexpressed genes (gray) annotated in ENCODE regions. p value was determined with a Chi-square test.
Figure 4
Figure 4
SmRfrags in Different Cell Lines The fraction of SmRfrags detected in the indicated cell lines is shown. Percentage is expressed as the percentage of total SmRfrags.
Figure 5
Figure 5
Sense-Antisense SmRfrags, in Different Cell Lines In brackets, the numbers of sense and antisense pairs are indicated for each cell line. Percentage of total SmRfrags is shown in bold.
Figure 6
Figure 6
Three Potential New MicroRNAs in the ENCODE Regions Genomic localization of SmRfrags and Northern-blot analysis: Color bars depict the position of SmRfrags with their respective signal intensities (log2). The cutoff for the top1% positive signals ranges from 2.5 (log2) to 3.5 (log2), depending on the cell lines. Arrows indicate the strand direction. Grey bar represents the Northern-blot probe. The conservation pattern (“conservation” track) is based on the UCSC phastCons scores. This track shows evolutionary conservation in 17 vertebrates, including mammalian, amphibian, bird, and fish species, on the basis of phastCons, a phylogenetic hidden Markov model. Multiz alignments of the assemblies were used to generate this track (generated with UCSC genome browser). The conservation is visualized by a blue scale density gradient and sequence annotation specific for each species. Northern-blot validation of microRNAs: Probes are 25 mer LNA oligonucleotides (see Material and Methods). Each blot contains a positive control lane (PC), which is a 25-mer oligonucleotide with the complementary sequence used for the probe. Lanes (1), (2), (3), and (4) correspond to total RNA from HeLa S3, GM06990, SK-N-SH, and HepG2, respectively. Two specific bands of 25 and 70 nt were detected, corresponding to the mature and the precursor forms of the putative microRNA, respectively.

Similar articles

  • Transcriptional and post-transcriptional profile of human chromosome 21.
    Nikolaev SI, Deutsch S, Genolet R, Borel C, Parand L, Ucla C, Schütz F, Duriaux Sail G, Dupré Y, Jaquier-Gubler P, Araud T, Conne B, Descombes P, Vassalli JD, Curran J, Antonarakis SE. Nikolaev SI, et al. Genome Res. 2009 Aug;19(8):1471-9. doi: 10.1101/gr.089425.108. Epub 2009 Jul 6. Genome Res. 2009. PMID: 19581486 Free PMC article.
  • Structured RNAs in the ENCODE selected regions of the human genome.
    Washietl S, Pedersen JS, Korbel JO, Stocsits C, Gruber AR, Hackermüller J, Hertel J, Lindemeyer M, Reiche K, Tanzer A, Ucla C, Wyss C, Antonarakis SE, Denoeud F, Lagarde J, Drenkow J, Kapranov P, Gingeras TR, Guigó R, Snyder M, Gerstein MB, Reymond A, Hofacker IL, Stadler PF. Washietl S, et al. Genome Res. 2007 Jun;17(6):852-64. doi: 10.1101/gr.5650707. Genome Res. 2007. PMID: 17568003 Free PMC article.
  • Integrated analysis of experimental data sets reveals many novel promoters in 1% of the human genome.
    Trinklein ND, Karaöz U, Wu J, Halees A, Force Aldred S, Collins PJ, Zheng D, Zhang ZD, Gerstein MB, Snyder M, Myers RM, Weng Z. Trinklein ND, et al. Genome Res. 2007 Jun;17(6):720-31. doi: 10.1101/gr.5716607. Genome Res. 2007. PMID: 17567992 Free PMC article.
  • A high-resolution map of transcription in the yeast genome.
    David L, Huber W, Granovskaia M, Toedling J, Palm CJ, Bofkin L, Jones T, Davis RW, Steinmetz LM. David L, et al. Proc Natl Acad Sci U S A. 2006 Apr 4;103(14):5320-5. doi: 10.1073/pnas.0601091103. Epub 2006 Mar 28. Proc Natl Acad Sci U S A. 2006. PMID: 16569694 Free PMC article.
  • In-depth query of large genomes using tiling arrays.
    Samanta MP, Tongprasit W, Stolc V. Samanta MP, et al. Methods Mol Biol. 2007;377:163-74. doi: 10.1007/978-1-59745-390-5_10. Methods Mol Biol. 2007. PMID: 17634616 Review.

Cited by

References

    1. Cheng J., Kapranov P., Drenkow J., Dike S., Brubaker S., Patel S., Long J., Stern D., Tammana H., Helt G. Transcriptional maps of 10 human chromosomes at 5-nucleotide resolution. Science. 2005;308:1149–1154. - PubMed
    1. Kapranov P., Drenkow J., Cheng J., Long J., Helt G., Dike S., Gingeras T.R. Examples of the complex architecture of the human transcriptome revealed by RACE and high-density tiling arrays. Genome Res. 2005;15:987–997. - PMC - PubMed
    1. Kapranov P., Cawley S.E., Drenkow J., Bekiranov S., Strausberg R.L., Fodor S.P., Gingeras T.R. Large-scale transcriptional activity in chromosomes 21 and 22. Science. 2002;296:916–919. - PubMed
    1. Carninci P., Kasukawa T., Katayama S., Gough J., Frith M.C., Maeda N., Oyama R., Ravasi T., Lenhard B., Wells C. The transcriptional landscape of the mammalian genome. Science. 2005;309:1559–1563. - PubMed
    1. Kapranov P., Cheng J., Dike S., Nix D.A., Duttagupta R., Willingham A.T., Stadler P.F., Hertel J., Hackermuller J., Hofacker I.L. RNA maps reveal new RNA classes and a possible function for pervasive transcription. Science. 2007;316:1484–1488. - PubMed

Publication types