Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Comparative Study
. 2007 May;17(5):556-65.
doi: 10.1101/gr.6036807. Epub 2007 Mar 26.

Functionality or transcriptional noise? Evidence for selection within long noncoding RNAs

Affiliations
Comparative Study

Functionality or transcriptional noise? Evidence for selection within long noncoding RNAs

Jasmina Ponjavic et al. Genome Res. 2007 May.

Abstract

Long transcripts that do not encode protein have only rarely been the subject of experimental scrutiny. Presumably, this is owing to the current lack of evidence of their functionality, thereby leaving an impression that, instead, they represent "transcriptional noise." Here, we describe an analysis of 3122 long and full-length, noncoding RNAs ("macroRNAs") from the mouse, and compare their sequences and their promoters with orthologous sequence from human and from rat. We considered three independent signatures of purifying selection related to substitutions, sequence insertions and deletions, and splicing. We find that the evolution of the set of noncoding RNAs is not consistent with neutralist explanations. Rather, our results indicate that purifying selection has acted on the macroRNAs' promoters, primary sequence, and consensus splice site motifs. Promoters have experienced the greatest elimination of nucleotide substitutions, insertions, and deletions. The proportion of conserved sequence (4.1%-5.5%) in these macroRNAs is comparable to the density of exons within protein-coding transcripts (5.2%). These macroRNAs, taken together, thus possess the imprint of purifying selection, thereby indicating their functionality. Our findings should now provide an incentive for the experimental investigation of these macroRNAs' functions.

PubMed Disclaimer

Figures

Figure 1.
Figure 1.
Nucleotide substitution and transversion rates are suppressed within macroRNA transcripts. Panels show the cumulative distributions of substitution (A,B) and transversion rates (C,D) as measured on macroRNA transcripts (red curves), and the same rates measured on nearby nonoverlapping AR sequence of matched length (black curves). (A) Mouse–human substitution rates; (B) mouse–rat substitution rates; (C,D) mouse–human and mouse–rat transversion rates. All macroRNA rates are significantly different from, and lower than, the putatively neutral AR rates (Kolmogorov-Smirnov test, P < 10−15 for all panels). The suppression of transversion rates in macroRNAs compared to AR sequence demonstrates that these observations are not a consequence of a higher density of highly mutable CpG sites within ARs, since the associated mutations are mainly transitions rather than transversions.
Figure 2.
Figure 2.
Density of indel-purified segments in macroRNA transcripts. Shown are the IPS densities within macroRNA transcripts (red line) for 10 G+C content bins (horizontal axis), and the expected density based on the intergenic distribution of IPSs (black line; the gray band indicates 95% confidence intervals obtained by randomization; see Methods for details). The IPS densities within macroRNAs exceed significantly the levels expected in G+C-matched intergenic sequence, indicating the past action of purifying selection.
Figure 3.
Figure 3.
Strong conservation of macroRNA promoters. Panels show the cumulative distributions of substitution (A,B) and transversion rates (C,D), as measured on the core putative promoter regions of macroRNA transcripts (red curves; 0–400 bp upstream of transcription start site), and the same rates on nearby AR sequence of the same length (black curves). Mouse macroRNA putative core promoter regions exhibit significantly suppressed substitution and transversion rates compared to selectively neutral ARs (P < 10−15 for all panels). This is true both for mouse–human (A,C) and mouse–rat (B,D) comparisons.
Figure 4.
Figure 4.
Density of indel-purified segments in macroRNA promoters. Shown are the IPS densities within 400-bp regions upstream of macroRNA transcripts (red line) for 10 G+C content bins (horizontal axis), and the expected density based on the intergenic distribution of IPSs (black line; the gray band indicates 95% confidence intervals). IPS densities are substantially and significantly higher within putative core promoter regions.

Similar articles

Cited by

References

    1. Allen J.E., Pertea M., Salzberg S.L., Pertea M., Salzberg S.L., Salzberg S.L. Computational gene prediction using multiple sources of evidence. Genome Res. 2004;14:142–148. - PMC - PubMed
    1. Altschul S.F., Gish W., Miller W., Myers E.W., Lipman D.J., Gish W., Miller W., Myers E.W., Lipman D.J., Miller W., Myers E.W., Lipman D.J., Myers E.W., Lipman D.J., Lipman D.J. Basic local alignment search tool. J. Mol. Biol. 1990;215:403–410. - PubMed
    1. Badger J.H., Olsen G.J., Olsen G.J. CRITICA: Coding region identification tool invoking comparative analysis. Mol. Biol. Evol. 1999;16:512–524. - PubMed
    1. Bertone P., Stolc V., Royce T.E., Rozowsky J.S., Urban A.E., Zhu X., Rinn J.L., Tongprasit W., Samanta M., Weissman S., Stolc V., Royce T.E., Rozowsky J.S., Urban A.E., Zhu X., Rinn J.L., Tongprasit W., Samanta M., Weissman S., Royce T.E., Rozowsky J.S., Urban A.E., Zhu X., Rinn J.L., Tongprasit W., Samanta M., Weissman S., Rozowsky J.S., Urban A.E., Zhu X., Rinn J.L., Tongprasit W., Samanta M., Weissman S., Urban A.E., Zhu X., Rinn J.L., Tongprasit W., Samanta M., Weissman S., Zhu X., Rinn J.L., Tongprasit W., Samanta M., Weissman S., Rinn J.L., Tongprasit W., Samanta M., Weissman S., Tongprasit W., Samanta M., Weissman S., Samanta M., Weissman S., Weissman S., et al. Global identification of human transcribed sequences with genome tiling arrays. Science. 2004;306:2242–2246. - PubMed
    1. Birney E., Andrews D., Caccamo M., Chen Y., Clarke L., Coates G., Cox T., Cunningham F., Curwen V., Cutts T., Andrews D., Caccamo M., Chen Y., Clarke L., Coates G., Cox T., Cunningham F., Curwen V., Cutts T., Caccamo M., Chen Y., Clarke L., Coates G., Cox T., Cunningham F., Curwen V., Cutts T., Chen Y., Clarke L., Coates G., Cox T., Cunningham F., Curwen V., Cutts T., Clarke L., Coates G., Cox T., Cunningham F., Curwen V., Cutts T., Coates G., Cox T., Cunningham F., Curwen V., Cutts T., Cox T., Cunningham F., Curwen V., Cutts T., Cunningham F., Curwen V., Cutts T., Curwen V., Cutts T., Cutts T., et al. Ensembl 2006. Nucleic Acids Res. 2006;34:D556–D561. - PMC - PubMed

Publication types