Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2012 Jul;40(12):e95.
doi: 10.1093/nar/gks241. Epub 2012 Mar 19.

R-CHIE: a web server and R package for visualizing RNA secondary structures

Affiliations

R-CHIE: a web server and R package for visualizing RNA secondary structures

Daniel Lai et al. Nucleic Acids Res. 2012 Jul.

Abstract

Visually examining RNA structures can greatly aid in understanding their potential functional roles and in evaluating the performance of structure prediction algorithms. As many functional roles of RNA structures can already be studied given the secondary structure of the RNA, various methods have been devised for visualizing RNA secondary structures. Most of these methods depict a given RNA secondary structure as a planar graph consisting of base-paired stems interconnected by roundish loops. In this article, we present an alternative method of depicting RNA secondary structure as arc diagrams. This is well suited for structures that are difficult or impossible to represent as planar stem-loop diagrams. Arc diagrams can intuitively display pseudo-knotted structures, as well as transient and alternative structural features. In addition, they facilitate the comparison of known and predicted RNA secondary structures. An added benefit is that structure information can be displayed in conjunction with a corresponding multiple sequence alignments, thereby highlighting structure and primary sequence conservation and variation. We have implemented the visualization algorithm as a web server R-chie as well as a corresponding R package called R4RNA, which allows users to run the software locally and across a range of common operating systems.

PubMed Disclaimer

Figures

Figure 1.
Figure 1.
An example of a ‘double structure arc diagram’, showing the Cripavirus Internal Ribosomal Entry Site [family RF00458 from the Rfam database (20)]. The RNA secondary structure shown above the horizontal sequence line has been predicted by Transat (11). Every arc corresponds to one base pair whose colour indicates its P-value, where dark blue is ≤1e-06, light blue is ≤1e-05, orange is ≤1e-04 and red is ≤1e-03 (P-value threshold). The RNA structure shown below the horizontal sequence line shows the consensus RNA structure from Rfam.
Figure 2.
Figure 2.
An example of a ‘overlapping structure arc diagram’, overlapping the Transat predicted structure and the Rfam consensus structure of family RF00458 presented in Figure 1. The structure shown above the horizontal sequence is the known structure in black, coloured by P-value if correctly predicted by Transat (best in blue and worst in red, see Figure 1 for exact P-value colours). The arcs below the line represent novel base pairs predicted by Transat not found in the known Rfam structure. Such a diagram can give} a qualitative description of a predicted structure's performance, where high sensitivity would result in a high proportion of top helices being coloured, and high specificity would result in a majority of helices above the line. On the other hand, the novel base pairs observed on the bottom half, may indicate alternative structural elements not yet experimentally verified, but worth investigating, especially in light of strong evolutionary evidence (Figure 3).
Figure 3.
Figure 3.
An example of an ‘overlapping covariance arc diagram’, of the Transat predicted structure, overlapping the Rfam consensus structure of family RF00458 (see Figure 2 for a detailed explanation of the known top and novel bottom arcs). The colouring of arcs indicates their P-value (detailed in Figure 1) from best (blue) to worst (red) and gray for conflicting base pairs. Between the arcs are two covariance blocks representing the same seven sequences in a gapped multiple sequence alignment from Rfam. Unpaired nucleotides are in black and gaps are in gray. For columns at the ends of a single non-conflicting arc, bases are assigned green if they are valid base pairs (G:C, A:U, G:U and the reversed pairings), or else they are red. For the green valid base pairs, if the base pair varies from the most commonly observed base pair in the column, then it is coloured blue to signify co-variation, or compensatory mutations to retain the base-pairing potential of the positions (dark blue for two sided, and light blue for one sided). The colouring of base pairs for arcs gives a qualitative representation of how well the base pair is conserved, and can be used to infer the validity of predicted base pairs, or the quality of an alignment given a known structure. For example, while most of the novel base pairs predicted to exist are simply extensions or slight shifts of known helices, the three largest red helices in the bottom show very high sequence conservation.

References

    1. Shapiro BA, Lipkin LE, Maizel J. An interactive technique for the display of nucleic acid secondary structure. Nucleic Acids Res. 1982;10:7041–7052. - PMC - PubMed
    1. Nussinov R, Pieczenik G, Griggs JR, Kleitman DJ. Algorithms for loop matchings. SIAM J. Appl. Math. 1978;35:68.
    1. Jacobson AB, Zuker M. Structural analysis by energy dot plot of a large mRNA. J. Mol. Biol. 1993;233:261–269. - PubMed
    1. Tinoco I, Uhlenbeck OC, Levine MD. Estimation of secondary structure in ribonucleic acids. Nature. 1971;230:362–367. - PubMed
    1. Hofacker IL, Fontana W, Stadler PF, Bonhoeffer LS, Tacker M, Schuster P. Fast folding and comparison of RNA secondary structures. Monatsh. Chem. 1994;125:167–188.

Publication types