Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2018 Jan 9;46(1):314-323.
doi: 10.1093/nar/gkx1057.

Modeling RNA secondary structure folding ensembles using SHAPE mapping data

Affiliations

Modeling RNA secondary structure folding ensembles using SHAPE mapping data

Aleksandar Spasic et al. Nucleic Acids Res. .

Abstract

RNA secondary structure prediction is widely used for developing hypotheses about the structures of RNA sequences, and structure can provide insight about RNA function. The accuracy of structure prediction is known to be improved using experimental mapping data that provide information about the pairing status of single nucleotides, and these data can now be acquired for whole transcriptomes using high-throughput sequencing. Prior methods for using these experimental data focused on predicting structures for sequences assuming that they populate a single structure. Most RNAs populate multiple structures, however, where the ensemble of strands populates structures with different sets of canonical base pairs. The focus on modeling single structures has been a bottleneck for accurately modeling RNA structure. In this work, we introduce Rsample, an algorithm for using experimental data to predict more than one RNA structure for sequences that populate multiple structures at equilibrium. We demonstrate, using SHAPE mapping data, that we can accurately model RNA sequences that populate multiple structures, including the relative probabilities of those structures. This program is freely available as part of the RNAstructure software package.

PubMed Disclaimer

Figures

Figure 1.
Figure 1.
Overview of Rsample and experimental data. (A) The Rsample procedure. (B) The three conformations of a designed FMN riboswitch, color coded with SHAPE reactivity from Cordero et al. (43). The observed SHAPE reactivities are the weighted means of the reactivity of each structure. In the natural mixture in solution, the reactivity of each structure cannot be observed. (C) Distribution of normalized SHAPE reactivities for unpaired nucleotides, nucleotides paired at helix ends, and nucleotides paired and not at helix ends. The distributions show significant overlap and peak at a reactivity of 0, although the distribution for unpaired nucleotides skews farther towards higher reactivities. Nucleotides at the ends of helices also have some skew to higher reactivities than nucleotides in the interior of helices.
Figure 2.
Figure 2.
Performance of methods for predicting the ratio of known conformations. Experimental ratios are compared to the predicted ratios for the method without using SHAPE restraints (No SHAPE), using our original method for including restraints (Pseudo ΔG) (21), and the method introduced here (Rsample). (A) The predicted centroid structures for the FMN riboswitch in the absence of ligand using Rsample. The three accepted structures are shown in Figure 1B. The remaining plots show the models for each sequence, FMN riboswitch (–FMN; B), FMN riboswitch (+FMN, C), HIV RRA (D), bistable sequence (E), 16S rRNA (F), ADD riboswitch (-adenine; G), ADD riboswitch (+adenine; H), and medloop (I). To illustrate the performance of the whole predicted ensemble predicted ratio for structures that were not close to any of the known conformations (‘other’) is also given.
Figure 3.
Figure 3.
Performance of our method (Rsample) compared to other methods for predicting single conformation. The results are given as geometric mean of sensitivity and positive predictive value. RNAprobing predictions for 16S rRNA, 23S rRNA and its average over all sequences are not included because RNAprobing does not handle sequences longer than 1, 000 nucleotides.

Similar articles

Cited by

References

    1. Cech T.R., Steitz J.A.. The noncoding RNA revolution-trashing old rules to forge new ones. Cell. 2014; 157:77–94. - PubMed
    1. Ban N., Nissen P., Hansen J., Moore P.B., Steitz T.A.. The complete atomic structure of the large ribosomal subunit at 2.4 Å resolution. Science. 2000; 289:905–920. - PubMed
    1. Wu L., Belasco J.G.. Let me count the ways: mechanisms of gene regulation by miRNAs and siRNAs. Mol. Cell. 2008; 29:1–7. - PubMed
    1. Matera A.G., Terns R.M., Terns M.P.. Non-coding RNAs: lessons from the small nuclear and small nucleolar RNAs. Nat. Rev. Mol. Cell Biol. 2007; 8:209–220. - PubMed
    1. Serganov A., Nudler E.. A decade of riboswitches. Cell. 2013; 152:17–24. - PMC - PubMed

Publication types