Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
[Preprint]. 2025 Apr 11:2024.12.11.627843.
doi: 10.1101/2024.12.11.627843.

Analysis of natural structures and chemical mapping data reveals local stability compensation in RNA

Affiliations

Analysis of natural structures and chemical mapping data reveals local stability compensation in RNA

Robert L Cornwell-Arquitt et al. bioRxiv. .

Update in

Abstract

RNA molecules adopt complex structures that perform essential biological functions across all forms of life, making them promising candidates for therapeutic applications. However, our ability to design new RNA structures remains limited by an incomplete understanding of their folding principles. While global metrics such as the minimum free energy are widely used, they are at odds with naturally occurring structures and incompatible with established design rules. Here, we introduce local stability compensation (LSC), a principle that RNA folding is governed by the local balance between destabilizing loops and their stabilizing adjacent stems, challenging the focus on global energetic optimization. Analysis of over 100,000 RNA structures revealed that LSC signatures are particularly pronounced in bulges and their adjacent stems, with distinct patterns across different RNA families that align with their biological functions. To validate LSC experimentally, we systematically analyzed thousands of RNA variants using DMS chemical mapping. Our results demonstrate that stem folding, as measured by reactivity, correlates with LSC (R2 = 0.458 for hairpin loops) and that instabilities show no significant effect on folding for distal stems. These findings demonstrate that LSC can be a guiding principle for understanding RNA function and for the rational design of custom RNAs.

PubMed Disclaimer

Conflict of interest statement

Conflict of interest The authors declare no conflicts of interest.

Figures

Figure 1.
Figure 1.. Local stability compensation in bpRNA-1m90.
(A) Hairpin, (B) bulge, and (C) internal loop ΔGs from refolded bpRNA-1m90 structures are binned and the corresponding stem ΔGs are plotted as violins with linear regressions drawn for the median (blue) and 5% quantile (red). (D) bpRNA-1m90 net free energy kernel density estimates for hairpin loops and (E) bulges and (F) internal loops belonging to different RNA types.
Figure 2.
Figure 2.. Net ΔG distributions are distinct for RNA types and particular RNA loops.
(A) C4 antisense RNA example secondary structure and (B) hairpin net ΔG distribution (H1 in blue, H2 in red) and other hairpins in bpRNA-1m90 (grey). (C) TwoAYGGAY RNA example secondary structure and (D) net ΔG distribution (H1 in blue, H2 in red) and other hairpins in bpRNA-1m90 (grey). (E) Example tRNA secondary structure (bpRNA_RFAM_1424) and (F) two dimensional box plots for the D-loop, T-loop, and anticodon loop in red, green, and blue respectively. (G) pre-miRNA example secondary structure and (H) net ΔG distribution (blue) and other hairpins in bpRNA-1m90 (grey).
Figure 3.
Figure 3.. Differences in net ΔG by phylogeny for select RNA types.
(A) plant pre-miRNA hairpin net ΔGs (green) and metazoan (grey) net ΔGs before folding with constraints and (B) after folding with constraints. (C) plant pre-miRNA bulge ΔGs (green) and metazoan (grey) net ΔGs after folding with constraints. (D) plant pre-miRNA internal loop ΔGs (green) and metazoan (grey) net ΔGs after folding with constraints. (E) Thermophilic (orange) hairpin net ΔGs and non-thermophilic net ΔGs (blue). (F) Thermophilic (orange) bulge net ΔGs and non-thermophilic net ΔGs (blue). (G) Thermophilic (orange) internal loop net ΔGs and non-thermophilic net ΔGs (blue).
Figure 4.
Figure 4.. Library design and local structure AUROC heatmaps.
(A) Designed structure examples for hairpins, bulges, internal loops, (top, middle, bottom respectively), dashed lines indicate the variable region. Average AUROC and structure count heatmaps for cells of (B) hairpin, (C) bulge, and (D) internal loop ΔG by adjacent stem ΔG, yellow cells show low AUROC or poor agreement of DMS data with the designed structure, while dark blue cells show high AUROC with strong agreement, and black cells contain no data. Count heatmaps show the number of RNAs contributing to each cell with darker red coloring indicating a higher number of RNAs.
Figure 5.
Figure 5.. Local vs distal stem reactivity correlations.
(A) Example structure from the hairpin library with adjacent and distal stems to the hairpin loop labeled and local stem base pair indices labeled. (B) Linear regression between hairpin net ΔG and log average local stem reactivity, compared to average distal (C) stem reactivity. (D) Average net ΔG bin reactivity per position away from the hairpin loop. (E) Example structure from the bulge library with adjacent and distal stems to the bulge labeled and local stem base pair indices labeled. (F) Linear regression between bulge net ΔG and log average local stem reactivity, compared to average distal (G) stem reactivity. (H) Average net ΔG bin reactivity per position away from the bulge, where an index is present on both stems, the average is used. (I) Example structure from the internal loop library with adjacent and distal stems to the internal loop labeled and local stem base pair indices labeled. (J) Linear regression between internal loop net ΔG and log average local stem reactivity, compared to average distal (K) stem reactivity. (L) Average net ΔG bin reactivity per position away from the internal loop, where an index is present on both stems, the average is used.
Figure 6.
Figure 6.. Local structure fidelity curve fitting.
(A) Hairpin net ΔG vs average local AUROC, points indicated by black, magenta, and cyan are shown in (B), raw reactivity values color-coded by nucleotide for three example designed hairpins, the boxed region is involved in AUROC calculation. (C,D,E) Hill equation curves fit to net ΔG bins (0.2 kcal/mol) versus the mean AUROC for each bin.

References

    1. Fabian M.R., Sonenberg N. and Filipowicz W. (2010) Regulation of mRNA Translation and Stability by microRNAs. Annu. Rev. Biochem., 79, 351–379. - PubMed
    1. Mortimer S.A., Kidwell M.A. and Doudna J.A. (2014) Insights into RNA structure and function from genome-wide studies. Nat. Rev. Genet., 15, 469–479. - PubMed
    1. Statello L., Guo C.-J., Chen L.-L. and Huarte M. (2021) Gene regulation by long non-coding RNAs and its biological functions. Nat. Rev. Mol. Cell Biol., 22, 96–118. - PMC - PubMed
    1. Delebecque C.J., Lindner A.B., Silver P.A. and Aldaye F.A. (2011) Organization of Intracellular Reactions with Rationally Designed RNA Assemblies. Science, 333, 470–474. - PubMed
    1. Sachdeva G., Garg A., Godding D., Way J.C. and Silver P.A. (2014) In vivo co-localization of enzymes on RNA scaffolds increases metabolic production in a geometrically dependent manner. Nucleic Acids Res., 42, 9493–9503. - PMC - PubMed

Publication types

LinkOut - more resources