Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2021 Aug 17;118(33):e2109085118.
doi: 10.1073/pnas.2109085118.

High-throughput dissection of the thermodynamic and conformational properties of a ubiquitous class of RNA tertiary contact motifs

Affiliations

High-throughput dissection of the thermodynamic and conformational properties of a ubiquitous class of RNA tertiary contact motifs

Steve L Bonilla et al. Proc Natl Acad Sci U S A. .

Abstract

Despite RNA's diverse secondary and tertiary structures and its complex conformational changes, nature utilizes a limited set of structural "motifs"-helices, junctions, and tertiary contact modules-to build diverse functional RNAs. Thus, in-depth descriptions of a relatively small universe of RNA motifs may lead to predictive models of RNA tertiary conformational landscapes. Motifs may have different properties depending on sequence and secondary structure, giving rise to subclasses that expand the universe of RNA building blocks. Yet we know very little about motif subclasses, given the challenges in mapping conformational properties in high throughput. Previously, we used "RNA on a massively parallel array" (RNA-MaP), a quantitative, high-throughput technique, to study thousands of helices and two-way junctions. Here, we adapt RNA-MaP to study the thermodynamic and conformational properties of tetraloop/tetraloop receptor (TL/TLR) tertiary contact motifs, analyzing 1,493 TLR sequences from different classes. Clustering analyses revealed variability in TL specificity, stability, and conformational behavior. Nevertheless, natural GAAA/11ntR TL/TLRs, while varying in tertiary stability by ∼2.5 kcal/mol, exhibited conserved TL specificity and conformational properties. Thus, RNAs may tune stability without altering the overall structure of these TL/TLRs. Furthermore, their stability correlated with natural frequency, suggesting thermodynamics as the dominant selection pressure. In contrast, other TL/TLRs displayed heterogenous conformational behavior and appear to not be under strong thermodynamic selection. Our results build toward a generalizable model of RNA-folding thermodynamics based on the properties of isolated motifs, and our characterized TL/TLR library can be used to engineer RNAs with predictable thermodynamic and conformational behavior.

Keywords: RNA folding; RNA nanotechnology; RNA structure; high-throughput biochemistry; tertiary motifs.

PubMed Disclaimer

Conflict of interest statement

The authors declare no competing interest.

Figures

Fig. 1.
Fig. 1.
Modular RNA structure and energetics. (A) Tertiary structure of the canonical GAAA/11ntR TL/TLR. Crystal structures of GAAA/11ntR from RNase P RNA (Top Left; Protein Data Bank [PDB] 1NBS), P4-P6 domain (Top Right; PDB 1GID), and Azoarcus group I intron (Bottom; PDB 1ZZN), superimposed using the PyMOL molecular graphics system (version 2.0 Schrödinger, LLC). (B) Reconstitution Model of RNA folding (13). Folding free energy (∆Gfold) is decomposed into an internal conformational search (∆Galign) and the formation of stabilizing tertiary interactions (∆Gtert). ∆Galign can be further decomposed into a contribution from electrostatics and ion interactions (∆G+/-) and a contribution from the conformational properties of helices, junctions, and tertiary contact motifs comprising the RNA (∆GHJH; HJH stands for helix-junction-helix). Thus, in this model, the conformational ensemble of the motifs is dictated by properties intrinsic to the motif such as sequence and topology, while environmental factors such as ions affect the relative stabilities of the conformations within the ensemble.
Fig. 2.
Fig. 2.
High-throughput studies of RNA structural motifs. (A) Tertiary structure of tectoRNA dimer. Structure shown is that of homodimer with two identical GAAA/11ntRwt TL/TLRs (49). One of the GAAA/11ntRwt is replaced by an orthogonal GGAA/R1 TL/TLR in our studies to ensure heterodimer formation (28). (B) Pathways for the formation of bound tectoRNA dimer. After the formation of one of the tertiary contacts, an internal conformational search aligns the second TL to its cognate TLR. (C) High-throughput RNA-MaP platform used to characterize RNA structural motifs. Tens of thousands of “chip” piece sequences are transcribed from sequenced DNA on an Illumina chip (37). Stars represent fluorescent probes. Junctions and/or short helical elements are inserted into the helical domain of the chip piece (26). (D) Sequence, secondary, and tertiary structure of representative TL/TLRs belonging to different types. Tertiary structures are from GCGA TL docking into CUG-CAG (Protein Data Bank [PDB] 3IGI), GGAA/11ntRwt (PDB 1GID), and GAAA/Vc2wt (PDB 3IRW). The GAAA/C7.2wt structure was modeled computationally (50). (E) Sequence of IC3wt TLR motif. (F) Schematic for characterization of the TL/TLR library via RNA-MaP. Varying TLR sequences were inserted in place of the 11ntRwt sequence in the chip piece. Scaffold variants for the insertion of each TLR sequence were generated by altering the length, sequence, and/or secondary structure of the helical domain of the chip piece. (G) Examples of scaffolds of different lengths. The length of the chip piece is defined as the number of bp between the GGAA TL and the TLR, including canonical and noncanonical (i.e., mismatches) bp. Residues without opposing residues in complementary strand (e.g., bulges) do not contribute to the reported length. (H–J) Distribution of TLR sequences in the library. Natural 11ntR and 12ntR TLR variants were obtained from databases and published sequence alignments of functional RNAs (–42).
Fig. 3.
Fig. 3.
Thermodynamic fingerprints report on TL/TLR 3D conformational properties. (A) Structural changes to the scaffold affect ∆Gbind by altering the internal conformational search to align the TL/TLRs. The figure shows the schematic comparing the internal conformational search undergone by a short scaffold (9 bp, Top) and a long scaffold (11 bp, Bottom). The large gray circle represents the conformational space explored by a particular scaffold during the internal conformational search, and the smaller darker gray area represents the part of this space where the TL and TLR are “correctly” aligned for the formation of the TL/TLR. In this simplified schematic (with equal probabilities throughout the 2D gray space), the fraction of conformational space, leading to productive formation of the TL/TLR, is greater for the 9 bp than for the 11-bp scaffold. Thus, the 9-bp scaffold has a greater probability of aligning the TL/TLR, resulting in higher affinity compared to the 11-bp scaffold. In actual cases, the relative occupancy across the available conformations also affects the probability of binding with each scaffold. (B) Measurements of ∆Gbind across distinct scaffolds produce thermodynamic fingerprints that report on the alignment properties of the TL/TLRs. Each datapoint corresponds to a distinct scaffold varying in length, secondary structure, and/or sequence. Depending on its conformational properties, each scaffold has a distinct probability of aligning the TL/TLRs, resulting in the distinct ∆Gbind values. (C) Differences between TL/TLR variants can affect ∆Gbind by affecting the strength of the tertiary interactions and/or the alignment preferences. Cartoons represent three distinct TL/TLR sequences embedded within the same scaffold. TL/TLR1 and TL/TLR2 have the same alignment preferences but differ in the strength of their tertiary interactions. In this case, the difference in stability between the TL/TLRs is independent of the structural context in which the comparison is made. TL/TLR3 has different alignment preferences compared to the other two TL/TLRs. In this case, the difference in stability between TL/TL3 and the other TL/TLRs depends on the structural context in which the comparison is made. (D) TL/TLRs with the same alignment preferences produce thermodynamic fingerprints with the same shape. The strength differences between TL/TLR1 and TL/TLR2 are expressed by a constant offset in their stability (∆∆Gbind). (E) TL/TLRs that differ in alignment preferences produce thermodynamic fingerprints with different shapes, such that ∆∆Gbind depends on the scaffold identity.
Fig. 4.
Fig. 4.
Thermodynamic and conformational properties of representative TL/TLRs across scaffolds. (A) ∆Gbind for TLRs binding to GAAA TL across scaffolds. Dashed gray line indicates the threshold of –7.1 kcal/mol. Open symbols are ∆Gbind above this threshold. Dashed colored lines serve as guides for the thermodynamic fingerprints. To compare the fingerprints, they were superimposed to that of GAAA/11ntRwt by shifting them by a constant ∆Gbind while minimizing rmsd. This minimal rmsd is shown as is the correlation constant, r. Values above threshold were not used to calculate the correlation coefficient (r). ∆Gavg is the median. Values of r for TLRs with ∆Gavg > –7.1 kcal/mol are not reliable and therefore were not considered. Solution conditions: 89 mM Tris-Borate, pH 8.0, 30 mM MgCl2, 0.01 mg/mL yeast tRNA, and 0.01% Tween 20. Measurements at different ionic conditions are provided in SI Appendix, Fig. S3. (B) ∆∆Gbind relative to GAAA/11ntRwt, calculated as the differences in ∆Gbind showed in A. Open symbols correspond values below stability threshold and are lower limits. (C) ∆Gbind for GAAA versus GUAA TLs for each representative TLR across its scaffolds. Dashed gray lines indicate a threshold of –7.1 kcal/mol, as above. ∆∆Gavg is the average difference in stability between binding to GUAA and GAAA in units of kcal/mol (e.g., on average, constructs with GAAA/11ntRwt are 3.55 kcal/mol more stable than constructs with GUAA/11ntRwt).
Fig. 5.
Fig. 5.
Overview of TLR thermodynamic behavior. (A) Secondary structure and sequence of five scaffolds common to the entire TLR library. (B) Distribution of ∆Gavg for TLR sequences across five common scaffolds in A and two TL sequences. TLRs with ∆Gavg > –7.1 kcal/mol (nweak = 387) were not included in the clustering analysis. (C) Fraction of the variance represented by each of the PCs. (D) Hierarchical clustering of the first two PCs using Ward’s method implemented in Python. (E) Hierarchical clustering of TLRs across five common scaffolds and TL sequences GAAA and GUAA. Prior to clustering, ∆Gbind values for each of the TLRs was scaled by ∆Gmean to produce ∆∆Grel values. The ∆∆Grel values were used in a PCA (C above), and the first two PCs were hierarchically clustered to generate the clustergrams shown. (F) Distribution of TLR mutants according to type (i.e., 11ntR, IC3, and C7.2) across clusters. (G and H) Sequence motifs representing variability among 11ntR (G) and IC3 (H) variants in the individual subgroups revealed by hierarchical clustering. Sequence motifs were generated using Weblogo (51). Subgroups 1 and 2 were mostly composed of 11ntR variants and subgroup 5 mostly of IC3 variants. (I) PCA loading plot showing the influence of the sequence of the binding TL and the secondary structure of the scaffold on the first two PCs. (J) Pseudofingerprints showing the average behavior of the TLR subgroups across TL sequence and scaffold secondary structure.
Fig. 6.
Fig. 6.
Thermodynamic analysis of natural GAAA/11ntR variants. (A) Average relative stability and Pearson correlation coefficients of the fingerprints. ∆∆Gavg values are relative to GAAA/11ntRwt. Larger symbols are for variants that were compared across 50 scaffolds and smaller symbols for those compared across five scaffolds. The red dashed line represents the decrease in correlation expected for theoretically identical fingerprints from error as stability decreases (SI Appendix, Fig. S2). (B and C) Correlation between relative stability in 30 mM MgCl2 (B) or in 5 mM MgCl2/150 mM KCl (C) and the frequency of TLR variants in natural RNA sequences.

References

    1. Warf M. B., Berglund J. A., Role of RNA structure in regulating pre-mRNA splicing. Trends Biochem. Sci. 35, 169–178 (2010). - PMC - PubMed
    1. Rinn J. L., Chang H. Y., Genome regulation by long noncoding RNAs. Annu. Rev. Biochem. 81, 145–166 (2012). - PMC - PubMed
    1. Mercer T. R., Mattick J. S., Structure and function of long noncoding RNAs in epigenetic regulation. Nat. Struct. Mol. Biol. 20, 300–307 (2013). - PubMed
    1. Cech T. R., Steitz J. A., The noncoding RNA revolution-trashing old rules to forge new ones. Cell 157, 77–94 (2014). - PubMed
    1. Herschlag D., Bonilla S., Bisaria N., The story of RNA folding, as told in epochs. Cold Spring Harb. Perspect. Biol. 10, a032433 (2018). - PMC - PubMed

Publication types

LinkOut - more resources