Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
[Preprint]. 2021 Mar 30:2021.03.29.437587.
doi: 10.1101/2021.03.29.437587.

Combinatorial optimization of mRNA structure, stability, and translation for RNA-based therapeutics

Affiliations

Combinatorial optimization of mRNA structure, stability, and translation for RNA-based therapeutics

Kathrin Leppek et al. bioRxiv. .

Update in

  • Combinatorial optimization of mRNA structure, stability, and translation for RNA-based therapeutics.
    Leppek K, Byeon GW, Kladwang W, Wayment-Steele HK, Kerr CH, Xu AF, Kim DS, Topkar VV, Choe C, Rothschild D, Tiu GC, Wellington-Oguri R, Fujii K, Sharma E, Watkins AM, Nicol JJ, Romano J, Tunguz B, Diaz F, Cai H, Guo P, Wu J, Meng F, Shi S, Participants E, Dormitzer PR, Solórzano A, Barna M, Das R. Leppek K, et al. Nat Commun. 2022 Mar 22;13(1):1536. doi: 10.1038/s41467-022-28776-w. Nat Commun. 2022. PMID: 35318324 Free PMC article.

Abstract

Therapeutic mRNAs and vaccines are being developed for a broad range of human diseases, including COVID-19. However, their optimization is hindered by mRNA instability and inefficient protein expression. Here, we describe design principles that overcome these barriers. We develop a new RNA sequencing-based platform called PERSIST-seq to systematically delineate in-cell mRNA stability, ribosome load, as well as in-solution stability of a library of diverse mRNAs. We find that, surprisingly, in-cell stability is a greater driver of protein output than high ribosome load. We further introduce a method called In-line-seq, applied to thousands of diverse RNAs, that reveals sequence and structure-based rules for mitigating hydrolytic degradation. Our findings show that "superfolder" mRNAs can be designed to improve both stability and expression that are further enhanced through pseudouridine nucleoside modification. Together, our study demonstrates simultaneous improvement of mRNA stability and protein expression and provides a computational-experimental platform for the enhancement of mRNA medicines.

PubMed Disclaimer

Conflict of interest statement

DECLARATION OF INTEREST

Stanford University has submitted provisional patent applications related to use of the Hoxa9 P4 stem-loop and the CoV2 5’ UTR, computational design of mRNAs, chemically modified nucleotides to stabilize RNA therapeutics, and PERSIST-seq.

Figures

Figure 1.
Figure 1.. PERSIST-seq overview and illustrative ribosome load insights.
(A) Overview of the mRNA optimization workflow. Literature mined and rationally designed 5’ and 3’ UTRs were combined with Eterna and algorithmically designed coding sequences. All sequences were then experimentally tested in parallel for in-solution and in-cell stability as well as ribosome load. The mRNA design included unique, 6–9 nt barcodes in the 3’ UTR for tag counting by short-read sequencing. (B) Experimental design for testing in-solution and in-cell stability and ribosome load in parallel. mRNAs were in vitro transcribed, 5’ capped, and polyadenylated in a pooled format before transfection into HEK293T cells or being subjected to in-solution degradation. Transfected cells were then harvested for sucrose gradient fractionation or in-cell degradation analysis. (C) Polysome trace from transfected HEK293T cells with 233-mRNA pool. (D) 5’ UTR variants display a higher variance in mean ribosome load per construct as determined from polysome sequencing. The formula for ribosome load is given. (E) Heatmaps from polysome profiles of mRNA designs selected from the top, middle, and bottom five mRNAs (by ribosome load) from each design category. (F) Secondary structure model of the SARS-CoV-2 5’ UTR. Introduced mutations and substitutions are highlighted. (G) Heatmaps of SARS-CoV-2 5’ UTR variants’ polysome profiles sorted by ribosome load.
Figure 2.
Figure 2.. In-cell RNA stability drives downstream protein expression levels.
(A) In-cell half-life of each mRNA design in HEK293T cells. (B) Higher polysome load correlates with decreased in-cell half-life. Correlation between in-cell half-life and mean ribosome load across the entire profile (left), monosome-to-free subunit ratio (center), or polysome-to-monosome ratio (right). (C) In-cell half-life and mean ribosome load for individual mRNA designs with varying UTRs. (D) Kinetic model for predicting protein expression from mRNA half-life and ribosome load. P(t) is protein quantity at time t; m0 is the mass of mRNA present at t=0; l is mRNA length; kt is translation rate; and km and kp are rates of mRNA and protein decay, respectively. (E) Protein expression predicted using the kinetic model in (D) on the basis of mRNA half-life and ribosome load. Top: predicted protein expression of each UTR variant; note closer similarity to in-cell half-life data than to ribosome load in (C). Bottom: predicted protein expression normalized by mRNA length (corresponding to transfecting equal masses of each mRNA). (F) Correlation of predicted protein expression and Nluc/Fluc activity in HEK293T cells. (G) In-solution half-life of various mRNA design variants. mRNA lifetimes are strongly dependent on mRNA length and designed structures, revealed by time courses of mRNA degradation under accelerated aging conditions (10 mM MgCl2, 50 mM Na-CHES, pH 10.0). (H) Nucleotide-resolution in vitro DMS mapping confirms large differences in structural accessibility between a highly structured JEV-HA-Nluc mRNA construct, “LinearDesign-1” and a highly unstructured construct “Yellowstone”. The 5’ and 3’ UTRs (hHBB) were kept constant between designs. (I) Nucleotide DMS accessibility mapped on to structures from DMS-directed structure prediction.
Figure 3.
Figure 3.. High-throughput in-line hydrolysis uncovers principles of in-solution RNA degradation.
(A) Eterna participants were asked to design 68-nucleotide RNA fragments maximizing sequence and structure diversity. 3030 constructs were characterized and probed using high-throughput in-line degradation (In-line-seq). (B) Nucleotide-resolution degradation of 2165 68-nt RNA sequences (filtered for signal quality), probed by In-line-seq, sorted by hierarchical clustering on degradation profiles. (C) Sequences span a diverse set of secondary structure motifs, revealing patterns in degradation based on both sequence (i.e., linkages ending at 3’ uridine are particularly reactive) and structure (symmetric internal loops, circled, have suppressed hydrolytic degradation compared to asymmetric internal loops). (D) The ridge regression model “DegScore” was trained to predict per-nucleotide degradation from sequence and loop assignment information. Coefficients with the largest magnitude corresponded to sequence identity immediately after the link, with U being most disfavored. (E) DegScore showed improved predictive power on mRNAs over two other metrics previously posited to predict RNA stability. (F) Introduction of pseudouridine (ψ) modifications stabilizes selected short RNAs at U nucleotides in both loop motifs and in fully unstructured RNAs. (G) Capillary electrophoresis characterization of fragmentation time courses of Nluc mRNA molecules designed with extensive structure (LinearDesign-1) and relatively less structure (Yellowstone), synthesized with standard nucleotides and with ψ modifications. The full-length mRNA band is indicated with a red asterisk. The Tetrahymena ribozyme P4-P6 domain RNA was included after degradation as a control. (H) Exponential fits of capillary electrophoresis measurements of intact RNA over ten time points confirm >3x difference between in-solution lifetimes of LinearDesign-1 and Yellowstone Nluc mRNAs. Inset: Calculated half-lives. Error bars represent standard deviations from bootstrapped exponential fits (n = 1000).
Figure 4.
Figure 4.. Integration of 5’/3’ UTRs, structure-optimized CDSs, and pseudouridine (ψ) together enhance mRNA stability and translational output.
(A) CDS and 5’/3’UTR combinations differentially impact protein synthesis. Six mRNA constructs were in vitro synthesized and luciferase activity was measured 6 or 24 hrs post-transfection. Inclusion of ψ was tested on two selected constructs. (B) Workflow for different approaches to design the CDS variants tested in (C). (C) Variations in CDS design facilitates high in-solution stability and differential protein expression. In vitro transcribed mRNAs (24 in total) were subjected to in-solution degradation or transfected into HEK293T cells for 6 and 24 hrs. In-solution half-lives and luciferase activity are normalized to the Nluc START reference construct. Predicted secondary structures are shown for select constructs with colors indicating DegScore at each nucleotide. Designs derived from LinearDesign solutions are marked with a purple triangle. (D) Predicted secondary structure overview of Ribotree_LinearDesign_degscoreall_1. Zoomed boxes indicate sequence optimizations and subsequent structural changes made by DegScore to the reference LinearDesign construct. (E) Increased in-solution half-life and enhanced luciferase expression at 24 hrs, but not 6 hrs, correlate with DegScore. (F) Schematic for testing the synergy between RNA modifications and mRNA design rules on downstream stability and protein output. mRNAs were in vitro synthesized with or without ψ and subjected to degradation conditions. Samples were collected overtime and the RNA was purified before being transfected into HEK293T cells. Luciferase activity was measured 24 hrs after transfection. (G) Luciferase activity of the reference Nluc sequence and DegScore-optimized CDS with or without ψ after being subjected to in-solution degradation.

References

    1. Kowalski P. S., Rudra A., Miao L. & Anderson D. G. Delivering the Messenger: Advances in Technologies for Therapeutic mRNA Delivery. Mol. Ther. 27, 710–728 (2019). - PMC - PubMed
    1. Pardi N., Hogan M. J., Porter F. W. & Weissman D. mRNA vaccines - a new era in vaccinology. Nat. Rev. Drug Discov. 17, 261–279 (2018). - PMC - PubMed
    1. Sahin U., Karikó K. & Türeci Ö. mRNA-based therapeutics--developing a new class of drugs. Nat. Rev. Drug Discov. 13, 759–780 (2014). - PubMed
    1. Jackson N. A. C., Kester K. E., Casimiro D., Gurunathan S. & DeRosa F. The promise of mRNA vaccines: a biotech and industrial perspective. npj Vaccines 5, 11 (2020). - PMC - PubMed
    1. Weng Y. et al. The challenge and prospect of mRNA therapeutics landscape. Biotechnol. Adv. 40, 107534 (2020). - PubMed

Publication types