Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2021 Mar 2:10:e63431.
doi: 10.7554/eLife.63431.

tRNA sequences can assemble into a replicator

Affiliations

tRNA sequences can assemble into a replicator

Alexandra Kühnlein et al. Elife. .

Abstract

Can replication and translation emerge in a single mechanism via self-assembly? The key molecule, transfer RNA (tRNA), is one of the most ancient molecules and contains the genetic code. Our experiments show how a pool of oligonucleotides, adapted with minor mutations from tRNA, spontaneously formed molecular assemblies and replicated information autonomously using only reversible hybridization under thermal oscillations. The pool of cross-complementary hairpins self-selected by agglomeration and sedimentation. The metastable DNA hairpins bound to a template and then interconnected by hybridization. Thermal oscillations separated replicates from their templates and drove an exponential, cross-catalytic replication. The molecular assembly could encode and replicate binary sequences with a replication fidelity corresponding to 85-90 % per nucleotide. The replication by a self-assembly of tRNA-like sequences suggests that early forms of tRNA could have been involved in molecular replication. This would link the evolution of translation to a mechanism of molecular replication.

Keywords: Origin of tRNA; Self-assembly; computational biology; cross-catalytic replication; molecular biophysics; none; structural biology; systems biology.

Plain language summary

The genetic code stored within DNA contains the instructions for manufacturing all the proteins organisms need to develop, grow and survive. This requires molecular machines that ‘transcribe’ regions of the genetic code into RNA molecules which are then ‘translated’ into the string of amino acids that form the final protein. However, these molecular machines and other proteins are also needed to replicate and synthesize the sequences stored in DNA. This presents evolutionary biologists with a ‘chicken-and-egg’ situation: which came first, the DNA sequences needed to manufacture proteins or the proteins needed to transcribe and translate DNA? Understanding the order in which DNA replication and protein translation evolved is challenging as these processes are tightly intertwined in modern-day species. One theory, known as the ‘RNA world hypothesis’, suggests that all life on Earth began with a single RNA molecule that was able to make copies of itself, as DNA does today. To investigate this hypothesis, Kühnlein, Lanzmich and Braun studied a molecule called transfer RNA (or tRNA for short) which is responsible for translating RNA into proteins. tRNA is assumed to be one of the earliest evolved molecules in biology. Yet, why it was present in early life forms before it was needed for translation still remained somewhat of a mystery. To gain a better understanding of tRNA’s role early in evolution, Kühnlein, Lanzmich and Braun made small changes to its genetic code and then carried out tests on these tRNA-like sequences. The experiments showed these ‘early’ forms of tRNA can actually self-assemble into a molecule which is capable of replicating the information stored in its sequence. It suggests early forms of tRNA could have been involved in replication before modern tRNA developed its role in protein translation. With these experiments, Kühnlein, Lanzmich and Braun have identified a possible evolutionary link between DNA replication and protein translation, suggesting the two processes emerged through one shared pathway: tRNA. This deepens our understanding about the origins of early life, while taking biochemists one step closer to their distant goal of recreating self-replicating molecular machines in the laboratory.

PubMed Disclaimer

Conflict of interest statement

AK, SL, DB No competing interests declared

Figures

Figure 1.
Figure 1.. Heat-driven replication by hybridization using hairpin structures inspired from transfer RNA.
(a) Transfer RNA folds into a double-hairpin conformation upon very few base substitutions. In that configuration, the 3’-terminal amino acid binding site (green) is close to the anticodon (blue) and a double hairpin structure forms. A set of pairwise complementary double hairpins can encode and replicate sequences of information. A binary code implemented in the position of the anti-codon, the information domain, allows to encode and replicate binary sequences (red vs blue). Each strand (82-84 nt) comprises two hairpin loops (gray) and an interjacent unpaired information domain of 15 nt length (blue/red, here: 0D). The displayed structure of eight strands shows replication of a template corresponding to the binary code 0010. Note, that no covalent linkage is involved in the process. (b) Replication is driven by thermal oscillations in four steps: (0) The hairpins are activated into their closed conformation by fast cooling indicated by triangles. (1) Strands with matching information domain bind to the template. (2) Fluctuations in the bound strands’ hairpins facilitate the hybridization of neighboring strands. (3) Subsequent heating splits replica from template, while keeping the longer hairpin sequences connected, freeing both as templates for the next cycle.
Figure 1—figure supplement 1.
Figure 1—figure supplement 1.. Secondary structure predictions and free energy calculations for the replicator in DNA and RNA using NUPACK.
Combinations of sets of four complementary double hairpins are displayed for (a) DNA and (b) RNA at 25 °C, 1 M Na+ and 0.625 mM per strand. For (b) every 'T' in the DNA sequence is substituted by a 'U'. Due to computational limitations the only four strands could be calculated at a time. The predicted structures look as expected, forming the backbone (top and bottom from center) and the information domain bonds (left and right from center), which are flanked by the closed hairpins. The DNA (a) and RNA (b) version are identical. Also, there is no difference in structure or free energy between information domain '0' and information domain '1'. The calculated free energies are given below each structure and are averaged over each complete set.
Figure 2.
Figure 2.. Assembly of different subsets of the cross-replicating system of strands observed by native gel electrophoresis.
Samples contained strands at 200 nM concentration each and were slowly annealed as described in Materials and methods. Lane contents are indicated at the top of each lane. Comparison of different lanes allowed for the attribution of bands to complexes. Complexes incorporating all present strands are marked (•). The red channel shows the intensity 0A-Cy5, the cyan channel shows SYBR Green I fluorescence. Single information domain bonds (lane 2, 7) break during gel electrophoresis.
Figure 2—figure supplement 1.
Figure 2—figure supplement 1.. Gel mobilities of different complexes compared to linear dsDNA.
(a) Ferguson plot of differently sized complexes compared to linear dsDNA (gray lines) of comparable mass. The slopes of log(mobility) vs. gel concentration are proportional to the friction constants of the molecules (Rodbard and Chrambach, 1970). (b) Linear dsDNA shows significantly lower friction constants than any of the complexes of at least two molecules. Symbolic complexes are indicated next to the data points. This is due to the branched structure of the complexes and conforms with the suggested assembly geometry. Idealized tertiary structures of complexes 0A0B and 0A0B:0¯A0¯B, and 100 bp dsDNA are given as size reference.
Figure 3.
Figure 3.. Spontaneous self-assembly and sedimentation of matching hairpins.
(a) In a simple, sealed microfluidic chamber (Figure 3—figure supplement 2), the hairpin strands can self-assemble into agglomerates and sediment on a timescale of hours. The sample was initially heated to 95 °C for 10 s to ensure an unbound initial state, then rapidly (within 30 s) cooled to 25 °C, where self-assembly and sedimentation occured. Note, that agglomeration and sedimentation only occured if all eight matching hairpins were provided (top two rows) but not in the case of a knockout (-1D, bottom row). For quantification, the bulk and sediment intensities were normalized by the first frame after heating. Samples contained strands at total concentration of 5 µM, about threefold higher than in Figure 2 and the following replication experiments. (b) Time traces of concentration increase for sediment and bulk of different configurations, same examples as shown in a. The time traces of all further knockout permutations are shown in Figure 3—figure supplement 1b. (c) Final concentration increase of sediment, relative to first frame after heating, for all configurations. The final values (N≥3) for c/c0 are retrieved from fitting the time traces. For the full set of complementary hairpins, self-assembly and sedimentation is most pronounced.
Figure 3—figure supplement 1.
Figure 3—figure supplement 1.. Extended data on self-assembly and sedimentation.
(a) Original fluorescence microscopy images (left) and cuttings (right) as shown in Figure 3a. To calculate c/c0 time traces (Figure 3b), (c) the image stacks were stabilized and the whole flank was integrated, the cuttings serve barely illustrative purpose. (b) Time traces of concentration increase for sediment and bulk of all different configurations, including all knockout permutations. (c) Final concentration increase of sediment, relative to first frame after heating, for all configurations including labeling and random-sequence controls (bottom three rows). The sedimentation of the full system is independent of the label and its position. Random sequences do not show agglomeration nor sedimentation. (d) Concentration dependence of the sedimentation kinetics. The characteristic sedimentation time is determined by fitting the initial increase of c/c0 over time with an exponential function. The concentration dependence can be fitted with a power law function y(c) = a1+a2*cp, which yields an exponent of p = –1.06, fit with weighted error bars displayed.
Figure 3—figure supplement 2.
Figure 3—figure supplement 2.. Sketch of microfluidic chamber.
(a) One chamber contained five independent pockets of 20 µL capacity, which were accessed through a 0.5 mm channel. After inserting the sample, the pockets were sealed with Parafilm. An additional pocket contained a temperature sensor in water, which was used for the PID temperature regulation. (b) Image of one chamber. The chamber design is cut out of a 500 µm FEP foil and fit between two plane sapphires. Three Peltier elements are attached to the backside of the chamber which allow quick heating and cooling of the sample at rates > 2 K/s. This package is then screwed onto an aluminum base by a steel top frame. The sapphire allowed full visual access to the sample.
Figure 4.
Figure 4.. Isothermal template assisted product formation.
(a) Schematic representation of the templating step at constant temperature. (b) Kinetics of tetramer formation at 45 °C with different starting concentrations of template (c¯0). Data includes concentrations of all complexes containing tetramers. (c) Templating observed over a broad temperature range. Large circles show data for reactions at c¯0=120 nM of template 0¯A0¯B0¯C0¯D, small circles show the spontaneous formation (c¯0=0). The latter increases at T > 45 °C. Above 48 °C, binding of monomers to the template gets weaker, slowing down the rate of template assisted formation. This is consistent with the melting temperatures of the information domains (see Figure 4—figure supplement 1).
Figure 4—figure supplement 1.
Figure 4—figure supplement 1.. Determination of thermal oscillation temperatures.
(a) Melting temperatures of complementary information domains (red, strands 0A+0¯A, determined via quenching of 0A-Cy5) and backbone domains (blue, strands 0A+0B, determined via UV absorption). Dashed lines show simulation data. Dotted gray lines depict simulated melting curves for 0B+0C, 0C+0D, 0D+0A. The hatched area indicates the thermal oscillation range of 45–67 °C. (b) Simulated equilibrium concentrations in a reaction mixture containing 200 nM of each of 0A, 0B, 0C, 0¯A, 0¯B, 0¯C. The peak temperature of the thermal oscillations mostly melts the bond between the trimers 0A0B0C and 0¯A0¯B0¯C but is below the melting transition of 0A0B0C or 0¯A0¯B0¯C (yellow, green).
Figure 5.
Figure 5.. Exponential amplification of a restricted sequence subset with thermal oscillations.
(a) Amplification time traces for concentration c for sequence 0000 during the first four to six cycles (Tpeak = 67 °C) for template (0¯A0¯B0¯C0¯D) concentrations c¯0 from 0 to 45 nM. The data was fitted using the cross-catalytic model from equation (1). Strands 0A, 0¯A, 0B, …, 0¯D were used at 200 nM concentration each. Data points show concentrations of complexes 4:4. (b) Initial reaction velocity as a function of initial template concentration c¯0. The data points show good agreement with the line calculated from the fits in panel a. (c) Amplification proceeded for peak temperatures below 74 °C. Above, backbone duplexes start to melt, and the complexes are no longer stable. The base temperature was 45 °C, reactions initially contained 30 nM of complex 0¯A0¯B0¯C0¯D as template. (d) Serial transfer experiment. The reaction containing strands 0A, 0¯A, 0B, …, 0¯D (black circles) survived successive dilution by a factor of 1/2 every three cycles at almost constant concentration. In contrast, a reaction with the same amount of template 0¯A0¯B0¯C0¯D, but lacking monomers 0¯AD, fades out (open circles). The solid line shows the model from Equation 1.
Figure 6.
Figure 6.. Sequence replication with thermal oscillations and fidelity check by forcing mutations from '0' to '1' at different locations.
(a) Replication of sequence 0A0B0C0D. Reactions were started with 15 nM initial template 0¯A0¯B0¯C0¯D. All strands (0A, 0¯A, 1A, …, 1¯D) were present at 100 nM each. Native-PAGE results comparing the reaction of all 16 strands ('++++') with the reaction lacking strand 0D ('+++−'). The defective set '+++−' mostly produced 3:4 complexes instead of 4:4 complexes (see schematics on the right). The overall yield of tetramer-containing complexes was greatly reduced. As size reference, the marker lane contained complexes 0A0B0C0D, 0A0B0C, 0A0B, and monomers 0A. The complete gel is presented in Figure 6—figure supplement 1. (b) Product concentration over time for the complete sequence network (yellow) and three defective sets with missing strands. Data was integrated by quantitative image analysis from electrophoresis gels using covalent markers on the 0A-strand counting all product complexes containing tetramers. Mutations of information in the product from '0' to '1' were induced by defective reactions that lacked strands 0D ('+++−'), 0Cand0D ('++−−'), and 0Band0D ('+−+−'). All reactions were initiated with 15 nM of 0¯A0¯B0¯C0¯D. The solid line shows data from reaction '++++' without template. (c) End point comparison of reactions with templates 0¯A0¯B0¯C0¯D (panels a, b), 0¯A1¯B0¯C1¯D, and 0¯A0¯B1¯C1¯D after six cycles. Horizontal lines indicate averages of the three template sequences. A single missing strand reduced product yield to about 40 %, two missing strands to 15–20 %.
Figure 6—figure supplement 1.
Figure 6—figure supplement 1.. Extended electrophoresis gel image data.
(a) Assembly yields of different strands bound to tetramer template 0A0B0C0D. Lane contents are given above each lane, complexes are identified on the right. Samples were annealed as for Figure 2 and described in Materials and methods. The red channel shows the intensity of 0A-Cy5, the cyan channel shows SYBR Green I. Single or unconnected strands (lanes 2, 4) detached from the template during electrophoresis. Complexes of size 2:4, 3:4, and 4:4 were resolved. (b) Full PAGE result from Figure 6a. The gel shows replication of template 0¯A0¯B0¯C0¯D in a reaction containing all strands ('++++', left) with a reaction lacking strand 0D ('+++−', right). The marker lane contained complexes 0A0B0C0D, 0A0B0C, 0A0B, and monomer 0A, and was prepared as described in Materials and methods.
Figure 7.
Figure 7.. Sequence space analysis of information domain binding.
The binding energies quantify the ability of the replication mechanism to discriminate nucleotide mutations. (a) Cumulative free energy distributions of information domain duplexes 0:0¯ (red), 1:1¯ (light red), as well as all 0:0¯* and 1:1¯* with up to three point mutations in 0¯* and 1¯* (yellow, green, blue). 99 % of duplexes 0:0¯* with three point mutations have free energies ΔG ≥ -12.5 kcal/mol (dashed line), significantly weaker than that of 0:0¯ (ΔG = -15.4 kcal/mol). (b) Melting curves of information domain duplexes 0:0¯ (red), 1:1¯ (light red), and the two duplexes 0:0¯* indicated by arrows in panel a. Even the 0:0¯* duplex (i) at the low end of the ΔG distribution has a melting temperature of about 10 °C below that of 0:0¯. This difference in melting temperature destabilizes binding of the information domain and causes the replication mechanism to reject these sequences in the thermal oscillation regime between Tbase = 45 °C and Tpeak = 67 °C (gray box).
Figure 7—figure supplement 1.
Figure 7—figure supplement 1.. Information domain binding energy statistics split into information domains containing terminal mutations and those with internal mutations only.
Cumulative free energy distribution of duplexes 0:0¯* (as in Figure 7), split into information domains 0¯* containing mutations at terminal bases (thin lines) and those with internal mutations only (thick lines). The dashed line shows combined data for two to three internal mutations. The logarithmic plot shows the fast drop and thus small influence by special mutations in the energy landscape.

References

    1. Adamski P, Eleveld M, Sood A, Kun Ádám, Szilágyi A, Czárán T, Szathmáry E, Otto S. From self-replication to replicator systems en route to de novo life. Nature Reviews Chemistry. 2020;4:386–403. doi: 10.1038/s41570-020-0196-x. - DOI - PubMed
    1. Altay Y, Tezcan M, Otto S. Emergence of a new Self-Replicator from a dynamic combinatorial library requires a specific Pre-Existing replicator. Journal of the American Chemical Society. 2017;139:13612–13615. doi: 10.1021/jacs.7b07346. - DOI - PMC - PubMed
    1. Ashkenasy G, Hermans TM, Otto S, Taylor AF. Systems chemistry. Chemical Society Reviews. 2017;46:2543–2554. doi: 10.1039/C7CS00117G. - DOI - PubMed
    1. Attwater J, Wochner A, Holliger P. In-ice evolution of RNA polymerase ribozyme activity. Nature Chemistry. 2013;5:1011–1018. doi: 10.1038/nchem.1781. - DOI - PMC - PubMed
    1. Ball R, Brindley J. Hydrogen peroxide thermochemical oscillator as driver for primordial RNA replication. Journal of the Royal Society Interface. 2014;11:20131052. doi: 10.1098/rsif.2013.1052. - DOI - PMC - PubMed

Publication types

LinkOut - more resources