. 2004 Feb 27;32(4):1392-403.

doi: 10.1093/nar/gkh291. Print 2004.

Paradigms for computational nucleic acid design

Robert M Dirks¹, Milo Lin, Erik Winfree, Niles A Pierce

Affiliations

PMID: 14990744
PMCID: PMC390280
DOI: 10.1093/nar/gkh291

Paradigms for computational nucleic acid design

Robert M Dirks et al. Nucleic Acids Res. 2004.

. 2004 Feb 27;32(4):1392-403.

doi: 10.1093/nar/gkh291. Print 2004.

Authors

Robert M Dirks¹, Milo Lin, Erik Winfree, Niles A Pierce

Affiliation

¹ Chemistry Department, California Institute of Technology, Pasadena, CA 91125, USA.

PMID: 14990744
PMCID: PMC390280
DOI: 10.1093/nar/gkh291

Abstract

The design of DNA and RNA sequences is critical for many endeavors, from DNA nanotechnology, to PCR-based applications, to DNA hybridization arrays. Results in the literature rely on a wide variety of design criteria adapted to the particular requirements of each application. Using an extensively studied thermodynamic model, we perform a detailed study of several criteria for designing sequences intended to adopt a target secondary structure. We conclude that superior design methods should explicitly implement both a positive design paradigm (optimize affinity for the target structure) and a negative design paradigm (optimize specificity for the target structure). The commonly used approaches of sequence symmetry minimization and minimum free-energy satisfaction primarily implement negative design and can be strengthened by introducing a positive design component. Surprisingly, our findings hold for a wide range of secondary structures and are robust to modest perturbation of the thermodynamic parameters used for evaluating sequence quality, suggesting the feasibility and ongoing utility of a unified approach to nucleic acid design as parameter sets are refined further. Finally, we observe that designing for thermodynamic stability does not determine folding kinetics, emphasizing the opportunity for extending design criteria to target kinetic features of the energy landscape.

PubMed Disclaimer

Figures

**Figure 1**
(a) Feedback loop for evaluating nucleic acid sequence designs and methodologies. (b) Positive and negative design paradigms. Two sequences are evaluated using an empirical potential on both the desired target structure and an undesired structure. Using a positive design paradigm, sequence A would be selected since it exhibits a stronger affinity than sequence B for the target structure (i.e. lower ΔG). Using a negative design paradigm, sequence B would be selected since it exhibits specificity for the target structure while sequence A exhibits specificity for the undesired structure. To provide a common basis for comparison, ΔG = 0 for a strand with no base pairs. (c) Canonical loops of nucleic acid secondary structure: hairpin loops, stacked base pairs, a bulge loop, an interior loop and a multiloop. These loop structures are all nested (i.e. there are no crossing arcs in the corresponding polymer graph with the backbone drawn as a straight line). (d) A sample pseudoknot with base pairs a·f and c·h (with a < c) that fail to satisfy the nesting property a < c < h < f, yielding crossing arcs in the corresponding polymer graph.

**Figure 2**
RNA multiloop. (a) Histograms for 100 sequence designs based on probability of sampling the target graph, p(s∗). The color legend applies to all plots. (b) Histograms for the same 100 sequence designs based on average number of incorrect nucleotides, n(s∗). (c) Base-pairing probabilities P_i,j for the median sequence based on p(s∗). Square sizes correspond to P_i,j ≥ {0.5,0.05,0.005}, respectively. The target structure is identical to that obtained by optimizing probability (black) or the average number of incorrect nucleotides (not shown). (d) p(s∗) versus free energy, ΔG(s∗). Each dot corresponds to one of 100 sequences designed using each method. Each bold square corresponds to the median over the 100 sequences designed using each method. (e) p(s∗) versus median folding time, t(s∗), over 1000 kinetic trajectories starting from random coil initial conditions. Dots and squares are interpreted as in (d).

**Figure 3**
RNA model perturbation study. For the multiloop designs of Figure 2, the top-ranked sequence for each method based on p(s∗) is re-examined using 1000 randomized potential functions where every parameter is independently adjusted by an amount uniformly distributed on ±10%, ±20% or ±50%. The original probabilities are depicted as dashed lines.

**Figure 4**
RNA multiloop variations. Design performance based on (a) p(s∗) and (b) n(s∗) with stem α = (4,6,8) and single-stranded multiloop regions β = (0,2,4). Surfaces show the mean values plus and minus one standard deviation for 100 independently designed sequences. The results for optimizing average incorrect nucleotides (not shown) are nearly indistinguishable from those obtained by optimizing probability.

**Figure 5**
Large RNA multiloop. See caption for Figure 2a–c.

**Figure 6**
RNA pseudoknot. See caption for Figure 2a–c.

See this image and copyright information in PMC

Cited by

DNA tetrominoes: the construction of DNA nanostructures using self-organised heterogeneous deoxyribonucleic acids shapes.
Ong HS, Rahim MS, Firdaus-Raih M, Ramlan EI. Ong HS, et al. PLoS One. 2015 Aug 10;10(8):e0134520. doi: 10.1371/journal.pone.0134520. eCollection 2015. PLoS One. 2015. PMID: 26258940 Free PMC article.
Computational RNA secondary structure design: empirical complexity and improved methods.
Aguirre-Hernández R, Hoos HH, Condon A. Aguirre-Hernández R, et al. BMC Bioinformatics. 2007 Jan 31;8:34. doi: 10.1186/1471-2105-8-34. BMC Bioinformatics. 2007. PMID: 17266771 Free PMC article.
Computational design and experimental verification of pseudoknotted ribozymes.
Najeh S, Zandi K, Kharma N, Perreault J. Najeh S, et al. RNA. 2023 Jun;29(6):764-776. doi: 10.1261/rna.079148.122. Epub 2023 Mar 3. RNA. 2023. PMID: 36868786 Free PMC article.
Topological constraints in nucleic acid hybridization kinetics.
Bois JS, Venkataraman S, Choi HM, Spakowitz AJ, Wang ZG, Pierce NA. Bois JS, et al. Nucleic Acids Res. 2005 Jul 25;33(13):4090-5. doi: 10.1093/nar/gki721. Print 2005. Nucleic Acids Res. 2005. PMID: 16043632 Free PMC article.
Generation of DNA oligomers with similar chemical kinetics via in-silico optimization.
Tobiason M, Yurke B, Hughes WL. Tobiason M, et al. Commun Chem. 2023 Oct 18;6(1):226. doi: 10.1038/s42004-023-01026-w. Commun Chem. 2023. PMID: 37853171 Free PMC article.

See all "Cited by" articles

References

1. Seeman N.C. (1982) Nucleic acid junctions and lattices. J. Theor. Biol., 99, 237–247. - PubMed
1. Seeman N.C. (1999) DNA engineering and its application to nanotechnology. Trends Biotechnol., 17, 437–443. - PubMed
1. Winfree E., Liu,F., Wenzler,L.A. and Seeman,N.C. (1998) Design and self-assembly of two-dimensional DNA crystals. Nature, 394, 539–544. - PubMed
1. Kallenbach R.K., Ma,R.-I. and Seeman,N.C. (1983) An immobile nucleic acid junction constructed from oligonucleotides. Nature, 305, 829–831.
1. Chen J. and Seeman,N.C. (1991) The synthesis from DNAs of a molecule with the connectivity of a cube. Nature, 350, 631–633. - PubMed

Publication types

Actions
Actions
Actions

MeSH terms

Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions

Substances

Actions
Actions

LinkOut - more resources

Full Text Sources
Other Literature Sources
- The Lens - Patent Citations Database

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Paradigms for computational nucleic acid design

Affiliation

Paradigms for computational nucleic acid design

Authors

Affiliation

Abstract

Figures

Similar articles

Cited by

References

Publication types

MeSH terms

Substances

LinkOut - more resources

Full Text Sources

Other Literature Sources

Abstract

Figures

Similar articles

Cited by

References

Publication types

MeSH terms

Substances

Related information

LinkOut - more resources

Full Text Sources

Other Literature Sources