Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2015 Aug 21;4(8):939-43.
doi: 10.1021/acssynbio.5b00068. Epub 2015 Jul 1.

Predicting the Genetic Stability of Engineered DNA Sequences with the EFM Calculator

Affiliations

Predicting the Genetic Stability of Engineered DNA Sequences with the EFM Calculator

Benjamin R Jack et al. ACS Synth Biol. .

Abstract

Unwanted evolution can rapidly degrade the performance of genetically engineered circuits and metabolic pathways installed in living organisms. We created the Evolutionary Failure Mode (EFM) Calculator to computationally detect common sources of genetic instability in an input DNA sequence. It predicts two types of mutational hotspots: deletions mediated by homologous recombination and indels caused by replication slippage on simple sequence repeats. We tested the performance of our algorithm on genetic circuits that were previously redesigned for greater evolutionary reliability and analyzed the stability of sequences in the iGEM Registry of Standard Biological Parts. More than half of the parts in the Registry are predicted to experience >100-fold elevated mutation rates due to the inclusion of unstable sequence configurations. We anticipate that the EFM Calculator will be a useful negative design tool for avoiding volatile DNA encodings, thereby increasing the evolutionary lifetimes of synthetic biology devices.

Keywords: computer-aided design (CAD); design-build-test cycle; genetic engineering; genetic robustness; hypermutable site; metabolic engineering.

PubMed Disclaimer

Figures

Figure 1.
Figure 1.
(a) The EFM Calculator accepts a DNA sequence as input, predicts two types of hypermutable sites in this sequence, and summarizes the overall prediction of instability with a RIP score. It outputs an HTML file of hypermutable sites on an interactive drawing of the sequence and in a table. The RIP score represents the factor by which redesigning the input sequence to eliminate predicted SSR and RMD mutational hotspots (leaving only the baseline rate of BPS mutations) could theoretically reduce the overall rate of mutations that contribute to the evolutionary failure modes of this DNA sequence when it is deployed in a bacterial host. (b) The original version of a genetic circuit that expresses GFP (BioBrick part T9002) receives a RIP score of 486.3. A redesigned version (T9002-E), shown experimentally to have a longer evolutionary half-life by Sleight et al., receives a more stable RIP score of 101.6.
Figure 2.
Figure 2.
(a) Distribution of RIP scores predicted by the EFM Calculator for parts with lengths of >50 base pairs in the iGEM Registry. The colored shading of bars demarcates ten-fold increases in predicted mutational instability. (b) EFM Calculator output for BioBrick part E1010, one of the top ten most used coding sequences in the Registry. The RIP score of 178.3 indicates that the mutation rate in this part is predicted to be 178 times the baseline rate of base-pair substitutions. Three simple sequence repeats contribute to this genetic instability prediction.

References

    1. Arkin AP, and Fletcher DA (2006) Fast, cheap and somewhat in control. Genome Biol. 7, 114. - PMC - PubMed
    1. Renda BA, Hammerling MJ, and Barrick JE (2014) Engineering reduced evolutionary potential for synthetic biology. Mol. Biosyst 10, 1668–1678. - PMC - PubMed
    1. Canton B, Labno A, and Endy D (2008) Refinement and standardization of synthetic biological parts and devices. Nat. Biotechnol 26, 787–793. - PubMed
    1. Sleight SC, Bartley BA, Lieviant JA, and Sauro HM (2010) Designing and engineering evolutionary robust genetic circuits. J. Biol. Eng 4, 12. - PMC - PubMed
    1. Oliveira PH, Lemos F, Monteiro GA, and Prazeres DMF (2008) Recombination frequency in plasmid DNA containing direct repeats—predictive correlation with repeat and intervening sequence length. Plasmid 60, 159–165. - PubMed

Publication types