Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2019 Feb 20;47(3):e17.
doi: 10.1093/nar/gky1182.

Start-Stop Assembly: a functionally scarless DNA assembly system optimized for metabolic engineering

Affiliations

Start-Stop Assembly: a functionally scarless DNA assembly system optimized for metabolic engineering

George M Taylor et al. Nucleic Acids Res. .

Abstract

DNA assembly allows individual DNA constructs or libraries to be assembled quickly and reliably. Most methods are either: (i) Modular, easily scalable and suitable for combinatorial assembly, but leave undesirable 'scar' sequences; or (ii) bespoke (non-modular), scarless but less suitable for construction of combinatorial libraries. Both have limitations for metabolic engineering. To overcome this trade-off we devised Start-Stop Assembly, a multi-part, modular DNA assembly method which is both functionally scarless and suitable for combinatorial assembly. Crucially, 3 bp overhangs corresponding to start and stop codons are used to assemble coding sequences into expression units, avoiding scars at sensitive coding sequence boundaries. Building on this concept, a complete DNA assembly framework was designed and implemented, allowing assembly of up to 15 genes from up to 60 parts (or mixtures); monocistronic, operon-based or hybrid configurations; and a new streamlined assembly hierarchy minimizing the number of vectors. Only one destination vector is required per organism, reflecting our optimization of the system for metabolic engineering in diverse organisms. Metabolic engineering using Start-Stop Assembly was demonstrated by combinatorial assembly of carotenoid pathways in Escherichia coli resulting in a wide range of carotenoid production and colony size phenotypes indicating the intended exploration of design space.

PubMed Disclaimer

Figures

Figure 1.
Figure 1.
Schematic overview of Start-Stop Assembly. (A) Architecture of Level 1 assembly of expression units from genetic parts, showing detail for the CDS part in Level 0. Parts stored in Level 0 are flanked by inward-facing SapI restriction sites. Each type of part (promoter, RBS, CDS, terminator) uses different and unique fusion sites (α-ϵ sites) that allow the correct assembly of expression units. Coding sequences use the start codon ATG (γ site) and stop codon TAA (δ site) as fusion sites allowing functionally scarless assembly of expression units. (B) Schematic illustration showing the overall framework for Start-Stop Assembly. Up to 15 expression units can be hierarchically assembled from basic parts.
Figure 2.
Figure 2.
Assessment of fidelity and bias of multi-part Level 1 assembly. Constructs were assembled from Level 0 parts (in storage plasmids) into Level 1 vector pStA1AZ. Promoters, in descending order of strength: P1 = J23100, P2 = J23102, P3 = J23118, P4 = J23107, P5 = J23116, P6 = J23113. RBSs, in descending order of strength: R1 = RBSc44, R2 = RBSc33, R3 = RBSc13, R4 = RBSc58, R5 = RBSc42, R6 = RBSc36. CDS: e = eyfp. Terminator: T1 = L3S2P55. Representative images of E. coli DH10B colonies transformed with assembly reactions are shown illuminated by blue light to visualize fluorescence. In each experiment ten randomly-selected colonies (c1-c10) were analysed by sequencing (with primers oligoGT234 and oligoGT235) and flow cytometry. Sequencing results show expected intact parts and fusion sites. Flow cytometry histograms show fluorescence intensity of 10,000 events (cells) normalised to the maximum (in order to visualize distribution rather than absolute values) for wild-type E. coli DH10B as a negative reference (−), the most fluorescent of the ten clones as a positive reference (+) and a pool of several hundred transformants (P). (A) Assessment of assembly fidelity using assembly of an individual P3-R3-e-T1 expression unit. Two colonies were randomly selected for analysis from each of five replicate assemblies. The flow cytometry positive reference (+) strain is c3. (B) Assessment of assembly bias by combinatorial assembly of EYFP expression units using six promoters P1-P6, six RBSs R1-R6, eyfp and terminator T1. Ten colonies from one assembly were randomly selected for analysis. The flow cytometry positive reference (+) strain is c1. In the sequencing results the red cross indicates a misassembly between β and γ in place of the UTR/RBS. Two further replicates of this experiment are shown in Supplementary Figure S24.
Figure 3.
Figure 3.
Alternative hierarchies achieved by alternative organization of Level 1 donor and Level 2 acceptor fusion sites. Rounded rectangles represent vectors. Letters represent Level 1 donor fusion sites and Level 2 acceptor fusion sites. (A) Start-Stop Assembly uses alternative ‘Z vectors’ at Level 1 to minimize the number of destination vectors at Level 2. By assembling the last expression unit (of those being assembled at Level 1) in a Z vector at Level 1, in which the last donor fusion site is Z, then the recipient Level 2 vector can always use the same acceptor fusion sites, A and Z. While this requires more Level 1 vectors, it requires only one Level 2 vector. If alternative destination vectors are required for different organisms or contexts, only one new Level 2 vector needs to be constructed. (B) The typical Golden Gate framework requires multiple Level 2 vectors per context to accommodate all the possible Level 1 fusion site configurations. Therefore if an alternative destination vector is required several new vectors will need to be constructed per context or organism.
Figure 4.
Figure 4.
Detail of multi-part hierarchical assembly and fusion sites used in Start-Stop Assembly. Up to 15 expression units can be hierarchically assembled from basic parts (promoters, RBSs, CDSs and terminators) stored at Level 0. Level 1 expression units are assembled from four Level 0 parts, Level 2 constructs are assembled from up to five Level 1 expression units, and Level 3 constructs are assembled from up to three Level 2 constructs. Different type IIS restriction endonucleases (SapI, BsaI or BbsI) and different antibiotic-resistance markers (AmpR, TetR, KanR or CamR) are used for each assembly level as shown. Acceptor fusion sites are shown inside donor fusion sites at Level 1 and Level 2. Only donor fusion sites are shown at Level 0, and only acceptor fusion sites are shown at Level 3. As described in the text, alternative sets of vectors are used at Level 1 depending upon the number of expression units being assembled, such that the first and last Level 1 donor fusion sites are always A and Z, respectively, which are always the Level 2 acceptor fusion sites. Multiple Level 2 vectors (pStA212, pStA223, pStA234) with different donor fusion sites are only needed if the assembly will be continued to Level 3, in order to assemble more than five expression units. Alternative Level 3 vectors (pStA313, pStA314) with different acceptor fusion sites allow for assembly of two or three Level 2 constructs.
Figure 5.
Figure 5.
Combinatorial assembly of astaxanthin pathway using Start-Stop Assembly. (A) Astaxanthin pathway showing the endogenous enzymes of the E. coli MEP pathway and heterologous enzymes. Coloured products are shown in boxes, enzymes included in the combinatorial assembly are shown above arrows. Abbreviations: G3P (glyceraldehyde 3-phosphate), FPP (Farnesyl pyrophosphate), GGPP (Geranylgeranyl pyrophosphate). (B) Design of astaxanthin pathway library pGT535, composed of eight expression units combinatorially and hierarchically assembled into Level 3 vector pStA314, as outlined in the text and shown in detail in Figure S20. Mixtures of six parts of varying strengths were used at each promoter and RBS position shown, as described in the text. These are shown as n in the figure to reflect the sample of the combinatorial design space present after hierarchical assembly. Assembled insert size = 9,984 bp. Total size of plasmid including assembled insert = 12,691 bp. (C) Phenotypic variation among E. coli clones of the astaxanthin pathway library (pGT535) was compared to controls pStA314 (empty vector) and an isolated clone pGT536 from the astaxanthin pathway library pGT535. Phenotypic variation is shown using representative images of colonies, histograms of colony size (measured as colony area in pixels, x-axis values represent the upper limit of each histogram bin) and the distributions of colony colours (represented using red, green and blue values extracted from colony images).
Figure 6.
Figure 6.
Lycopene content of E. coli clones from the astaxanthin library. Ten E. coli colonies were randomly selected from the combinatorially-assembled astaxanthin pathway library pGT535 (Figure 5, Figure S20). Each clone was grown for 24 h in LB broth before lycopene content was determined as mg lycopene per dry cell weight (DCW). Error bars represent the standard deviation of three independent biological replicates. A photograph of a sample of the culture of each clone was taken (shown to left of graph), which shows the visible variation of colour in the ten cultures, attributed to their different profiles of the carotenoids produced in the astaxanthin pathway.

References

    1. Casini A., Storch M., Baldwin G.S., Ellis T.. Bricks and blueprints: methods and standards for DNA assembly. Nat. Rev. Mol. Cell Biol. 2015; 16:568–576. - PubMed
    1. Engler C., Kandzia R., Marillonnet S.. A one pot, one step, precision cloning method with high throughput capability. PLoS One. 2008; 3:e3647. - PMC - PubMed
    1. Engler C., Gruetzner R., Kandzia R., Marillonnet S.. Golden gate shuffling: a one-pot DNA shuffling method based on type IIs restriction enzymes. PLoS One. 2009; 4:e5553. - PMC - PubMed
    1. Weber E., Engler C., Gruetzner R., Werner S., Marillonnet S.. A modular cloning system for standardized assembly of multigene constructs. PLoS One. 2011; 6:e16765. - PMC - PubMed
    1. Sarrion-Perdigones A., Falconi E.E., Zandalinas S.I., Juárez P., Fernández-del-Carmen A., Granell A., Orzaez D.. GoldenBraid: an iterative cloning system for standardized assembly of reusable genetic modules. PLoS One. 2011; 6:e21622. - PMC - PubMed

Publication types

MeSH terms