Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2010 May;38(8):2617-23.
doi: 10.1093/nar/gkq093. Epub 2010 Mar 7.

CLONEQC: lightweight sequence verification for synthetic biology

Affiliations

CLONEQC: lightweight sequence verification for synthetic biology

Pablo A Lee et al. Nucleic Acids Res. 2010 May.

Abstract

Synthetic biology projects aim to produce physical DNA that matches a designed target sequence. Chemically synthesized oligomers are generally used as the starting point for building larger and larger sequences. Due to the error rate of chemical synthesis, these oligomers can have many differences from the target sequence. As oligomers are joined together to make larger and larger synthetic intermediates, it becomes essential to perform quality control to eliminate intermediates with errors and retain only those DNA molecules that are error free with respect to the target. This step is often performed by transforming bacteria with synthetic DNA and sequencing colonies until a clone with a perfect sequence is identified. Here we present CloneQC, a lightweight software pipeline available as a free web server and as source code that performs quality control on sequenced clones. Input to the server is a list of desired sequences and forward and reverse reads for each clone. The server generates summary statistics (error rates and success rates target-by-target) and a detailed report of perfect clones. This software will be useful to laboratories conducting in-house DNA synthesis and is available at http://cloneqc.thruhere.net/ and as Berkeley Software Distribution (BSD) licensed source.

PubMed Disclaimer

Figures

Figure 1.
Figure 1.
Flowchart of the CloneQC sequence validation pipeline. See ‘Materials and Methods’ section for details.
Figure 2.
Figure 2.
Summary results for CloneQC run as a web application. ‘STATISTICS’ provides summary statistics for each synthetic target matched by at least one clone. Following are summary tables for each target sequence (only the first two shown), giving the identities of clones that contain perfect physical DNA for the target (Passing Clones), have discrepancies between reads but may have perfect physical DNA (Check Clones) or have errors that are fixable by reamplification (Fixable Clones).

References

    1. Dymond JS, Scheifele LZ, Richardson S, Lee P, Chandrasegaran S, Bader JS, Boeke JD. Teaching synthetic biology, bioinformatics and engineering to undergraduates: the interdisciplinary Build-a-Genome course. Genetics. 2009;181:13–21. - PMC - PubMed
    1. Richardson SM, Wheelan SJ, Yarrington RM, Boeke JD. GeneDesign: rapid, automated design of multikilobase synthetic genes. Genome Res. 2006;16:550–556. - PMC - PubMed
    1. Richardson SM, Nunley PW, Yarrington RM, Boeke JD, Bader JS. Genedesign 3.0: An updated synthetic biology toolkit. Nucleic Acids Res. 2010 (in press) - PMC - PubMed
    1. Carr PA, Park JS, Lee Y-J, Yu T, Zhang S, Jacobson JM. Protein-mediated error correction for de novo DNA synthesis. Nucleic Acids Res. 2004;32:e162. - PMC - PubMed
    1. Tian J, Gong H, Sheng N, Zhou X, Gulari E, Gao X, Church G. Accurate multiplex gene synthesis from programmable DNA microchips. Nature. 2004;432:1050–1054. - PubMed

Publication types