Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2016 Jul 1:6:28973.
doi: 10.1038/srep28973.

CRISP-ID: decoding CRISPR mediated indels by Sanger sequencing

Affiliations

CRISP-ID: decoding CRISPR mediated indels by Sanger sequencing

Jonas Dehairs et al. Sci Rep. .

Abstract

The advent of next generation gene editing technologies has revolutionized the fields of genome engineering in allowing the generation of gene knockout models and functional gene analysis. However, the screening of resultant clones remains challenging due to the simultaneous presence of different indels. Here, we present CRISP-ID, a web application which uses a unique algorithm for genotyping up to three alleles from a single Sanger sequencing trace, providing a robust and readily accessible platform to directly identify indels and significantly speed up the characterization of clones.

PubMed Disclaimer

Conflict of interest statement

The authors declare no competing financial interests.

Figures

Figure 1
Figure 1. Input, processing and output of the CRISP-ID application.
After uploading a trace file, CRISP-ID draws a chromatogram that displays the sequence peaks and base calls. The user can tweak the base calling by trimming the start and the end of the chromatogram and by adjusting the background cut-off. To obtain the sequences, a “frame primer” is constructed, containing only the homozygous base calls after the start of the spectral shift (A). This frame primer runs over the entire length of the overlapping spectrum. Provided that the frame primer is sufficiently long, it will align with the reference sequence once for each sequence that is present in the mix (B). If no alignments can be found initially, the primer is iteratively trimmed until either alignments are found, or a minimal size of 10 homozygous base calls is reached. Trimming the 3′ end of the frame primer might be necessary due to poor quality base calls near the end of the sequence run, trimming the 5′ end is necessary in case of insertions, and is set to 10 bases by default (not shown in figure). Initially, a “first guess” of the sequences is constructed based on peak height, with the first sequence containing the highest peaks (C). The first sequence is then compared to the reference sequence according to the first frame found in step B. In case of a base mismatch, the base is swapped with the second sequence. During this process of matching the first sequence to the reference sequence, the swapping inherently results in simultaneously solving the second sequence (D). Finally, the user is presented with a multiple alignment of the de-convoluted sequences and the reference sequence, revealing the size and locus of the indels.

References

    1. Urnov F. D. et al. Highly efficient endogenous human gene correction using designed zinc-finger nucleases. Nature 435, 646–651 (2005). - PubMed
    1. Cermak T. et al. Efficient design and assembly of custom TALEN and other TAL effector-based constructs for DNA targeting. Nucleic Acids Res. 39, e82 (2011). - PMC - PubMed
    1. Mali P. et al. RNA-Guided Human Genome Engineering via Cas9. Science 339, 823–826 (2013). - PMC - PubMed
    1. Wang H. et al. One-Step Generation of Mice Carrying Mutations in Multiple Genes by CRISPR/Cas-Mediated Genome Engineering. Cell 153, 910–918 (2013). - PMC - PubMed
    1. Hwang W. Y. et al. Efficient genome editing in zebrafish using a CRISPR-Cas system. Nat. Biotechnol. 31, 227–229 (2013). - PMC - PubMed

Publication types