Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2021 May 24;12(1):3042.
doi: 10.1038/s41467-021-22417-4.

CRISPECTOR provides accurate estimation of genome editing translocation and off-target activity from comparative NGS data

Affiliations

CRISPECTOR provides accurate estimation of genome editing translocation and off-target activity from comparative NGS data

Ido Amit et al. Nat Commun. .

Abstract

Controlling off-target editing activity is one of the central challenges in making CRISPR technology accurate and applicable in medical practice. Current algorithms for analyzing off-target activity do not provide statistical quantification, are not sufficiently sensitive in separating signal from noise in experiments with low editing rates, and do not address the detection of translocations. Here we present CRISPECTOR, a software tool that supports the detection and quantification of on- and off-target genome-editing activity from NGS data using paired treatment/control CRISPR experiments. In particular, CRISPECTOR facilitates the statistical analysis of NGS data from multiplex-PCR comparative experiments to detect and quantify adverse translocation events. We validate the observed results and show independent evidence of the occurrence of translocations in human cell lines, after genome editing. Our methodology is based on a statistical model comparison approach leading to better false-negative rates in sites with weak yet significant off-target activity.

PubMed Disclaimer

Conflict of interest statement

G.K., M.S.M., and G.R.R. are employees of Integrated DNA Technologies (IDT), which sells reagents similar to some described in the manuscript. All other authors declare no competing interests.

Figures

Fig. 1
Fig. 1. Quantifying off-target activity by CRISPECTOR.
a, b Hypothetical indels at the expected cut-site in the M experiments. a Out of 226 off-target sites that we have examined in our experimental data, we have found 31 sites to have indel frequency higher than 0.1% (fraction = 0.001) in the M experiments. M indel frequencies at the cut-site were measured by a direct calculation, not using a comparative set-up. b These 31 noisy off-target sites come from different gRNAs examined, as depicted. (c) CRISPECTOR workflow. CRISPECTOR assigns each read in the Tx and M FASTQ files to a specific locus of interest or a putative translocation. Then, a Bayesian inference classifier accurately estimates the indel editing activity, and a hypergeometric test is performed to detect translocation reads.
Fig. 2
Fig. 2. Off-target activity levels estimates by CRISPECTOR.
a Site 51 of RAG1, XT 2p in HEK293-Cas9. High rate of deletions of length 1 at the expected cut-site masks the real editing signal (Tx=17, M = 41, solid circle) if only subtraction is used. Also, the PAM site is GGG. Thus, we can find an alternative cut-site 1-bp to the right of the expected cut-site. CRISPECTOR points to five reads with deletions of length 1 on this alternative cut-site (dashed circle). Editing activity estimation—CRISPECTOR = 0.113%, CRISPResso2 = 0.068% and ampliCan = 0.070%. Note that reference is depicted 3’→5’ since the gRNA cuts the negative strand in this locus. The notation Del k stands for deletions of length k. b Editing activity by CRISPECTOR, ampliCan and CRISPResso2 for RAG1, XT 2p, HEK293-Cas9 sample. For many instances with low editing activity levels, there is a disagreement between tools with respect to the active (≥0.1%) or inactive (<0.1%) classification. In this experiment, CRISPECTOR reported four sites as ≥0.1% (84, 61, 43, and 51, denoted with asterisks), where both ampliCan and CRISPResso2 determined the sites as ≤0.1%. Error bars represent confidence intervals of CRISPECTOR results. This activity was validated by human examination of the individual actual reads (see “Discussion” and Supplementary Note 2). c Tool comparison for the active off-target site classification. Off-target site classification results for CRISPECTOR, CRISPResso2 (CS2), and ampliCan (Amp). In the left column squares, the number of instances with agreement and disagreement in the classification task are depicted. Each square refers to a comparison of two tools. P (Positive) and N (Negative) stand for active and inactive sites, respectively. The two pie charts depict CRISPECTOR and CRISPResso2 classification results on the 19 validated sites. TP, TN, FP, and FN denote true-positive, true-negative, false-positive, and false-negative, respectively. True state determined by the actual reads.
Fig. 3
Fig. 3. CRISPECTOR detects and quantifies adverse translocation events.
a Schematic description of all four translocation types. Schematic description of all possible translocations for chromosomes 3 and 10 p arms. The expected cut-sites are colored red and ⊕ denotes the concatenation/fusion of the sequences. The four possible fusion types give rise to structures that are either single-centromeric (A & B), centromere-free (C), or double-centromeric (D). b Translocation reads heatmap for RAG2, XT 2p, HEK293-Cas9. 19 translocations with a Hypergeometric (FDR corrected) p-value < 0.05 are presented in the heatmap with the associated read counts. For example: 225 reads were found between RAG2_5 and RAG2_1, yielding a p-value of 1.53 × 10−65. c ddPCR Validation of translocation events. Specific translocation or structural variation events were individually experimentally measured using event-specific ddPCR primers and probes. The means of 8 technical repeats are also depicted above (black lines). WRS (Wilcoxon Rank Sum) one-sided p-values to support the higher Tx values measured are: 7.7 × 10−4, 0.29, 0.027, 0.074, 0.09, and 0.17, ordered as in the figure. Source data is provided as a source data file. d Validation of CRISPECTOR translocation detection process based on synthetic spike-in standards. Edited RAG2, XT 2p, HEK293-Cas9 genomic DNA was spiked-in with serial dilutions (0, 0.016, 0.16, 1.6 and 16%) of RAG2_10/RAG2_7 synthetic fusion construct (see text). Observed numbers of fusion reads for the spiked-in RAG2_10/RAG2_7 construct (in orange) and for the naturally occurring RAG2_1/RAG2_5 (in green) are presented.

References

    1. Porteus MH. A new class of medicines through DNA editing. N. Engl. J. Med. 2019;380:947–959. doi: 10.1056/NEJMra1800729. - DOI - PubMed
    1. Tsai SQ, Joung JK. Defining and improving the genome-wide specificities of CRISPR-Cas9 nucleases. Nat. Rev. Genet. 2016;17:300–312. doi: 10.1038/nrg.2016.28. - DOI - PMC - PubMed
    1. Bikard D, et al. Programmable repression and activation of bacterial gene expression using an RNA-guided DNA binding protein Supplementary Materials. Nucleic Acids Res. 2013;41:7429–7437. doi: 10.1093/nar/gkt520. - DOI - PMC - PubMed
    1. Zhang Y, Malzahn AA, Sretenovic S, Qi Y. The emerging and uncultivated potential of CRISPR technology in plant science. Nat. Plants. 2019;5:778–794. doi: 10.1038/s41477-019-0461-5. - DOI - PubMed
    1. Han R, et al. Functional CRISPR screen identifies AP1-associated enhancer regulating FOXF1 to modulate oncogene-induced senescence. Genome Biol. 2018;19:118. doi: 10.1186/s13059-018-1494-1. - DOI - PMC - PubMed

Publication types