Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2023 Nov 23;27(1):108557.
doi: 10.1016/j.isci.2023.108557. eCollection 2024 Jan 19.

SWOffinder: Efficient and versatile search of CRISPR off-targets with bulges by Smith-Waterman alignment

Affiliations

SWOffinder: Efficient and versatile search of CRISPR off-targets with bulges by Smith-Waterman alignment

Ofir Yaish et al. iScience. .

Abstract

CRISPR/Cas9 technology is revolutionizing the field of gene editing. While this technology enables the targeting of any gene, it may also target unplanned loci, termed off-target sites (OTS), which are a few mismatches, insertions, and deletions from the target. While existing methods for finding OTS up to a given mismatch threshold are efficient, other methods considering insertions and deletions are limited by long runtimes, incomplete OTS lists, and partial support of versatile thresholds. Here, we developed SWOffinder, an efficient method based on Smith-Waterman alignment to find all OTS up to some edit distance. We implemented an original trace-back approach to find OTS under versatile criteria, such as separate limits on the number of insertions, deletions, and mismatches. Compared to state-of-the-art methods, only SWOffinder finds all OTS in the genome in just a few minutes. SWOffinder enables accurate and efficient genomic search of OTS, which will lead to safer gene editing.

Keywords: Algorithms; Bioinformatics; Biological constraints; Techniques in genetics.

PubMed Disclaimer

Conflict of interest statement

The authors declare that they have no competing interests.

Figures

None
Graphical abstract
Figure 1
Figure 1
Off-target site alignment illustration An off-target site alignment with 1 DNA bulge, 1 RNA bulge, and 1 mismatch of a sgRNA sequence AUUGAGAUAGUGUGGGGAAG with NGG PAM and a genome window of the forward strand.
Figure 2
Figure 2
Overview of SWOffinder for searching off-target sites across a given genome First, in the SW-matrix calculation step, SWOffinder scans the genome for all end positions of sites with an edit distance that meets a user-specified threshold using a novel version of the Smith-Waterman alignment algorithm. Then, in the trace-back step, SWOffinder applies a novel recursive procedure on the subset of end positions obtained by the SW-matrix calculation step to filter sites, which do not meet the user-specified operation-specific thresholds, and find their alignment that meets those thresholds with minimum edit distance (Algorithm 1).
Figure 3
Figure 3
Off-target sites identification and runtime comparison of SWOffinder, CALITAS, and CRISPRitz (A) Comparison of the number of sites found by each method. We ran SWOffinder and CALITAS to find off-target sites up to edit distance 4, and CRISPRitz with up to 4 mismatches and 4 bugles (as it does not support an edit-distance threshold). (B) The overlap between sites that were identified by SWOffinder and sites that were found by CALITAS and CRISPRitz in a window. SWOffinder found all sites that were found by CALITAS and CRISPRitz. (C) Runtime comparison of SWOffinder, CALITAS, and CRISPRitz in searching off-target sites up to edit distance 4. CRISPRitz was run with up to 4 mismatches and 4 bulges (as it does not support an edit-distance threshold).
Figure 4
Figure 4
A comparison of the number of unique off-target sites found and the runtime between SWOffinder and CRISPRitz over various combinations of bulges and mismatches thresholds We did not compare to CALITAS in this benchmark as it does not support a search with a combination of separate thresholds. (A) A comparison of the unique off-target sites that were found. Note that the x-axis is on a logarithmic scale. (B) A comparison of the runtime. Note that the runtime of SWOffinder trace-back step was calculated by reducing the runtime of the SWOffinder SW-matrix calculation step from the total runtime.

Similar articles

Cited by

References

    1. Jiang F., Doudna J.A. CRISPR–Cas9 structures and mechanisms. Annu. Rev. Biophys. 2017;46:505–529. - PubMed
    1. Pineda M., Lear A., Collins J.P., Kiani S. Safe CRISPR: challenges and possible solutions. Trends Biotechnol. 2019;37:389–401. - PubMed
    1. Bae S., Park J., Kim J.-S. Cas-OFFinder: a fast and versatile algorithm that searches for potential off-target sites of Cas9 RNA-guided endonucleases. Bioinformatics. 2014;30:1473–1475. - PMC - PubMed
    1. Backurs A., Indyk P. Proceedings of the forty-seventh annual ACM symposium on Theory of computing. 2015. Edit distance cannot be computed in strongly subquadratic time (unless SETH is false) pp. 51–58.
    1. Cancellieri S., Canver M.C., Bombieri N., Giugno R., Pinello L. CRISPRitz: rapid, high-throughput and variant-aware in silico off-target site identification for CRISPR genome editing. Bioinformatics. 2020;36:2001–2008. - PMC - PubMed

LinkOut - more resources