Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2014 Mar;32(3):279-284.
doi: 10.1038/nbt.2808. Epub 2014 Jan 26.

Improving CRISPR-Cas nuclease specificity using truncated guide RNAs

Affiliations

Improving CRISPR-Cas nuclease specificity using truncated guide RNAs

Yanfang Fu et al. Nat Biotechnol. 2014 Mar.

Abstract

Clustered, regularly interspaced, short palindromic repeat (CRISPR) RNA-guided nucleases (RGNs) are highly efficient genome editing tools. CRISPR-associated 9 (Cas9) RGNs are directed to genomic loci by guide RNAs (gRNAs) containing 20 nucleotides that are complementary to a target DNA sequence. However, RGNs can induce mutations at sites that differ by as many as five nucleotides from the intended target. Here we report that truncated gRNAs, with shorter regions of target complementarity <20 nucleotides in length, can decrease undesired mutagenesis at some off-target sites by 5,000-fold or more without sacrificing on-target genome editing efficiencies. In addition, use of truncated gRNAs can further reduce off-target effects induced by pairs of Cas9 variants that nick DNA (paired nickases). Our results delineate a simple, effective strategy to improve the specificities of Cas9 nucleases or paired nickases.

PubMed Disclaimer

Figures

Figure 1
Figure 1. On-target genome editing activities of truncated gRNAs and Cas9 nuclease in human cells
(a) Target site in the EGFP gene used to assess the activities of gRNAs that are truncated at their 5' end. Note that Gs near the 5' end (underlined) enable gRNAs with complementarity lengths of 15, 17, 19 and 20 nt to be expressed from a U6 promoter in human cells. The degenerate base of the protospacer adjacent motif (PAM) sequence (in this case, a C nucleotide) is shown in lowercase. (b) Efficiencies of EGFP disruption in human cells mediated by Cas9 and gRNAs bearing variable length complementarity regions (shown in nts) for the target site of (a). Ctrl = control gRNA lacking a complementarity region. Error bars indicate standard errors of the mean (s.e.m.), n = 2. (c) Efficiencies of EGFP disruption in human cells mediated by Cas9 and full-length or shortened gRNAs for four target sites in the EGFP reporter gene (sites #1 – #4). Lengths and sequences of the gRNA complementarity regions are shown. Ctrl, control gRNA lacking a complementarity region; nt, nucleotides. Error bars represent s.e.m., n = 2. (d) Efficiencies of targeted indel mutations introduced at seven different human endogenous gene targets by matched standard and tru-RGNs. Lengths and sequences of gRNA complementarity regions are shown. Indel frequencies were measured by T7EI assay. Ctrl = control gRNA lacking a complementarity region. Three of the target sites used are named as previously described: VEGFA site 1 (aka Target site 1), VEGFA Site 3 (aka Target Site 3), and EMX1 Site 1 (aka Target Site 4). VEGFA= vascular endothelial growth factor A, EMX1= empty spiracles homolog 1. Error bars represent s.e.m., n = 2. (e) Efficiencies of precise alterations mediated by homology-directed repair (HDR) using single strand oligonucleotides (ssODN) at two endogenous human genes (VEGFA and EMX1) with either full-length RGNs or tru-RGNs. The percentage of alleles in which HDR had occurred (Percentage HDR) was measured using a BamHI restriction digest assay (Online Methods). Control gRNA = control gRNA lacking a complementarity region. Error bars represent s.e.m., n = 2.
Figure 1
Figure 1. On-target genome editing activities of truncated gRNAs and Cas9 nuclease in human cells
(a) Target site in the EGFP gene used to assess the activities of gRNAs that are truncated at their 5' end. Note that Gs near the 5' end (underlined) enable gRNAs with complementarity lengths of 15, 17, 19 and 20 nt to be expressed from a U6 promoter in human cells. The degenerate base of the protospacer adjacent motif (PAM) sequence (in this case, a C nucleotide) is shown in lowercase. (b) Efficiencies of EGFP disruption in human cells mediated by Cas9 and gRNAs bearing variable length complementarity regions (shown in nts) for the target site of (a). Ctrl = control gRNA lacking a complementarity region. Error bars indicate standard errors of the mean (s.e.m.), n = 2. (c) Efficiencies of EGFP disruption in human cells mediated by Cas9 and full-length or shortened gRNAs for four target sites in the EGFP reporter gene (sites #1 – #4). Lengths and sequences of the gRNA complementarity regions are shown. Ctrl, control gRNA lacking a complementarity region; nt, nucleotides. Error bars represent s.e.m., n = 2. (d) Efficiencies of targeted indel mutations introduced at seven different human endogenous gene targets by matched standard and tru-RGNs. Lengths and sequences of gRNA complementarity regions are shown. Indel frequencies were measured by T7EI assay. Ctrl = control gRNA lacking a complementarity region. Three of the target sites used are named as previously described: VEGFA site 1 (aka Target site 1), VEGFA Site 3 (aka Target Site 3), and EMX1 Site 1 (aka Target Site 4). VEGFA= vascular endothelial growth factor A, EMX1= empty spiracles homolog 1. Error bars represent s.e.m., n = 2. (e) Efficiencies of precise alterations mediated by homology-directed repair (HDR) using single strand oligonucleotides (ssODN) at two endogenous human genes (VEGFA and EMX1) with either full-length RGNs or tru-RGNs. The percentage of alleles in which HDR had occurred (Percentage HDR) was measured using a BamHI restriction digest assay (Online Methods). Control gRNA = control gRNA lacking a complementarity region. Error bars represent s.e.m., n = 2.
Figure 1
Figure 1. On-target genome editing activities of truncated gRNAs and Cas9 nuclease in human cells
(a) Target site in the EGFP gene used to assess the activities of gRNAs that are truncated at their 5' end. Note that Gs near the 5' end (underlined) enable gRNAs with complementarity lengths of 15, 17, 19 and 20 nt to be expressed from a U6 promoter in human cells. The degenerate base of the protospacer adjacent motif (PAM) sequence (in this case, a C nucleotide) is shown in lowercase. (b) Efficiencies of EGFP disruption in human cells mediated by Cas9 and gRNAs bearing variable length complementarity regions (shown in nts) for the target site of (a). Ctrl = control gRNA lacking a complementarity region. Error bars indicate standard errors of the mean (s.e.m.), n = 2. (c) Efficiencies of EGFP disruption in human cells mediated by Cas9 and full-length or shortened gRNAs for four target sites in the EGFP reporter gene (sites #1 – #4). Lengths and sequences of the gRNA complementarity regions are shown. Ctrl, control gRNA lacking a complementarity region; nt, nucleotides. Error bars represent s.e.m., n = 2. (d) Efficiencies of targeted indel mutations introduced at seven different human endogenous gene targets by matched standard and tru-RGNs. Lengths and sequences of gRNA complementarity regions are shown. Indel frequencies were measured by T7EI assay. Ctrl = control gRNA lacking a complementarity region. Three of the target sites used are named as previously described: VEGFA site 1 (aka Target site 1), VEGFA Site 3 (aka Target Site 3), and EMX1 Site 1 (aka Target Site 4). VEGFA= vascular endothelial growth factor A, EMX1= empty spiracles homolog 1. Error bars represent s.e.m., n = 2. (e) Efficiencies of precise alterations mediated by homology-directed repair (HDR) using single strand oligonucleotides (ssODN) at two endogenous human genes (VEGFA and EMX1) with either full-length RGNs or tru-RGNs. The percentage of alleles in which HDR had occurred (Percentage HDR) was measured using a BamHI restriction digest assay (Online Methods). Control gRNA = control gRNA lacking a complementarity region. Error bars represent s.e.m., n = 2.
Figure 1
Figure 1. On-target genome editing activities of truncated gRNAs and Cas9 nuclease in human cells
(a) Target site in the EGFP gene used to assess the activities of gRNAs that are truncated at their 5' end. Note that Gs near the 5' end (underlined) enable gRNAs with complementarity lengths of 15, 17, 19 and 20 nt to be expressed from a U6 promoter in human cells. The degenerate base of the protospacer adjacent motif (PAM) sequence (in this case, a C nucleotide) is shown in lowercase. (b) Efficiencies of EGFP disruption in human cells mediated by Cas9 and gRNAs bearing variable length complementarity regions (shown in nts) for the target site of (a). Ctrl = control gRNA lacking a complementarity region. Error bars indicate standard errors of the mean (s.e.m.), n = 2. (c) Efficiencies of EGFP disruption in human cells mediated by Cas9 and full-length or shortened gRNAs for four target sites in the EGFP reporter gene (sites #1 – #4). Lengths and sequences of the gRNA complementarity regions are shown. Ctrl, control gRNA lacking a complementarity region; nt, nucleotides. Error bars represent s.e.m., n = 2. (d) Efficiencies of targeted indel mutations introduced at seven different human endogenous gene targets by matched standard and tru-RGNs. Lengths and sequences of gRNA complementarity regions are shown. Indel frequencies were measured by T7EI assay. Ctrl = control gRNA lacking a complementarity region. Three of the target sites used are named as previously described: VEGFA site 1 (aka Target site 1), VEGFA Site 3 (aka Target Site 3), and EMX1 Site 1 (aka Target Site 4). VEGFA= vascular endothelial growth factor A, EMX1= empty spiracles homolog 1. Error bars represent s.e.m., n = 2. (e) Efficiencies of precise alterations mediated by homology-directed repair (HDR) using single strand oligonucleotides (ssODN) at two endogenous human genes (VEGFA and EMX1) with either full-length RGNs or tru-RGNs. The percentage of alleles in which HDR had occurred (Percentage HDR) was measured using a BamHI restriction digest assay (Online Methods). Control gRNA = control gRNA lacking a complementarity region. Error bars represent s.e.m., n = 2.
Figure 2
Figure 2. tru-gRNAs exhibit enhanced specificities and function efficiently with Cas9 nuclease and paired Cas9 nickases in human cells
(a) Activities of RGNs targeted to three sites in EGFP using full-length (top) or tru-gRNAs (bottom) with single mismatches at each position (except at the 5'-most base which must remain a G for efficient expression from the U6 promoter). Grey boxes in the grids show the positions of Watson-Crick transversion mismatches. Empty gRNA control is a gRNA lacking a complementarity region. RGN activities were measured using the EGFP disruption assay and values shown represent the percentage of EGFP-negative observed relative to an RGN using a perfectly matched gRNA. Experiments were performed in duplicate and means with error bars representing s.e.m. are shown. Nucleotides present at each position of the gRNA complementarity region are shown at the bottom of each panel. (b) Activities of RGNs targeted to three sites in EGFP using full-length (top) or tru-gRNAs (bottom) with adjacent double mismatches at each position (except at the 5'-most base which must remain a G for efficient expression from the U6 promoter). Data presented as in (a). (c) The percentage of sequencing reads containing on- and off-target indel mutations induced by RGNs targeted to three different endogenous human gene sites as measured by deep sequencing. Indel frequencies are shown for the three target sites from cells in which targeted RGNs with a full-length gRNA, a tru-gRNA, or a control gRNA lacking a complementarity region were expressed. Sequences of the on-target and off-target (OT) sites for full-length and tru-gRNAs are shown in Table 1. The scale of the y-axis is the same for all three panels. Absolute counts of indel mutation data used to make these graphs can be found in Supplementary Table 2. On-target sites are named as previously described and as in Fig. 1d. (d) Fold-improvements in off-target site specificities of three tru-RGNs compared to matched standard RGNs. Values shown represent the ratio of on-target to off-target activities for tru-RGNs divided by the ratio of on-target to off-target activities of standard RGNs for the off-target sites shown, calculated using the data from (c) and Supplementary Table 2. For the sites marked with an asterisk (*), no indels were observed with the tru-RGN and therefore the values shown represent conservative statistical estimates for the fold-improvements in specificities for these off-target sites (see Online Methods). (e) Schematic illustrating locations of VEGFA sites 1 and 4 targeted by gRNAs and variant Cas9 nickases to generate paired double nicks. Target sites for the full-length gRNAs are underlined, with the first base in the PAM sequence shown in lowercase. Location of the BamHI restriction site inserted by homology-directed repair (HDR) with a ssODN donor is shown. (f) Substitution of a full-length gRNA for VEGFA site 1 with a tru-gRNA does not reduce the efficiency of indel mutations observed with a paired full-length gRNA for VEGFA site 4 and Cas9-D10A nickases. Control gRNA used is one lacking a complementarity region. The frequency of indel mutations induced by the VEGFA site 4 gRNA alone with Cas9 nickase is highlighted in red. (g) Substitution of a full-length gRNA for VEGFA site 1 with a tru-gRNA does not reduce the efficiency of BamHI sequence alterations introduced with a paired full-length gRNA for VEGFA site 4, Cas9-D10A nickase, and an ssODN donor template. Control gRNA used is one lacking a complementarity region.
Figure 2
Figure 2. tru-gRNAs exhibit enhanced specificities and function efficiently with Cas9 nuclease and paired Cas9 nickases in human cells
(a) Activities of RGNs targeted to three sites in EGFP using full-length (top) or tru-gRNAs (bottom) with single mismatches at each position (except at the 5'-most base which must remain a G for efficient expression from the U6 promoter). Grey boxes in the grids show the positions of Watson-Crick transversion mismatches. Empty gRNA control is a gRNA lacking a complementarity region. RGN activities were measured using the EGFP disruption assay and values shown represent the percentage of EGFP-negative observed relative to an RGN using a perfectly matched gRNA. Experiments were performed in duplicate and means with error bars representing s.e.m. are shown. Nucleotides present at each position of the gRNA complementarity region are shown at the bottom of each panel. (b) Activities of RGNs targeted to three sites in EGFP using full-length (top) or tru-gRNAs (bottom) with adjacent double mismatches at each position (except at the 5'-most base which must remain a G for efficient expression from the U6 promoter). Data presented as in (a). (c) The percentage of sequencing reads containing on- and off-target indel mutations induced by RGNs targeted to three different endogenous human gene sites as measured by deep sequencing. Indel frequencies are shown for the three target sites from cells in which targeted RGNs with a full-length gRNA, a tru-gRNA, or a control gRNA lacking a complementarity region were expressed. Sequences of the on-target and off-target (OT) sites for full-length and tru-gRNAs are shown in Table 1. The scale of the y-axis is the same for all three panels. Absolute counts of indel mutation data used to make these graphs can be found in Supplementary Table 2. On-target sites are named as previously described and as in Fig. 1d. (d) Fold-improvements in off-target site specificities of three tru-RGNs compared to matched standard RGNs. Values shown represent the ratio of on-target to off-target activities for tru-RGNs divided by the ratio of on-target to off-target activities of standard RGNs for the off-target sites shown, calculated using the data from (c) and Supplementary Table 2. For the sites marked with an asterisk (*), no indels were observed with the tru-RGN and therefore the values shown represent conservative statistical estimates for the fold-improvements in specificities for these off-target sites (see Online Methods). (e) Schematic illustrating locations of VEGFA sites 1 and 4 targeted by gRNAs and variant Cas9 nickases to generate paired double nicks. Target sites for the full-length gRNAs are underlined, with the first base in the PAM sequence shown in lowercase. Location of the BamHI restriction site inserted by homology-directed repair (HDR) with a ssODN donor is shown. (f) Substitution of a full-length gRNA for VEGFA site 1 with a tru-gRNA does not reduce the efficiency of indel mutations observed with a paired full-length gRNA for VEGFA site 4 and Cas9-D10A nickases. Control gRNA used is one lacking a complementarity region. The frequency of indel mutations induced by the VEGFA site 4 gRNA alone with Cas9 nickase is highlighted in red. (g) Substitution of a full-length gRNA for VEGFA site 1 with a tru-gRNA does not reduce the efficiency of BamHI sequence alterations introduced with a paired full-length gRNA for VEGFA site 4, Cas9-D10A nickase, and an ssODN donor template. Control gRNA used is one lacking a complementarity region.
Figure 2
Figure 2. tru-gRNAs exhibit enhanced specificities and function efficiently with Cas9 nuclease and paired Cas9 nickases in human cells
(a) Activities of RGNs targeted to three sites in EGFP using full-length (top) or tru-gRNAs (bottom) with single mismatches at each position (except at the 5'-most base which must remain a G for efficient expression from the U6 promoter). Grey boxes in the grids show the positions of Watson-Crick transversion mismatches. Empty gRNA control is a gRNA lacking a complementarity region. RGN activities were measured using the EGFP disruption assay and values shown represent the percentage of EGFP-negative observed relative to an RGN using a perfectly matched gRNA. Experiments were performed in duplicate and means with error bars representing s.e.m. are shown. Nucleotides present at each position of the gRNA complementarity region are shown at the bottom of each panel. (b) Activities of RGNs targeted to three sites in EGFP using full-length (top) or tru-gRNAs (bottom) with adjacent double mismatches at each position (except at the 5'-most base which must remain a G for efficient expression from the U6 promoter). Data presented as in (a). (c) The percentage of sequencing reads containing on- and off-target indel mutations induced by RGNs targeted to three different endogenous human gene sites as measured by deep sequencing. Indel frequencies are shown for the three target sites from cells in which targeted RGNs with a full-length gRNA, a tru-gRNA, or a control gRNA lacking a complementarity region were expressed. Sequences of the on-target and off-target (OT) sites for full-length and tru-gRNAs are shown in Table 1. The scale of the y-axis is the same for all three panels. Absolute counts of indel mutation data used to make these graphs can be found in Supplementary Table 2. On-target sites are named as previously described and as in Fig. 1d. (d) Fold-improvements in off-target site specificities of three tru-RGNs compared to matched standard RGNs. Values shown represent the ratio of on-target to off-target activities for tru-RGNs divided by the ratio of on-target to off-target activities of standard RGNs for the off-target sites shown, calculated using the data from (c) and Supplementary Table 2. For the sites marked with an asterisk (*), no indels were observed with the tru-RGN and therefore the values shown represent conservative statistical estimates for the fold-improvements in specificities for these off-target sites (see Online Methods). (e) Schematic illustrating locations of VEGFA sites 1 and 4 targeted by gRNAs and variant Cas9 nickases to generate paired double nicks. Target sites for the full-length gRNAs are underlined, with the first base in the PAM sequence shown in lowercase. Location of the BamHI restriction site inserted by homology-directed repair (HDR) with a ssODN donor is shown. (f) Substitution of a full-length gRNA for VEGFA site 1 with a tru-gRNA does not reduce the efficiency of indel mutations observed with a paired full-length gRNA for VEGFA site 4 and Cas9-D10A nickases. Control gRNA used is one lacking a complementarity region. The frequency of indel mutations induced by the VEGFA site 4 gRNA alone with Cas9 nickase is highlighted in red. (g) Substitution of a full-length gRNA for VEGFA site 1 with a tru-gRNA does not reduce the efficiency of BamHI sequence alterations introduced with a paired full-length gRNA for VEGFA site 4, Cas9-D10A nickase, and an ssODN donor template. Control gRNA used is one lacking a complementarity region.
Figure 2
Figure 2. tru-gRNAs exhibit enhanced specificities and function efficiently with Cas9 nuclease and paired Cas9 nickases in human cells
(a) Activities of RGNs targeted to three sites in EGFP using full-length (top) or tru-gRNAs (bottom) with single mismatches at each position (except at the 5'-most base which must remain a G for efficient expression from the U6 promoter). Grey boxes in the grids show the positions of Watson-Crick transversion mismatches. Empty gRNA control is a gRNA lacking a complementarity region. RGN activities were measured using the EGFP disruption assay and values shown represent the percentage of EGFP-negative observed relative to an RGN using a perfectly matched gRNA. Experiments were performed in duplicate and means with error bars representing s.e.m. are shown. Nucleotides present at each position of the gRNA complementarity region are shown at the bottom of each panel. (b) Activities of RGNs targeted to three sites in EGFP using full-length (top) or tru-gRNAs (bottom) with adjacent double mismatches at each position (except at the 5'-most base which must remain a G for efficient expression from the U6 promoter). Data presented as in (a). (c) The percentage of sequencing reads containing on- and off-target indel mutations induced by RGNs targeted to three different endogenous human gene sites as measured by deep sequencing. Indel frequencies are shown for the three target sites from cells in which targeted RGNs with a full-length gRNA, a tru-gRNA, or a control gRNA lacking a complementarity region were expressed. Sequences of the on-target and off-target (OT) sites for full-length and tru-gRNAs are shown in Table 1. The scale of the y-axis is the same for all three panels. Absolute counts of indel mutation data used to make these graphs can be found in Supplementary Table 2. On-target sites are named as previously described and as in Fig. 1d. (d) Fold-improvements in off-target site specificities of three tru-RGNs compared to matched standard RGNs. Values shown represent the ratio of on-target to off-target activities for tru-RGNs divided by the ratio of on-target to off-target activities of standard RGNs for the off-target sites shown, calculated using the data from (c) and Supplementary Table 2. For the sites marked with an asterisk (*), no indels were observed with the tru-RGN and therefore the values shown represent conservative statistical estimates for the fold-improvements in specificities for these off-target sites (see Online Methods). (e) Schematic illustrating locations of VEGFA sites 1 and 4 targeted by gRNAs and variant Cas9 nickases to generate paired double nicks. Target sites for the full-length gRNAs are underlined, with the first base in the PAM sequence shown in lowercase. Location of the BamHI restriction site inserted by homology-directed repair (HDR) with a ssODN donor is shown. (f) Substitution of a full-length gRNA for VEGFA site 1 with a tru-gRNA does not reduce the efficiency of indel mutations observed with a paired full-length gRNA for VEGFA site 4 and Cas9-D10A nickases. Control gRNA used is one lacking a complementarity region. The frequency of indel mutations induced by the VEGFA site 4 gRNA alone with Cas9 nickase is highlighted in red. (g) Substitution of a full-length gRNA for VEGFA site 1 with a tru-gRNA does not reduce the efficiency of BamHI sequence alterations introduced with a paired full-length gRNA for VEGFA site 4, Cas9-D10A nickase, and an ssODN donor template. Control gRNA used is one lacking a complementarity region.

References

    1. Wiedenheft B, Sternberg SH, Doudna JA. RNA-guided genetic silencing systems in bacteria and archaea. Nature. 2012;482:331–338. - PubMed
    1. Mali P, Esvelt KM, Church GM. Cas9 as a versatile tool for engineering biology. Nat Methods. 2013;10:957–963. - PMC - PubMed
    1. Charpentier E, Doudna JA. Biotechnology: Rewriting a genome. Nature. 2013;495:50–51. - PubMed
    1. Fu Y, et al. High-frequency off-target mutagenesis induced by CRISPR-Cas nucleases in human cells. Nat Biotechnol. 2013;31:822–826. - PMC - PubMed
    1. Hsu PD, et al. DNA targeting specificity of RNA-guided Cas9 nucleases. Nat Biotechnol. 2013;31:827–832. - PMC - PubMed

Publication types

MeSH terms

Associated data