Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
[Preprint]. 2024 Sep 9:2024.09.09.612085.
doi: 10.1101/2024.09.09.612085.

CRISPR-CLEAR: Nucleotide-Resolution Mapping of Regulatory Elements via Allelic Readout of Tiled Base Editing

Affiliations

CRISPR-CLEAR: Nucleotide-Resolution Mapping of Regulatory Elements via Allelic Readout of Tiled Base Editing

Basheer Becerra et al. bioRxiv. .

Abstract

CRISPR tiling screens have advanced the identification and characterization of regulatory sequences but are limited by low resolution arising from the indirect readout of editing via guide RNA sequencing. This study introduces CRISPR-CLEAR, an end-to-end experimental assay and computational pipeline, which leverages targeted sequencing of CRISPR-introduced alleles at the endogenous target locus following dense base-editing mutagenesis. This approach enables the dissection of regulatory elements at nucleotide resolution, facilitating a direct assessment of genotype-phenotype effects.

PubMed Disclaimer

Conflict of interest statement

Competing interests Competing interests L.P. has financial interests in Edilytics, Inc., Excelsior Genomics, and SeQure Dx, Inc. L.P.’s interests were reviewed and are managed by Massachusetts General Hospital and Partners HealthCare in accordance with their conflict of interest policies. The remaining authors declare no competing interests.

Figures

Figure 1:
Figure 1:. A base-editor tiling screen with allele-based readout
a. Comparison of CRISPR-CLEAR workflow with standard sgRNA enrichment sequencing approach. The workflow illustrates the key steps from guide RNA design to data analysis. First, cells stably expressing a base editor are transduced with a library of guide RNAs tiling the regulatory sequence. After editing, cells are FACS-sorted based on the expression of the target protein. Genomic DNA is extracted from sorted cells. Next-generation libraries are prepared to quantify sgRNA counts and to measure the distribution of edits at the endogenous sequence in the sorted population of cells. The left pathway shows the standard approach using sgRNA count-based readout and the CRISPR-SURF pipeline for deconvolution of functional regions. The right pathway depicts the CRISPR-CLEAR approach using direct allele-based readout and the CRISPR-Millipede pipeline, enabling precise genotype-to-phenotype linkage through per-allele and per-nucleotide analysis. b. A putative proximal enhancer is located upstream of the CD19 promoter, based on H3K27ac and ATAC-seq of NALM6 cells, and sequence conservation. c. NALM6 clones with mono- or biallelic deletion of the CD19 enhancer show reduction in CD19 mRNA levels. d. Wild-type and enhancer KO clones were stained with a CD19 antibody. Flow cytometry indicated a reduction in CD19 protein levels. One-way ANOVA, replicates are shown in circles/squares/triangles (n=6–10), * = p<0.05).
Figure 2:
Figure 2:. Direct measurement of regulatory potential using allele-based readout
a. CRISPR-SURF analyses. Top: plot showing the fold enrichment of sgRNA in CD19 positive versus CD19 negative cells in the ABE8e-SpRY screen. Scores of guides from each of three replicates are shown in red, blue, or green. Scores of negative control sgRNAs (20mers not containing editable adenines) are shown in grey. Deconvolution score track with the region called significant at positions 220–230 by CRISPR-SURF demarcated with a black segment below. Bottom: plot showing the fold enrichment of guides from the evoCDA screen. Negative control guides (20mers lacking a cytosine within the expected editing window, with a non-NGG PAM) are shown in grey. Deconvolution score track with the regions called as significant at positions 140–150 and 220–230 by CRISPR-SURF demarcated with a black segment below. b. CRISPR-Millipede analyses. From top to bottom: plot showing the effect sizes obtained for A>G (green) and T>C (purple) substitutions (covariates) at given positions in the sequence from the CRISPR-Millipede analysis of the alleles in the CD19 positive and CD19 negative sorted populations from the ABE8e-SpRY screen. Positive effect size indicates variants leading to lower CD19 expression. Track showing the posterior inclusion probabilities (PIP) for each of the covariates in the ABE8e-SpRY screen. Track showing the editing rate of A>G and T>C substitutions in the CD19 positive (red) and CD19 negative sorted populations (blue). Plot showing the effect sizes obtained for G>A (yellow) and C>T (blue) substitutions (covariates) at given positions in the sequence from the CRISPR-Millipede analysis of the alleles in the CD19 positive and CD19 negative sorted populations from the evoCDA screen. Track showing the posterior inclusion probabilities (PIP) for each of the covariate in the evoCDA screen. Track showing the editing rate of G>A and C>T substitutions in the CD19 positive (red) and CD19 negative populations (blue). Two regions with overlap of hits in CRISPR-SURF and CRISPR-Millipede are highlighted as pink (region 1) and light blue (region 2). c. Top: TF motifs found at sequences with regulatory potential (high betas). Motifs were filtered based on the expression of the cognate transcription factors in NALM6 cells (>25 CPM) and significant genome-wide CRISPR screen scores (>0.5 or <−0.5) for modulation of CD19 from a past study. Bottom: PhyloP scores for the CD19 enhancer show that CRISPR-Millipede hits are highly conserved. d. PyDESEQ2 analysis: Differentially enriched alleles in CD19 negative or high populations in both screens. Pink dots correspond to alleles containing identified CRISPR-Millipede hits in region 1 (151A>G in the ABE8e-SpRy screen and 154G>A in the evoCDA screen). Light blue dots correspond to alleles containing identified CRISPR-Millipede hits in region 2 (223A>G, 230A>G in the ABE8e-SpRy screen, 227T>C, 229T>C in the evoCDA screen). Representative alleles are labeled. e. CRISPR-Millipede visualizations from top to bottom: board plot highlighting estimated nucleotide level effects on region 2 (chr16:28930891–28930931). The visualization consists of a heatmap showing CRISPR-Millipede effect sizes (square color), PIP (square size) and WT nucleotide with circles for the ABE8e-SpRy screen. Top substitutions with high effect size and PIP include 223A>G and 230A>G. Board plot for the evoCDA screen. Top substitutions with high effect size and PIP include 227T>C and 229T>C. Track showing the reference sequence for region 2. Track showing recovered effect sizes for both screens as logo track. Tracks showing candidate TF motifs including SPIB, IKZF1, and PAX5. Tracks showing the editing rate of A>G and T>C and the C>T and G>A substitutions in the two screens (Red: CD19 positive, Blue: CD19 negative).
Figure 3:
Figure 3:. Targeting Millipede hits downregulates CD19 and provides resistance to aCD19-CAR T cells.
a. Map of the recovered hits by CRISPR-Millipede on the CD19 enhancer sequence, showing TF binding motifs and sgRNAs used for validation experiments. Editable nucleotides are shown in green. b. Validation using individual sgRNAs and flow cytometry. While sg119, targeting a neutral sequence within the enhancer showed no effect, editing with sgRNAs 145, 217, 218, 220, 223 and 225 results in downregulation of CD19 MFI compared to sgAAVS1. c. Millipede analysis using sg218 highlights nucleotides 220, 223, 224 and 230. d. Genomics tracks of the CD19 locus: the CD19 enhancer is occupied by IKZF1 and PAX5. e. Targeting PAX5 and SPIB, but not MYB, resulted in downregulation of CD19. f. Schematics of the CAR-T co-culture experiments. Wild-type, BFP+ NALM6 cells are mixed 1:1 with GFP+ NALM6 cells carrying edits at the CD19 enhancer and co-cultured with aCD19 CAR T cells or mock T cells. High GFP/BFP ratio indicates that editing facilitates resistance to aCD19-mediated killing. g. NALM6 cells edited with sg145 are resistant to aCD19 CAR T. h. sg218 provides a milder, yet significant, accumulation of aCD19-resistant cells. One-way ANOVA, replicates are shown in circles (n=3), ** = p<0.01; *** = p<0.001; **** = p<0.0001 (b, e). Multiple unpaired t test, replicates are shown in circles (n=8–10), **** = p<0.000001 (g, h).

References

    1. Nasser J, Bergman DT, Fulco CP, Guckelberger P, Doughty BR, Patwardhan TA, Jones TR, Nguyen TH, Ulirsch JC, Lekschas F, Mualim K, Natri HM, Weeks EM, Munson G, Kane M, Kang HY, Cui A, Ray JP, Eisenhaure TM, Collins RL, Dey K, Pfister H, Price AL, Epstein CB, Kundaje A, Xavier RJ, Daly MJ, Huang H, Finucane HK, Hacohen N, Lander ES, Engreitz JM. Genome-wide enhancer maps link risk variants to disease genes. Nature. 2021. May;593(7858):238–243. - PMC - PubMed
    1. Canver MC, Smith EC, Sher F, Pinello L, Sanjana NE, Shalem O, Chen DD, Schupp PG, Vinjamur DS, Garcia SP, Luc S, Kurita R, Nakamura Y, Fujiwara Y, Maeda T, Yuan GC, Zhang F, Orkin SH, Bauer DE. BCL11A enhancer dissection by Cas9-mediated in situ saturating mutagenesis. Nature. 2015. Nov 12;527(7577):192–197. - PMC - PubMed
    1. Gasperini M, Hill AJ, McFaline-Figueroa JL, Martin B, Kim S, Zhang MD, Jackson D, Leith A, Schreiber J, Noble WS, Trapnell C, Ahituv N, Shendure J. A genome-wide framework for mapping gene regulation via cellular genetic screens. Cell. Elsevier BV; 2019. Jan 10;176(1–2):377–390.e19. - PMC - PubMed
    1. Clement K, Hsu JY, Canver MC, Joung JK, Pinello L. Technologies and Computational Analysis Strategies for CRISPR Applications. Mol Cell. 2020. Jul 2;79(1):11–29. - PMC - PubMed
    1. Fulco CP, Nasser J, Jones TR, Munson G, Bergman DT, Subramanian V, Grossman SR, Anyoha R, Doughty BR, Patwardhan TA, Nguyen TH, Kane M, Perez EM, Durand NC, Lareau CA, Stamenova EK, Aiden EL, Lander ES, Engreitz JM. Activity-by-contact model of enhancer-promoter regulation from thousands of CRISPR perturbations. Nat Genet. 2019. Dec;51(12):1664–1669. - PMC - PubMed

Publication types

LinkOut - more resources