Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2024 Jun 7;20(6):e1012214.
doi: 10.1371/journal.pcbi.1012214. eCollection 2024 Jun.

Strong association between genomic 3D structure and CRISPR cleavage efficiency

Affiliations

Strong association between genomic 3D structure and CRISPR cleavage efficiency

Shaked Bergman et al. PLoS Comput Biol. .

Abstract

CRISPR is a gene editing technology which enables precise in-vivo genome editing; but its potential is hampered by its relatively low specificity and sensitivity. Improving CRISPR's on-target and off-target effects requires a better understanding of its mechanism and determinants. Here we demonstrate, for the first time, the chromosomal 3D spatial structure's association with CRISPR's cleavage efficiency, and its predictive capabilities. We used high-resolution Hi-C data to estimate the 3D distance between different regions in the human genome and utilized these spatial properties to generate 3D-based features, characterizing each region's density. We evaluated these features based on empirical, in-vivo CRISPR efficiency data and compared them to 425 features used in state-of-the-art models. The 3D features ranked in the top 13% of the features, and significantly improved the predictive power of LASSO and xgboost models trained with these features. The features indicated that sites with lower spatial density demonstrated higher efficiency. Understanding how CRISPR is affected by the 3D DNA structure provides insight into CRISPR's mechanism in general and improves our ability to correctly predict CRISPR's cleavage as well as design sgRNAs for therapeutic and scientific use.

PubMed Disclaimer

Conflict of interest statement

The authors have declared that no competing interests exist.

Figures

Fig 1
Fig 1. The general structure of the study.
See details in the main text.
Fig 2
Fig 2. 3D features perform comparably to state-of-the-art features and encode new information.
(A) Boxplots of Pearson correlations (absolute values) between predictive features and CRISPR efficiency in the Leenay, TTISS and GUIDE-Seq datasets. Blue: 420 sequence features; orange: 5 thermodynamic features; green: 9 3D features. (B-C) Bar charts of Pearson correlations (blue) and partial correlations (orange) with CRISPR efficiency, in the Leenay, TTISS and GUIDE-Seq datasets. The partial correlation of each vector controls for all other vectors in the dataset’s subplot. (B) Epigenetic features and 3D feature (correlations appear in absolute value); (C) State-of-the-art models and 3D feature.
Fig 3
Fig 3. 3D features significantly improve CRISPR prediction.
Histograms of the correlations between measured and predicted efficiency in the Leenay and TTISS datasets, when testing LASSO and xgboost models. The blue/orange histogram indicates the model with/without the 3D feature and the average correlation is marked with a solid/dashed line, respectively. p-values were calculated using Wilcoxon’s signed rank test. (A) Models using all 425 classic features; (B) Models using the top 30 features, based on Pearson correlation with CRISPR efficiency.
Fig 4
Fig 4. 3D features rank highly compared to classic features.
Ranks of features used in the LASSO and xgboost models, averaged over the 1000 iterations; a higher rank indicates a more important feature. The 3D feature and classic features are marked with an orange/blue dot, respectively. (A) Number of times each feature was selected in the LASSO models. (B) Feature rank based on LASSO permutation importance. (C) Feature rank based on SHAP values of the xgboost models. (D) Feature rank based on xgboost permutation importance. (E) Feature rank based on xgboost “gain” importance.
Fig 5
Fig 5. 3D features improve prediction in non-matching cell types.
(A) Correlations between 3D features, generated from Hi-C in T/HEK293/GM12878 cells, and CRISPR efficiency. (B-C) Blue/orange/green bars represent 3D features generated from Hi-C in T/HEK293/GM12878 cells, respectively. (B) Relative change in xgboost r2 before and after adding a 3D feature to the model. (C) Rank of the 3D features relative to the other features, based on their average SHAP rank in the 1000 xgboost iterations; a higher rank is better. The dashed line represents the theoretical rank of a feature ranked first in all 1000 models.
Fig 6
Fig 6. Hypotheses explaining the inverse relation between genomic 3D density and CRISPR efficiency.
Unsuitable target sites are colored in orange, and suitable, fully complementary target sites are colored in green. pH/pL is the probability of binding to the suitable target site in the high-density/low-density region, respectively. (A) A Cas9-sgRNA complex searching for a target site in a dense region (left) encounters many unsuitable target sites; thus, the probability of finding the suitable target site in the dense region is lower than in a low-density region (right), where significantly fewer potential target sites exist. (B) Target sites in high-density regions (left) are less accessible physically to the Cas9-sgRNA complex, relative to low-density regions (right), because of DNA packing in the region.

Similar articles

Cited by

References

    1. Doudna JA, Charpentier E. The new frontier of genome engineering with CRISPR-Cas9. Science (80-) [Internet]. 2014. Nov 28;346(6213):1258096. Available from: http://science.sciencemag.org/content/346/6213/1258096.abstract - PubMed
    1. Li H, Yang Y, Hong W, Huang M, Wu M, Zhao X. Applications of genome editing technology in the targeted therapy of human diseases: mechanisms, advances and prospects. Signal Transduct Target Ther [Internet]. 2020;5(1):1. Available from: doi: 10.1038/s41392-019-0089-y - DOI - PMC - PubMed
    1. Pickar-Oliver A, Gersbach CA. The next generation of CRISPR–Cas technologies and applications. Nat Rev Mol Cell Biol [Internet]. 2019;20(8):490–507. Available from: doi: 10.1038/s41580-019-0131-5 - DOI - PMC - PubMed
    1. Moreno-Mateos MA, Vejnar CE, Beaudoin J-D, Fernandez JP, Mis EK, Khokha MK, et al.. CRISPRscan: designing highly efficient sgRNAs for CRISPR-Cas9 targeting in vivo. Nat Methods [Internet]. 2015;12(10):982–8. Available from: doi: 10.1038/nmeth.3543 - DOI - PMC - PubMed
    1. Singh R, Kuscu C, Quinlan A, Qi Y, Adli M. Cas9-chromatin binding information enables more accurate CRISPR off-target prediction. Nucleic Acids Res [Internet]. 2015. Oct 15;43(18):e118–e118. Available from: doi: 10.1093/nar/gkv575 - DOI - PMC - PubMed