Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2025 Aug 23;16(1):7891.
doi: 10.1038/s41467-025-63164-0.

Structure-guided engineering of type I-F CASTs for targeted gene insertion in human cells

Affiliations

Structure-guided engineering of type I-F CASTs for targeted gene insertion in human cells

George D Lampe et al. Nat Commun. .

Abstract

Conventional genome editing tools rely on DNA double-strand breaks (DSBs) and host recombination proteins to achieve large insertions, resulting in heterogeneous mixtures of undesirable outcomes. We recently leveraged a type I-F CRISPR-associated transposase, PseCAST, for DSB-free DNA integration in human cells, albeit at low efficiencies; multiple lines of evidence suggest DNA binding may be a bottleneck for higher efficiencies. Here we report structural determinants of DNA recognition by the PseCAST QCascade complex using single-particle cryogenic electron microscopy (cryoEM), revealing subtype-specific interactions and RNA-DNA heteroduplex features. By combining structural data, library screens, and rationally engineered mutants, we uncover variants with increased integration efficiencies and modified PAM stringencies. We further leverage transpososome structural predictions to build hybrid CASTs that combine orthogonal DNA binding and integration modules. Our work provides unique structural insights into type I-F CASTs and showcases diverse strategies to investigate and engineer RNA-guided transposase architectures for human genome editing applications.

PubMed Disclaimer

Conflict of interest statement

Competing interests: Columbia University has filed a patent application related to this work. G.D.L., A.R.L., D.J.Z., and S.H.S. are inventors on other patents and patent applications related to CRISPR–Cas systems and uses thereof (application numbers US20240279629, US20250163410, WO2020181264, WO2022261122, WO2022266492, WO2023245010, WO2024124048, WO2025029727, WO2025085782, WO2025085787). S.H.S. is a co-founder of and scientific advisor to Dahlia Biosciences, a scientific advisor to Prime Medicine and CrisprBits and an equity holder in Dahlia Biosciences and CrisprBits. The remaining authors declare no competing interests.

Figures

Fig. 1
Fig. 1. CryoEM structure of the TniQ-Cascade (QCascade) complex from PseCAST.
a Phylogenetic tree of type I-F CRISPR-associated transposons (CASTs), based on previous work in the lab. Systems with previously solved QCascade structures are marked with red arrows, while PseCAST is marked with a green arrow. Phylogenetic clades are colored. b Experimental design to investigate both DNA binding and overall integration activities for CAST systems in human cells. DNA binding is extrapolated from two different transcriptional activation assays, one in which VP64 is fused to Cas7 and one in which VP64 is fused to TnsC. Overall integration efficiencies are measured via amplicon sequencing. c Comparison of VchCAST and PseCAST across different assays in human cells. Although PseCAST exhibits consistently weak transcriptional activation compared to VchCAST, its absolute DNA integration activity at genomic target sites is approximately two orders of magnitude greater. DNA integration data is adapted from a previous publication. Data are shown as mean for n = 2 biologically independent samples for Cas7 and TnsC activation. Data are shown as mean ± s.d. for n = 3 biologically independent samples for DNA integration. Source data are provided as a Source Data file. d Operonic architecture of PseCAST components from the PseCAST transposon, with genes encoding the QCascade complex labeled accordingly. e Left, dominant reference-free 2D cryoEM class averages. Right, cryoEM densities with colored map regions corresponding to Cas8 (blue), Cas7 monomers 1–6 (light blue), Cas6 (purple), TniQ monomers 1–2 (orange, yellow), crRNA (gray), and target DNA (red) indicated. f Refined model for the Cas8 ɑ-helical domain and its positioning relative to the TniQ dimer interface.
Fig. 2
Fig. 2. The role of crRNA in the PAM-distal region of PseQCascade.
a Overall view of the cryoEM reconstruction of the PseCAST QCascade complex. b Magnified view of the dashed region in (a), highlighting the cryoEM density (colored and semi-transparent) for interactions between the indicated crRNA nucleotides and protein subunits. c Magnified view of the dashed regions in (b), highlighting interactions between the crRNA and Cas6 (left), TniQ.1 (middle), and both TniQ.2 and Cas7.6 (right). Key interacting residues are labeled. d Normalized RNA-guided DNA integration efficiency at the genomic AAVS1 locus in HEK293T cells, as measured by amplicon sequencing. The indicated alanine mutations were designed to perturb specific RNA-protein interactions highlighted in (c), and were compared to WT. NT, non-targeting crRNA. Data are shown as mean ± s.d. for n = 3 biologically independent samples. Source data are provided as a Source Data file. e Comparison of the crRNA conformation within the PAM-distal region, adjacent to the site of RNA hairpin stabilization by Cas6, for VchCAST (PDB: 6PIJ) and PseCAST (this study). The region around nucleotide G41 exhibits a distinct configuration for PseCAST, likely affecting the behavior of the adjacent TniQ dimer.
Fig. 3
Fig. 3. TniQ recruitment to the Cas6-Cas7.6 interface of Cascade requires hydrophobic and electrostatic interactions.
a Overall view of the PseCAST QCascade complex, oriented to highlight the TniQ dimer (dark/light orange). b Magnified view of the region indicated in (a), showing how TniQ.1 (dark orange) interacts with a hydrophobic cavity on Cas6. The two visual renderings are colored either by Cas6 surface (purple, top) or hydrophobicity (bottom). c Comparison of the hydrophobic interactions between TniQ.1 and Cas6 in PseCAST (left) and VchCAST (right, PDB: 6PIJ), with residues labeled. d Normalized RNA-guided DNA integration efficiency at the genomic AAVS1 locus in HEK293T cells, as measured by amplicon sequencing. The indicated arginine point mutations were designed to perturb TniQ.1-Cas6 hydrophobic interactions. NT, non-targeting crRNA. Source data are provided as a Source Data file. e Magnified views of hydrogen bonding (top) and electrostatic (bottom) interactions between Cas7.6 (blue) and TniQ.2 helix (yellow). f Normalized RNA-guided DNA integration efficiency at the genomic AAVS1 locus in HEK293T cells, as measured by amplicon sequencing. Alanine mutations perturbing Cas7.6-TniQ interactions are generally tolerated. Source data are provided as a Source Data file. Data in (d, f) are shown as mean ± s.d. for n = 3 biologically independent samples.
Fig. 4
Fig. 4. Structural and functional consequences of PAM and target DNA recognition by PseQCascade.
a Top, overall view of the PseCAST QCascade complex, oriented to highlight the target DNA recognition. Bottom, Magnified view of the experimental cryoEM density map around Cas7.1 and Cas7.2, showing interactions with the crRNA (gray) and DNA target strand (TS, red). NTS, DNA non-target strand. b Magnified views of the PAM binding pocket, with Cas8 and DNA shown in blue and red, respectively. Residues A243 and A244 lack any base-specific, hydrogen-bonding interactions with the DNA. c Normalized DNA integration efficiency at the genomic AAVS1 locus in HEK293T cells for the indicated Cas8 mutants (top), plotted above the WebLogo for PAM preferences in the –1 and -2 positions (bottom) derived from integration into pTarget. (For additional PAM specificity data, see Supplementary Fig. 7e.) Integration efficiency data are shown as mean ± s.d. for n = 3 biologically independent samples. Source data are provided as a Source Data file. d Overlay of the refined atomic model and cryoEM density (semi-transparent) for the seed region of QCascade bound to the DNA target strand. e Schematic representation showing angles for the first five RNA-DNA base pairs (BP 1–5) within the R-loop. f View of the RNA-DNA heteroduplex at right, highlighting the unfavorable base-pairing surrounding flipped out nucleobases within the first 18 base pairs of the R-loop. g Magnified view of the RNA-DNA heteroduplex segments aligned at the flipped out base pair, revealing consistent unfavorable angles at the adjacent base pairs. h Normalized RNA-guided DNA integration efficiency at the genomic AAVS1 locus in HEK293T cells for the indicated Cas7 mutations, as measured by amplicon sequencing. Data are shown as mean ± s.d. for n = 3 biologically independent samples. Source data are provided as a Source Data file.
Fig. 5
Fig. 5. AlphaFold-guided engineering of TnsABC to generate chimeric CAST systems.
a Schematic showing the approach to generate a chimeric CAST system by combining optimal DNA targeting and DNA integration machineries from distinct CAST systems. b AlphaFold-generated structure prediction of the TnsABC co-complex from PseCAST. The C-terminal ‘hook’ region of TnsB that putatively interacts with TnsC is marked. c Visualization of select TnsB graft points within the predicted PseTnsABC structure. Residues where Pse-Vch chimerism was introduced are colored in blue, and the three top performing graft points (V585, S589, Q594; PseTnsB numbering) from panel (e) are labeled. d Experimental workflow to test chimeric TnsAB constructs for RNA-guided DNA integration activity. E. coli BL21(DE3) cells containing a pEffector encoding VchQCascade and VchTnsC were transformed with a plasmid encoding a mini-transposon (mini-Tn) and TnsAB, with TnsAB derived from either VchCAST, PseCAST, or a chimeric combination thereof. Integration efficiency was measured by qPCR (bottom). e E. coli DNA integration efficiencies for each tested TnsAB chimera. The amino acid listed represents the position at which the reading frame was grafted from PseTnsB (red) to VchTnsB (blue). “Custom” denotes a variant in which multiple different VchTnsB sequences were substituted (see Supplementary Data 3 for details). Source data are provided as a Source Data file. Data are shown as mean for n = 2 biologically independent samples.

Update of

Similar articles

References

    1. Branzei, D. & Foiani, M. Regulation of DNA repair throughout the cell cycle. Nat. Rev. Mol. Cell Biol.9, 297–308 (2008). - PubMed
    1. Heyer, W.-D., Ehmsen, K. T. & Liu, J. RegulAtion Of Homologous Recombination In Eukaryotes. Annu. Rev. Genet.44, 113–139 (2010). - PMC - PubMed
    1. Pawelczak, K. S., Gavande, N. S., VanderVere-Carozza, P. S. & Turchi, J. J. Modulating DNA repair pathways to improve precision genome engineering. ACS Chem. Biol.13, 389–396 (2018). - PubMed
    1. Kanca, O. et al. An efficient CRISPR-based strategy to insert small and large fragments of DNA using short homology arms. eLife8, e51539 (2019). - PMC - PubMed
    1. Zuccaro, M. V. et al. Allele-specific chromosome removal after Cas9 cleavage in human embryos. Cell183, 1–15 (2020). - PubMed

LinkOut - more resources