scooby: modeling multimodal genomic profiles from DNA sequence at single-cell resolution
- PMID: 41125796
- PMCID: PMC12615262
- DOI: 10.1038/s41592-025-02854-5
scooby: modeling multimodal genomic profiles from DNA sequence at single-cell resolution
Abstract
Understanding how regulatory sequences shape gene expression across individual cells is a fundamental challenge in genomics. Joint RNA sequencing and epigenomic profiling provides opportunities to build models capturing sequence determinants across steps of gene expression. However, current models, developed primarily for bulk omics data, fail to capture the cellular heterogeneity and dynamic processes revealed by single-cell multimodal technologies. Here, we introduce scooby, a framework to model genomic profiles of single-cell RNA-sequencing coverage and single-cell assay for transposase-accessible chromatin using sequencing insertions from sequence at single-cell resolution. For this, we leverage the pretrained multiomics profile predictor Borzoi and equip it with a cell-specific decoder. Scooby recapitulates cell-specific expression levels of held-out genes and identifies regulators and their putative target genes. Moreover, scooby allows resolving single-cell effects of bulk expression quantitative trait loci and delineating their impact on chromatin accessibility and gene expression. We anticipate scooby to aid unraveling the complexities of gene regulation at the resolution of individual cells.
© 2025. The Author(s).
Conflict of interest statement
Competing interests: J.D.B. holds patents related to ATAC-seq and is an SAB member of Camp4 and seqWell. F.J.T. consults for Immunai, Singularity Bio, CytoReason and Omniscope, and has ownership interest in Dermagnostix and Cellarity. The other authors declare no competing interests.
Figures
Update of
-
scooby: Modeling multi-modal genomic profiles from DNA sequence at single-cell resolution.bioRxiv [Preprint]. 2025 Jul 16:2024.09.19.613754. doi: 10.1101/2024.09.19.613754. bioRxiv. 2025. Update in: Nat Methods. 2025 Nov;22(11):2275-2285. doi: 10.1038/s41592-025-02854-5. PMID: 39345504 Free PMC article. Updated. Preprint.
References
-
- Sasse, A., Chikina, M. & Mostafavi, S. Unlocking gene regulation with sequence-to-function models. Nat. Methods21, 1374–1377 (2024). - PubMed
-
- Agarwal, V. & Shendure, J. Predicting mRNA abundance directly from genomic sequence using deep convolutional neural networks. Cell Rep.31, 107663 (2020). - PubMed
-
- Alipanahi, B., Delong, A., Weirauch, M. T. & Frey, B. J. Predicting the sequence specificities of DNA- and RNA-binding proteins by deep learning. Nat. Biotechnol.33, 831–838 (2015). - PubMed
MeSH terms
Substances
Grants and funding
- 101054957/EC | EU Framework Programme for Research and Innovation H2020 | H2020 Priority Excellent Science | H2020 European Research Council (H2020 Excellent Science - European Research Council)
- UM1 HG011986/HG/NHGRI NIH HHS/United States
- 101118521/EC | EU Framework Programme for Research and Innovation H2020 | H2020 Priority Excellent Science | H2020 European Research Council (H2020 Excellent Science - European Research Council)
- 403584255/Deutsche Forschungsgemeinschaft (German Research Foundation)
- DP2 HL151353/HL/NHLBI NIH HHS/United States
LinkOut - more resources
Full Text Sources
