CZ CELLxGENE Discover: a single-cell data platform for scalable exploration, analysis and modeling of aggregated data
- PMID: 39607691
- PMCID: PMC11701654
- DOI: 10.1093/nar/gkae1142
CZ CELLxGENE Discover: a single-cell data platform for scalable exploration, analysis and modeling of aggregated data
Abstract
Hundreds of millions of single cells have been analyzed using high-throughput transcriptomic methods. The cumulative knowledge within these datasets provides an exciting opportunity for unlocking insights into health and disease at the level of single cells. Meta-analyses that span diverse datasets building on recent advances in large language models and other machine-learning approaches pose exciting new directions to model and extract insight from single-cell data. Despite the promise of these and emerging analytical tools for analyzing large amounts of data, the sheer number of datasets, data models and accessibility remains a challenge. Here, we present CZ CELLxGENE Discover (cellxgene.cziscience.com), a data platform that provides curated and interoperable single-cell data. Available via a free-to-use online data portal, CZ CELLxGENE hosts a growing corpus of community-contributed data of over 93 million unique cells. Curated, standardized and associated with consistent cell-level metadata, this collection of single-cell transcriptomic data is the largest of its kind and growing rapidly via community contributions. A suite of tools and features enables accessibility and reusability of the data via both computational and visual interfaces to allow researchers to explore individual datasets, perform cross-corpus analysis, and run meta-analyses of tens of millions of cells across studies and tissues at the resolution of single cells.
© The Author(s) 2024. Published by Oxford University Press on behalf of Nucleic Acids Research.
Figures







Similar articles
-
CellDepot: A Unified Repository for scRNA-seq Data and Visual Exploration.J Mol Biol. 2022 Jun 15;434(11):167425. doi: 10.1016/j.jmb.2021.167425. Epub 2021 Dec 28. J Mol Biol. 2022. PMID: 34971674
-
Iterative single-cell multi-omic integration using online learning.Nat Biotechnol. 2021 Aug;39(8):1000-1007. doi: 10.1038/s41587-021-00867-x. Epub 2021 Apr 19. Nat Biotechnol. 2021. PMID: 33875866 Free PMC article.
-
HelPredictor models single-cell transcriptome to predict human embryo lineage allocation.Brief Bioinform. 2021 Nov 5;22(6):bbab196. doi: 10.1093/bib/bbab196. Brief Bioinform. 2021. PMID: 34037706
-
Unlocking immune-mediated disease mechanisms with transcriptomics.Biochem Soc Trans. 2021 Apr 30;49(2):705-714. doi: 10.1042/BST20200652. Biochem Soc Trans. 2021. PMID: 33843974 Free PMC article. Review.
-
The promise of single-cell RNA sequencing for kidney disease investigation.Kidney Int. 2017 Dec;92(6):1334-1342. doi: 10.1016/j.kint.2017.06.033. Epub 2017 Oct 12. Kidney Int. 2017. PMID: 28893418 Free PMC article. Review.
Cited by
-
Data-driven fine-grained region discovery in the mouse brain with transformers.bioRxiv [Preprint]. 2025 Feb 26:2024.05.05.592608. doi: 10.1101/2024.05.05.592608. bioRxiv. 2025. PMID: 38766132 Free PMC article. Preprint.
-
Rare Variant Analyses in Ancestrally Diverse Cohorts Reveal Novel ADHD Risk Genes.medRxiv [Preprint]. 2025 Jan 17:2025.01.14.25320294. doi: 10.1101/2025.01.14.25320294. medRxiv. 2025. PMID: 39867378 Free PMC article. Preprint.
-
Heat Shock Protein Family A Member 1A Attenuates Apoptosis and Oxidative Stress via ERK/JNK Pathway in Hyperplastic Prostate.MedComm (2020). 2025 Mar 10;6(3):e70129. doi: 10.1002/mco2.70129. eCollection 2025 Mar. MedComm (2020). 2025. PMID: 40066224 Free PMC article.
-
A general strategy for generating expert-guided, simplified views of ontologies.bioRxiv [Preprint]. 2024 Dec 17:2024.12.13.628309. doi: 10.1101/2024.12.13.628309. bioRxiv. 2024. PMID: 39763856 Free PMC article. Preprint.
-
Consequences of training data composition for deep learning models in single-cell biology.bioRxiv [Preprint]. 2025 Feb 24:2025.02.19.639127. doi: 10.1101/2025.02.19.639127. bioRxiv. 2025. PMID: 40060416 Free PMC article. Preprint.
References
MeSH terms
LinkOut - more resources
Full Text Sources