scJoint integrates atlas-scale single-cell RNA-seq and ATAC-seq data with transfer learning
- PMID: 35058621
- PMCID: PMC9186323
- DOI: 10.1038/s41587-021-01161-6
scJoint integrates atlas-scale single-cell RNA-seq and ATAC-seq data with transfer learning
Abstract
Single-cell multiomics data continues to grow at an unprecedented pace. Although several methods have demonstrated promising results in integrating several data modalities from the same tissue, the complexity and scale of data compositions present in cell atlases still pose a challenge. Here, we present scJoint, a transfer learning method to integrate atlas-scale, heterogeneous collections of scRNA-seq and scATAC-seq data. scJoint leverages information from annotated scRNA-seq data in a semisupervised framework and uses a neural network to simultaneously train labeled and unlabeled data, allowing label transfer and joint visualization in an integrative framework. Using atlas data as well as multimodal datasets generated with ASAP-seq and CITE-seq, we demonstrate that scJoint is computationally efficient and consistently achieves substantially higher cell-type label accuracy than existing methods while providing meaningful joint visualizations. Thus, scJoint overcomes the heterogeneity of different data modalities to enable a more comprehensive understanding of cellular phenotypes.
© 2022. The Author(s), under exclusive licence to Springer Nature America, Inc.
Conflict of interest statement
Competing interests
The authors declare no competing interests.
Figures
References
-
- Stuart T & Satija R Integrative single-cell analysis. Nat. Rev. Genet 20, 257–272 (2019). - PubMed
-
- Berger SL The complex language of chromatin regulation during transcription. Nature 447, 407–412 (2007). - PubMed
-
- Klemm SL, Shipony Z & Greenleaf WJ Chromatin accessibility and the regulatory epigenome. Nat. Rev. Genet 20, 207–220 (2019). - PubMed
Publication types
MeSH terms
Grants and funding
LinkOut - more resources
Full Text Sources
