Comprehensive Integration of Single-Cell Data
- PMID: 31178118
- PMCID: PMC6687398
- DOI: 10.1016/j.cell.2019.05.031
Comprehensive Integration of Single-Cell Data
Abstract
Single-cell transcriptomics has transformed our ability to characterize cell states, but deep biological understanding requires more than a taxonomic listing of clusters. As new methods arise to measure distinct cellular modalities, a key analytical challenge is to integrate these datasets to better understand cellular identity and function. Here, we develop a strategy to "anchor" diverse datasets together, enabling us to integrate single-cell measurements not only across scRNA-seq technologies, but also across different modalities. After demonstrating improvement over existing methods for integrating scRNA-seq data, we anchor scRNA-seq experiments with scATAC-seq to explore chromatin differences in closely related interneuron subsets and project protein expression measurements onto a bone marrow atlas to characterize lymphocyte populations. Lastly, we harmonize in situ gene expression and scRNA-seq datasets, allowing transcriptome-wide imputation of spatial gene expression patterns. Our work presents a strategy for the assembly of harmonized references and transfer of information across datasets.
Keywords: integration; multi-modal; scATAC-seq; scRNA-seq; single cell; single-cell ATAC sequencing; single-cell RNA sequencing.
Copyright © 2019 Elsevier Inc. All rights reserved.
Conflict of interest statement
Declaration of interests
The authors declare no competing interests.
Figures
Comment in
-
Integration of Single-Cell Genomics Datasets.Cell. 2019 Jun 13;177(7):1677-1679. doi: 10.1016/j.cell.2019.05.034. Cell. 2019. PMID: 31199914
References
-
- Achim K, Pettit J-B, Saraiva LR, Gavriouchkina D, Larsson T, Arendt D, Marioni JC, 2015. High-throughput spatial mapping of single-cell RNA-seq data to tissue of origin. Nature Biotechnology 33 (5), 503–509. URL http://www.nature.com/doifinder/10.1038/nbt.3209 - DOI - PubMed
-
- Arya S, Mount D, Kemp SE, Jefferis G, 2018. RANN: Fast Nearest Neighbour Search (Wraps ANN Library) Using L2 Metric . R package version 2.6. URL https://CRAN.R-project.org/package=RANN
-
- Baglama J, Reichel L, Lewis BW, 2018. irlba: Fast Truncated Singular Value Decomposition and Principal Components Analysis for Large Dense and Sparse Matrices. R package version 2.3.2. URL https://CRAN.R-project.org/package=irlba
Publication types
MeSH terms
Grants and funding
LinkOut - more resources
Full Text Sources
Other Literature Sources
Molecular Biology Databases
Research Materials
