Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2021 Jan 20;22(1):66.
doi: 10.1186/s12864-020-07358-4.

Systematic comparison of high-throughput single-cell RNA-seq methods for immune cell profiling

Affiliations

Systematic comparison of high-throughput single-cell RNA-seq methods for immune cell profiling

Tracy M Yamawaki et al. BMC Genomics. .

Abstract

Background: Elucidation of immune populations with single-cell RNA-seq has greatly benefited the field of immunology by deepening the characterization of immune heterogeneity and leading to the discovery of new subtypes. However, single-cell methods inherently suffer from limitations in the recovery of complete transcriptomes due to the prevalence of cellular and transcriptional dropout events. This issue is often compounded by limited sample availability and limited prior knowledge of heterogeneity, which can confound data interpretation.

Results: Here, we systematically benchmarked seven high-throughput single-cell RNA-seq methods. We prepared 21 libraries under identical conditions of a defined mixture of two human and two murine lymphocyte cell lines, simulating heterogeneity across immune-cell types and cell sizes. We evaluated methods by their cell recovery rate, library efficiency, sensitivity, and ability to recover expression signatures for each cell type. We observed higher mRNA detection sensitivity with the 10x Genomics 5' v1 and 3' v3 methods. We demonstrate that these methods have fewer dropout events, which facilitates the identification of differentially-expressed genes and improves the concordance of single-cell profiles to immune bulk RNA-seq signatures.

Conclusion: Overall, our characterization of immune cell mixtures provides useful metrics, which can guide selection of a high-throughput single-cell RNA-seq method for profiling more complex immune-cell heterogeneity usually found in vivo.

Keywords: High throughput sequencing; Immune-cell profiling; Single cell; Single-cell RNA-seq; Transcriptomics.

PubMed Disclaimer

Conflict of interest statement

The authors have read the journal’s policy and have the following conflicts: Tracy M. Yamawaki, Daniel R. Lu, Daniel C. Ellwanger, Dev Bhatt, Paolo Manzanillo, Vanessa, Arias, Hong Zhou, Oliver Homann, Songli Wang, and Chi-Ming Li are employees or contract workers at Amgen Inc. Oh Kyu Yoon was employed by Amgen Inc. while working on the study. All authors owned Amgen shares when the experiments were carried out. However, these do not alter the authors’ adherence to all journal policies on sharing data and materials.

Figures

Fig. 1
Fig. 1
Overview of high-throughput single-cell benchmarking experiments. Experiments were performed using four immune cell lines to benchmark cell recovery, transcript detection sensitivity, concordance to bulk RNA-seq and differentially-expressed gene identification
Fig. 2
Fig. 2
Library-pool and cell-capture efficiencies: a Cell capture efficiency was measured by the number of cell identifiers (CIDs) above the inflection point of the rank ordered reads/CID plot (knee plot) relative to the number of cells loaded on the instrument. Horizontal lines indicate theoretical capture efficiency based on bead/cell loading concentrations or manufacturer’s guidelines. b Library pool efficiency was measured by the number of reads in CIDs above the inflection point
Fig. 3
Fig. 3
Transcript detection sensitivity: a Distributions of unique molecular identifiers (UMIs) and b genes detected in EL4 cells by sample are plotted. c Numbers of UMIs or d genes detected versus numbers of reads per cell for each cell type are plotted. e Accumulated average numbers of genes detected from aggregated data of subsamples up to 50 cells are plotted. f Dropout modeling (dropout rate versus FPKM of bulk sequencing) for EL4 cells by method are shown. A left-shifted curve indicates higher sensitivity, that is, fewer dropouts at lower expression levels. Sensitivity of methods for EL4 cells ranked in the following order: 10x 3′ v3 > 10x 5′ v1 > 10x 3′ v2 > ddSEQ > Drop-seq > ICELL8 3′ DE > ICELL8 3′ DE-UMI. Cells with high mitochondrial expression rates were excluded from this calculation
Fig. 4
Fig. 4
Correlation to bulk RNA-seq: a Pearson correlation (r) of cell identifiers (CIDs) to bulk RNA-seq data using highly-expressed variable genes. Only r values above 0.2 were included in plot. b Average Pearson correlation using all genes for aggregated data of 50 subsamples of up to 50 cells are plotted
Fig. 5
Fig. 5
Differentially-expressed (DE) gene detection: a Fold change (FC) versus false discovery rate (FDR) calculated using a hurdle model (MAST) for mouse genes in EL4 vs IVA12 cells. Shown is a representative subsample of mouse cells (n=199) using the 10x 3′ v2 method demonstrating the criteria for declaring DE genes (FDR < 10− 4); DE genes are highlighted in red. b Number of significant DE genes calculated using MAST between EL4 and IVA12 cells by method. Error bars represent the 95% confidence interval. The total number of significant DE genes are plotted in red, the number of DE genes with > 1.5-fold difference in expression in bulk RNA-seq (5868 genes) are plotted in cyan. c Median bulk RNA-seq expression (FPKM) of all significant DE genes (red) or DE genes with > 1.5-fold difference (cyan). Error bars represent 95% confidence interval
Fig. 6
Fig. 6
Cell recovery by cell identifier (CID) thresholding: a Example of using the transposed log-log empirical cumulative density plot of the total counts of each CID to identify cell-containing droplets . Common thresholding points, the ‘knee’ and the ‘inflection’ are indicated with arrows. The knee is the point at which the signed curvature is minimized, the inflection is the point at which the first derivative is minimized. b The fraction of cells above the knee or inflection are plotted. c Fraction of cells below mitochondrial rate threshold (listed in Supplement Table 4) relative to knee point. Samples are colored by cell sample mixture listed in Supplement Table 2
Fig. 7
Fig. 7
Dropout rates by cell type: a Distribution of reads across cell types is plotted by method. b Dropout rate models for cell types are shown. c 10x 3′ v3 cells were binned by number of unique molecular identifiers (UMIs) and distributions of nUMIs for each cell type in each bin are plotted. d Gene Detection 50 (GD50) rates, expression level at 0.5 probability of the dropout model, are plotted for each cell type in 10x 3′ v3 experiments by bin. e Dropout models in each bin for EL4, IVA12 and Jurkat cells are plotted along with the model for TALL-104 cells in bin 1
Fig. 8
Fig. 8
mRNA capture variation across peripheral blood mononuclear cells (PBMCs). a Single-cell data generated with the 10x 3′ v3 and 10x 5′ v1 chemistries were projected onto an annotated PBMC CITE-Seq reference dataset. b Violin plots of log normalized expression of common immune-cell markers. c nGenes and d nUMIs detected for each cell class from each method. CTL-cytotoxic, TCM – T central memory, TEM – T effector memory, Treg – T regulatory cells, dnT – double negative T, gdT – gamma delta T, MAIT - Mucosal associated invariant T, pDC - Plasmacytoid dendritic cell, ASDC - AXL+ dendritic cell, cDC – classical dendritic cell, MC – monocyte, Eryth-Erythrocyte, ILC - Innate lymphoid cell, HSPC - Hematopoietic stem and progenitor cell

References

    1. Bandura DR, Baranov VI, Ornatsky OI, Antonov A, Kinach R, Lou X, Pavlov S, Vorobiev S, Dick JE, Tanner SD. Mass cytometry: technique for real time single cell multitarget immunoassay based on inductively coupled plasma time-of-flight mass spectrometry. Anal Chem. 2009;81(16):6813–6822. doi: 10.1021/ac901049w. - DOI - PubMed
    1. He H, Suryawanshi H, Morozov P, Gay-Mimbrera J, Del Duca E, Kim HJ, Kameyama N, Estrada Y, Der E, Krueger JG, et al. Single-cell transcriptome analysis of human skin identifies novel fibroblast subpopulation and enrichment of immune subsets in atopic dermatitis. J Allergy Clin Immunol. 2020;145(6):1615–28. - PubMed
    1. Sade-Feldman M, Yizhak K, Bjorgaard SL, Ray JP, de Boer CG, Jenkins RW, Lieb DJ, Chen JH, Frederick DT, Barzily-Rokni M, et al. Defining T cell states associated with response to checkpoint immunotherapy in melanoma. Cell. 2018;175(4):998–1013. doi: 10.1016/j.cell.2018.10.038. - DOI - PMC - PubMed
    1. Zhang Y, Zheng L, Zhang L, Hu X, Ren X, Zhang Z. Deep single-cell RNA sequencing data of individual T cells from treatment-naive colorectal cancer patients. Sci Data. 2019;6(1):131. doi: 10.1038/s41597-019-0131-5. - DOI - PMC - PubMed
    1. Zheng C, Zheng L, Yoo JK, Guo H, Zhang Y, Guo X, Kang B, Hu R, Huang JY, Zhang Q, et al. Landscape of infiltrating T cells in liver Cancer revealed by single-cell sequencing. Cell. 2017;169(7):1342–1356. doi: 10.1016/j.cell.2017.05.035. - DOI - PubMed