Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2018 Jul;15(7):539-542.
doi: 10.1038/s41592-018-0033-z. Epub 2018 Jun 25.

SAVER: gene expression recovery for single-cell RNA sequencing

Affiliations

SAVER: gene expression recovery for single-cell RNA sequencing

Mo Huang et al. Nat Methods. 2018 Jul.

Abstract

In single-cell RNA sequencing (scRNA-seq) studies, only a small fraction of the transcripts present in each cell are sequenced. This leads to unreliable quantification of genes with low or moderate expression, which hinders downstream analysis. To address this challenge, we developed SAVER (single-cell analysis via expression recovery), an expression recovery method for unique molecule index (UMI)-based scRNA-seq data that borrows information across genes and cells to provide accurate expression estimates for all genes.

PubMed Disclaimer

Conflict of interest statement

Competing Financial Interests Statement

A.R. receives consulting income and A.R. and S.S. receive royalties related to Stellaris RNA FISH probes. All other authors declare no competing interests.

Figures

Figure 1
Figure 1
RNA FISH validation of SAVER results on Drop-seq data. (a) Overview of SAVER procedure. (b) Comparison of Gini coefficient for each gene between FISH and Drop-seq (left) and between FISH and SAVER recovered values (right) for n = 15 genes. (c) Kernel density estimates of cross-cell expression distribution of LMNA (upper) and CCNA2 (lower). (d) Scatterplots of expression levels between BABAM1 and LMNA. Pearson correlations were calculated across n = 17,095 cells for FISH and n = 8,498 cells for Drop-seq and SAVER.
Figure 2
Figure 2
Evaluation of SAVER by down-sampling and cell clustering. (a) Performance of algorithms measured by correlation with reference, on the gene level (left) and on the cell level (right). Number of genes and cells can be found in Supplementary Table 3. Box plots show the median (center line), interquartile range (hinges), and 1.5 times the interquartile range (whiskers); outlier data beyond this range are not shown. (b) Comparison of gene-to-gene (left) and cell-to-cell (right) correlation matrices of recovered values with the true correlation matrices, as measured by correlation matrix distance (CMD). (c) Differential expression (DE) analysis between CA1Pyr1 cells (n = 351) and CA1Py2 cells (n = 389) showing significant genes detected at FDR = 0.01 (left) and estimated FDR (right). (d) Cell clustering and t-SNE visualization of the Zeisel dataset (n = 1,799). Jaccard index of the down-sampled observed dataset and recovery methods as compared to the reference classification is shown. (e) t-SNE visualization of 7,387 mouse cortex cells for the observed data (left) and SAVER (right) colored by cell types determined by Hrvatin et al.

References

    1. Svensson V, et al. Power analysis of single-cell RNA-sequencing experiments. Nat Methods. 2017;14:381–387. - PMC - PubMed
    1. Kharchenko PV, Silberstein L, Scadden DT. Bayesian approach to single-cell differential expression analysis. Nat Methods. 2014;11:740–2. - PMC - PubMed
    1. Finak G, et al. MAST: a flexible statistical framework for assessing transcriptional changes and characterizing heterogeneity in single-cell RNA sequencing data. Genome Biol. 2015;16:278. - PMC - PubMed
    1. Pierson E, Yau C. ZIFA: Dimensionality reduction for zero-inflated single-cell gene expression analysis. Genome Biol. 2015;16:241. - PMC - PubMed
    1. van Dijk D, et al. MAGIC: A diffusion-based imputation method reveals gene-gene interactions in single-cell RNA-sequencing data. bioRxiv. 2017

Methods-only References

    1. Satija Lab. Seurat – Guided Clustering Tutorial. at < http://satijalab.org/seurat/pbmc3k_tutorial.html>.
    1. Lun ATL, McCarthy DJ, Marioni JC. A step-by-step workflow for low-level analysis of single-cell RNA-seq data with Bioconductor. Bioconductor. at < https://bioconductor.org/help/workflows/simpleSingleCell/>. - PMC - PubMed
    1. Kiselev V, et al. Analysis of single cell RNA-seq data. at < https://hemberg-lab.github.io/scRNA.seq.course/index.html>.
    1. Wang J, et al. Gene Expression Distribution Deconvolution in Single Cell RNA Sequencing. bioRxiv. 2017:1–17. - PMC - PubMed
    1. Wagner F, Yan Y, Yanai I. K-nearest neighbor smoothing for single-cell RNA-Seq data. 2017

Publication types