Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Review
. 2020 Feb;17(2):137-145.
doi: 10.1038/s41592-019-0654-x. Epub 2019 Dec 2.

Orchestrating single-cell analysis with Bioconductor

Affiliations
Review

Orchestrating single-cell analysis with Bioconductor

Robert A Amezquita et al. Nat Methods. 2020 Feb.

Erratum in

Abstract

Recent technological advancements have enabled the profiling of a large number of genome-wide features in individual cells. However, single-cell data present unique challenges that require the development of specialized methods and software infrastructure to successfully derive biological insights. The Bioconductor project has rapidly grown to meet these demands, hosting community-developed open-source software distributed as R packages. Featuring state-of-the-art computational methods, standardized data infrastructure and interactive data visualization tools, we present an overview and online book (https://osca.bioconductor.org) of single-cell methods for prospective users.

PubMed Disclaimer

Conflict of interest statement

Competing Financial Interests

RG declares ownership in CellSpace Biosciences.

Figures

Figure 1:
Figure 1:. Number of Bioconductor packages for the analysis of high-throughput sequencing data over ten years.
Bioconductor software packages associated with the analysis of sequencing data were tracked by date of submission over the course of ten years. Software packages were uniquely defined by their primary sequencing technology association, with examples of specific terms used for annotation in parentheses.
Figure 2:
Figure 2:. Overview of the SingleCellExperiment class.
The SingleCellExperiment class instantiates an object (SingleCellExperiment herein abbreviated sce) capable of storing various datatypes associated with single-cell assays. A sce object is organized into components (e.g. rowData, assays, colData, reducedDims). In the assays component the rows represent features such as genes (horizontal pink bands), and the columns represent cells (vertical yellow band). The rowData and colData components can hold information (such as metadata) about the features and cells, respectively. Note that in the colData and reducedDims components, cells are represented as rows (horizontal yellow bands) and the number of columns in the assays component must match the number of rows in the colData and reducedDims components.
Figure 3:
Figure 3:. Bioconductor workflow for analyzing single-cell data.
A typical analytical workflow using Bioconductor leads to the creation and evolution of a SingleCellExperiment (or sce) object during data processing and downstream statistical analysis (left column). An example of a sce object evolving throughout the course of a workflow is shown, including visualization, analysis, and annotation (right column).
Figure 4:
Figure 4:. Select visualizations derived from various Bioconductor workflows.
Various visualizations associated with preprocessing (blue boxes) and downstream statistical analyses (orange boxes). The example data set used throughout was generated as part of the Human Cell Atlas [21]). Details on the generation of these figures are described in our online companion book (https://osca.bioconductor.org)

References

    1. Huber Wolfgang, Vincent J Carey Robert Gentleman, Anders Simon, Carlson Marc, Benilton S Carvalho Hector Corrada Bravo, Davis Sean, Gatto Laurent, Girke Thomas, Gottardo Raphael, Hahne Florian, Hansen Kasper D, Irizarry Rafael A, Lawrence Michael, Michael I Love, MacDonald James, Obenchain Valerie, Oleś Andrzej K, Pagès Hervé, Reyes Alejandro, Shannon Paul, Smyth Gordon K, Tenenbaum Dan, Waldron Levi, and Morgan Martin. Orchestrating high-throughput genomic analysis with Bioconductor. Nat Methods, 12(2):115–21, 02 2015. doi:10.1038/nmeth.3252. - DOI - PMC - PubMed
    1. Robinson Mark D, McCarthy Davis J, and Smyth Gordon K. edgeR: A Bioconductor package for differential expression analysis of digital gene expression data. Bioinformatics, 26(1):139–40, 2010. doi:10.1093/bioinformatics/btp616. URL https://bioconductor.org/packages/edgeR. - DOI - PMC - PubMed
    1. Lawrence Michael, Huber Wolfgang, Hervé Pagès Patrick Aboyoun, Carlson Marc, Gentleman Robert, Morgan Martin T, and Carey Vincent J. Software for computing and annotating genomic ranges. PLoS Comput Biol, 9(8):e1003118, 2013. doi:10.1371/journal.pcbi.1003118. URL https://bioconductor.org/packages/IRanges. - DOI - PMC - PubMed
    1. Aryee Martin J, Jaffe Andrew E, Corrada-Bravo Hector, Ladd-Acosta Christine, Feinberg Andrew P, Hansen Kasper D, and Irizarry Rafael A. Minfi: a flexible and comprehensive Bioconductor package for the analysis of Infinium DNA methylation microarrays. Bioinformatics, 30(10):1363–9, 2014. doi:10.1093/bioinformatics/btu049. URL https://bioconductor.org/packages/minfi. - DOI - PMC - PubMed
    1. Love Michael I, Huber Wolfgang, and Anders Simon. Moderated estimation of fold change and dispersion for RNA-seq data with DESeq2. Genome Biol, 15(12):550, 2014. doi:10.1186/s13059-014-0550-8. URL https://bioconductor.org/packages/DESeq2. - DOI - PMC - PubMed

Publication types