Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2024 Oct 16;15(10):982-990.e5.
doi: 10.1016/j.cels.2024.09.003. Epub 2024 Oct 3.

Automated single-cell omics end-to-end framework with data-driven batch inference

Affiliations

Automated single-cell omics end-to-end framework with data-driven batch inference

Yuan Wang et al. Cell Syst. .

Abstract

To facilitate single-cell multi-omics analysis and improve reproducibility, we present single-cell pipeline for end-to-end data integration (SPEEDI), a fully automated end-to-end framework for batch inference, data integration, and cell-type labeling. SPEEDI introduces data-driven batch inference and transforms the often heterogeneous data matrices obtained from different samples into a uniformly annotated and integrated dataset. Without requiring user input, it automatically selects parameters and executes pre-processing, sample integration, and cell-type mapping. It can also perform downstream analyses of differential signals between treatment conditions and gene functional modules. SPEEDI's data-driven batch-inference method works with widely used integration and cell-typing tools. By developing data-driven batch inference, providing full end-to-end automation, and eliminating parameter selection, SPEEDI improves reproducibility and lowers the barrier to obtaining biological insight from these valuable single-cell datasets. The SPEEDI interactive web application can be accessed at https://speedi.princeton.edu/. A record of this paper's transparent peer review process is included in the supplemental information.

Keywords: batch identification; cell-type mapping; information theory; integration; scATAC-seq; scRNA-seq; single-cell genomics.

PubMed Disclaimer

Conflict of interest statement

Declaration of interests S.C.S. is a consultant, equity owner, and interim chief scientific officer at GNOMX Corp. Patents were filed related to this work. O.G.T. is on the advisory board of Cell Systems.

Update of

Similar articles

Cited by

References

    1. Haghverdi L, Lun ATL, Morgan MD, & Marioni JC (2018). Batch effects in single-cell RNA-sequencing data are corrected by matching mutual nearest neighbors. Nature biotechnology, 36(5), 421–427. 10.1038/nbt.4091 - DOI - PMC - PubMed
    1. Stuart T, Butler A, Hoffman P, Hafemeister C, Papalexi E, Mauck WM 3rd, Hao Y, Stoeckius M, Smibert P, & Satija R (2019). Comprehensive Integration of Single-Cell Data. Cell, 177(7), 1888–1902.e21. 10.1016/j.cell.2019.05.031 - DOI - PMC - PubMed
    1. Hie B, Bryson B, & Berger B (2019). Efficient integration of heterogeneous single-cell transcriptomes using Scanorama. Nature biotechnology, 37(6), 685–691. 10.1038/s41587-019-0113-3 - DOI - PMC - PubMed
    1. Korsunsky I, Millard N, Fan J, Slowikowski K, Zhang F, Wei K, Baglaenko Y, Brenner M, Loh PR, & Raychaudhuri S (2019). Fast, sensitive and accurate integration of single-cell data with Harmony. Nature methods, 16(12), 1289–1296. 10.1038/s41592-019-0619-0 - DOI - PMC - PubMed
    1. Luecken MD, Büttner M, Chaichoompu K, Danese A, Interlandi M, Mueller MF, Strobl DC, Zappia L, Dugas M, Colomé-Tatché M, & Theis FJ (2022). Benchmarking atlas-level data integration in single-cell genomics. Nature methods, 19(1), 41–50. 10.1038/s41592-021-01336-8 - DOI - PMC - PubMed