Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2024 Feb 1;40(2):btae053.
doi: 10.1093/bioinformatics/btae053.

Scbean: a python library for single-cell multi-omics data analysis

Affiliations

Scbean: a python library for single-cell multi-omics data analysis

Haohui Zhang et al. Bioinformatics. .

Abstract

Summary: Single-cell multi-omics technologies provide a unique platform for characterizing cell states and reconstructing developmental process by simultaneously quantifying and integrating molecular signatures across various modalities, including genome, transcriptome, epigenome, and other omics layers. However, there is still an urgent unmet need for novel computational tools in this nascent field, which are critical for both effective and efficient interrogation of functionality across different omics modalities. Scbean represents a user-friendly Python library, designed to seamlessly incorporate a diverse array of models for the examination of single-cell data, encompassing both paired and unpaired multi-omics data. The library offers uniform and straightforward interfaces for tasks, such as dimensionality reduction, batch effect elimination, cell label transfer from well-annotated scRNA-seq data to scATAC-seq data, and the identification of spatially variable genes. Moreover, Scbean's models are engineered to harness the computational power of GPU acceleration through Tensorflow, rendering them capable of effortlessly handling datasets comprising millions of cells.

Availability and implementation: Scbean is released on the Python Package Index (PyPI) (https://pypi.org/project/scbean/) and GitHub (https://github.com/jhu99/scbean) under the MIT license. The documentation and example code can be found at https://scbean.readthedocs.io/en/latest/.

PubMed Disclaimer

Conflict of interest statement

None declared.

Figures

Figure 1.
Figure 1.
Overview of a multi-omics dataset pipeline employing Scbean. (A) Data collection, preprocessing, and loading data into APIs in Scbean. (B) The latent embedding obtained through the integration process can be leveraged for various subsequent analyses, including identification of marker genes, batch normalization, clustering, and trajectory analysis. Furthermore, results generated by VISGP can be instrumental not only in identifying spatially variable genes but also in elucidating the tissue’s structural anatomy.
Figure 2.
Figure 2.
Four APIs of SCBEAN. DAVAE supports integration of scRNA-seq, scATAC-seq, and spatial transcriptomics based on domain-adversarial and variational approximation. VIPCCA supports integration of unpaired single-cell multi-omics data, differential gene expression analysis based on non-linear canonical correlation analysis. VIMCCA supports joint-analysis of paired multi-modal single-cell datasets based on a multi-view latent variable model. VISGP supports the discovery of spatially variable genes exhibiting distinct expression patterns in spatial transcriptomic data.

References

    1. Cao K, Gong Q, Hong Y. et al. A unified computational framework for single-cell data integration with optimal transport. Nat Commun 2022;13:7419. - PMC - PubMed
    1. Cao Z-J, Gao G.. Multi-omics single-cell data integration and regulatory inference with graph-linked embedding. Nat Biotechnol 2022;40:1458–66. - PMC - PubMed
    1. Gao C, Liu J, Kriebel AR. et al. Iterative single-cell multi-omic integration using online learning. Nat Biotechnol 2021;39:1000–7. - PMC - PubMed
    1. Gayoso A, Steier Z, Lopez R. et al. Joint probabilistic modeling of single-cell multi-omic data with totalVI. Nat Methods 2021;18:272–82. - PMC - PubMed
    1. Gayoso A, Lopez R, Xing G. et al. A python library for probabilistic analysis of single-cell omics data. Nat Biotechnol 2022;40:163–6. - PubMed

Publication types