Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Review
. 2022 Oct;20(5):836-849.
doi: 10.1016/j.gpb.2022.11.013. Epub 2022 Dec 26.

Computational Methods for Single-cell Multi-omics Integration and Alignment

Affiliations
Review

Computational Methods for Single-cell Multi-omics Integration and Alignment

Stefan Stanojevic et al. Genomics Proteomics Bioinformatics. 2022 Oct.

Abstract

Recently developed technologies to generate single-cell genomic data have made a revolutionary impact in the field of biology. Multi-omics assays offer even greater opportunities to understand cellular states and biological processes. The problem of integrating different omics data with very different dimensionality and statistical properties remains, however, quite challenging. A growing body of computational tools is being developed for this task, leveraging ideas ranging from machine translation to the theory of networks, and represents another frontier on the interface of biology and data science. Our goal in this review is to provide a comprehensive, up-to-date survey of computational techniques for the integration of single-cell multi-omics data, while making the concepts behind each algorithm approachable to a non-expert audience.

Keywords: Integration; Machine learning; Multi-omics; Single-cell; Unsupervised learning.

PubMed Disclaimer

Conflict of interest statement

The authors have declared no competing interests.

Figures

Figure 1
Figure 1
Integration and alignment of multi-omics data Multi-omics data can sometimes be sequenced from the same set of single cells (left); at other times, only the data sequenced from the same/similar sample, but different single cells are available (right). In the former case, we have the task of integrating the different data modalities (left); in the latter case, we need to first identify similar cells across the samples (right). This is the computational task of alignment.
Figure 2
Figure 2
Single-cell multi-omics integration methods Illustration of some common integration approaches for single-cell multi-omics data: matrix factorization uncovering a representation of both cells and omics via factors (A), neural networks which combine different -omics into a single cell representation (B), and network-based approaches, which represent cells as nodes on the graphs connected to nearby cells (C).
Figure 3
Figure 3
Single-cell multi-omics alignment methods Illustration of some common approaches for alignment of multi-omics single-cell data: Bayesian methods, modeling the probability distribution of -omics measurements using a number of latent variables and updating such distributions using Bayes’ formula (A), manifold alignment methods uncovering a surface in the space of omics on which the alignment can be performed (B), and neural network-based models, creating latent representations of different -omics data, which can be more easily aligned (C).
Figure 4
Figure 4
Distance-based alignment Schematic overview of the distance-based alignment algorithm: cells are represented by nodes in two different graph representations, corresponding to two different omics assays. Cells with very similar omics measurements are connected to form graphs. Two graphs are then aligned in order to preserve a notion of distance on the graph.

References

    1. Macosko E.Z., Basu A., Satija R., Nemesh J., Shekhar K., Goldman M., et al. Highly parallel genome-wide expression profiling of individual cells using nanoliter droplets. Cell. 2015;161:1202–1214. - PMC - PubMed
    1. Klein A.M., Mazutis L., Akartuna I., Tallapragada N., Veres A., Li V., et al. Droplet barcoding for single-cell transcriptomics applied to embryonic stem cells. Cell. 2015;161:1187–1201. - PMC - PubMed
    1. Zheng G.X.Y., Terry J.M., Belgrader P., Ryvkin P., Bent Z.W., Wilson R., et al. Massively parallel digital transcriptional profiling of single cells. Nat Commun. 2017;8:14049. - PMC - PubMed
    1. Stoeckius M., Hafemeister C., Stephenson W., Houck-Loomis B., Chattopadhyay P.K., Swerdlow H., et al. Simultaneous epitope and transcriptome measurement in single cells. Nat Methods. 2017;14:865–868. - PMC - PubMed
    1. Peterson V.M., Zhang K.X., Kumar N., Wong J., Li L., Wilson D.C., et al. Multiplexed quantification of proteins and transcripts in single cells. Nat Biotechnol. 2017;35:936–939. - PubMed

Publication types