Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2022 Jan;29(1):19-22.
doi: 10.1089/cmb.2021.0477. Epub 2022 Jan 5.

Single-Cell Multiomics Integration by SCOT

Affiliations

Single-Cell Multiomics Integration by SCOT

Pinar Demetci et al. J Comput Biol. 2022 Jan.

Abstract

Although the availability of various sequencing technologies allows us to capture different genome properties at single-cell resolution, with the exception of a few co-assaying technologies, applying different sequencing assays on the same single cell is impossible. Single-cell alignment using optimal transport (SCOT) is an unsupervised algorithm that addresses this limitation by using optimal transport to align single-cell multiomics data. First, it preserves the local geometry by constructing a k-nearest neighbor (k-NN) graph for each data set (or domain) to capture the intra-domain distances. SCOT then finds a probabilistic coupling matrix that minimizes the discrepancy between the intra-domain distance matrices. Finally, it uses the coupling matrix to project one single-cell data set onto another through barycentric projection, thus aligning them. SCOT requires tuning only two hyperparameters and is robust to the choice of one. Furthermore, the Gromov-Wasserstein distance in the algorithm can guide SCOT's hyperparameter tuning in a fully unsupervised setting when no orthogonal alignment information is available. Thus, SCOT is a fast and accurate alignment method that provides a heuristic for hyperparameter selection in a real-world unsupervised single-cell data alignment scenario. We provide a tutorial for SCOT and make its source code publicly available on GitHub.

Keywords: data integration; manifold alignment; multiomics; optimal transport; single-cell genomics.

PubMed Disclaimer

Conflict of interest statement

The authors declare they have competing financial interests.

Figures

FIG. 1.
FIG. 1.
Overview of the SCOT algorithm: SCOT takes in two data sets of single-cell genomics measurements in the form of count matrices to align them. For each data set, it first constructs k-nearest neighbor graphs based on correlations between samples and calculates the distance matrices capturing the intra-domain distances. Given these, it solves the Gromov-Wasserstein optimal transport formulation to find an ideal coupling (also known as “correspondence”) matrix, describing the probability of alignment between the samples across the two data sets. Finally, it completes the alignment by projecting the first domain onto the second one based on correspondence probabilities using barycentric projection. SCOT, single-cell alignment using optimal transport.

References

    1. Amodio, M., and Krishnaswamy, S.. 2018. MAGAN: Aligning biological manifolds. In International Conference on Machine Learning (ICML). Proceedings of Machine Learning Research (PMLR) 80, 215–223. Stockholm, Sweden.
    1. Cao, K., Bai, X., Hong, Y., et al. . 2020. Unsupervised topological alignment for single-cell multiomics integration. Bioinformatics 36 (Supplement-1), i48–i56. - PMC - PubMed
    1. Flamary, R., and Courty, N.. 2021. POT: Python optimal transport. Journal of Machine Learning Research (JMLR) 22, 1–8.
    1. Liu, J., Huang, Y., Singh, R., et al. . 2019. Jointly embedding multiple single-cell omics measurements. In 19th International Workshop on Algorithms in Bioinformatics (WAB). LIPICS 43, 10:1–10:13. - PMC - PubMed
    1. Stuart, T., Butler, A., Hoffman, P., et al. . 2019. Comprehensive integration of single-cell data. Cell 77, 1888–1902. - PMC - PubMed

Publication types

MeSH terms

LinkOut - more resources