QOT: Quantized Optimal Transport for sample-level distance matrix in single-cell omics
- PMID: 39808114
- PMCID: PMC11962597
- DOI: 10.1093/bib/bbae713
QOT: Quantized Optimal Transport for sample-level distance matrix in single-cell omics
Abstract
Single-cell technologies have enabled the high-dimensional characterization of cell populations at an unprecedented scale. The innate complexity and increasing volume of data pose significant computational and analytical challenges, especially in comparative studies delineating cellular architectures across various biological conditions (i.e. generation of sample-level distance matrices). Optimal Transport is a mathematical tool that captures the intrinsic structure of data geometrically and has been applied to many bioinformatics tasks. In this paper, we propose QOT (Quantized Optimal Transport), a new method enabling efficient computation of sample-level distance matrix from large-scale single-cell omics data through a quantization step. We apply our algorithm to real-world single-cell genomics and pathomics datasets, aiming to extrapolate cell-level insights to inform sample-level categorizations. Our empirical study shows that QOT outperforms existing two OT-based algorithms in accuracy and robustness when obtaining a distance matrix from high throughput single-cell measures at the sample level. Moreover, the sample level distance matrix could be used in the downstream analysis (i.e. uncover the trajectory of disease progression), highlighting its usage in biomedical informatics and data science.
Keywords: Gaussian Mixture Model; Wasserstein distance; optimal transport; quantization; single-cell genomics.
© The Author(s) 2025. Published by Oxford University Press.
Figures







Update of
-
QOT: Efficient Computation of Sample Level Distance Matrix from Single-Cell Omics Data through Quantized Optimal Transport.bioRxiv [Preprint]. 2024 Feb 6:2024.02.06.578032. doi: 10.1101/2024.02.06.578032. bioRxiv. 2024. Update in: Brief Bioinform. 2024 Nov 22;26(1):bbae713. doi: 10.1093/bib/bbae713. PMID: 38370767 Free PMC article. Updated. Preprint.
Similar articles
-
QOT: Efficient Computation of Sample Level Distance Matrix from Single-Cell Omics Data through Quantized Optimal Transport.bioRxiv [Preprint]. 2024 Feb 6:2024.02.06.578032. doi: 10.1101/2024.02.06.578032. bioRxiv. 2024. Update in: Brief Bioinform. 2024 Nov 22;26(1):bbae713. doi: 10.1093/bib/bbae713. PMID: 38370767 Free PMC article. Updated. Preprint.
-
Detection of PatIent-Level distances from single cell genomics and pathomics data with Optimal Transport (PILOT).Mol Syst Biol. 2024 Feb;20(2):57-74. doi: 10.1038/s44320-023-00003-8. Epub 2023 Dec 19. Mol Syst Biol. 2024. PMID: 38177382 Free PMC article.
-
scMFG: a single-cell multi-omics integration method based on feature grouping.BMC Genomics. 2025 Feb 11;26(1):132. doi: 10.1186/s12864-025-11319-0. BMC Genomics. 2025. PMID: 39934664 Free PMC article.
-
Computational modelling in single-cell cancer genomics: methods and future directions.Phys Biol. 2020 Sep 19;17(6):061001. doi: 10.1088/1478-3975/abacfe. Phys Biol. 2020. PMID: 32759485 Review.
-
Single-cell omics: Overview, analysis, and application in biomedical science.J Cell Biochem. 2021 Nov;122(11):1571-1578. doi: 10.1002/jcb.30134. Epub 2021 Aug 30. J Cell Biochem. 2021. PMID: 34459502 Review.
References
MeSH terms
Grants and funding
LinkOut - more resources
Full Text Sources