moSCminer: a cell subtype classification framework based on the attention neural network integrating the single-cell multi-omics dataset on the cloud
- PMID: 38426141
- PMCID: PMC10903350
- DOI: 10.7717/peerj.17006
moSCminer: a cell subtype classification framework based on the attention neural network integrating the single-cell multi-omics dataset on the cloud
Abstract
Single-cell omics sequencing has rapidly advanced, enabling the quantification of diverse omics profiles at a single-cell resolution. To facilitate comprehensive biological insights, such as cellular differentiation trajectories, precise annotation of cell subtypes is essential. Conventional methods involve clustering cells and manually assigning subtypes based on canonical markers, a labor-intensive and expert-dependent process. Hence, an automated computational prediction framework is crucial. While several classification frameworks for predicting cell subtypes from single-cell RNA sequencing datasets exist, these methods solely rely on single-omics data, offering insights at a single molecular level. They often miss inter-omic correlations and a holistic understanding of cellular processes. To address this, the integration of multi-omics datasets from individual cells is essential for accurate subtype annotation. This article introduces moSCminer, a novel framework for classifying cell subtypes that harnesses the power of single-cell multi-omics sequencing datasets through an attention-based neural network operating at the omics level. By integrating three distinct omics datasets-gene expression, DNA methylation, and DNA accessibility-while accounting for their biological relationships, moSCminer excels at learning the relative significance of each omics feature. It then transforms this knowledge into a novel representation for cell subtype classification. Comparative evaluations against standard machine learning-based classifiers demonstrate moSCminer's superior performance, consistently achieving the highest average performance on real datasets. The efficacy of multi-omics integration is further corroborated through an in-depth analysis of the omics-level attention module, which identifies potential markers for cell subtype annotation. To enhance accessibility and scalability, moSCminer is accessible as a user-friendly web-based platform seamlessly connected to a cloud system, publicly accessible at http://203.252.206.118:5568. Notably, this study marks the pioneering integration of three single-cell multi-omics datasets for cell subtype identification.
Keywords: Attention-based neural network; Cell subtype classification; Cloud system; Deep learning-based framework; Self attention; Single-cell multi-omics; Web platform.
© 2024 Choi et al.
Conflict of interest statement
The authors declare that they have no competing interests.
Figures




Similar articles
-
moBRCA-net: a breast cancer subtype classification framework based on multi-omics attention neural networks.BMC Bioinformatics. 2023 Apr 26;24(1):169. doi: 10.1186/s12859-023-05273-5. BMC Bioinformatics. 2023. PMID: 37101124 Free PMC article.
-
A multimodal graph neural network framework for cancer molecular subtype classification.BMC Bioinformatics. 2024 Jan 15;25(1):27. doi: 10.1186/s12859-023-05622-4. BMC Bioinformatics. 2024. PMID: 38225583 Free PMC article.
-
Amogel: a multi-omics classification framework using associative graph neural networks with prior knowledge for biomarker identification.BMC Bioinformatics. 2025 Mar 28;26(1):94. doi: 10.1186/s12859-025-06111-6. BMC Bioinformatics. 2025. PMID: 40155814 Free PMC article.
-
Multimodal deep learning approaches for single-cell multi-omics data integration.Brief Bioinform. 2023 Sep 20;24(5):bbad313. doi: 10.1093/bib/bbad313. Brief Bioinform. 2023. PMID: 37651607 Free PMC article. Review.
-
A comprehensive review of machine learning techniques for multi-omics data integration: challenges and applications in precision oncology.Brief Funct Genomics. 2024 Sep 27;23(5):549-560. doi: 10.1093/bfgp/elae013. Brief Funct Genomics. 2024. PMID: 38600757 Review.
References
-
- Abadi M, Agarwal A, Barham P, Brevdo E, Chen Z, Citro C, Corrado GS, Davis A, Dean J, Devin M, Ghemawat S, Goodfellow I, Harp A, Irving G, Isard M, Jia Y, Jozefowicz R, Kaiser L, Kudlur M, Levenberg J, Mané D, Monga R, Moore S, Murray D, Olah C, Schuster M, Shlens J, Steiner B, Sutskever I, Talwar K, Tucker P, Vanhoucke V, Vasudevan V, Viégas F, Vinyals O, Warden P, Wattenberg M, Wicke M, Yu Y, Zheng X. TensorFlow: large-scale machine learning on heterogeneous systems. 2015. https://www.tensorflow.org https://www.tensorflow.org
-
- Bian S, Wang Y, Zhou Y, Wang W, Guo L, Wen L, Fu W, Zhou X, Tang F. Integrative single-cell multiomics analyses dissect molecular signatures of intratumoral heterogeneities and differentiation states of human gastric cancer. National Science Review. 2023;10(6):nwad094. doi: 10.1093/nsr/nwad094. - DOI - PMC - PubMed
MeSH terms
LinkOut - more resources
Full Text Sources