Deep structural clustering for single-cell RNA-seq data jointly through autoencoder and graph neural network
- PMID: 35172334
- DOI: 10.1093/bib/bbac018
Deep structural clustering for single-cell RNA-seq data jointly through autoencoder and graph neural network
Abstract
Single-cell RNA sequencing (scRNA-seq) permits researchers to study the complex mechanisms of cell heterogeneity and diversity. Unsupervised clustering is of central importance for the analysis of the scRNA-seq data, as it can be used to identify putative cell types. However, due to noise impacts, high dimensionality and pervasive dropout events, clustering analysis of scRNA-seq data remains a computational challenge. Here, we propose a new deep structural clustering method for scRNA-seq data, named scDSC, which integrate the structural information into deep clustering of single cells. The proposed scDSC consists of a Zero-Inflated Negative Binomial (ZINB) model-based autoencoder, a graph neural network (GNN) module and a mutual-supervised module. To learn the data representation from the sparse and zero-inflated scRNA-seq data, we add a ZINB model to the basic autoencoder. The GNN module is introduced to capture the structural information among cells. By joining the ZINB-based autoencoder with the GNN module, the model transfers the data representation learned by autoencoder to the corresponding GNN layer. Furthermore, we adopt a mutual supervised strategy to unify these two different deep neural architectures and to guide the clustering task. Extensive experimental results on six real scRNA-seq datasets demonstrate that scDSC outperforms state-of-the-art methods in terms of clustering accuracy and scalability. Our method scDSC is implemented in Python using the Pytorch machine-learning library, and it is freely available at https://github.com/DHUDBlab/scDSC.
Keywords: ZINB model; autoencoder; deep clustering; graph neural network; scRNA-Seq.
© The Author(s) 2022. Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com.
Similar articles
-
Soft graph clustering for single-cell RNA sequencing data.BMC Bioinformatics. 2025 Jul 25;26(1):195. doi: 10.1186/s12859-025-06231-z. BMC Bioinformatics. 2025. PMID: 40713495 Free PMC article.
-
scAMZI: attention-based deep autoencoder with zero-inflated layer for clustering scRNA-seq data.BMC Genomics. 2025 Apr 7;26(1):350. doi: 10.1186/s12864-025-11511-2. BMC Genomics. 2025. PMID: 40197174 Free PMC article.
-
scBGEDA: deep single-cell clustering analysis via a dual denoising autoencoder with bipartite graph ensemble clustering.Bioinformatics. 2023 Feb 14;39(2):btad075. doi: 10.1093/bioinformatics/btad075. Bioinformatics. 2023. PMID: 36734596 Free PMC article.
-
Machine learning and statistical methods for clustering single-cell RNA-sequencing data.Brief Bioinform. 2020 Jul 15;21(4):1209-1223. doi: 10.1093/bib/bbz063. Brief Bioinform. 2020. PMID: 31243426 Review.
-
Evaluating the performance of dropout imputation and clustering methods for single-cell RNA sequencing data.Comput Biol Med. 2022 Jul;146:105697. doi: 10.1016/j.compbiomed.2022.105697. Epub 2022 Jun 8. Comput Biol Med. 2022. PMID: 35697529 Review.
Cited by
-
scHeteroNet: A Heterophily-Aware Graph Neural Network for Accurate Cell Type Annotation and Novel Cell Detection.Adv Sci (Weinh). 2025 Apr;12(16):e2412095. doi: 10.1002/advs.202412095. Epub 2025 Mar 5. Adv Sci (Weinh). 2025. PMID: 40042052 Free PMC article.
-
A hybrid adversarial autoencoder-graph network model with dynamic fusion for robust scRNA-seq clustering.BMC Genomics. 2025 Aug 18;26(1):749. doi: 10.1186/s12864-025-11941-y. BMC Genomics. 2025. PMID: 40826008 Free PMC article.
-
scEGG: an exogenous gene-guided clustering method for single-cell transcriptomic data.Brief Bioinform. 2024 Sep 23;25(6):bbae483. doi: 10.1093/bib/bbae483. Brief Bioinform. 2024. PMID: 39344711 Free PMC article.
-
Deep learning powered single-cell clustering framework with enhanced accuracy and stability.Sci Rep. 2025 Feb 3;15(1):4107. doi: 10.1038/s41598-025-87672-7. Sci Rep. 2025. PMID: 39900656 Free PMC article.
-
nsDCC: dual-level contrastive clustering with nonuniform sampling for scRNA-seq data analysis.Brief Bioinform. 2024 Sep 23;25(6):bbae477. doi: 10.1093/bib/bbae477. Brief Bioinform. 2024. PMID: 39327063 Free PMC article.
Publication types
MeSH terms
LinkOut - more resources
Full Text Sources
Other Literature Sources