Dimensionality reduction and visualization of single-cell RNA-seq data with an improved deep variational autoencoder
- PMID: 37088976
- DOI: 10.1093/bib/bbad152
Dimensionality reduction and visualization of single-cell RNA-seq data with an improved deep variational autoencoder
Abstract
Single-cell RNA sequencing (scRNA-seq) is a revolutionary breakthrough that determines the precise gene expressions on individual cells and deciphers cell heterogeneity and subpopulations. However, scRNA-seq data are much noisier than traditional high-throughput RNA-seq data because of technical limitations, leading to many scRNA-seq data studies about dimensionality reduction and visualization remaining at the basic data-stacking stage. In this study, we propose an improved variational autoencoder model (termed DREAM) for dimensionality reduction and a visual analysis of scRNA-seq data. Here, DREAM combines the variational autoencoder and Gaussian mixture model for cell type identification, meanwhile explicitly solving 'dropout' events by introducing the zero-inflated layer to obtain the low-dimensional representation that describes the changes in the original scRNA-seq dataset. Benchmarking comparisons across nine scRNA-seq datasets show that DREAM outperforms four state-of-the-art methods on average. Moreover, we prove that DREAM can accurately capture the expression dynamics of human preimplantation embryonic development. DREAM is implemented in Python, freely available via the GitHub website, https://github.com/Crystal-JJ/DREAM.
Keywords: dimensionality reduction; dropout; single-cell RNA-seq; variational autoencoder; visualization.
© The Author(s) 2023. Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com.
Similar articles
-
Deep structural clustering for single-cell RNA-seq data jointly through autoencoder and graph neural network.Brief Bioinform. 2022 Mar 10;23(2):bbac018. doi: 10.1093/bib/bbac018. Brief Bioinform. 2022. PMID: 35172334
-
VASC: Dimension Reduction and Visualization of Single-cell RNA-seq Data by Deep Variational Autoencoder.Genomics Proteomics Bioinformatics. 2018 Oct;16(5):320-331. doi: 10.1016/j.gpb.2018.08.003. Epub 2018 Dec 18. Genomics Proteomics Bioinformatics. 2018. PMID: 30576740 Free PMC article.
-
A deep adversarial variational autoencoder model for dimensionality reduction in single-cell RNA sequencing analysis.BMC Bioinformatics. 2020 Feb 21;21(1):64. doi: 10.1186/s12859-020-3401-5. BMC Bioinformatics. 2020. PMID: 32085701 Free PMC article.
-
Supervised application of internal validation measures to benchmark dimensionality reduction methods in scRNA-seq data.Brief Bioinform. 2021 Nov 5;22(6):bbab304. doi: 10.1093/bib/bbab304. Brief Bioinform. 2021. PMID: 34374742 Review.
-
Machine learning and statistical methods for clustering single-cell RNA-sequencing data.Brief Bioinform. 2020 Jul 15;21(4):1209-1223. doi: 10.1093/bib/bbz063. Brief Bioinform. 2020. PMID: 31243426 Review.
Cited by
-
DCRELM: dual correlation reduction network-based extreme learning machine for single-cell RNA-seq data clustering.Sci Rep. 2024 Jun 12;14(1):13541. doi: 10.1038/s41598-024-64217-y. Sci Rep. 2024. PMID: 38866896 Free PMC article.
-
Advances in the Application of Single-Cell Transcriptomics in Plant Systems and Synthetic Biology.Biodes Res. 2024 Feb 29;6:0029. doi: 10.34133/bdr.0029. eCollection 2024. Biodes Res. 2024. PMID: 38435807 Free PMC article. Review.
-
scAMZI: attention-based deep autoencoder with zero-inflated layer for clustering scRNA-seq data.BMC Genomics. 2025 Apr 7;26(1):350. doi: 10.1186/s12864-025-11511-2. BMC Genomics. 2025. PMID: 40197174 Free PMC article.
-
SIGRN: Inferring Gene Regulatory Network with Soft Introspective Variational Autoencoders.Int J Mol Sci. 2024 Nov 27;25(23):12741. doi: 10.3390/ijms252312741. Int J Mol Sci. 2024. PMID: 39684451 Free PMC article.
-
scSID: A lightweight algorithm for identifying rare cell types by capturing differential expression from single-cell sequencing data.Comput Struct Biotechnol J. 2024 Jan 3;23:589-600. doi: 10.1016/j.csbj.2023.12.043. eCollection 2024 Dec. Comput Struct Biotechnol J. 2024. PMID: 38274993 Free PMC article.
Publication types
MeSH terms
LinkOut - more resources
Full Text Sources