GE-Impute: graph embedding-based imputation for single-cell RNA-seq data
- PMID: 35901457
- DOI: 10.1093/bib/bbac313
GE-Impute: graph embedding-based imputation for single-cell RNA-seq data
Abstract
Single-cell RNA-sequencing (scRNA-seq) has been widely used to depict gene expression profiles at the single-cell resolution. However, its relatively high dropout rate often results in artificial zero expressions of genes and therefore compromised reliability of results. To overcome such unwanted sparsity of scRNA-seq data, several imputation algorithms have been developed to recover the single-cell expression profiles. Here, we propose a novel approach, GE-Impute, to impute the dropout zeros in scRNA-seq data with graph embedding-based neural network model. GE-Impute learns the neural graph representation for each cell and reconstructs the cell-cell similarity network accordingly, which enables better imputation of dropout zeros based on the more accurately allocated neighbors in the similarity network. Gene expression correlation analysis between true expression data and simulated dropout data suggests significantly better performance of GE-Impute on recovering dropout zeros for both droplet- and plated-based scRNA-seq data. GE-Impute also outperforms other imputation methods in identifying differentially expressed genes and improving the unsupervised clustering on datasets from various scRNA-seq techniques. Moreover, GE-Impute enhances the identification of marker genes, facilitating the cell type assignment of clusters. In trajectory analysis, GE-Impute improves time-course scRNA-seq data analysis and reconstructing differentiation trajectory. The above results together demonstrate that GE-Impute could be a useful method to recover the single-cell expression profiles, thus enabling better biological interpretation of scRNA-seq data. GE-Impute is implemented in Python and is freely available at https://github.com/wxbCaterpillar/GE-Impute.
Keywords: graph embedding; imputation; neural graph representation; similarity network; single-cell RNA-sequencing.
© The Author(s) 2022. Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com.
Similar articles
-
PbImpute: Precise Zero Discrimination and Balanced Imputation in Single-Cell RNA Sequencing Data.J Chem Inf Model. 2025 Mar 10;65(5):2670-2684. doi: 10.1021/acs.jcim.4c02125. Epub 2025 Feb 17. J Chem Inf Model. 2025. PMID: 39957720 Free PMC article.
-
CL-Impute: A contrastive learning-based imputation for dropout single-cell RNA-seq data.Comput Biol Med. 2023 Sep;164:107263. doi: 10.1016/j.compbiomed.2023.107263. Epub 2023 Jul 23. Comput Biol Med. 2023. PMID: 37531858
-
CDSImpute: An ensemble similarity imputation method for single-cell RNA sequence dropouts.Comput Biol Med. 2022 Jul;146:105658. doi: 10.1016/j.compbiomed.2022.105658. Epub 2022 May 21. Comput Biol Med. 2022. PMID: 35751187
-
Machine learning and statistical methods for clustering single-cell RNA-sequencing data.Brief Bioinform. 2020 Jul 15;21(4):1209-1223. doi: 10.1093/bib/bbz063. Brief Bioinform. 2020. PMID: 31243426 Review.
-
Evaluating the performance of dropout imputation and clustering methods for single-cell RNA sequencing data.Comput Biol Med. 2022 Jul;146:105697. doi: 10.1016/j.compbiomed.2022.105697. Epub 2022 Jun 8. Comput Biol Med. 2022. PMID: 35697529 Review.
Cited by
-
CPARI: a novel approach combining cell partitioning with absolute and relative imputation to address dropout in single-cell RNA-seq data.Brief Bioinform. 2024 Nov 22;26(1):bbae668. doi: 10.1093/bib/bbae668. Brief Bioinform. 2024. PMID: 39715686 Free PMC article.
-
PbImpute: Precise Zero Discrimination and Balanced Imputation in Single-Cell RNA Sequencing Data.J Chem Inf Model. 2025 Mar 10;65(5):2670-2684. doi: 10.1021/acs.jcim.4c02125. Epub 2025 Feb 17. J Chem Inf Model. 2025. PMID: 39957720 Free PMC article.
-
DNI-MDCAP: improvement of causal MiRNA-disease association prediction based on deep network imputation.BMC Bioinformatics. 2024 Jan 12;25(1):22. doi: 10.1186/s12859-024-05644-6. BMC Bioinformatics. 2024. PMID: 38216907 Free PMC article.
-
Improved downstream functional analysis of single-cell RNA-sequence data using DGAN.Sci Rep. 2023 Jan 28;13(1):1618. doi: 10.1038/s41598-023-28952-y. Sci Rep. 2023. PMID: 36709340 Free PMC article.
-
scDTL: enhancing single-cell RNA-seq imputation through deep transfer learning with bulk cell information.Brief Bioinform. 2024 Sep 23;25(6):bbae555. doi: 10.1093/bib/bbae555. Brief Bioinform. 2024. PMID: 39504481 Free PMC article.
Publication types
MeSH terms
Substances
LinkOut - more resources
Full Text Sources