scTsI: an effective two-stage imputation method for single-cell RNA-seq data
- PMID: 40579791
- PMCID: PMC12204820
- DOI: 10.1093/bib/bbaf298
scTsI: an effective two-stage imputation method for single-cell RNA-seq data
Abstract
Single-cell RNA-seq facilitates the understanding of cell types and states and the revealing of the cellular heterogeneity in developmental processes and disease mechanisms. However, the dropout events in single-cell RNA-seq data, in which genes are not detected due to technical noise or limited sequencing depth, seriously affect downstream analyses. Imputation is an effective way to relieve the impact of dropout events. However, the current methods may introduce new noise or modify the high expression values in the imputation process and their performance may be lower than expected when dealing with data with a high dropout rate, facing with different types of data, and aiming at various downstream analyses. We propose a two-stage imputation algorithm, scTsI, for single-cell RNA-seq data. In the first stage, scTsI imputes the zero values using the information of neighboring cells and genes. In the second stage, scTsI transforms the expression matrix into a vector, performs row transformation, and adjusts the imputed values through ridge regression and leveraging bulk RNA-seq data as a constraint. scTsI ensures that the original highly expressed values are unchanged, avoids introducing new noise, and allows sparse matrix input to accelerate imputation. We conduct experiments on a variety of simulated and real data with different dropout rates and compare scTsI with the commonly used imputation methods. The results show that scTsI can restore gene expression and maintain cell-cell similarity across different data dimensions and dropout rates. scTsI can also improve the performance of data visualization, clustering, and cell trajectory inference.
Keywords: bulk RNA-seq data; imputation; ridge regression; single-cell gene expression; vector transformation.
© The Author(s) 2025. Published by Oxford University Press.
Figures






References
-
- Stevenson K, Uversky VN. Single-cell RNA-Seq: A next generation sequencing tool for a high-resolution view of the individual cell. J Biomol Struct Dyn 2020;38:3730–5. - PubMed
-
- Shapiro E, Biezuner T, Linnarsson S. Single-cell sequencing-based technologies will revolutionize whole-organism science. Nat Rev Genet 2013;14:618–30. - PubMed
-
- Olsen TK, Baryawno N. Introduction to single-cell RNA sequencing. Curr Protoc Mol Biol 2018;122:e57. - PubMed
-
- Papalexi E, Satija R. Single-cell RNA sequencing to explore immune cell heterogeneity. Nat Rev Immunol 2018;18:35–45. - PubMed
MeSH terms
Grants and funding
LinkOut - more resources
Full Text Sources