Inverse weighting method with jackknife variance estimator for differential expression analysis of single-cell RNA sequencing data
- PMID: 35926443
- DOI: 10.1016/j.compbiolchem.2022.107733
Inverse weighting method with jackknife variance estimator for differential expression analysis of single-cell RNA sequencing data
Abstract
Single-cell RNA sequencing (scRNA-seq) data exhibit an unusual abundance of zero counts with a considerable fraction due to the dropout events, which introduces challenges to differential expression analysis. To correct biases in differential expression due to the informative dropouts, an inverse non-dropout-probability weighting method is proposed given that the dropout rate is negatively dependent on the underlying gene expression magnitude in scRNA-seq data. The weights are estimated using the maximum likelihood method where dropout values are integrated out using the Gauss-Hermite quadrature. Linear, generalized linear and mixed regressions with the estimated weights are fitted on original or transformed scRNA-seq data. Variances of coefficient estimators from the weighted regressions are estimated using the jackknife method. Extensive simulation studies are carried out to compare the proposed method to five cutting-edge methods (Limma, edgeR, MAST, ZIAQ and scImpute), where the proposed method performs among the best under all scenarios in terms of AUC, sensitivity, specificity and FDR. Rate of detecting true positives is examined for the proposed method and five comparison methods using mouse embryonic stem cells and fibroblasts where differentially expressed (DE) genes detected in bulk RNA-seq data on the same set of genes under the same conditions from independent source serve as true positives. Specificity is compared for these methods on true negative data by random splitting of a real dataset. Furthermore, the proposed method is illustrated on a lineage study where cells in the same embryo are correlated and genes differentially expressed between cell division lineages are identified.
Keywords: Differential expression; Informative dropout; Inverse probability weighting; Jackknife; Single-cell RNA sequencing data.
Published by Elsevier Ltd.
Conflict of interest statement
Conflict of interest The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest. Declaration of Competing Interests The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.
Similar articles
-
ZIAQ: a quantile regression method for differential expression analysis of single-cell RNA-seq data.Bioinformatics. 2020 May 1;36(10):3124-3130. doi: 10.1093/bioinformatics/btaa098. Bioinformatics. 2020. PMID: 32053182
-
A Comprehensive Survey of Statistical Approaches for Differential Expression Analysis in Single-Cell RNA Sequencing Studies.Genes (Basel). 2021 Dec 2;12(12):1947. doi: 10.3390/genes12121947. Genes (Basel). 2021. PMID: 34946896 Free PMC article.
-
Observation weights unlock bulk RNA-seq tools for zero inflation and single-cell applications.Genome Biol. 2018 Feb 26;19(1):24. doi: 10.1186/s13059-018-1406-4. Genome Biol. 2018. PMID: 29478411 Free PMC article.
-
Machine learning and statistical methods for clustering single-cell RNA-sequencing data.Brief Bioinform. 2020 Jul 15;21(4):1209-1223. doi: 10.1093/bib/bbz063. Brief Bioinform. 2020. PMID: 31243426 Review.
-
Optimized single-nucleus transcriptional profiling by combinatorial indexing.Nat Protoc. 2023 Jan;18(1):188-207. doi: 10.1038/s41596-022-00752-0. Epub 2022 Oct 19. Nat Protoc. 2023. PMID: 36261634 Free PMC article. Review.
Cited by
-
Cholecystitis may decrease the risk of sudden death: A 2-sample Mendelian randomization study.Medicine (Baltimore). 2024 May 24;103(21):e38240. doi: 10.1097/MD.0000000000038240. Medicine (Baltimore). 2024. PMID: 38787985 Free PMC article.
MeSH terms
Substances
LinkOut - more resources
Full Text Sources