Identifying Differentially Expressed Genes of Zero Inflated Single Cell RNA Sequencing Data Using Mixed Model Score Tests
- PMID: 33613638
- PMCID: PMC7894898
- DOI: 10.3389/fgene.2021.616686
Identifying Differentially Expressed Genes of Zero Inflated Single Cell RNA Sequencing Data Using Mixed Model Score Tests
Abstract
Single cell RNA sequencing (scRNA-seq) allows quantitative measurement and comparison of gene expression at the resolution of single cells. Ignoring the batch effects and zero inflation of scRNA-seq data, many proposed differentially expressed (DE) methods might generate bias. We propose a method, single cell mixed model score tests (scMMSTs), to efficiently identify DE genes of scRNA-seq data with batch effects using the generalized linear mixed model (GLMM). scMMSTs treat the batch effect as a random effect. For zero inflation, scMMSTs use a weighting strategy to calculate observational weights for counts independently under zero-inflated and zero-truncated distributions. Counts data with calculated weights were subsequently analyzed using weighted GLMMs. The theoretical null distributions of the score statistics were constructed by mixed Chi-square distributions. Intensive simulations and two real datasets were used to compare edgeR-zinbwave, DESeq2-zinbwave, and scMMSTs. Our study demonstrates that scMMSTs, as supplement to standard methods, are advantageous to define DE genes of zero-inflated scRNA-seq data with batch effects.
Keywords: differential expression analyses; generalized linear mixed model; observational weights; score test; single cell RNA sequencing; zero inflation.
Copyright © 2021 He, Pan, Shao and Wang.
Conflict of interest statement
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Figures
References
-
- Benjamini Y., Hochberg Y. (1995). Controlling the false discovery rate: a practical and powerful approach to multiple testing. J. R. Stat. Soc. Ser. B-Methodol. 57 289–300. 10.1111/j.2517-6161.1995.tb02031.x - DOI
-
- Böhning D., Dietz E., Schlattmann P., Mendonça L., Kirchner U. (1999). The zero-inflated Poisson model and the decayed, missing and filled teeth index in dental epidemiology. J. R. Stat. Soc. Ser. A 162 195–209. 10.1111/1467-985X.00130 - DOI
-
- Breslow N. E., Clayton D. G. (1993). Approximate inference in generalized linear mixed models. J. Am. Stat. Assoc. 88 9–25. 10.2307/2290687 - DOI
LinkOut - more resources
Full Text Sources
Other Literature Sources
