PLNseq: a multivariate Poisson lognormal distribution for high-throughput matched RNA-sequencing read count data
- PMID: 25641202
- DOI: 10.1002/sim.6449
PLNseq: a multivariate Poisson lognormal distribution for high-throughput matched RNA-sequencing read count data
Abstract
High-throughput RNA-sequencing (RNA-seq) technology provides an attractive platform for gene expression analysis. In many experimental settings, RNA-seq read counts are measured from matched samples or taken from the same subject under multiple treatment conditions. The induced correlation therefore should be evaluated and taken into account in deriving tests of differential expression. We proposed a novel method 'PLNseq', which uses a multivariate Poisson lognormal distribution to model matched read count data. The correlation is directly modeled through Gaussian random effects, and inferences are made by likelihood methods. A three-stage numerical algorithm is developed to estimate unknown parameters and conduct differential expression analysis. Results using simulated data demonstrate that our method performs reasonably well in terms of parameter estimation, DE analysis power, and robustness. PLNseq also has better control of FDRs than the benchmarks edgeR and DESeq2 in the situations where the correlation is different across the genes but can still be accurately estimated. Furthermore, direct evaluation of correlation through PLNseq enables us to develop a new and more powerful test for DE analysis. Application to a lung cancer study is provided to illustrate the practical utilities of our method. An R package implementing the method is also publicly available.
Keywords: Poisson lognormal model; RNA-seq; differential expression analysis; matched samples.
Copyright © 2015 John Wiley & Sons, Ltd.
Similar articles
-
Differential correlation for sequencing data.BMC Res Notes. 2017 Jan 19;10(1):54. doi: 10.1186/s13104-016-2331-9. BMC Res Notes. 2017. PMID: 28103954 Free PMC article.
-
Differential expression analysis of RNA sequencing data by incorporating non-exonic mapped reads.BMC Genomics. 2015;16 Suppl 7(Suppl 7):S14. doi: 10.1186/1471-2164-16-S7-S14. Epub 2015 Jun 11. BMC Genomics. 2015. PMID: 26099631 Free PMC article.
-
A comparison of per sample global scaling and per gene normalization methods for differential expression analysis of RNA-seq data.PLoS One. 2017 May 1;12(5):e0176185. doi: 10.1371/journal.pone.0176185. eCollection 2017. PLoS One. 2017. PMID: 28459823 Free PMC article.
-
Statistical detection of differentially expressed genes based on RNA-seq: from biological to phylogenetic replicates.Brief Bioinform. 2016 Mar;17(2):243-8. doi: 10.1093/bib/bbv035. Epub 2015 Jun 24. Brief Bioinform. 2016. PMID: 26108230 Review.
-
A comparison of statistical methods for detecting differentially expressed genes from RNA-seq data.Am J Bot. 2012 Feb;99(2):248-56. doi: 10.3732/ajb.1100340. Epub 2012 Jan 20. Am J Bot. 2012. PMID: 22268221 Review.
Cited by
-
A Phylogenetic Framework to Simulate Synthetic Interspecies RNA-Seq Data.Mol Biol Evol. 2023 Jan 4;40(1):msac269. doi: 10.1093/molbev/msac269. Mol Biol Evol. 2023. PMID: 36508357 Free PMC article.
-
DREAMSeq: An Improved Method for Analyzing Differentially Expressed Genes in RNA-seq Data.Front Genet. 2018 Nov 30;9:588. doi: 10.3389/fgene.2018.00588. eCollection 2018. Front Genet. 2018. PMID: 30559761 Free PMC article.
-
A multivariate Poisson-log normal mixture model for clustering transcriptome sequencing data.BMC Bioinformatics. 2019 Jul 16;20(1):394. doi: 10.1186/s12859-019-2916-0. BMC Bioinformatics. 2019. PMID: 31311497 Free PMC article.
-
A comparison of methods for multiple degree of freedom testing in repeated measures RNA-sequencing experiments.BMC Med Res Methodol. 2022 May 28;22(1):153. doi: 10.1186/s12874-022-01615-8. BMC Med Res Methodol. 2022. PMID: 35643435 Free PMC article.
-
MCMSeq: Bayesian hierarchical modeling of clustered and repeated measures RNA sequencing experiments.BMC Bioinformatics. 2020 Aug 28;21(1):375. doi: 10.1186/s12859-020-03715-y. BMC Bioinformatics. 2020. PMID: 32859148 Free PMC article.