A multivariate statistical test for differential expression analysis
- PMID: 35585166
- PMCID: PMC9117296
- DOI: 10.1038/s41598-022-12246-w
A multivariate statistical test for differential expression analysis
Abstract
Statistical tests of differential expression usually suffer from two problems. Firstly, their statistical power is often limited when applied to small and skewed data sets. Secondly, gene expression data are usually discretized by applying arbitrary criteria to limit the number of false positives. In this work, a new statistical test obtained from a convolution of multivariate hypergeometric distributions, the Hy-test, is proposed to address these issues. Hy-test has been carried out on transcriptomic data from breast and kidney cancer tissues, and it has been compared with other differential expression analysis methods. Hy-test allows implicit discretization of the expression profiles and is more selective in retrieving both differential expressed genes and terms of Gene Ontology. Hy-test can be adopted together with other tests to retrieve information that would remain hidden otherwise, e.g., terms of (1) cell cycle deregulation for breast cancer and (2) "programmed cell death" for kidney cancer.
© 2022. The Author(s).
Conflict of interest statement
The authors declare no competing interests.
Figures



Similar articles
-
Expression and methylation patterns partition luminal-A breast tumors into distinct prognostic subgroups.Breast Cancer Res. 2016 Jul 7;18(1):74. doi: 10.1186/s13058-016-0724-2. Breast Cancer Res. 2016. PMID: 27386846 Free PMC article.
-
A new statistical method for curve group analysis of longitudinal gene expression data illustrated for breast cancer in the NOWAC postgenome cohort as a proof of principle.BMC Med Res Methodol. 2016 Mar 5;16:28. doi: 10.1186/s12874-016-0129-z. BMC Med Res Methodol. 2016. PMID: 26944545 Free PMC article.
-
Multi-group cancer outlier differential gene expression detection.Comput Biol Chem. 2007 Apr;31(2):65-71. doi: 10.1016/j.compbiolchem.2007.02.004. Epub 2007 Feb 16. Comput Biol Chem. 2007. PMID: 17392030
-
Cross-platform comparison and visualisation of gene expression data using co-inertia analysis.BMC Bioinformatics. 2003 Nov 21;4:59. doi: 10.1186/1471-2105-4-59. BMC Bioinformatics. 2003. PMID: 14633289 Free PMC article.
-
Previously unidentified changes in renal cell carcinoma gene expression identified by parametric analysis of microarray data.BMC Cancer. 2003 Nov 27;3:31. doi: 10.1186/1471-2407-3-31. BMC Cancer. 2003. PMID: 14641932 Free PMC article.
Cited by
-
Systematic benchmarking of statistical methods to assess differential expression of circular RNAs.Brief Bioinform. 2023 Jan 19;24(1):bbac612. doi: 10.1093/bib/bbac612. Brief Bioinform. 2023. PMID: 36592056 Free PMC article.
-
Potential biomarkers uncovered by bioinformatics analysis in sotorasib resistant-pancreatic ductal adenocarcinoma.Front Med (Lausanne). 2023 Jun 15;10:1107128. doi: 10.3389/fmed.2023.1107128. eCollection 2023. Front Med (Lausanne). 2023. PMID: 37396909 Free PMC article.
References
Publication types
MeSH terms
LinkOut - more resources
Full Text Sources
Medical