Microarray data analysis: a hierarchical T-test to handle heteroscedasticity
- PMID: 15702953
Microarray data analysis: a hierarchical T-test to handle heteroscedasticity
Abstract
The analysis of differential gene expression in microarray experiments requires the development of adequate statistical tools. This article describes a simple statistical method for detecting differential expression between two conditions with a low number of replicates. When comparing two group means using a traditional t-test, gene-specific variance estimates are unstable and can lead to wrong conclusions. We construct a likelihood ratio test while modelling these variances hierarchically across all genes, and express it as a t-test statistic. By borrowing information across genes we can take advantage of their large numbers, and still yield a gene-specific test statistic. We show that this hierarchical t-test is more powerful than its traditional version and generates less false positives in a simulation study, especially with small sample sizes. This approach can be extended to cases where there are more than two groups.
Similar articles
-
Comparison of seven methods for producing Affymetrix expression scores based on False Discovery Rates in disease profiling data.BMC Bioinformatics. 2005 Feb 10;6:26. doi: 10.1186/1471-2105-6-26. BMC Bioinformatics. 2005. PMID: 15705192 Free PMC article.
-
Unequal group variances in microarray data analyses.Bioinformatics. 2008 May 1;24(9):1168-74. doi: 10.1093/bioinformatics/btn100. Epub 2008 Mar 14. Bioinformatics. 2008. PMID: 18344518
-
Sample size for FDR-control in microarray data analysis.Bioinformatics. 2005 Jul 15;21(14):3097-104. doi: 10.1093/bioinformatics/bti456. Epub 2005 Apr 21. Bioinformatics. 2005. PMID: 15845654
-
The analysis of microarray data.Pharmacogenomics. 2003 Jul;4(4):477-97. doi: 10.1517/phgs.4.4.477.22744. Pharmacogenomics. 2003. PMID: 12831325 Review.
-
Using ANOVA to analyze microarray data.Biotechniques. 2004 Aug;37(2):173-5, 177. doi: 10.2144/04372TE01. Biotechniques. 2004. PMID: 15335204 Review.
Cited by
-
Clustering, Pathway Enrichment, and Protein-Protein Interaction Analysis of Gene Expression in Neurodevelopmental Disorders.Adv Pharmacol Sci. 2018 Nov 27;2018:3632159. doi: 10.1155/2018/3632159. eCollection 2018. Adv Pharmacol Sci. 2018. PMID: 30598663 Free PMC article.
-
Gene expression variation between mouse inbred strains.BMC Genomics. 2004 Aug 18;5(1):57. doi: 10.1186/1471-2164-5-57. BMC Genomics. 2004. PMID: 15317656 Free PMC article.
-
Selection and classification of gene expression in autism disorder: Use of a combination of statistical filters and a GBPSO-SVM algorithm.PLoS One. 2017 Nov 2;12(11):e0187371. doi: 10.1371/journal.pone.0187371. eCollection 2017. PLoS One. 2017. PMID: 29095904 Free PMC article.
-
Comparison of multi-tissue aging between human and mouse.Sci Rep. 2019 Apr 17;9(1):6220. doi: 10.1038/s41598-019-42485-3. Sci Rep. 2019. PMID: 30996271 Free PMC article.
-
Importance of replication in analyzing time-series gene expression data: corticosteroid dynamics and circadian patterns in rat liver.BMC Bioinformatics. 2010 May 26;11:279. doi: 10.1186/1471-2105-11-279. BMC Bioinformatics. 2010. PMID: 20500897 Free PMC article.
Publication types
MeSH terms
LinkOut - more resources
Full Text Sources