Statistical analysis of microarray data: a Bayesian approach
- PMID: 14557114
- DOI: 10.1093/biostatistics/4.4.597
Statistical analysis of microarray data: a Bayesian approach
Abstract
The potential of microarray data is enormous. It allows us to monitor the expression of thousands of genes simultaneously. A common task with microarray is to determine which genes are differentially expressed between two samples obtained under two different conditions. Recently, several statistical methods have been proposed to perform such a task when there are replicate samples under each condition. Two major problems arise with microarray data. The first one is that the number of replicates is very small (usually 2-10), leading to noisy point estimates. As a consequence, traditional statistics that are based on the means and standard deviations, e.g. t-statistic, are not suitable. The second problem is that the number of genes is usually very large (approximately 10,000), and one is faced with an extreme multiple testing problem. Most multiple testing adjustments are relatively conservative, especially when the number of replicates is small. In this paper we present an empirical Bayes analysis that handles both problems very well. Using different parametrizations, we develop four statistics that can be used to test hypotheses about the means and/or variances of the gene expression levels in both one- and two-sample problems. The methods are illustrated using experimental data with prior knowledge. In addition, we present the result of a simulation comparing our methods to well-known statistics and multiple testing adjustments.
Similar articles
-
Empirical Bayes screening of many p-values with applications to microarray studies.Bioinformatics. 2005 May 1;21(9):1987-94. doi: 10.1093/bioinformatics/bti301. Epub 2005 Feb 2. Bioinformatics. 2005. PMID: 15691856
-
Significance testing for small microarray experiments.Stat Med. 2005 Aug 15;24(15):2281-98. doi: 10.1002/sim.2109. Stat Med. 2005. PMID: 15889452
-
Microarray data analysis: a hierarchical T-test to handle heteroscedasticity.Appl Bioinformatics. 2004;3(4):229-35. Appl Bioinformatics. 2004. PMID: 15702953
-
Differential analysis of DNA microarray gene expression data.Mol Microbiol. 2003 Feb;47(4):871-7. doi: 10.1046/j.1365-2958.2003.03298.x. Mol Microbiol. 2003. PMID: 12581345 Review.
-
Clustering methods for microarray gene expression data.OMICS. 2006 Winter;10(4):507-31. doi: 10.1089/omi.2006.10.507. OMICS. 2006. PMID: 17233561 Review.
Cited by
-
Multivariate hierarchical Bayesian model for differential gene expression analysis in microarray experiments.BMC Bioinformatics. 2008;9 Suppl 1(Suppl 1):S9. doi: 10.1186/1471-2105-9-S1-S9. BMC Bioinformatics. 2008. PMID: 18315862 Free PMC article.
-
Use of genomic DNA control features and predicted operon structure in microarray data analysis: ArrayLeaRNA - a Bayesian approach.BMC Bioinformatics. 2007 Nov 19;8:455. doi: 10.1186/1471-2105-8-455. BMC Bioinformatics. 2007. PMID: 18021437 Free PMC article.
-
A two-sample Bayesian t-test for microarray data.BMC Bioinformatics. 2006 Mar 10;7:126. doi: 10.1186/1471-2105-7-126. BMC Bioinformatics. 2006. PMID: 16529652 Free PMC article.
-
Comparative Analysis of Shapley Values Enhances Transcriptomics Insights across Some Common Uterine Pathologies.Genes (Basel). 2024 Jun 1;15(6):723. doi: 10.3390/genes15060723. Genes (Basel). 2024. PMID: 38927658 Free PMC article.
-
A DNA sequence directed mutual transcription regulation of HSF1 and NFIX involves novel heat sensitive protein interactions.PLoS One. 2009;4(4):e5050. doi: 10.1371/journal.pone.0005050. Epub 2009 Apr 1. PLoS One. 2009. PMID: 19337383 Free PMC article.
Publication types
MeSH terms
Substances
LinkOut - more resources
Full Text Sources
Other Literature Sources
Research Materials