EVE (external variance estimation) increases statistical power for detecting differentially expressed genes
- PMID: 17680783
- DOI: 10.1111/j.1365-313X.2007.03227.x
EVE (external variance estimation) increases statistical power for detecting differentially expressed genes
Abstract
Accurately identifying differentially expressed genes from microarray data is not a trivial task, partly because of poor variance estimates of gene expression signals. Here, after analyzing 380 replicated microarray experiments, we found that probesets have typical, distinct variances that can be estimated based on a large number of microarray experiments. These probeset-specific variances depend at least in part on the function of the probed gene: genes for ribosomal or structural proteins often have a small variance, while genes implicated in stress responses often have large variances. We used these variance estimates to develop a statistical test for differentially expressed genes called EVE (external variance estimation). The EVE algorithm performs better than the t-test and LIMMA on some real-world data, where external information from appropriate databases is available. Thus, EVE helps to maximize the information gained from a typical microarray experiment. Nonetheless, only a large number of replicates will guarantee to identify nearly all truly differentially expressed genes. However, our simulation studies suggest that even limited numbers of replicates will usually result in good coverage of strongly differentially expressed genes.
Similar articles
-
Microarray data analysis: a hierarchical T-test to handle heteroscedasticity.Appl Bioinformatics. 2004;3(4):229-35. Appl Bioinformatics. 2004. PMID: 15702953
-
Variance component estimation for mixed model analysis of cDNA microarray data.Biom J. 2008 Dec;50(6):927-39. doi: 10.1002/bimj.200810476. Biom J. 2008. PMID: 19035549
-
Unequal group variances in microarray data analyses.Bioinformatics. 2008 May 1;24(9):1168-74. doi: 10.1093/bioinformatics/btn100. Epub 2008 Mar 14. Bioinformatics. 2008. PMID: 18344518
-
Microarray data quality control improves the detection of differentially expressed genes.Genomics. 2010 Mar;95(3):138-42. doi: 10.1016/j.ygeno.2010.01.003. Epub 2010 Jan 14. Genomics. 2010. PMID: 20079422 Review.
-
Normalization and quantification of differential expression in gene expression microarrays.Brief Bioinform. 2006 Jun;7(2):166-77. doi: 10.1093/bib/bbl002. Epub 2006 Mar 7. Brief Bioinform. 2006. PMID: 16772260 Review.
Cited by
-
Density based pruning for identification of differentially expressed genes from microarray data.BMC Genomics. 2010 Nov 2;11 Suppl 2(Suppl 2):S3. doi: 10.1186/1471-2164-11-S2-S3. BMC Genomics. 2010. PMID: 21047384 Free PMC article.
-
Literature aided determination of data quality and statistical significance threshold for gene expression studies.BMC Genomics. 2012;13 Suppl 8(Suppl 8):S23. doi: 10.1186/1471-2164-13-S8-S23. Epub 2012 Dec 17. BMC Genomics. 2012. PMID: 23282414 Free PMC article.
-
Using pre-existing microarray datasets to increase experimental power: application to insulin resistance.PLoS Comput Biol. 2010 Mar 26;6(3):e1000718. doi: 10.1371/journal.pcbi.1000718. PLoS Comput Biol. 2010. PMID: 20361040 Free PMC article.
-
A Population Proportion approach for ranking differentially expressed genes.BMC Bioinformatics. 2008 Sep 18;9:380. doi: 10.1186/1471-2105-9-380. BMC Bioinformatics. 2008. PMID: 18801167 Free PMC article.
Publication types
MeSH terms
LinkOut - more resources
Full Text Sources