Analysis of a simulated microarray dataset: comparison of methods for data normalisation and detection of differential expression (open access publication)
- PMID: 18053575
- PMCID: PMC2682813
- DOI: 10.1186/1297-9686-39-6-669
Analysis of a simulated microarray dataset: comparison of methods for data normalisation and detection of differential expression (open access publication)
Abstract
Microarrays allow researchers to measure the expression of thousands of genes in a single experiment. Before statistical comparisons can be made, the data must be assessed for quality and normalisation procedures must be applied, of which many have been proposed. Methods of comparing the normalised data are also abundant, and no clear consensus has yet been reached. The purpose of this paper was to compare those methods used by the EADGENE network on a very noisy simulated data set. With the a priori knowledge of which genes are differentially expressed, it is possible to compare the success of each approach quantitatively. Use of an intensity-dependent normalisation procedure was common, as was correction for multiple testing. Most variety in performance resulted from differing approaches to data quality and the use of different statistical tests. Very few of the methods used any kind of background correction. A number of approaches achieved a success rate of 95% or above, with relatively small numbers of false positives and negatives. Applying stringent spot selection criteria and elimination of data did not improve the false positive rate and greatly increased the false negative rate. However, most approaches performed well, and it is encouraging that widely available techniques can achieve such good results on a very noisy data set.
Similar articles
-
Analysis of the real EADGENE data set: comparison of methods and guidelines for data normalisation and selection of differentially expressed genes (open access publication).Genet Sel Evol. 2007 Nov-Dec;39(6):633-50. doi: 10.1186/1297-9686-39-6-633. Epub 2007 Dec 6. Genet Sel Evol. 2007. PMID: 18053573 Free PMC article.
-
The EADGENE Microarray Data Analysis Workshop (open access publication).Genet Sel Evol. 2007 Nov-Dec;39(6):621-31. doi: 10.1186/1297-9686-39-6-621. Epub 2007 Dec 6. Genet Sel Evol. 2007. PMID: 18053572 Free PMC article.
-
Analysis of the real EADGENE data set: multivariate approaches and post analysis (open access publication).Genet Sel Evol. 2007 Nov-Dec;39(6):651-68. doi: 10.1186/1297-9686-39-6-651. Epub 2007 Dec 6. Genet Sel Evol. 2007. PMID: 18053574 Free PMC article.
-
The analysis of microarray data.Pharmacogenomics. 2003 Jul;4(4):477-97. doi: 10.1517/phgs.4.4.477.22744. Pharmacogenomics. 2003. PMID: 12831325 Review.
-
An assessment of recently published gene expression data analyses: reporting experimental design and statistical factors.BMC Med Inform Decis Mak. 2006 Jun 21;6:27. doi: 10.1186/1472-6947-6-27. BMC Med Inform Decis Mak. 2006. PMID: 16790051 Free PMC article. Review.
Cited by
-
Methods for interpreting lists of affected genes obtained in a DNA microarray experiment.BMC Proc. 2009 Jul 16;3 Suppl 4(Suppl 4):S5. doi: 10.1186/1753-6561-3-s4-s5. BMC Proc. 2009. PMID: 19615118 Free PMC article.
-
The EADGENE and SABRE post-analyses workshop.BMC Proc. 2009 Jul 16;3 Suppl 4(Suppl 4):I1. doi: 10.1186/1753-6561-3-S4-I1. BMC Proc. 2009. PMID: 19615108 Free PMC article. No abstract available.
Publication types
MeSH terms
Grants and funding
LinkOut - more resources
Full Text Sources