Which is better for cDNA-microarray-based classification: ratios or direct intensities
- PMID: 15454406
- DOI: 10.1093/bioinformatics/bth272
Which is better for cDNA-microarray-based classification: ratios or direct intensities
Abstract
Motivation: There are two general methods for making gene-expression microarrays: one is to hybridize a single test set of labeled targets to the probe, and measure the background-subtracted intensity at each probe site; the other is to hybridize both a test and a reference set of differentially labeled targets to a single detector array, and measure the ratio of the background-subtracted intensities at each probe site. Which method is better depends on the variability in the cell system and the random factors resulting from the microarray technology. It also depends on the purpose for which the microarray is being used. Classification is a fundamental application and it is the one considered here.
Results: This paper describes a model-based simulation paradigm that compares the classification accuracy provided by these methods over a variety of noise types and presents the results of a study modeled on noise typical of cDNA microarray data. The model consists of four parts: (1) the measurement equation for genes in the reference state; (2) the measurement equation for genes in the test state; (3) the ratio and normalization procedure for a dual-channel system; and (4) the intensity and normalization procedure for a single-channel system. In the reference state, the mean intensities are modeled as a shifted exponential distribution, and the intensity for a particular gene is modeled via a normal distribution, Normal(I, alphaI), about its mean intensity I, with alpha being the coefficient of variation of the cell system. In the test state, some genes have their intensities up-regulated by a random factor. The model includes a number of random factors affecting intensity measurement: deposition gain d, labeling gain, and post-image-processing residual noise. The key conclusion resulting from the study is that the coefficient of variation governing the randomness of the intensities and the deposition gain are the most important factors for determining whether a single-channel or dual-channel system provides superior classification, and the decision region in the alpha-d plane is approximately linear.
Similar articles
-
Interactively optimizing signal-to-noise ratios in expression profiling: project-specific algorithm selection and detection p-value weighting in Affymetrix microarrays.Bioinformatics. 2004 Nov 1;20(16):2534-44. doi: 10.1093/bioinformatics/bth280. Epub 2004 Apr 29. Bioinformatics. 2004. PMID: 15117752
-
Statistical analysis of high-density oligonucleotide arrays: a multiplicative noise model.Bioinformatics. 2002 Dec;18(12):1633-40. doi: 10.1093/bioinformatics/18.12.1633. Bioinformatics. 2002. PMID: 12490448
-
The statistics of identifying differentially expressed genes in Expresso and TM4: a comparison.BMC Bioinformatics. 2006 Apr 20;7:215. doi: 10.1186/1471-2105-7-215. BMC Bioinformatics. 2006. PMID: 16626497 Free PMC article.
-
Integration of amplified differential gene expression (ADGE) and DNA microarray.IUBMB Life. 2002 Dec;54(6):335-8. doi: 10.1080/15216540216032. IUBMB Life. 2002. PMID: 12665243 Review.
-
Real-time DNA microarrays: reality check.Biochem Soc Trans. 2009 Apr;37(Pt 2):471-5. doi: 10.1042/BST0370471. Biochem Soc Trans. 2009. PMID: 19290884 Free PMC article. Review.
Cited by
-
Multiplexed microsphere diagnostic tools in gene expression applications: factors and futures.Int J Nanomedicine. 2006;1(2):195-201. doi: 10.2147/nano.2006.1.2.195. Int J Nanomedicine. 2006. PMID: 17722536 Free PMC article. Review.
-
Evaluation of one- and two-color gene expression arrays for microbial comparative genome hybridization analyses in routine applications.J Clin Microbiol. 2010 Sep;48(9):3105-10. doi: 10.1128/JCM.00233-10. Epub 2010 Jun 30. J Clin Microbiol. 2010. PMID: 20592156 Free PMC article.
-
MCMC implementation of the optimal Bayesian classifier for non-Gaussian models: model-based RNA-Seq classification.BMC Bioinformatics. 2014 Dec 10;15(1):401. doi: 10.1186/s12859-014-0401-3. BMC Bioinformatics. 2014. PMID: 25491122 Free PMC article.
-
Integrated analysis of gene expression and copy number data on gene shaving using independent component analysis.IEEE/ACM Trans Comput Biol Bioinform. 2011 Nov-Dec;8(6):1568-79. doi: 10.1109/TCBB.2011.71. IEEE/ACM Trans Comput Biol Bioinform. 2011. PMID: 21519112 Free PMC article.
-
A probe-density-based analysis method for array CGH data: simulation, normalization and centralization.Bioinformatics. 2008 Aug 15;24(16):1749-56. doi: 10.1093/bioinformatics/btn321. Epub 2008 Jul 4. Bioinformatics. 2008. PMID: 18603568 Free PMC article.
Publication types
MeSH terms
Substances
LinkOut - more resources
Full Text Sources