Methodological study of affine transformations of gene expression data with proposed robust non-parametric multi-dimensional normalization method
- PMID: 16509971
- PMCID: PMC1534066
- DOI: 10.1186/1471-2105-7-100
Methodological study of affine transformations of gene expression data with proposed robust non-parametric multi-dimensional normalization method
Abstract
Background: Low-level processing and normalization of microarray data are most important steps in microarray analysis, which have profound impact on downstream analysis. Multiple methods have been suggested to date, but it is not clear which is the best. It is therefore important to further study the different normalization methods in detail and the nature of microarray data in general.
Results: A methodological study of affine models for gene expression data is carried out. Focus is on two-channel comparative studies, but the findings generalize also to single- and multi-channel data. The discussion applies to spotted as well as in-situ synthesized microarray data. Existing normalization methods such as curve-fit ("lowess") normalization, parallel and perpendicular translation normalization, and quantile normalization, but also dye-swap normalization are revisited in the light of the affine model and their strengths and weaknesses are investigated in this context. As a direct result from this study, we propose a robust non-parametric multi-dimensional affine normalization method, which can be applied to any number of microarrays with any number of channels either individually or all at once. A high-quality cDNA microarray data set with spike-in controls is used to demonstrate the power of the affine model and the proposed normalization method.
Conclusion: We find that an affine model can explain non-linear intensity-dependent systematic effects in observed log-ratios. Affine normalization removes such artifacts for non-differentially expressed genes and assures that symmetry between negative and positive log-ratios is obtained, which is fundamental when identifying differentially expressed genes. In addition, affine normalization makes the empirical distributions in different channels more equal, which is the purpose of quantile normalization, and may also explain why dye-swap normalization works or fails. All methods are made available in the aroma package, which is a platform-independent package for R.
Figures









Similar articles
-
A robust two-way semi-linear model for normalization of cDNA microarray data.BMC Bioinformatics. 2005 Jan 21;6:14. doi: 10.1186/1471-2105-6-14. BMC Bioinformatics. 2005. PMID: 15663789 Free PMC article.
-
Can Zipf's law be adapted to normalize microarrays?BMC Bioinformatics. 2005 Feb 23;6:37. doi: 10.1186/1471-2105-6-37. BMC Bioinformatics. 2005. PMID: 15727680 Free PMC article.
-
Use of normalization methods for analysis of microarrays containing a high degree of gene effects.BMC Bioinformatics. 2008 Nov 28;9:505. doi: 10.1186/1471-2105-9-505. BMC Bioinformatics. 2008. PMID: 19040742 Free PMC article.
-
Normalization and quantification of differential expression in gene expression microarrays.Brief Bioinform. 2006 Jun;7(2):166-77. doi: 10.1093/bib/bbl002. Epub 2006 Mar 7. Brief Bioinform. 2006. PMID: 16772260 Review.
-
Chondrocyte channel transcriptomics: do microarray data fit with expression and functional data?Channels (Austin). 2013 Nov-Dec;7(6):459-67. doi: 10.4161/chan.26071. Epub 2013 Aug 30. Channels (Austin). 2013. PMID: 23995703 Free PMC article. Review.
Cited by
-
Empirical estimation of sequencing error rates using smoothing splines.BMC Bioinformatics. 2016 Apr 22;17:177. doi: 10.1186/s12859-016-1052-3. BMC Bioinformatics. 2016. PMID: 27102907 Free PMC article.
-
Genomic islands as a marker to differentiate between clinical and environmental Burkholderia pseudomallei.PLoS One. 2012;7(6):e37762. doi: 10.1371/journal.pone.0037762. Epub 2012 Jun 1. PLoS One. 2012. PMID: 22675491 Free PMC article.
-
Microarray image analysis: background estimation using quantile and morphological filters.BMC Bioinformatics. 2006 Feb 28;7:96. doi: 10.1186/1471-2105-7-96. BMC Bioinformatics. 2006. PMID: 16504173 Free PMC article.
-
TumorBoost: normalization of allele-specific tumor copy numbers from a single pair of tumor-normal genotyping microarrays.BMC Bioinformatics. 2010 May 12;11:245. doi: 10.1186/1471-2105-11-245. BMC Bioinformatics. 2010. PMID: 20462408 Free PMC article.
-
MicroRNA expression as risk biomarker of breast cancer metastasis: a pilot retrospective case-cohort study.BMC Cancer. 2014 Oct 2;14:739. doi: 10.1186/1471-2407-14-739. BMC Cancer. 2014. PMID: 25277099 Free PMC article.
References
-
- Schena M, Shalon D, Davis RW, Brown PO. Quantitative monitoring of gene expression patterns with a complementary DNA microarray. Science. 1995;270(5235):467–470. - PubMed
-
- Bengtsson H. Identification and normalization of plate effects in cDNA microarray data. Preprints in Mathematical Sciences 2002:28, Mathematical Statistics, Centre for Mathematical Sciences, Lund University, Sweden; 2002.
Publication types
MeSH terms
LinkOut - more resources
Full Text Sources
Other Literature Sources
Research Materials