A new non-linear normalization method for reducing variability in DNA microarray experiments
- PMID: 12225587
- PMCID: PMC126873
- DOI: 10.1186/gb-2002-3-9-research0048
A new non-linear normalization method for reducing variability in DNA microarray experiments
Abstract
Background: Microarray data are subject to multiple sources of variation, of which biological sources are of interest whereas most others are only confounding. Recent work has identified systematic sources of variation that are intensity-dependent and non-linear in nature. Systematic sources of variation are not limited to the differing properties of the cyanine dyes Cy(5) and Cy(3) as observed in cDNA arrays, but are the general case for both oligonucleotide microarray (Affymetrix GeneChips) and cDNA microarray data. Current normalization techniques are most often linear and therefore not capable of fully correcting for these effects.
Results: We present here a simple and robust non-linear method for normalization using array signal distribution analysis and cubic splines. These methods compared favorably to normalization using robust local-linear regression (lowess). The application of these methods to oligonucleotide arrays reduced the relative error between replicates by 5-10% compared with a standard global normalization method. Application to cDNA arrays showed improvements over the standard method and over Cy(3)-Cy(5) normalization based on dye-swap replication. In addition, a set of known differentially regulated genes was ranked higher by the t-test. In either cDNA or Affymetrix technology, signal-dependent bias was more than ten times greater than the observed print-tip or spatial effects.
Conclusions: Intensity-dependent normalization is important for both high-density oligonucleotide array and cDNA array data. Both the regression and spline-based methods described here performed better than existing linear methods when assessed on the variability of replicate arrays. Dye-swap normalization was less effective at Cy(3)-Cy(5) normalization than either regression or spline-based methods alone.
Figures












Similar articles
-
Normalization for cDNA microarray data: a robust composite method addressing single and multiple slide systematic variation.Nucleic Acids Res. 2002 Feb 15;30(4):e15. doi: 10.1093/nar/30.4.e15. Nucleic Acids Res. 2002. PMID: 11842121 Free PMC article.
-
Normalization for Affymetrix GeneChips.Methods Inf Med. 2005;44(3):414-7. Methods Inf Med. 2005. PMID: 16113766
-
A robust neural networks approach for spatial and intensity-dependent normalization of cDNA microarray data.Bioinformatics. 2005 Jun 1;21(11):2674-83. doi: 10.1093/bioinformatics/bti397. Epub 2005 Mar 29. Bioinformatics. 2005. PMID: 15797913
-
What statisticians should know about microarray gene expression technology.Methods Mol Biol. 2013;972:1-13. doi: 10.1007/978-1-60327-337-4_1. Methods Mol Biol. 2013. PMID: 23385528 Review.
-
Normalization of microarray data: single-labeled and dual-labeled arrays.Mol Cells. 2006 Dec 31;22(3):254-61. Mol Cells. 2006. PMID: 17202852 Review.
Cited by
-
Identification of Yellow Pigmentation Genes in Brassica rapa ssp. pekinensis Using Br300 Microarray.Int J Genomics. 2014;2014:204969. doi: 10.1155/2014/204969. Epub 2014 Dec 31. Int J Genomics. 2014. PMID: 25629030 Free PMC article.
-
Transcriptomic analysis of KSHV-infected primary oral fibroblasts: The role of interferon-induced genes in the latency of oncogenic virus.Oncotarget. 2016 Jul 26;7(30):47052-47060. doi: 10.18632/oncotarget.9720. Oncotarget. 2016. PMID: 27363016 Free PMC article.
-
Removing Batch Effects from Longitudinal Gene Expression - Quantile Normalization Plus ComBat as Best Approach for Microarray Transcriptome Data.PLoS One. 2016 Jun 7;11(6):e0156594. doi: 10.1371/journal.pone.0156594. eCollection 2016. PLoS One. 2016. PMID: 27272489 Free PMC article.
-
Age-related somatic structural changes in the nuclear genome of human blood cells.Am J Hum Genet. 2012 Feb 10;90(2):217-28. doi: 10.1016/j.ajhg.2011.12.009. Epub 2012 Feb 2. Am J Hum Genet. 2012. PMID: 22305530 Free PMC article.
-
Impaired uptake and/or utilization of leucine by Saccharomyces cerevisiae is suppressed by the SPT15-300 allele of the TATA-binding protein gene.Appl Environ Microbiol. 2009 Oct;75(19):6055-61. doi: 10.1128/AEM.00989-09. Epub 2009 Aug 7. Appl Environ Microbiol. 2009. PMID: 19666729 Free PMC article.
References
-
- Schadt EE, Li C, Su C, Wong WH. Analyzing high-density oligonucleotide gene expression array data. J Cell Biochem. 2000;80:192–202. - PubMed
-
- Schadt EE, Li C, Ellis B, Wong WH. Feature extraction and normalization algorithms for high-density oligonucleotide gene expression array data. J Cell Biochem. 2001;Suppl 37:120–125. - PubMed
Publication types
MeSH terms
Substances
LinkOut - more resources
Full Text Sources
Other Literature Sources
Medical
Molecular Biology Databases