Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2006;2006(1):43056.
doi: 10.1155/BSB/2006/43056.

Normalization benefits microarray-based classification

Affiliations

Normalization benefits microarray-based classification

Jianping Hua et al. EURASIP J Bioinform Syst Biol. 2006.

Abstract

When using cDNA microarrays, normalization to correct labeling bias is a common preliminary step before further data analysis is applied, its objective being to reduce the variation between arrays. To date, assessment of the effectiveness of normalization has mainly been confined to the ability to detect differentially expressed genes. Since a major use of microarrays is the expression-based phenotype classification, it is important to evaluate microarray normalization procedures relative to classification. Using a model-based approach, we model the systemic-error process to generate synthetic gene-expression values with known ground truth. These synthetic expression values are subjected to typical normalization methods and passed through a set of classification rules, the objective being to carry out a systematic study of the effect of normalization on classification. Three normalization methods are considered: offset, linear regression, and Lowess regression. Seven classification rules are considered: 3-nearest neighbor, linear support vector machine, linear discriminant analysis, regular histogram, Gaussian kernel, perceptron, and multiple perceptron with majority voting. The results of the first three are presented in the paper, with the full results being given on a complementary website. The conclusion from the different experiment models considered in the study is that normalization can have a significant benefit for classification under difficult experimental conditions, with linear and Lowess regression slightly outperforming the offset method.

PubMed Disclaimer

References

    1. Quackenbush J. Microarray data normalization and transformation. Nature Genetics. 2002;32(5 supplement):496–501. - PubMed
    1. Bilban M, Buehler LK, Head S, Desoye G, Quaranta V. Normalizing DNA microarray data. Current Issues in Molecular Biology. 2002;4(2):57–64. - PubMed
    1. Attoor S, Dougherty ER, Chen Y, Bittner ML, Trent JM. Which is better for cDNA-microarray-based classification: ratios or direct intensities. Bioinformatics. 2004;20(16):2513–2520. doi: 10.1093/bioinformatics/bth272. - DOI - PubMed
    1. Chen Y, Kamat V, Dougherty ER, Bittner ML, Meltzer PS, Trent JM. Ratio statistics of gene expression levels and applications to microarray data analysis. Bioinformatics. 2002;18(9):1207–1215. doi: 10.1093/bioinformatics/18.9.1207. - DOI - PubMed
    1. Yang YH, Dudoit S, Luu P. et al.Normalization for cDNA microarray data: a robust composite method addressing single and multiple slide systematic variation. Nucleic Acids Research. 2002;30(4):e15. doi: 10.1093/nar/30.4.e15. - DOI - PMC - PubMed

LinkOut - more resources