Statistical challenges in preprocessing in microarray experiments in cancer
- PMID: 18829474
- PMCID: PMC3529914
- DOI: 10.1158/1078-0432.CCR-07-4532
Statistical challenges in preprocessing in microarray experiments in cancer
Abstract
Many clinical studies incorporate genomic experiments to investigate the potential associations between high-dimensional molecular data and clinical outcome. A critical first step in the statistical analyses of these experiments is that the molecular data are preprocessed. This article provides an overview of preprocessing methods, including summary algorithms and quality control metrics for microarrays. Some of the ramifications and effects that preprocessing methods have on the statistical results are illustrated. The discussions are centered around a microarray experiment based on lung cancer tumor samples with survival as the clinical outcome of interest. The procedures that are presented focus on the array platform used in this study. However, many of these issues are more general and are applicable to other instruments for genome-wide investigation. The discussions here will provide insight into the statistical challenges in preprocessing microarrays used in clinical studies of cancer. These challenges should not be viewed as inconsequential nuisances but rather as important issues that need to be addressed so that informed conclusions can be drawn.
Figures




Similar articles
-
A review of statistical methods for preprocessing oligonucleotide microarrays.Stat Methods Med Res. 2009 Dec;18(6):533-41. doi: 10.1177/0962280209351924. Stat Methods Med Res. 2009. PMID: 20048383 Free PMC article.
-
Micro-Analyzer: automatic preprocessing of Affymetrix microarray data.Comput Methods Programs Biomed. 2013 Aug;111(2):402-9. doi: 10.1016/j.cmpb.2013.04.006. Epub 2013 May 31. Comput Methods Programs Biomed. 2013. PMID: 23731720
-
Working with Oligonucleotide Arrays.Methods Mol Biol. 2016;1418:145-59. doi: 10.1007/978-1-4939-3578-9_7. Methods Mol Biol. 2016. PMID: 27008013
-
Classification algorithms for phenotype prediction in genomics and proteomics.Front Biosci. 2008 Jan 1;13:691-708. doi: 10.2741/2712. Front Biosci. 2008. PMID: 17981580 Free PMC article. Review.
-
Statistical considerations for analysis of microarray experiments.Clin Transl Sci. 2011 Dec;4(6):466-77. doi: 10.1111/j.1752-8062.2011.00309.x. Epub 2011 Nov 7. Clin Transl Sci. 2011. PMID: 22212230 Free PMC article. Review.
Cited by
-
Criteria for the use of omics-based predictors in clinical trials: explanation and elaboration.BMC Med. 2013 Oct 17;11:220. doi: 10.1186/1741-7015-11-220. BMC Med. 2013. PMID: 24228635 Free PMC article.
-
FAS Death Receptor: A Breast Cancer Subtype-Specific Radiation Response Biomarker and Potential Therapeutic Target.Radiat Res. 2015 Nov;184(5):456-69. doi: 10.1667/RR14089.1. Epub 2015 Oct 21. Radiat Res. 2015. PMID: 26488758 Free PMC article. Clinical Trial.
-
Identification of Key Genes in Gastric Cancer by Bioinformatics Analysis.Biomed Res Int. 2020 Sep 21;2020:7658230. doi: 10.1155/2020/7658230. eCollection 2020. Biomed Res Int. 2020. PMID: 33015179 Free PMC article.
-
Unraveling the Core Components and Critical Targets of Houttuynia cordata Thunb. in Treating Non-small Cell Lung Cancer through Network Pharmacology and Multi-omics Analysis.Curr Pharm Des. 2025;31(7):540-558. doi: 10.2174/0113816128330427241017110325. Curr Pharm Des. 2025. PMID: 39440769 Free PMC article.
-
A Python Clustering Analysis Protocol of Genes Expression Data Sets.Genes (Basel). 2022 Oct 12;13(10):1839. doi: 10.3390/genes13101839. Genes (Basel). 2022. PMID: 36292724 Free PMC article.
References
-
- Pollack JR, Perou CM, Alizadeh AA, Eisen MB, Pergamenschikov A, Williams CF, Jeffrey SS, Botstein D, Brown PO. Genome-wide analysis of DNA copy-number changes using cDNA microarrays. Nature Genetics. 1999;23(1):41–46. - PubMed
-
- Schena M, Shalon D, Davis RW, Brown PO. Quantitative monitoring of gene-expression patterns with a complementary-DNA microarray. Science. 1995;270(5235):467–470. - PubMed
-
- Barry WT, Nobel AB, Wright FA. Significance analysis of functional categories in gene expression studies: a structured permutation approach. Bioinformatics. 2005;21(9):1943–1949. - PubMed
Publication types
MeSH terms
Grants and funding
LinkOut - more resources
Full Text Sources