Exploration, normalization, and summaries of high density oligonucleotide array probe level data
- PMID: 12925520
- DOI: 10.1093/biostatistics/4.2.249
Exploration, normalization, and summaries of high density oligonucleotide array probe level data
Abstract
In this paper we report exploratory analyses of high-density oligonucleotide array data from the Affymetrix GeneChip system with the objective of improving upon currently used measures of gene expression. Our analyses make use of three data sets: a small experimental study consisting of five MGU74A mouse GeneChip arrays, part of the data from an extensive spike-in study conducted by Gene Logic and Wyeth's Genetics Institute involving 95 HG-U95A human GeneChip arrays; and part of a dilution study conducted by Gene Logic involving 75 HG-U95A GeneChip arrays. We display some familiar features of the perfect match and mismatch probe (PM and MM) values of these data, and examine the variance-mean relationship with probe-level data from probes believed to be defective, and so delivering noise only. We explain why we need to normalize the arrays to one another using probe level intensities. We then examine the behavior of the PM and MM using spike-in data and assess three commonly used summary measures: Affymetrix's (i) average difference (AvDiff) and (ii) MAS 5.0 signal, and (iii) the Li and Wong multiplicative model-based expression index (MBEI). The exploratory data analyses of the probe level data motivate a new summary measure that is a robust multi-array average (RMA) of background-adjusted, normalized, and log-transformed PM values. We evaluate the four expression summary measures using the dilution study data, assessing their behavior in terms of bias, variance and (for MBEI and RMA) model fit. Finally, we evaluate the algorithms in terms of their ability to detect known levels of differential expression using the spike-in data. We conclude that there is no obvious downside to using RMA and attaching a standard error (SE) to this quantity using a linear model which removes probe-specific affinities.
Similar articles
-
Statistical analysis of high-density oligonucleotide arrays: a multiplicative noise model.Bioinformatics. 2002 Dec;18(12):1633-40. doi: 10.1093/bioinformatics/18.12.1633. Bioinformatics. 2002. PMID: 12490448
-
A new summarization method for Affymetrix probe level data.Bioinformatics. 2006 Apr 15;22(8):943-9. doi: 10.1093/bioinformatics/btl033. Epub 2006 Feb 10. Bioinformatics. 2006. PMID: 16473874
-
SUM: a new way to incorporate mismatch probe measurements.Genomics. 2004 Oct;84(4):767-77. doi: 10.1016/j.ygeno.2004.06.013. Genomics. 2004. PMID: 15475255
-
Normalization of microarray data: single-labeled and dual-labeled arrays.Mol Cells. 2006 Dec 31;22(3):254-61. Mol Cells. 2006. PMID: 17202852 Review.
-
On the causes of outliers in Affymetrix GeneChip data.Brief Funct Genomic Proteomic. 2009 May;8(3):199-212. doi: 10.1093/bfgp/elp027. Brief Funct Genomic Proteomic. 2009. PMID: 19734302 Review.
Cited by
-
Experimental and in silico analysis of LINC01279 expression in tumor of patients with breast cancer.J Appl Genet. 2024 Oct 28. doi: 10.1007/s13353-024-00908-6. Online ahead of print. J Appl Genet. 2024. PMID: 39465460
-
Gene expression differences during the heterogeneous progression of peripheral atherosclerosis in familial hypercholesterolemic swine.BMC Genomics. 2013 Jul 3;14:443. doi: 10.1186/1471-2164-14-443. BMC Genomics. 2013. PMID: 23822099 Free PMC article.
-
Uncovering the molecular secrets of inflammatory breast cancer biology: an integrated analysis of three distinct affymetrix gene expression datasets.Clin Cancer Res. 2013 Sep 1;19(17):4685-96. doi: 10.1158/1078-0432.CCR-12-2549. Epub 2013 Feb 8. Clin Cancer Res. 2013. PMID: 23396049 Free PMC article.
-
Behavioural and Brain Gene Expression Profiling in Pigs during Tail Biting Outbreaks - Evidence of a Tail Biting Resistant Phenotype.PLoS One. 2013 Jun 18;8(6):e66513. doi: 10.1371/journal.pone.0066513. Print 2013. PLoS One. 2013. PMID: 23824700 Free PMC article.
-
Transcriptional analysis of human cranial compartments with different embryonic origins.Arch Oral Biol. 2015 Sep;60(9):1450-60. doi: 10.1016/j.archoralbio.2015.06.008. Epub 2015 Jul 2. Arch Oral Biol. 2015. PMID: 26188427 Free PMC article.
Publication types
MeSH terms
Substances
Grants and funding
LinkOut - more resources
Full Text Sources
Other Literature Sources
Molecular Biology Databases