Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2007;8(3):R44.
doi: 10.1186/gb-2007-8-3-r44.

Normalization of two-channel microarrays accounting for experimental design and intensity-dependent relationships

Affiliations

Normalization of two-channel microarrays accounting for experimental design and intensity-dependent relationships

Alan R Dabney et al. Genome Biol. 2007.

Abstract

In normalizing two-channel expression arrays, the ANOVA approach explicitly incorporates the experimental design in its model, and the MA plot-based approach accounts for intensity-dependent biases. However, both approaches can lead to inaccurate normalization in fairly common scenarios. We propose a method called efficient Common Array Dye Swap (eCADS) for normalizing two-channel microarrays that accounts for both experimental design and intensity-dependent biases. Under reasonable experimental designs, eCADS preserves differential expression relationships and requires only a single array per sample pair.

PubMed Disclaimer

Figures

Figure 1
Figure 1
Overview of the eCADS model. The left panel summarizes the model of equation 2. The observed fluorescence intensity (yR or yG) for agene with x RNA is modeled as the sum of the average dye function d, the function corresponding to the dye used for labeling (δR or δG), and an array-specific function a. Since we do not know the true RNA amounts x, we warp the x-axis so that every x value is replaced with d(x); these 'warped RNA amounts' are essentially group means adjusted gene-by-gene for bias (see Model formulation). The curves in the right panel are analogously warped versions of the curves in the left panel, now representing deviations from the group mean (the straight line). The warping enables the estimation of the model without affecting the relationships of interest.
Figure 2
Figure 2
Functions used in simulation. Functions used in simulated example for dye (left) and array (right) effects. The actual dye 'effect' functions (the δs in equation 2) are the dye-specific deviations from the average curve. The array functions sum to zero at any point on the x-axis.
Figure 3
Figure 3
Summary of simulated t-statistics. Comparison of t-statistics (averaged across 100 simulations) and true mean differences after MA (left), ANOVA (middle), and eCADS (right) normalization. Black points represent genes for which the sign of differential expression has not been preserved. Plots for MA show systematic shift, indicating bias.
Figure 4
Figure 4
Summary of simulated null histograms. Histograms of null p-values after MA (left), ANOVA (middle), and eCADS (right) normalization in one simulation. Neither the MA nor ANOVA null p-values are uniformly distributed (KS significance approximately zero), while eCADS null p-values are (KS p = 0.86).
Figure 5
Figure 5
MA plots for mouse prostate development study. The three arrays from one dye configuration are in the top row, while those from the reversed configuration are in the second row. There is apparently asymmetric, intensity-dependent differential expression in this example.
Figure 6
Figure 6
Estimated group means and bias functions for mouse data. The left panel is an MA plot comparing the 'warped RNA' (group means adjusted gene-by-gene for bias) for the two comparison groups. The middle panel shows the estimated dye effect functions, and the right panel shows the estimated array effect functions.
Figure 7
Figure 7
Estimated group means and bias functions for MAQC data. The left panel is an MA plot comparing the 'warped RNA' (group means adjusted gene-by-gene for bias) for the two comparison groups. The middle panel shows the estimated dye effect functions, and the right panel shows the estimated site effect functions.
Figure 8
Figure 8
Estimated array functions for MAQC data. The estimated array effect functions by site. Note that site two has substantially more array-to-array variability than the other two sites.

References

    1. Patterson TA, Lobenhofer EK, Fulmer-Smentek SB, Collins PJ, Chu T, Bao W, Fang H, Kawasaki ES, Hager J, Tikhonova IR. et al.Performance comparison of one-color and two-color platforms within the Microarray Quality Control (MAQC) project. Nat Biotechnol. 2006;24:1140–1150. doi: 10.1038/nbt1242. - DOI - PubMed
    1. Tseng GC, Oh M, Rohlin L, Liao JC, Wong WH. Issues in cDNA microarray analysis: quality filtering, channel normalization, models of variations and assessment of gene effects. Nucleic Acids Res. 2001;29:2540–2557. doi: 10.1093/nar/29.12.2549. - DOI - PMC - PubMed
    1. Yang YH, Dudoit S, Luu P, Speed TP. In: Microarrays: Optical Technologies and Informatics. Bittner ML, Chen Y, Dorsel AN, Dougherty ER, editor. Vol. 4266. Bellingham, WA: Society of Photo-Optical Instrumentation Engineers; 2001. Normalization of cDNA microarrays. pp. 141–152. Proceedings of SPIE.
    1. Yang Y, Dudoit S, Luu P, Lin D, Peng V, Ngai J, Speed T. Normalization for cDNA microarray data: A robust composite method addressing single and multiple slide systematic variation. Nucleic Acids Res. 2002;30:e15. doi: 10.1093/nar/30.4.e15. - DOI - PMC - PubMed
    1. Yang IV, Chen E, Hasseman JP, Liang W, Frank BC, Wang S, Sharov V, Saeed AI, White J, Li J. et al.Within the fold: Assessing differential expression measures and reproducibility in microarray assays. Genome Biol. 2002;3:research0062.1–0062.12. doi: 10.1186/gb-2002-3-11-research0062. - DOI - PMC - PubMed

Publication types

MeSH terms

LinkOut - more resources