Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2000 Aug 29;97(18):9834-9.
doi: 10.1073/pnas.97.18.9834.

Importance of replication in microarray gene expression studies: statistical methods and evidence from repetitive cDNA hybridizations

Affiliations

Importance of replication in microarray gene expression studies: statistical methods and evidence from repetitive cDNA hybridizations

M L Lee et al. Proc Natl Acad Sci U S A. .

Abstract

We present statistical methods for analyzing replicated cDNA microarray expression data and report the results of a controlled experiment. The study was conducted to investigate inherent variability in gene expression data and the extent to which replication in an experiment produces more consistent and reliable findings. We introduce a statistical model to describe the probability that mRNA is contained in the target sample tissue, converted to probe, and ultimately detected on the slide. We also introduce a method to analyze the combined data from all replicates. Of the 288 genes considered in this controlled experiment, 32 would be expected to produce strong hybridization signals because of the known presence of repetitive sequences within them. Results based on individual replicates, however, show that there are 55, 36, and 58 highly expressed genes in replicates 1, 2, and 3, respectively. On the other hand, an analysis by using the combined data from all 3 replicates reveals that only 2 of the 288 genes are incorrectly classified as expressed. Our experiment shows that any single microarray output is subject to substantial variability. By pooling data from replicates, we can provide a more reliable analysis of gene expression data. Therefore, we conclude that designing experiments with replications will greatly reduce misclassification rates. We recommend that at least three replicates be used in designing experiments by using cDNA microarrays, particularly when gene expression data from single specimens are being analyzed.

PubMed Disclaimer

Figures

Figure 1
Figure 1
(a), Normal probability plot of main effect estimates for expressed genes. (b), Normal probability plot of main effect estimates for unexpressed. genes.
Figure 2
Figure 2
Overlay of a histogram and mixed normal p.d.f. for gene expression main effect.

References

    1. Eisen M. scanalyzeUser Manual. Stanford, CA: Stanford Univ.; 1999. , Ver. 2.32.
    1. Eisen M B, Spellman P T, Brown P O, Bostein D. Proc Natl Acad Sci USA. 1998;95:14863–14868. - PMC - PubMed

Publication types

MeSH terms

LinkOut - more resources