Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Comparative Study
. 2003 Jul;13(7):1775-85.
doi: 10.1101/gr.1048803. Epub 2003 Jun 12.

Spotted long oligonucleotide arrays for human gene expression analysis

Affiliations
Comparative Study

Spotted long oligonucleotide arrays for human gene expression analysis

Andrea Barczak et al. Genome Res. 2003 Jul.

Abstract

DNA microarrays produced by deposition (or 'spotting')of a single long oligonucleotide probe for each gene may be an attractive alternative to other types of arrays. We produced spotted oligonucleotide arrays using two large collections of approximately 70-mer probes, and used these arrays to analyze gene expression in two dissimilar human RNA samples. These samples were also analyzed using arrays produced by in situ synthesis of sets of multiple short (25-mer) oligonucleotides for each gene (Affymetrix GeneChips). We compared expression measurements for 7344 genes that were represented in both long oligonucleotide probe collections and the in situ-synthesized 25-mer arrays. We found strong correlations (r = 0.8-0.9) between relative gene expression measurements made with spotted long oligonucleotide probes and in situ-synthesized 25-mer probe sets. Spotted long oligonucleotide arrays were suitable for use with both unamplified cDNA and amplified RNA targets, and are a cost-effective alternative for many functional genomics applications. Most previously reported evaluations of microarray technologies have focused on expression measurements made on a relatively small number of genes. The approach described here involves far more gene expression measurements and provides a useful method for comparing existing and emerging techniques for genome-scale expression analysis.

PubMed Disclaimer

Figures

Figure 1
Figure 1
Differential expression and signal intensity measurements for all three array types. (A) Version 1 spotted long oligonucleotide arrays (means from six replicate two-color hybridizations). (B) Version 2 spotted long oligonucleotide arrays (four replicate two-color hybridizations). (C) In situ-synthesized 25-mer arrays (two replicate K562 sample single-color hybridizations and three replicate pool sample single-color hybridizations). Each point represents data from a single long oligonucleotide probe (A, B) or 25-mer probe set (C). M is a measure of differential gene expression (log2 [K562 intensity / pool intensity]). A is a measure of signal intensity (0.5 log2 K562 intensity + 0.5 log2 pool intensity).
Figure 2
Figure 2
Representation of genes on the three array types. Figures indicate the numbers of genes (distinct UniGene clusters) represented by at least one probe (or probe set) on each of the array types employed in this study. A total of 7344 genes were represented on all three array types.
Figure 3
Figure 3
Comparison of gene expression measurements from Version 1 (v1) long oligonucleotide arrays and in situ-synthesized 25-mer arrays. (A) Differential gene expression measurements for all 7344 common genes. (B) Differential gene expression measurements for 2877 genes remaining after exclusion of probes or probes sets with low signal intensities (A values below the median for that array type). (C) Comparison of signal intensities for all 7344 common genes. (D) Comparison of M values obtained from the first three and the last three v1 oligonucleotide arrays. Each point represents data from a single UniGene cluster. Pearson correlation coefficients (r) are shown for each comparison. Dashed lines are lines of equality.
Figure 4
Figure 4
Comparison of gene expression measurements on Version 2 (v2) long oligonucleotide arrays. Differential expression (A, B) and signal intensity (C, D) measurements from these arrays were compared with measurements from in situ-synthesized 25-mer arrays (A, C) and Version 1 long oligonucleotide arrays (B, D). Comparisons involve the 7344 common genes. For M value comparisons, probes or probe sets with low signal intensity were excluded. After exclusion of those probes, a total of 3133 genes (A) or 3344 genes (B) remained for comparison.
Figure 5
Figure 5
Differential expression measurements for genes with the highest and lowest M values on 25-mer arrays. Genes were divided into subsets according to the M values determined on 25-mer arrays. (A) Genes with M values in the top 1%. (B) Genes with M values close to zero. (C) Genes with M values in the bottom 1%. Each panel shows M values for these subsets of genes as measured using 25-mer arrays (solid lines), Version 1 long oligonucleotide arrays (dotted lines), and Version 2 long oligonucleotide arrays (dashed lines).
Figure 6
Figure 6
Amplified targets used with Version 1 spotted long oligonucleotide arrays. (A) Expression data from six replicate arrays hybridized with cRNA targets produced using a single round of amplification. (B) Data from two replicate arrays with cRNA targets produced using two rounds of amplification. (C, D) Comparisons of M values obtained using unamplified and amplified targets, after exclusion of probes with low A values. After exclusion of low-intensity signals, 5746 probes (C) or 5771 probes (D) of 13,971 total probes remained for comparison. (E, F) Signal intensity values for unamplified and amplified targets. Each point represents data from a single probe.

References

    1. Bolstad, B.M., Irizarry, R.A., Astrand, M., and Speed, T.P. 2003. A comparison of normalization methods for high density oligonucleotide array data based on variance and bias. Bioinformatics 19: 185-193. - PubMed
    1. DeRisi, J.L., Iyer, V.R., and Brown, P.O. 1997. Exploring the metabolic and genetic control of gene expression on a genomic scale. Science 278: 680-686. - PubMed
    1. Eberwine, J., Yeh, H., Miyashiro, K., Cao, Y., Nair, S., Finnell, R., Zettel, M., and Coleman, P. 1992. Analysis of gene expression in single live neurons. Proc. Natl. Acad. Sci. 89: 3010-3014. - PMC - PubMed
    1. Hughes, T.R., Mao, M., Jones, A.R., Burchard, J., Marton, M.J., Shannon, K.W., Lefkowitz, S.M., Ziman, M., Schelter, J.M., Meyer, M.R., et al. 2001. Expression profiling using microarrays fabricated by an ink-jet oligonucleotide synthesizer. Nat. Biotechnol. 19: 342-347. - PubMed
    1. Ihaka, R. and Gentleman, R. 1996. R: A language for data analysis and graphics. J. Comp. Graph. Stat. 5: 299-314.

WEB SITE REFERENCES

    1. http://www.ncbi.nlm.nih.gov/geo; Gene Expression Omnibus, NCBI, a public repository for expression data, including the data reported here.
    1. http://www.microarrays.org; DeRisi laboratory, University of California, San Francisco, a public source for microarray protocols and software.
    1. http://www.r-project.org; The R Project for Statistical Computing, describes and provides access to R, used for data analyses presented here.
    1. http://source.stanford.edu; Stanford Online Universal Resource for Clones and ESTs, provided a tool for batch assignment of UniGene cluster identifiers to the probes from each of the arrays.
    1. http://www.cmis.csiro.au/IAP/Spot/spotmanual.htm; Buckley, M.J., CSIRO Mathematical and Information Sciences, user's guide for Spot microarray image analysis software.

Publication types

MeSH terms

Substances