Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2008 Dec 15;24(24):2887-93.
doi: 10.1093/bioinformatics/btn571. Epub 2008 Nov 4.

Cross-hybridization modeling on Affymetrix exon arrays

Affiliations

Cross-hybridization modeling on Affymetrix exon arrays

Karen Kapur et al. Bioinformatics. .

Abstract

Motivation: Microarray designs have become increasingly probe-rich, enabling targeting of specific features, such as individual exons or single nucleotide polymorphisms. These arrays have the potential to achieve quantitative high-throughput estimates of transcript abundances, but currently these estimates are affected by biases due to cross-hybridization, in which probes hybridize to off-target transcripts.

Results: To study cross-hybridization, we map Affymetrix exon array probes to a set of annotated mRNA transcripts, allowing a small number of mismatches or insertion/deletions between the two sequences. Based on a systematic study of the degree to which probes with a given match type to a transcript are affected by cross-hybridization, we developed a strategy to correct for cross-hybridization biases of gene-level expression estimates. Comparison with Solexa ultra high-throughput sequencing data demonstrates that correction for cross-hybridization leads to a significant improvement of gene expression estimates.

Availability: We provide mappings between human and mouse exon array probes and off-target transcripts and provide software extending the GeneBASE program for generating gene-level expression estimates including the cross-hybridization correction http://biogibbs.stanford.edu/~kkapur/GeneBase/.

PubMed Disclaimer

Figures

Fig. 1.
Fig. 1.
Shown here is an illustration of a probe (top sequence) matching a transcript (bottom sequence) with a 1-bp mismatch and a 1-bp insertion/deletion.
Fig. 2.
Fig. 2.
The empirical cdf of gene standardized residuals, separated by paralog classification.
Fig. 3.
Fig. 3.
R2 statistics modeling full probe intensities by corresponding matching off-target gene expression levels, separated by the number of mismatches and insertion/deletions.
Fig. 4.
Fig. 4.
Plots of probe intensities from genes Scd3 (transcript cluster 6869918) and Scd1 (transcript cluster 6873271), separated by those which uniquely match the target transcripts and those which match the corresponding off-target transcript. Unique probes from Scd3 show a different intensity patterns from those which match Scd1, suggesting a cross-hybridization bias.
Fig. 5.
Fig. 5.
Histogram of T-values to compare the concordance of the two sets of exon array expression estimates, GeneBASE and GeneBASE-xhyb, with sequencing in liver. Histogram of (a) all genes with altered estimates between GeneBASE and GeneBASE-xhyb and (b) genes with absolute log moderated fold-change between GeneBASE and GeneBASE-xhyb estimates above 1.
Fig. 6.
Fig. 6.
Histogram of T-values to compare the concordance of the two sets of exon array expression estimates, GeneBASE and GeneBASE-xhyb-Filter, with sequencing. Histogram of (a) all genes with altered estimates between GeneBASE and GeneBASE-xhyb-Filter and (b) genes with absolute log moderated fold-change between GeneBASE and GeneBASE-xhyb-Filter estimates above 1.

References

    1. Affymetrix . Affymetrix technical documentation for exon array probe annotations. 2005a. Available at http://www.affymetrix.com/support/technical/whitepapers/exon_probeset_tr....
    1. Affymetrix . Exon array design datasheet. 2005b. Available at www.affymetrix.com/support/technical/datasheets/exon_arraydesign_datashe....
    1. Boutz PL, et al. A post-transcriptional regulatory switch in polypyrimidine tract-binding proteins reprograms alternative splicing in developing neurons. Genes Dev. 2007;21:1636–1652. - PMC - PubMed
    1. Casneuf T, et al. In situ analysis of cross-hybridisation on microarrays and the inference of expression correlation. BMC Bioinformatics. 2007;8:461. - PMC - PubMed
    1. Clark TA, et al. Discovery of tissue-specific exons using comprehensive human exon microarrays. Genome Biol. 2007;8:R64. - PMC - PubMed

Publication types