Transcript-based redefinition of grouped oligonucleotide probe sets using AceView: high-resolution annotation for microarrays
- PMID: 17394657
- PMCID: PMC1853115
- DOI: 10.1186/1471-2105-8-108
Transcript-based redefinition of grouped oligonucleotide probe sets using AceView: high-resolution annotation for microarrays
Abstract
Background: Extracting biological information from high-density Affymetrix arrays is a multi-step process that begins with the accurate annotation of microarray probes. Shortfalls in the original Affymetrix probe annotation have been described; however, few studies have provided rigorous solutions for routine data analysis.
Results: Using AceView, a comprehensive human transcript database, we have reannotated the probes by matching them to RNA transcripts instead of genes. Based on this transcript-level annotation, a new probe set definition was created in which every probe in a probe set maps to a common set of AceView gene transcripts. In addition, using artificial data sets we identified that a minimal probe set size of 4 is necessary for reliable statistical summarization. We further demonstrate that applying the new probe set definition can detect specific transcript variants contributing to differential expression and it also improves cross-platform concordance.
Conclusion: We conclude that our transcript-level reannotation and redefinition of probe sets complement the original Affymetrix design. Redefinitions introduce probe sets whose sizes may not support reliable statistical summarization; therefore, we advocate using our transcript-level mapping redefinition in a secondary analysis step rather than as a replacement. Knowing which specific transcripts are differentially expressed is important to properly design probe/primer pairs for validation purposes. For convenience, we have created custom chip-description-files (CDFs) and annotation files for our new probe set definitions that are compatible with Bioconductor, Affymetrix Expression Console or third party software.
Figures






Similar articles
-
Integrating multiple genome annotation databases improves the interpretation of microarray gene expression data.BMC Genomics. 2010 Jan 20;11:50. doi: 10.1186/1471-2164-11-50. BMC Genomics. 2010. PMID: 20089164 Free PMC article.
-
Novel definition files for human GeneChips based on GeneAnnot.BMC Bioinformatics. 2007 Nov 15;8:446. doi: 10.1186/1471-2105-8-446. BMC Bioinformatics. 2007. PMID: 18005434 Free PMC article.
-
Calculation of reliable transcript levels of annotated genes on the basis of multiple probe-sets in Affymetrix microarrays.Acta Biochim Pol. 2009;56(2):271-7. Epub 2009 May 12. Acta Biochim Pol. 2009. PMID: 19436837
-
Transcript-level annotation of Affymetrix probesets improves the interpretation of gene expression data.BMC Bioinformatics. 2007 Jun 11;8:194. doi: 10.1186/1471-2105-8-194. BMC Bioinformatics. 2007. PMID: 17559689 Free PMC article.
-
[Transcriptomes for serial analysis of gene expression].J Soc Biol. 2002;196(4):303-7. J Soc Biol. 2002. PMID: 12645300 Review. French.
Cited by
-
SplicerAV: a tool for mining microarray expression data for changes in RNA processing.BMC Bioinformatics. 2010 Feb 25;11:108. doi: 10.1186/1471-2105-11-108. BMC Bioinformatics. 2010. PMID: 20184770 Free PMC article.
-
Integrating multiple genome annotation databases improves the interpretation of microarray gene expression data.BMC Genomics. 2010 Jan 20;11:50. doi: 10.1186/1471-2164-11-50. BMC Genomics. 2010. PMID: 20089164 Free PMC article.
-
GEO dataset mining analysis reveals novel Staphylococcus aureus virulence gene regulatory networks and diagnostic targets in mice.Front Mol Biosci. 2024 Mar 28;11:1381334. doi: 10.3389/fmolb.2024.1381334. eCollection 2024. Front Mol Biosci. 2024. PMID: 38606287 Free PMC article.
-
Analysis of DNA strand-specific differential expression with high density tiling microarrays.BMC Bioinformatics. 2010 Mar 17;11:136. doi: 10.1186/1471-2105-11-136. BMC Bioinformatics. 2010. PMID: 20233458 Free PMC article.
-
From hybridization theory to microarray data analysis: performance evaluation.BMC Bioinformatics. 2011 Dec 2;12:464. doi: 10.1186/1471-2105-12-464. BMC Bioinformatics. 2011. PMID: 22136743 Free PMC article.
References
-
- Affymetrix MAS5 algorithm. 2006. http://www.affymetrix.com/support/technical/manual/expression_manual.affx
Publication types
MeSH terms
Substances
Grants and funding
LinkOut - more resources
Full Text Sources
Other Literature Sources