The effects of probe binding affinity differences on gene expression measurements and how to deal with them
- PMID: 19689957
- DOI: 10.1093/bioinformatics/btp492
The effects of probe binding affinity differences on gene expression measurements and how to deal with them
Abstract
Motivation: When comparing gene expression levels between species or strains using microarrays, sequence differences between the groups can cause false identification of expression differences. Our simulated dataset shows that a sequence divergence of only 1% between species can lead to falsely reported expression differences for >50% of the transcripts-similar levels of effect have been reported previously in comparisons of human and chimpanzee expression. We propose a method for identifying probes that cause such false readings, using only the microarray data, so that problematic probes can be excluded from analysis. We then test the power of the method to detect sequence differences and to correct for falsely reported expression differences. Our method can detect 70% of the probes with sequence differences using human and chimpanzee data, while removing only 18% of probes with no sequence differences. Although only 70% of the probes with sequence differences are detected, the effect of removing probes on falsely reported expression differences is more dramatic: the method can remove 98% of the falsely reported expression differences from a simulated dataset. We argue that the method should be used even when sequence data are available.
Contact: lachmann@eva.mpg.de
Supplementary information: Supplementary data are available at Bioinformatics online.
Similar articles
-
Calculation of reliable transcript levels of annotated genes on the basis of multiple probe-sets in Affymetrix microarrays.Acta Biochim Pol. 2009;56(2):271-7. Epub 2009 May 12. Acta Biochim Pol. 2009. PMID: 19436837
-
Gene expression and isoform variation analysis using Affymetrix Exon Arrays.BMC Genomics. 2008 Nov 7;9:529. doi: 10.1186/1471-2164-9-529. BMC Genomics. 2008. PMID: 18990248 Free PMC article.
-
Comparison of seven methods for producing Affymetrix expression scores based on False Discovery Rates in disease profiling data.BMC Bioinformatics. 2005 Feb 10;6:26. doi: 10.1186/1471-2105-6-26. BMC Bioinformatics. 2005. PMID: 15705192 Free PMC article.
-
Computational approaches to analysis of DNA microarray data.Yearb Med Inform. 2006:91-103. Yearb Med Inform. 2006. PMID: 17051302 Review.
-
A primer on gene expression and microarrays for machine learning researchers.J Biomed Inform. 2004 Aug;37(4):293-303. doi: 10.1016/j.jbi.2004.07.002. J Biomed Inform. 2004. PMID: 15465482 Review.
Cited by
-
Early gene expression divergence between allopatric populations of the house mouse (Mus musculus domesticus).Ecol Evol. 2013 Mar;3(3):558-68. doi: 10.1002/ece3.447. Epub 2013 Jan 31. Ecol Evol. 2013. PMID: 23532401 Free PMC article.
-
Optimized probe masking for comparative transcriptomics of closely related species.PLoS One. 2013 Nov 8;8(11):e78497. doi: 10.1371/journal.pone.0078497. eCollection 2013. PLoS One. 2013. PMID: 24260119 Free PMC article.
-
Species-Specific Changes in a Primate Transcription Factor Network Provide Insights into the Molecular Evolution of the Primate Prefrontal Cortex.Genome Biol Evol. 2018 Aug 1;10(8):2023-2036. doi: 10.1093/gbe/evy149. Genome Biol Evol. 2018. PMID: 30059966 Free PMC article.
-
Reciprocal F1 Hybrids of Two Inbred Mouse Strains Reveal Parent-of-Origin and Perinatal Diet Effects on Behavior and Expression.G3 (Bethesda). 2018 Nov 6;8(11):3447-3468. doi: 10.1534/g3.118.200135. G3 (Bethesda). 2018. PMID: 30171036 Free PMC article.
-
'maskBAD'--a package to detect and remove Affymetrix probes with binding affinity differences.BMC Bioinformatics. 2012 Apr 16;13:56. doi: 10.1186/1471-2105-13-56. BMC Bioinformatics. 2012. PMID: 22507266 Free PMC article.
Publication types
MeSH terms
Substances
LinkOut - more resources
Full Text Sources
Molecular Biology Databases