A tractable probabilistic model for Affymetrix probe-level analysis across multiple chips
- PMID: 16020470
- DOI: 10.1093/bioinformatics/bti583
A tractable probabilistic model for Affymetrix probe-level analysis across multiple chips
Abstract
Motivation: Affymetrix GeneChip arrays are currently the most widely used microarray technology. Many summarization methods have been developed to provide gene expression levels from Affymetrix probe-level data. Most of the currently popular methods do not provide a measure of uncertainty for the expression level of each gene. The use of probabilistic models can overcome this limitation. A full hierarchical Bayesian approach requires the use of computationally intensive MCMC methods that are impractical for large datasets. An alternative computationally efficient probabilistic model, mgMOS, uses Gamma distributions to model specific and non-specific binding with a latent variable to capture variations in probe affinity. Although promising, the main limitations of this model are that it does not use information from multiple chips and does not account for specific binding to the mismatch (MM) probes.
Results: We extend mgMOS to model the binding affinity of probe-pairs across multiple chips and to capture the effect of specific binding to MM probes. The new model, multi-mgMOS, provides improved accuracy, as demonstrated on some bench-mark datasets and a real time-course dataset, and is much more computationally efficient than a competing hierarchical Bayesian approach that requires MCMC sampling. We demonstrate how the probabilistic model can be used to estimate credibility intervals for expression levels and their log-ratios between conditions.
Availability: Both mgMOS and the new model multi-mgMOS have been implemented in an R package, which is available at http://www.bioinf.man.ac.uk/resources/puma.
Similar articles
-
Probe-level measurement error improves accuracy in detecting differential gene expression.Bioinformatics. 2006 Sep 1;22(17):2107-13. doi: 10.1093/bioinformatics/btl361. Epub 2006 Jul 4. Bioinformatics. 2006. PMID: 16820429
-
BGX: a Bioconductor package for the Bayesian integrated analysis of Affymetrix GeneChips.BMC Bioinformatics. 2007 Nov 12;8:439. doi: 10.1186/1471-2105-8-439. BMC Bioinformatics. 2007. PMID: 17997843 Free PMC article.
-
A distribution free summarization method for Affymetrix GeneChip arrays.Bioinformatics. 2007 Feb 1;23(3):321-7. doi: 10.1093/bioinformatics/btl609. Epub 2006 Dec 5. Bioinformatics. 2007. PMID: 17148508
-
puma 3.0: improved uncertainty propagation methods for gene and transcript expression analysis.BMC Bioinformatics. 2013 Feb 5;14:39. doi: 10.1186/1471-2105-14-39. BMC Bioinformatics. 2013. PMID: 23379655 Free PMC article.
-
Propagating uncertainty in microarray data analysis.Brief Bioinform. 2006 Mar;7(1):37-47. doi: 10.1093/bib/bbk003. Brief Bioinform. 2006. PMID: 16761363 Review.
Cited by
-
Genotype and expression analysis of two inbred mouse strains and two derived congenic strains suggest that most gene expression is trans regulated and sensitive to genetic background.BMC Genomics. 2010 Jun 7;11:361. doi: 10.1186/1471-2164-11-361. BMC Genomics. 2010. PMID: 20529291 Free PMC article.
-
Genetic and expression analysis of cattle identifies candidate genes in pathways responding to Trypanosoma congolense infection.Proc Natl Acad Sci U S A. 2011 May 31;108(22):9304-9. doi: 10.1073/pnas.1013486108. Epub 2011 May 18. Proc Natl Acad Sci U S A. 2011. PMID: 21593421 Free PMC article.
-
Modeling Exon-Specific Bias Distribution Improves the Analysis of RNA-Seq Data.PLoS One. 2015 Oct 8;10(10):e0140032. doi: 10.1371/journal.pone.0140032. eCollection 2015. PLoS One. 2015. PMID: 26448625 Free PMC article.
-
Model-based method for transcription factor target identification with limited data.Proc Natl Acad Sci U S A. 2010 Apr 27;107(17):7793-8. doi: 10.1073/pnas.0914285107. Epub 2010 Apr 12. Proc Natl Acad Sci U S A. 2010. PMID: 20385836 Free PMC article.
-
puma: a Bioconductor package for propagating uncertainty in microarray analysis.BMC Bioinformatics. 2009 Jul 9;10:211. doi: 10.1186/1471-2105-10-211. BMC Bioinformatics. 2009. PMID: 19589155 Free PMC article.
Publication types
MeSH terms
Grants and funding
LinkOut - more resources
Full Text Sources
Medical