puma 3.0: improved uncertainty propagation methods for gene and transcript expression analysis
- PMID: 23379655
- PMCID: PMC3626802
- DOI: 10.1186/1471-2105-14-39
puma 3.0: improved uncertainty propagation methods for gene and transcript expression analysis
Abstract
Background: Microarrays have been a popular tool for gene expression profiling at genome-scale for over a decade due to the low cost, short turn-around time, excellent quantitative accuracy and ease of data generation. The Bioconductor package puma incorporates a suite of analysis methods for determining uncertainties from Affymetrix GeneChip data and propagating these uncertainties to downstream analysis. As isoform level expression profiling receives more and more interest within genomics in recent years, exon microarray technology offers an important tool to quantify expression level of the majority of exons and enables the possibility of measuring isoform level expression. However, puma does not include methods for the analysis of exon array data. Moreover, the current expression summarisation method for Affymetrix 3' GeneChip data suffers from instability for low expression genes. For the downstream analysis, the method for differential expression detection is computationally intensive and the original expression clustering method does not consider the variance across the replicated technical and biological measurements. It is therefore necessary to develop improved uncertainty propagation methods for gene and transcript expression analysis.
Results: We extend the previously developed Bioconductor package puma with a new method especially designed for GeneChip Exon arrays and a set of improved downstream approaches. The improvements include: (i) a new gamma model for exon arrays which calculates isoform and gene expression measurements and a level of uncertainty associated with the estimates, using the multi-mappings between probes, isoforms and genes, (ii) a variant of the existing approach for the probe-level analysis of Affymetrix 3' GeneChip data to produce more stable gene expression estimates, (iii) an improved method for detecting differential expression which is computationally more efficient than the existing approach in the package and (iv) an improved method for robust model-based clustering of gene expression, which takes technical and biological replicate information into consideration.
Conclusions: With the extensions and improvements, the puma package is now applicable to the analysis of both Affymetrix 3' GeneChips and Exon arrays for gene and isoform expression estimation. It propagates the uncertainty of expression measurements into more efficient and comprehensive downstream analysis at both gene and isoform level. Downstream methods are also applicable to other expression quantification platforms, such as RNA-Seq, when uncertainty information is available from expression measurements. puma is available through Bioconductor and can be found at http://www.bioconductor.org.
Figures








Similar articles
-
puma: a Bioconductor package for propagating uncertainty in microarray analysis.BMC Bioinformatics. 2009 Jul 9;10:211. doi: 10.1186/1471-2105-10-211. BMC Bioinformatics. 2009. PMID: 19589155 Free PMC article.
-
Gene expression and isoform variation analysis using Affymetrix Exon Arrays.BMC Genomics. 2008 Nov 7;9:529. doi: 10.1186/1471-2164-9-529. BMC Genomics. 2008. PMID: 18990248 Free PMC article.
-
SigFuge: single gene clustering of RNA-seq reveals differential isoform usage among cancer samples.Nucleic Acids Res. 2014 Aug;42(14):e113. doi: 10.1093/nar/gku521. Epub 2014 Jul 16. Nucleic Acids Res. 2014. PMID: 25030904 Free PMC article.
-
Propagating uncertainty in microarray data analysis.Brief Bioinform. 2006 Mar;7(1):37-47. doi: 10.1093/bib/bbk003. Brief Bioinform. 2006. PMID: 16761363 Review.
-
An overview of image-processing methods for Affymetrix GeneChips.Brief Bioinform. 2008 Jan;9(1):25-33. doi: 10.1093/bib/bbm055. Epub 2007 Dec 5. Brief Bioinform. 2008. PMID: 18057073 Review.
Cited by
-
Comparative evaluation of isoform-level gene expression estimation algorithms for RNA-seq and exon-array platforms.Brief Bioinform. 2017 Mar 1;18(2):260-269. doi: 10.1093/bib/bbw016. Brief Bioinform. 2017. PMID: 26944083 Free PMC article.
-
Improving RNA-Seq expression estimation by modeling isoform- and exon-specific read sequencing rate.BMC Bioinformatics. 2015 Oct 16;16:332. doi: 10.1186/s12859-015-0750-6. BMC Bioinformatics. 2015. PMID: 26475308 Free PMC article.
-
Analysis of key genes and their functions in placental tissue of patients with gestational diabetes mellitus.Reprod Biol Endocrinol. 2019 Nov 29;17(1):104. doi: 10.1186/s12958-019-0546-z. Reprod Biol Endocrinol. 2019. PMID: 31783860 Free PMC article.
-
A data-driven approach links microglia to pathology and prognosis in amyotrophic lateral sclerosis.Acta Neuropathol Commun. 2017 Mar 16;5(1):23. doi: 10.1186/s40478-017-0424-x. Acta Neuropathol Commun. 2017. PMID: 28302159 Free PMC article.
-
Pulsatile exposure to simulated reflux leads to changes in gene expression in a 3D model of oesophageal mucosa.Int J Exp Pathol. 2014 Jun;95(3):216-28. doi: 10.1111/iep.12083. Epub 2014 Apr 8. Int J Exp Pathol. 2014. PMID: 24713057 Free PMC article.
References
Publication types
MeSH terms
Substances
Grants and funding
LinkOut - more resources
Full Text Sources
Other Literature Sources
Molecular Biology Databases
Research Materials