Significance analysis of functional categories in gene expression studies: a structured permutation approach
- PMID: 15647293
- DOI: 10.1093/bioinformatics/bti260
Significance analysis of functional categories in gene expression studies: a structured permutation approach
Abstract
Motivation: In high-throughput genomic and proteomic experiments, investigators monitor expression across a set of experimental conditions. To gain an understanding of broader biological phenomena, researchers have until recently been limited to post hoc analyses of significant gene lists.
Method: We describe a general framework, significance analysis of function and expression (SAFE), for conducting valid tests of gene categories ab initio. SAFE is a two-stage, permutation-based method that can be applied to various experimental designs, accounts for the unknown correlation among genes and enables permutation-based estimation of error rates.
Results: The utility and flexibility of SAFE is illustrated with a microarray dataset of human lung carcinomas and gene categories based on Gene Ontology and the Protein Family database. Significant gene categories were observed in comparisons of (1) tumor versus normal tissue, (2) multiple tumor subtypes and (3) survival times.
Availability: Code to implement SAFE in the statistical package R is available from the authors.
Supplementary information: http://www.bios.unc.edu/~fwright/SAFE.
Similar articles
-
Statistical assessment of functional categories of genes deregulated in pathological conditions by using microarray data.Bioinformatics. 2007 Aug 15;23(16):2063-72. doi: 10.1093/bioinformatics/btm289. Epub 2007 May 31. Bioinformatics. 2007. PMID: 17540679
-
Pathway recognition and augmentation by computational analysis of microarray expression data.Bioinformatics. 2006 Jan 15;22(2):233-41. doi: 10.1093/bioinformatics/bti764. Epub 2005 Nov 8. Bioinformatics. 2006. PMID: 16278238
-
Empirical Bayes screening of many p-values with applications to microarray studies.Bioinformatics. 2005 May 1;21(9):1987-94. doi: 10.1093/bioinformatics/bti301. Epub 2005 Feb 2. Bioinformatics. 2005. PMID: 15691856
-
Classification based upon gene expression data: bias and precision of error rates.Bioinformatics. 2007 Jun 1;23(11):1363-70. doi: 10.1093/bioinformatics/btm117. Epub 2007 Mar 28. Bioinformatics. 2007. PMID: 17392326 Review.
-
Gene-set analysis and reduction.Brief Bioinform. 2009 Jan;10(1):24-34. doi: 10.1093/bib/bbn042. Epub 2008 Oct 4. Brief Bioinform. 2009. PMID: 18836208 Free PMC article. Review.
Cited by
-
Investigating the concordance of Gene Ontology terms reveals the intra- and inter-platform reproducibility of enrichment analysis.BMC Bioinformatics. 2013 Apr 29;14:143. doi: 10.1186/1471-2105-14-143. BMC Bioinformatics. 2013. PMID: 23627640 Free PMC article.
-
Identification of pathway deregulation--gene expression based analysis of consistent signal transduction.PLoS One. 2012;7(7):e41541. doi: 10.1371/journal.pone.0041541. Epub 2012 Jul 25. PLoS One. 2012. PMID: 22848524 Free PMC article.
-
Sex-specific gene expression in the BXD mouse liver.Physiol Genomics. 2010 Aug;42(3):456-68. doi: 10.1152/physiolgenomics.00110.2009. Epub 2010 Jun 15. Physiol Genomics. 2010. PMID: 20551147 Free PMC article.
-
Chipster: user-friendly analysis software for microarray and other high-throughput data.BMC Genomics. 2011 Oct 14;12:507. doi: 10.1186/1471-2164-12-507. BMC Genomics. 2011. PMID: 21999641 Free PMC article.
-
Pathway analysis reveals functional convergence of gene expression profiles in breast cancer.BMC Med Genomics. 2008 Jun 27;1:28. doi: 10.1186/1755-8794-1-28. BMC Med Genomics. 2008. PMID: 18588682 Free PMC article.
Publication types
MeSH terms
Substances
Grants and funding
LinkOut - more resources
Full Text Sources
Other Literature Sources
Medical