A general modular framework for gene set enrichment analysis
- PMID: 19192285
- PMCID: PMC2661051
- DOI: 10.1186/1471-2105-10-47
A general modular framework for gene set enrichment analysis
Abstract
Background: Analysis of microarray and other high-throughput data on the basis of gene sets, rather than individual genes, is becoming more important in genomic studies. Correspondingly, a large number of statistical approaches for detecting gene set enrichment have been proposed, but both the interrelations and the relative performance of the various methods are still very much unclear.
Results: We conduct an extensive survey of statistical approaches for gene set analysis and identify a common modular structure underlying most published methods. Based on this finding we propose a general framework for detecting gene set enrichment. This framework provides a meta-theory of gene set analysis that not only helps to gain a better understanding of the relative merits of each embedded approach but also facilitates a principled comparison and offers insights into the relative interplay of the methods.
Conclusion: We use this framework to conduct a computer simulation comparing 261 different variants of gene set enrichment procedures and to analyze two experimental data sets. Based on the results we offer recommendations for best practices regarding the choice of effective procedures for gene set enrichment analysis.
Figures




Similar articles
-
Concordant integrative gene set enrichment analysis of multiple large-scale two-sample expression data sets.BMC Genomics. 2014;15 Suppl 1(Suppl 1):S6. doi: 10.1186/1471-2164-15-S1-S6. Epub 2014 Jan 24. BMC Genomics. 2014. PMID: 24564564 Free PMC article.
-
Comparative study of gene set enrichment methods.BMC Bioinformatics. 2009 Sep 2;10:275. doi: 10.1186/1471-2105-10-275. BMC Bioinformatics. 2009. PMID: 19725948 Free PMC article.
-
Gene set enrichment meta-learning analysis: next- generation sequencing versus microarrays.BMC Bioinformatics. 2010 Apr 8;11:176. doi: 10.1186/1471-2105-11-176. BMC Bioinformatics. 2010. PMID: 20377890 Free PMC article.
-
Statistical framework for gene expression data analysis.Methods Mol Biol. 2007;377:111-30. doi: 10.1007/978-1-59745-390-5_6. Methods Mol Biol. 2007. PMID: 17634612 Review.
-
Gene-set analysis and reduction.Brief Bioinform. 2009 Jan;10(1):24-34. doi: 10.1093/bib/bbn042. Epub 2008 Oct 4. Brief Bioinform. 2009. PMID: 18836208 Free PMC article. Review.
Cited by
-
Systems biology approaches for discovering biomarkers for traumatic brain injury.J Neurotrauma. 2013 Jul 1;30(13):1101-16. doi: 10.1089/neu.2012.2631. J Neurotrauma. 2013. PMID: 23510232 Free PMC article. Review.
-
DECODE: an integrated differential co-expression and differential expression analysis of gene expression data.BMC Bioinformatics. 2015 May 31;16:182. doi: 10.1186/s12859-015-0582-4. BMC Bioinformatics. 2015. PMID: 26026612 Free PMC article.
-
Blood and urine multi-omics analysis of the impact of e-vaping, smoking, and cessation: from exposome to molecular responses.Sci Rep. 2024 Feb 21;14(1):4286. doi: 10.1038/s41598-024-54474-2. Sci Rep. 2024. PMID: 38383592 Free PMC article.
-
Signatures of Positive Selection in the Genome of Apis mellifera carnica: A Subspecies of European Honeybees.Life (Basel). 2022 Oct 19;12(10):1642. doi: 10.3390/life12101642. Life (Basel). 2022. PMID: 36295077 Free PMC article.
-
Gene-set distance analysis (GSDA): a powerful tool for gene-set association analysis.BMC Bioinformatics. 2021 Apr 21;22(1):207. doi: 10.1186/s12859-021-04110-x. BMC Bioinformatics. 2021. PMID: 33882829 Free PMC article.
References
MeSH terms
LinkOut - more resources
Full Text Sources
Other Literature Sources