Group testing for pathway analysis improves comparability of different microarray datasets
- PMID: 16895928
- DOI: 10.1093/bioinformatics/btl424
Group testing for pathway analysis improves comparability of different microarray datasets
Abstract
Motivation: The wide use of DNA microarrays for the investigation of the cell transcriptome triggered the invention of numerous methods for the processing of microarray data and lead to a growing number of microarray studies that examine the same biological conditions. However, comparisons made on the level of gene lists obtained by different statistical methods or from different datasets hardly converge. We aimed at examining such discrepancies on the level of apparently affected biologically related groups of genes, e.g. metabolic or signalling pathways. This can be achieved by group testing procedures, e.g. over-representation analysis, functional class scoring (FCS), or global tests.
Results: Three public prostate cancer datasets obtained with the same microarray platform (HGU95A/HGU95Av2) were analyzed. Each dataset was subjected to normalization by either variance stabilizing normalization (vsn) or mixed model normalization (MMN). Then, statistical analysis of microarrays was applied to the vsn-normalized data and mixed model analysis to the data normalized by MMN. For multiple testing adjustment the false discovery rate was calculated and the threshold was set to 0.05. Gene lists from the same method applied to different datasets showed overlaps between 42 and 52%, while lists from different methods applied to the same dataset had between 63 and 85% of genes in common. A number of six gene lists obtained by the two statistical methods applied to the three datasets was then subjected to group testing by Fisher's exact test. Group testing by GSEA and global test was applied to the three datasets, as well. Fisher's exact test followed by global test showed more consistent results with respect to the concordance between analyses on gene lists obtained by different methods and different datasets than the GSEA. However, all group testing methods identified pathways that had already been described to be involved in the pathogenesis of prostate cancer. Moreover, pathways recurrently identified in these analyses are more likely to be reliable than those from a single analysis on a single dataset.
Similar articles
-
Algebraic stability indicators for ranked lists in molecular profiling.Bioinformatics. 2008 Jan 15;24(2):258-64. doi: 10.1093/bioinformatics/btm550. Epub 2007 Nov 16. Bioinformatics. 2008. PMID: 18024475
-
A rapid method for microarray cross platform comparisons using gene expression signatures.Mol Cell Probes. 2007 Feb;21(1):35-46. doi: 10.1016/j.mcp.2006.07.004. Epub 2006 Aug 10. Mol Cell Probes. 2007. PMID: 16982174
-
A comparison of meta-analysis methods for detecting differentially expressed genes in microarray experiments.Bioinformatics. 2008 Feb 1;24(3):374-82. doi: 10.1093/bioinformatics/btm620. Epub 2008 Jan 18. Bioinformatics. 2008. PMID: 18204063
-
Optimized detection of differential expression in global profiling experiments: case studies in clinical transcriptomic and quantitative proteomic datasets.Brief Bioinform. 2009 Sep;10(5):547-55. doi: 10.1093/bib/bbp033. Epub 2009 Jun 23. Brief Bioinform. 2009. PMID: 19549804 Review.
-
The end of the microarray Tower of Babel: will universal standards lead the way?J Biomol Tech. 2006 Jul;17(3):200-6. J Biomol Tech. 2006. PMID: 16870711 Free PMC article. Review.
Cited by
-
Test on existence of histology subtype-specific prognostic signatures among early stage lung adenocarcinoma and squamous cell carcinoma patients using a Cox-model based filter.Biol Direct. 2015 Apr 7;10:15. doi: 10.1186/s13062-015-0051-z. Biol Direct. 2015. PMID: 25887039 Free PMC article.
-
Electroretinography and Gene Expression Measures Implicate Phototransduction and Metabolic Shifts in Chick Myopia and Hyperopia Models.Life (Basel). 2021 May 29;11(6):501. doi: 10.3390/life11060501. Life (Basel). 2021. PMID: 34072440 Free PMC article.
-
Salmonella induces prominent gene expression in the rat colon.BMC Microbiol. 2007 Sep 12;7:84. doi: 10.1186/1471-2180-7-84. BMC Microbiol. 2007. PMID: 17850650 Free PMC article.
-
Systems biology approach to identification of biomarkers for metastatic progression in cancer.BMC Bioinformatics. 2008 Aug 12;9 Suppl 9(Suppl 9):S8. doi: 10.1186/1471-2105-9-S9-S8. BMC Bioinformatics. 2008. PMID: 18793472 Free PMC article.
-
Text-based over-representation analysis of microarray gene lists with annotation bias.Nucleic Acids Res. 2009 Jun;37(11):e79. doi: 10.1093/nar/gkp310. Epub 2009 May 8. Nucleic Acids Res. 2009. PMID: 19429895 Free PMC article.
Publication types
MeSH terms
Substances
LinkOut - more resources
Full Text Sources
Other Literature Sources