Meta-analysis for ranked discovery datasets: theoretical framework and empirical demonstration for microarrays
- PMID: 17988949
- DOI: 10.1016/j.compbiolchem.2007.09.003
Meta-analysis for ranked discovery datasets: theoretical framework and empirical demonstration for microarrays
Abstract
The combination of results from different large-scale datasets of multidimensional biological signals (such as gene expression profiling) presents a major challenge. Methodologies are needed that can efficiently combine diverse datasets, but can also test the extent of diversity (heterogeneity) across the combined studies. We developed METa-analysis of RAnked DISCovery datasets (METRADISC), a generalized meta-analysis method for combining information across discovery-oriented datasets and for testing between-study heterogeneity for each biological variable of interest. The method is based on non-parametric Monte Carlo permutation testing. The tested biological variables are ranked in each study according to the level of statistical significance. METRADISC tests for each biological variable of interest its average rank and the between-study heterogeneity of the study-specific ranks. After accounting for ties and differences in tested variables across studies, we randomly permute the ranks of each study and the simulated metrics of average rank and heterogeneity are calculated. The procedure is repeated to generate null distributions for the metrics. The use of METRADISC is demonstrated empirically using gene expression data from seven studies comparing prostate cancer cases and normal controls. We offer a new tool for combining complex datasets derived from massive testing, discovery-oriented research and for examining the diversity of results across the combined studies.
Similar articles
-
METRADISC-XL: a program for meta-analysis of multidimensional ranked discovery oriented datasets including microarrays.Comput Methods Programs Biomed. 2012 Dec;108(3):1243-6. doi: 10.1016/j.cmpb.2012.08.001. Epub 2012 Sep 5. Comput Methods Programs Biomed. 2012. PMID: 22959629
-
Heterogeneity testing in meta-analysis of genome searches.Genet Epidemiol. 2005 Feb;28(2):123-37. doi: 10.1002/gepi.20048. Genet Epidemiol. 2005. PMID: 15593093
-
Moderated effect size and P-value combinations for microarray meta-analyses.Bioinformatics. 2009 Oct 15;25(20):2692-9. doi: 10.1093/bioinformatics/btp444. Epub 2009 Jul 23. Bioinformatics. 2009. PMID: 19628502
-
Meta-analysis methods.Adv Genet. 2008;60:311-34. doi: 10.1016/S0065-2660(07)00413-0. Adv Genet. 2008. PMID: 18358326 Review.
-
Optimized detection of differential expression in global profiling experiments: case studies in clinical transcriptomic and quantitative proteomic datasets.Brief Bioinform. 2009 Sep;10(5):547-55. doi: 10.1093/bib/bbp033. Epub 2009 Jun 23. Brief Bioinform. 2009. PMID: 19549804 Review.
Cited by
-
The contribution of genetic variants of SLC2A1 gene in T2DM and T2DM-nephropathy: association study and meta-analysis.Ren Fail. 2018 Nov;40(1):561-576. doi: 10.1080/0886022X.2018.1496931. Ren Fail. 2018. PMID: 30353771 Free PMC article.
-
Statistical genomics in rare cancer.Semin Cancer Biol. 2020 Apr;61:1-10. doi: 10.1016/j.semcancer.2019.08.021. Epub 2019 Aug 19. Semin Cancer Biol. 2020. PMID: 31437624 Free PMC article. Review.
-
The prognostic and clinical significance of IFI44L aberrant downregulation in patients with oral squamous cell carcinoma.BMC Cancer. 2021 Dec 13;21(1):1327. doi: 10.1186/s12885-021-09058-y. BMC Cancer. 2021. PMID: 34903206 Free PMC article.
-
Meta-analysis of glioblastoma multiforme versus anaplastic astrocytoma identifies robust gene markers.Mol Cancer. 2009 Sep 4;8:71. doi: 10.1186/1476-4598-8-71. Mol Cancer. 2009. PMID: 19732454 Free PMC article.
-
Meta-Analysis Based on Nonconvex Regularization.Sci Rep. 2020 Apr 1;10(1):5755. doi: 10.1038/s41598-020-62473-2. Sci Rep. 2020. PMID: 32238826 Free PMC article.
MeSH terms
LinkOut - more resources
Full Text Sources