Improved scoring of functional groups from gene expression data by decorrelating GO graph structure
- PMID: 16606683
- DOI: 10.1093/bioinformatics/btl140
Improved scoring of functional groups from gene expression data by decorrelating GO graph structure
Abstract
Motivation: The result of a typical microarray experiment is a long list of genes with corresponding expression measurements. This list is only the starting point for a meaningful biological interpretation. Modern methods identify relevant biological processes or functions from gene expression data by scoring the statistical significance of predefined functional gene groups, e.g. based on Gene Ontology (GO). We develop methods that increase the explanatory power of this approach by integrating knowledge about relationships between the GO terms into the calculation of the statistical significance.
Results: We present two novel algorithms that improve GO group scoring using the underlying GO graph topology. The algorithms are evaluated on real and simulated gene expression data. We show that both methods eliminate local dependencies between GO terms and point to relevant areas in the GO graph that remain undetected with state-of-the-art algorithms for scoring functional terms. A simulation study demonstrates that the new methods exhibit a higher level of detecting relevant biological terms than competing methods.
Similar articles
-
Comparisons of graph-structure clustering methods for gene expression data.Acta Biochim Biophys Sin (Shanghai). 2006 Jun;38(6):379-84. doi: 10.1111/j.1745-7270.2006.00175.x. Acta Biochim Biophys Sin (Shanghai). 2006. PMID: 16761095
-
Towards precise classification of cancers based on robust gene functional expression profiles.BMC Bioinformatics. 2005 Mar 17;6:58. doi: 10.1186/1471-2105-6-58. BMC Bioinformatics. 2005. PMID: 15774002 Free PMC article.
-
The Neural/Immune Gene Ontology: clipping the Gene Ontology for neurological and immunological systems.BMC Bioinformatics. 2010 Sep 12;11:458. doi: 10.1186/1471-2105-11-458. BMC Bioinformatics. 2010. PMID: 20831831 Free PMC article.
-
Microarray data analysis: from disarray to consolidation and consensus.Nat Rev Genet. 2006 Jan;7(1):55-65. doi: 10.1038/nrg1749. Nat Rev Genet. 2006. PMID: 16369572 Review.
-
Statistical considerations for analysis of microarray experiments.Clin Transl Sci. 2011 Dec;4(6):466-77. doi: 10.1111/j.1752-8062.2011.00309.x. Epub 2011 Nov 7. Clin Transl Sci. 2011. PMID: 22212230 Free PMC article. Review.
Cited by
-
Population Genomics of the Maize Pathogen Ustilago maydis: Demographic History and Role of Virulence Clusters in Adaptation.Genome Biol Evol. 2021 May 7;13(5):evab073. doi: 10.1093/gbe/evab073. Genome Biol Evol. 2021. PMID: 33837781 Free PMC article.
-
YY1 plays an essential role at all stages of B-cell differentiation.Proc Natl Acad Sci U S A. 2016 Jul 5;113(27):E3911-20. doi: 10.1073/pnas.1606297113. Epub 2016 Jun 22. Proc Natl Acad Sci U S A. 2016. PMID: 27335461 Free PMC article.
-
GSVA: gene set variation analysis for microarray and RNA-seq data.BMC Bioinformatics. 2013 Jan 16;14:7. doi: 10.1186/1471-2105-14-7. BMC Bioinformatics. 2013. PMID: 23323831 Free PMC article.
-
Widespread translational control contributes to the regulation of Arabidopsis photomorphogenesis.Mol Syst Biol. 2012 Jan 17;8:566. doi: 10.1038/msb.2011.97. Mol Syst Biol. 2012. PMID: 22252389 Free PMC article.
-
vissE: a versatile tool to identify and visualise higher-order molecular phenotypes from functional enrichment analysis.BMC Bioinformatics. 2024 Feb 8;25(1):64. doi: 10.1186/s12859-024-05676-y. BMC Bioinformatics. 2024. PMID: 38331751 Free PMC article.
Publication types
MeSH terms
LinkOut - more resources
Full Text Sources
Other Literature Sources
Molecular Biology Databases