Effects of functional bias on supervised learning of a gene network model
- PMID: 19381535
- DOI: 10.1007/978-1-59745-243-4_20
Effects of functional bias on supervised learning of a gene network model
Abstract
Gene networks have proven to be an effective approach for modeling cellular systems, capable of capturing some of the extreme complexity of cells in a formal theoretical framework. Not surprisingly, this complexity, combined with our still-limited amount of experimental data measuring the genes and their interactions, makes the reconstruction of gene networks difficult. One powerful strategy has been to analyze functional genomics data using supervised learning of network relationships based upon reference examples from our current knowledge. However, this reliance on the set of reference examples for the supervised learning can introduce major pitfalls, with misleading reference sets resulting in suboptimal learning. There are three requirements for an effective reference set: comprehensiveness, reliability, and freedom from bias. Perhaps not too surprisingly, our current knowledge about gene function is highly biased toward several specific biological functions, such as protein synthesis. This functional bias in the reference set, especially combined with the corresponding functional bias in data sets, induces biased learning that can, in turn, lead to false positive biological discoveries, as we show here for the yeast Saccharomyces cerevisiae. This suggests that careful use of current knowledge and genomics data is required for successful gene network modeling using the supervised learning approach. We provide guidance for better use of these data in learning gene networks.
Similar articles
-
A hybrid Bayesian network learning method for constructing gene networks.Comput Biol Chem. 2007 Oct;31(5-6):361-72. doi: 10.1016/j.compbiolchem.2007.08.005. Epub 2007 Aug 19. Comput Biol Chem. 2007. PMID: 17889617
-
Gene expression complex networks: synthesis, identification, and analysis.J Comput Biol. 2011 Oct;18(10):1353-67. doi: 10.1089/cmb.2010.0118. Epub 2011 May 6. J Comput Biol. 2011. PMID: 21548810
-
Refining current knowledge on the yeast FLR1 regulatory network by combined experimental and computational approaches.Mol Biosyst. 2010 Dec;6(12):2471-81. doi: 10.1039/c004881j. Epub 2010 Oct 11. Mol Biosyst. 2010. PMID: 20938527
-
Computational reconstruction of protein-protein interaction networks: algorithms and issues.Methods Mol Biol. 2009;541:89-100. doi: 10.1007/978-1-59745-243-4_5. Methods Mol Biol. 2009. PMID: 19381528 Review.
-
Prediction and integration of regulatory and protein-protein interactions.Methods Mol Biol. 2009;541:101-43. doi: 10.1007/978-1-59745-243-4_6. Methods Mol Biol. 2009. PMID: 19381527 Review.
Cited by
-
Algorithms for modeling global and context-specific functional relationship networks.Brief Bioinform. 2016 Jul;17(4):686-95. doi: 10.1093/bib/bbv065. Epub 2015 Aug 6. Brief Bioinform. 2016. PMID: 26254431 Free PMC article.
-
Integration of probabilistic functional networks without an external Gold Standard.BMC Bioinformatics. 2022 Jul 25;23(1):302. doi: 10.1186/s12859-022-04834-4. BMC Bioinformatics. 2022. PMID: 35879662 Free PMC article.
-
CommWalker: correctly evaluating modules in molecular networks in light of annotation bias.Bioinformatics. 2018 Mar 15;34(6):994-1000. doi: 10.1093/bioinformatics/btx706. Bioinformatics. 2018. PMID: 29112702 Free PMC article.
-
An improved, bias-reduced probabilistic functional gene network of baker's yeast, Saccharomyces cerevisiae.PLoS One. 2007 Oct 3;2(10):e988. doi: 10.1371/journal.pone.0000988. PLoS One. 2007. PMID: 17912365 Free PMC article.
-
The tapeworm interactome: inferring confidence scored protein-protein interactions from the proteome of Hymenolepis microstoma.BMC Genomics. 2020 May 7;21(1):346. doi: 10.1186/s12864-020-6710-1. BMC Genomics. 2020. PMID: 32380953 Free PMC article.
Publication types
MeSH terms
Grants and funding
LinkOut - more resources
Full Text Sources
Molecular Biology Databases