Maximal extraction of biological information from genetic interaction data
- PMID: 19343223
- PMCID: PMC2659753
- DOI: 10.1371/journal.pcbi.1000347
Maximal extraction of biological information from genetic interaction data
Abstract
Extraction of all the biological information inherent in large-scale genetic interaction datasets remains a significant challenge for systems biology. The core problem is essentially that of classification of the relationships among phenotypes of mutant strains into biologically informative "rules" of gene interaction. Geneticists have determined such classifications based on insights from biological examples, but it is not clear that there is a systematic, unsupervised way to extract this information. In this paper we describe such a method that depends on maximizing a previously described context-dependent information measure to obtain maximally informative biological networks. We have successfully validated this method on two examples from yeast by demonstrating that more biological information is obtained when analysis is guided by this information measure. The context-dependent information measure is a function only of phenotype data and a set of interaction rules, involving no prior biological knowledge. Analysis of the resulting networks reveals that the most biologically informative networks are those with the greatest context-dependent information scores. We propose that these high-complexity networks reveal genetic architecture at a modular level, in contrast to classical genetic interaction rules that order genes in pathways. We suggest that our analysis represents a powerful, data-driven, and general approach to genetic interaction analysis, with particular potential in the study of mammalian systems in which interactions are complex and gene annotation data are sparse.
Conflict of interest statement
The authors have declared that no competing interests exist.
Figures







Similar articles
-
A hybrid graph-theoretic method for mining overlapping functional modules in large sparse protein interaction networks.Int J Data Min Bioinform. 2009;3(1):68-84. doi: 10.1504/ijdmb.2009.023885. Int J Data Min Bioinform. 2009. PMID: 19432377
-
Selective integration of multiple biological data for supervised network inference.Bioinformatics. 2005 May 15;21(10):2488-95. doi: 10.1093/bioinformatics/bti339. Epub 2005 Feb 22. Bioinformatics. 2005. PMID: 15728114
-
Estimating gene regulatory networks and protein-protein interactions of Saccharomyces cerevisiae from multiple genome-wide data.Bioinformatics. 2005 Sep 1;21 Suppl 2:ii206-12. doi: 10.1093/bioinformatics/bti1133. Bioinformatics. 2005. PMID: 16204105
-
A systems-biology approach to modular genetic complexity.Chaos. 2010 Jun;20(2):026102. doi: 10.1063/1.3455183. Chaos. 2010. PMID: 20590331 Free PMC article. Review.
-
Dissecting complex transcriptional responses using pathway-level scores based on prior information.BMC Bioinformatics. 2007 Sep 27;8 Suppl 6(Suppl 6):S6. doi: 10.1186/1471-2105-8-S6-S6. BMC Bioinformatics. 2007. PMID: 17903287 Free PMC article. Review.
Cited by
-
Genome-Wide Fitness and Genetic Interactions Determined by Tn-seq, a High-Throughput Massively Parallel Sequencing Method for Microorganisms.Curr Protoc Microbiol. 2015 Feb 2;36:1E.3.1-1E.3.24. doi: 10.1002/9780471729259.mc01e03s36. Curr Protoc Microbiol. 2015. PMID: 25641100 Free PMC article.
-
Multiple genetic interaction experiments provide complementary information useful for gene function prediction.PLoS Comput Biol. 2012;8(6):e1002559. doi: 10.1371/journal.pcbi.1002559. Epub 2012 Jun 21. PLoS Comput Biol. 2012. PMID: 22737063 Free PMC article.
-
Genome-wide fitness and genetic interactions determined by Tn-seq, a high-throughput massively parallel sequencing method for microorganisms.Curr Protoc Microbiol. 2010 Nov;Chapter 1:Unit1E.3. doi: 10.1002/9780471729259.mc01e03s19. Curr Protoc Microbiol. 2010. PMID: 21053251 Free PMC article.
-
Symmetries among Multivariate Information Measures Explored Using Möbius Operators.Entropy (Basel). 2019 Jan 18;21(1):88. doi: 10.3390/e21010088. Entropy (Basel). 2019. PMID: 33266804 Free PMC article.
-
Describing the complexity of systems: multivariable "set complexity" and the information basis of systems biology.J Comput Biol. 2014 Feb;21(2):118-40. doi: 10.1089/cmb.2013.0039. Epub 2013 Dec 30. J Comput Biol. 2014. PMID: 24377753 Free PMC article.
References
-
- Ideker T, Galitski T, Hood L. A new approach to decoding life: systems biology. Annu Rev Genomics Hum Genet. 2001;2:343–372. - PubMed
-
- Tong AH, Lesage G, Bader GD, Ding H, Xu H, et al. Global mapping of the yeast genetic interaction network. Science. 2004;303:808–813. - PubMed
-
- Schuldiner M, Collins SR, Thompson NJ, Denic V, Bhamidipati A, et al. Exploration of the function and organization of the yeast early secretory pathway through an epistatic miniarray profile. Cell. 2005;123:507–519. - PubMed
Publication types
MeSH terms
Substances
Grants and funding
LinkOut - more resources
Full Text Sources
Molecular Biology Databases