Revealing biological modules via graph summarization
- PMID: 19183002
- DOI: 10.1089/cmb.2008.11TT
Revealing biological modules via graph summarization
Abstract
The division of a protein interaction network into biologically meaningful modules can aid with automated detection of protein complexes and prediction of biological processes and can uncover the global organization of the cell. We propose the use of a graph summarization (GS) technique, based on graph compression, to cluster protein interaction graphs into biologically relevant modules. The method is motivated by defining a biological module as a set of proteins that have similar sets of interaction partners. We show this definition, put into practice by a GS algorithm, reveals modules that are more biologically enriched than those found by other methods. We also apply GS to predict complex memberships, biological processes, and co-complexed pairs and show that in most settings GS is preferable over existing methods of protein interaction graph clustering.
Similar articles
-
Evaluation of clustering algorithms for protein-protein interaction networks.BMC Bioinformatics. 2006 Nov 6;7:488. doi: 10.1186/1471-2105-7-488. BMC Bioinformatics. 2006. PMID: 17087821 Free PMC article.
-
A degree-distribution based hierarchical agglomerative clustering algorithm for protein complexes identification.Comput Biol Chem. 2011 Oct 12;35(5):298-307. doi: 10.1016/j.compbiolchem.2011.07.005. Epub 2011 Jul 20. Comput Biol Chem. 2011. PMID: 22000801
-
Detection of functional modules from protein interaction networks with an enhanced random walk based algorithm.Int J Comput Biol Drug Des. 2011;4(3):290-306. doi: 10.1504/IJCBDD.2011.041416. Epub 2011 Jul 21. Int J Comput Biol Drug Des. 2011. PMID: 21778561
-
Computational detection of protein complexes in AP-MS experiments.Proteomics. 2012 May;12(10):1663-8. doi: 10.1002/pmic.201100508. Proteomics. 2012. PMID: 22711593 Review.
-
Network integration and graph analysis in mammalian molecular systems biology.IET Syst Biol. 2008 Sep;2(5):206-21. doi: 10.1049/iet-syb:20070075. IET Syst Biol. 2008. PMID: 19045817 Free PMC article. Review.
Cited by
-
Chapter 5: Network biology approach to complex diseases.PLoS Comput Biol. 2012;8(12):e1002820. doi: 10.1371/journal.pcbi.1002820. Epub 2012 Dec 27. PLoS Comput Biol. 2012. PMID: 23300411 Free PMC article.
-
Inferring functional modules of protein families with probabilistic topic models.BMC Bioinformatics. 2011 May 9;12:141. doi: 10.1186/1471-2105-12-141. BMC Bioinformatics. 2011. PMID: 21554720 Free PMC article.
-
Metabolic network alignment in large scale by network compression.BMC Bioinformatics. 2012 Mar 21;13 Suppl 3(Suppl 3):S2. doi: 10.1186/1471-2105-13-S3-S2. BMC Bioinformatics. 2012. PMID: 22536900 Free PMC article.
-
Mining breast cancer genes with a network based noise-tolerant approach.BMC Syst Biol. 2013 Jun 25;7:49. doi: 10.1186/1752-0509-7-49. BMC Syst Biol. 2013. PMID: 23799982 Free PMC article.
-
Network archaeology: uncovering ancient networks from present-day interactions.PLoS Comput Biol. 2011 Apr;7(4):e1001119. doi: 10.1371/journal.pcbi.1001119. Epub 2011 Apr 14. PLoS Comput Biol. 2011. PMID: 21533211 Free PMC article.
Publication types
MeSH terms
LinkOut - more resources
Full Text Sources