Review

. 2010 May-Jun;2(3):277-292.

doi: 10.1002/wsbm.61.

Algorithmic and analytical methods in network biology

Mehmet Koyutürk^{1

2}

Affiliations

¹ Department of Electrical Engineering & Computer Science, Case Western Reserve University, Cleveland, OH 44106, USA.
² Center for Proteomics and Bioinformatics, Case Western Reserve University, Cleveland, OH 44106, USA.

PMID: 20836029
PMCID: PMC3087298
DOI: 10.1002/wsbm.61

Review

Algorithmic and analytical methods in network biology

Mehmet Koyutürk. Wiley Interdiscip Rev Syst Biol Med. 2010 May-Jun.

. 2010 May-Jun;2(3):277-292.

doi: 10.1002/wsbm.61.

Author

Mehmet Koyutürk^{1

2}

Affiliations

¹ Department of Electrical Engineering & Computer Science, Case Western Reserve University, Cleveland, OH 44106, USA.
² Center for Proteomics and Bioinformatics, Case Western Reserve University, Cleveland, OH 44106, USA.

PMID: 20836029
PMCID: PMC3087298
DOI: 10.1002/wsbm.61

Abstract

During the genomic revolution, algorithmic and analytical methods for organizing, integrating, analyzing, and querying biological sequence data proved invaluable. Today, increasing availability of high-throughput data pertaining to functional states of biomolecules, as well as their interactions, enables genome-scale studies of the cell from a systems perspective. The past decade witnessed significant efforts on the development of computational infrastructure for large-scale modeling and analysis of biological systems, commonly using network models. Such efforts lead to novel insights into the complexity of living systems, through development of sophisticated abstractions, algorithms, and analytical techniques that address a broad range of problems, including the following: (1) inference and reconstruction of complex cellular networks; (2) identification of common and coherent patterns in cellular networks, with a view to understanding the organizing principles and building blocks of cellular signaling, regulation, and metabolism; and (3) characterization of cellular mechanisms that underlie the differences between living systems, in terms of evolutionary diversity, development and differentiation, and complex phenotypes, including human disease. These problems pose significant algorithmic and analytical challenges because of the inherent complexity of the systems being studied; limitations of data in terms of availability, scope, and scale; intractability of resulting computational problems; and limitations of reference models for reliable statistical inference. This article provides a broad overview of existing algorithmic and analytical approaches to these problems, highlights key biological insights provided by these approaches, and outlines emerging opportunities and challenges in computational systems biology.

PubMed Disclaimer

Figures

**FIGURE 1**
Description of omic data sets in the context of central dogma.

**FIGURE 2**
Illustration of the general principles of common computational methods for predicting protein–protein interactions. In the upper panel, black and white boxes, respectively, indicate existence and absence of a homolog in the corresponding genome. In the lower panel, the red and green shades of boxes, respectively, indicate the degree of up- and down-regulation of the coding gene with respect to the corresponding condition.

**FIGURE 3**
Inferring domain–domain interactions (DDIs) from protein–protein interactions (PPIs). Given the domain decomposition of proteins and a set of PPIs, DDI inference methods target identification of DDIs that mediate these interactions. Different formulations of the problem optimize different criteria, leading to different solutions for DDI inference problem.

**FIGURE 4**
Arp 2/3 complex, which plays a significant role in the regulation of actin cytoskeleton, is identified as a conserved subnetwork through mining of protein–protein interaction networks of multiple organisms, using a fast algorithm that relies on contraction of ortholog proteins. The conserved subnetwork is shown on the left with nodes annotated by cluisters of ortholog groups (COG) identifiers. The occurrence of the subnetwork in three eukaryotic organisms is shown on the right. Dashed links indicate indirect interactions. Such knowledge discovery based analyses are likely to lead to the construction of canonical module libraries.

**FIGURE 5**
Screenshots from a sample computational tool, NARADA, that enables identification and browsing of canonical network patterns in regulatory networks. NARADA takes gene regulatory networks and functional annotation of individual genes as input and processes queries on regulatory pathways that involve specific biological processes (e.g., what are the processes that regulate ciliary or flagellar motility in *E. coli*? Are these regulatory pathways mediated by other processes?). NARADA is available as an open source at http://www.cs.purdue.edu/~jpandey/narada/. With the availability of such sophisticated tools, browsing basic biological information becomes a visually rich and interactive activity, moving beyond basic text and database searches.

**FIGURE 6**
Overview of common approaches to network based functional annotation. In each hypothetical example, the proteins with known function are annotated by a symbol that represents their function. Proteins with unknown function are labeled with question marks. As seen on the left, connectivity/modularity based schemes transfer function based on direct interactions. As seen in the middle, proximity based schemes diffuse function through the network. Finally, as seen on the right, pattern based schemes derive templates of functional interactions and interpolate these patterns accordingly to infer novel functions for proteins.

**FIGURE 7**
Framework for the integration of omic data for the discovery of subnetworks implicated in complex phenotypes. Proteomic screening provides functional data for a limited set of proteins, transcriptomic screening provides genome-scale data on mRNA expression, and curated or high-throughput protein–protein interactions provide a framework for the integration of these two complementary, valuable sources of data. This framework also illustrates how researchers can couple specific data sets generated in their labs with public data to broaden the scope of their analyses.

See this image and copyright information in PMC

Cited by

Structure-based systems biology for analyzing off-target binding.
Xie L, Xie L, Bourne PE. Xie L, et al. Curr Opin Struct Biol. 2011 Apr;21(2):189-99. doi: 10.1016/j.sbi.2011.01.004. Epub 2011 Feb 1. Curr Opin Struct Biol. 2011. PMID: 21292475 Free PMC article. Review.
Augmentation of crop productivity through interventions of omics technologies in India: challenges and opportunities.
Pathak RK, Baunthiyal M, Pandey D, Kumar A. Pathak RK, et al. 3 Biotech. 2018 Nov;8(11):454. doi: 10.1007/s13205-018-1473-y. Epub 2018 Oct 19. 3 Biotech. 2018. PMID: 30370195 Free PMC article. Review.
A network-based approach to classify the three domains of life.
Mueller LA, Kugler KG, Netzer M, Graber A, Dehmer M. Mueller LA, et al. Biol Direct. 2011 Oct 13;6:53. doi: 10.1186/1745-6150-6-53. Biol Direct. 2011. PMID: 21995640 Free PMC article.
Identification of rifampin-regulated functional modules and related microRNAs in human hepatocytes based on the protein interaction network.
Li J, Wang Y, Wang L, Dai X, Cong W, Feng W, Xu C, Deng Y, Wang Y, Skaar TC, Liang H, Liu Y. Li J, et al. BMC Genomics. 2016 Aug 22;17 Suppl 7(Suppl 7):517. doi: 10.1186/s12864-016-2909-6. BMC Genomics. 2016. PMID: 27557147 Free PMC article.
Evaluating the predictive accuracy of curated biological pathways in a public knowledgebase.
Wright AJ, Orlic-Milacic M, Rothfels K, Weiser J, Trinh QM, Jassal B, Haw RA, Stein LD. Wright AJ, et al. Database (Oxford). 2022 Mar 28;2022:baac009. doi: 10.1093/database/baac009. Database (Oxford). 2022. PMID: 35348650 Free PMC article.

See all "Cited by" articles

References

1. Altschul SF, Gish W, Miller W, Myers EW, Lipman DJ. Basic local alignment search tool. J Mol Biol. 1990;215:403–410. - PubMed
1. Larkin MA, Blackshields G, Brown NP, Chenna R, Mcgettigan PA, Mcwilliam H, et al. Clustal W and Clustal X version 2.0. Bioinformatics. 2007;23:2947–2948. - PubMed
1. Kitano H. Systems biology: a brief overview. Science. 2002;295:1662–1664. - PubMed
1. Wang DG, Fan JB, Siao CJ, Berno A, Young P, et al. Large-scale identification, mapping, and genotyping of single-nucleotide polymorphisms in the human genome. Science. 1998;280:1077–1082. - PubMed
1. Pollack JR, Perou CM, Alizadeh AA, Eisen MB, Pergamenschikov A, et al. Genome-wide analysis of DNA copy-number changes using cDNA microarrays. Nat Genet. 1999;23:41–46. - PubMed

Publication types

Actions
Actions
Actions

MeSH terms

Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Algorithmic and analytical methods in network biology

Affiliations

Algorithmic and analytical methods in network biology

Author

Affiliations

Abstract

Figures

Similar articles

Cited by

References

FURTHER READING

Publication types

MeSH terms

Substances

Grants and funding

LinkOut - more resources

Full Text Sources

Miscellaneous