Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2012 Jun 15;28(12):i49-58.
doi: 10.1093/bioinformatics/bts212.

A single source k-shortest paths algorithm to infer regulatory pathways in a gene network

Affiliations

A single source k-shortest paths algorithm to infer regulatory pathways in a gene network

Yu-Keng Shih et al. Bioinformatics. .

Abstract

Motivation: Inferring the underlying regulatory pathways within a gene interaction network is a fundamental problem in Systems Biology to help understand the complex interactions and the regulation and flow of information within a system-of-interest. Given a weighted gene network and a gene in this network, the goal of an inference algorithm is to identify the potential regulatory pathways passing through this gene.

Results: In a departure from previous approaches that largely rely on the random walk model, we propose a novel single-source k-shortest paths based algorithm to address this inference problem. An important element of our approach is to explicitly account for and enhance the diversity of paths discovered by our algorithm. The intuition here is that diversity in paths can help enrich different functions and thereby better position one to understand the underlying system-of-interest. Results on the yeast gene network demonstrate the utility of the proposed approach over extant state-of-the-art inference algorithms. Beyond utility, our algorithm achieves a significant speedup over these baselines.

Availability: All data and codes are freely available upon request.

PubMed Disclaimer

Figures

Fig. 1.
Fig. 1.
Two problems of the random walk model. (a) Normalization of edge weights is lossy. (b) The walks repeatedly go through the bidirectional edge
Fig. 2.
Fig. 2.
A graph and its corresponding pseudo tree. k=3. The node with a dash circle is a dummy node
Fig. 3.
Fig. 3.
An example with O(n2k) dummy nodes. (a) The graph. Each bold arrow represents k−1 paths. (b) The pseudo tree. The red box contains total (k−1)q dummy nodes in the paths from S to Aq−1
Fig. 4.
Fig. 4.
Overview of our inference framework
Fig. 5.
Fig. 5.
Evaluation using GOann: (a/b) The sorted e-values of the top GO term. (a) UnknownCausal using different k, (b) UnknownTarget using different k. The information model uses α=0.99. (c) The sorted maximal differences of e-values in each gold standard in UnknownTarget between the shortest paths and the diverse short paths using different λ. k=5
Fig. 6.
Fig. 6.
Evaluation using GOall. (a) UnknownCausal using different k, (b) UnknownTarget using different k and (c) Maximal differences. The description is the same as Figure 5
Fig. 7.
Fig. 7.
The comparison between our algorithm and Y-STATISTICAL on our yeast gene network. The starting node is GGC1. k = 5. (a) The average importance value vs. the average k-paths diversity and (b) Execution time vs. the average k-paths diversity
Fig. 8.
Fig. 8.
The top 10 genes when λ = 0 and their main pathways, given the starting node RTS1. The weight of each edge is reported by its thickness. Darker nodes have higher importance values when λ = 1.0. k=5

Similar articles

Cited by

References

    1. Ashburner M., et al. Gene ontology: tool for the unification of biology. Nat Genet. 2000;25:25–29. - PMC - PubMed
    1. Bader J.S., et al. Gaining confidence in high-throughput protein interaction networks. Nat Biotechnol. 2004;22:78–85. - PubMed
    1. Bebek G., Yang J. Pathfinder: mining signal transduction pathway segments from protein-protein interaction networks. BMC Bioinformatics. 2007;8:335. - PMC - PubMed
    1. Beyer A., et al. Integrated Assessment and Prediction of Transcription Factor Binding. PLoS Comput Biol. 2006;2:e70. - PMC - PubMed
    1. Chan L., Amon A. The protein phosphatase 2a functions in the spindle position checkpoint by regulating the checkpoint kinase kin4. Genes Dev. 2009;23:1639–1649. - PMC - PubMed

Publication types

MeSH terms