Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2021 Jul 20;22(4):bbaa277.
doi: 10.1093/bib/bbaa277.

IHP-PING-generating integrated human protein-protein interaction networks on-the-fly

Affiliations

IHP-PING-generating integrated human protein-protein interaction networks on-the-fly

Gaston K Mazandu et al. Brief Bioinform. .

Abstract

Advances in high-throughput sequencing technologies have resulted in an exponential growth of publicly accessible biological datasets. In the 'big data' driven 'post-genomic' context, much work is being done to explore human protein-protein interactions (PPIs) for a systems level based analysis to uncover useful signals and gain more insights to advance current knowledge and answer specific biological and health questions. These PPIs are experimentally or computationally predicted, stored in different online databases and some of PPI resources are updated regularly. As with many biological datasets, such regular updates continuously render older PPI datasets potentially outdated. Moreover, while many of these interactions are shared between these online resources, each resource includes its own identified PPIs and none of these databases exhaustively contains all existing human PPI maps. In this context, it is essential to enable the integration of or combining interaction datasets from different resources, to generate a PPI map with increased coverage and confidence. To allow researchers to produce an integrated human PPI datasets in real-time, we introduce the integrated human protein-protein interaction network generator (IHP-PING) tool. IHP-PING is a flexible python package which generates a human PPI network from freely available online resources. This tool extracts and integrates heterogeneous PPI datasets to generate a unified PPI network, which is stored locally for further applications.

Keywords: high-throughput technology; human proteome; network analysis; post-genomic analysis; protein–protein interaction.

PubMed Disclaimer

Figures

Figure 1
Figure 1
Overall workflow of the IHP-PING tool. The scheme goes through three main steps from user input to a generated human PPI network: Input is parsed via a simple single command-line terminal, then the selected human PPI datasets are retrieved and network generated in tsv, csv or csv2 format.
Figure 2
Figure 2
The IHP-PING output structure. The first two columns are the two interacting proteins, following columns are interaction source confidence or reliability scores and the last column is the interaction combined score. Space between columns can be tabs, commas or semicolons depending on whether the user chooses tsv, csv or csv2.
Figure 3
Figure 3
Distribution of interactions obtained from different resources contributing to a unified human PPI network—All interactions per source.
Figure 4
Figure 4
Distribution of interactions obtained from different resources contributing to a unified human PPI network—All interactions per source in high-low-medium confidence level interaction frequencies.
Figure 5
Figure 5
Highlighting lower and upper score thresholds for low-medium-high confidence levels.
Figure 6
Figure 6
Venn diagrams showing the general distributions of shared PPIs between experimentally derived PPIs from online databases following IMEx curation guidelines (IntAct, MINT and DIP), as well as BioGRID, HPRD and MPPI-MIPS, and computationally predicted PPIs from STRING and Sequence.
Figure 7
Figure 7
The unified human PPI network topological property. Power-law property visualizing protein degree against connections frequency in the network.
Figure 8
Figure 8
The unified human PPI network topological property. Small-world property—the distribution of shortest path lengths within the interaction network.
Figure 9
Figure 9
Using igraph and tcltk R software libraries to plot the Folylpolyglutamate synthase, mitochondrial (Q05932) protein for illustration. Node or protein size is now proportional to the its degree in the protein–protein interaction network and each link is proportional to the number of its data sources provided in Table 2, together with final or unified confidence scores.

References

    1. Mazandu GK, Chimusa ER, Rutherford K, et al. . Large-scale data-driven integrative framework for extracting essential targets and processes from disease-associated gene data sets. Brief Bioinform 2018;19(6):1141–52. - PubMed
    1. Cusick ME, Klitgord N, Vidal M, et al. . Interactome: gateway into systems biology. Hum Mol Genet 2005;14(2):R171–81. - PubMed
    1. Mazandu GK, Mulder NJ. Using the underlying biological organization of the mycobacterium tuberculosis functional network for protein function prediction. Infect Genet Evol 2011;12(5):922–32. - PubMed
    1. Mazandu GK, Opap K, Mulder NJ. Contribution of microarray data to the advancement of knowledge on the mycobacterium tuberculosis interactome: use of the random partial least squares approach. Infect Genet Evol 2011;11(4):725–33. - PubMed
    1. Mazandu GK, Kyomugisha I, Geza E, et al. . Designing data-driven learning algorithms: A necessity to ensure effective post-genomic medicine and biomedical research. In: Artificial Intelligence - Applications in Medicine and Biology. 5 Princes Gate Court, London, UK: IntechOpen Publisher, 2019, 3–18.

Publication types