Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2015 Jun 6:9:26.
doi: 10.1186/s12918-015-0173-z.

Dynamics of the discovery process of protein-protein interactions from low content studies

Affiliations

Dynamics of the discovery process of protein-protein interactions from low content studies

Zichen Wang et al. BMC Syst Biol. .

Abstract

Background: Thousands of biological and biomedical investigators study of the functional role of single genes and their protein products in normal physiology and in disease. The findings from these studies are reported in research articles that stimulate new research. It is now established that a complex regulatory networks's is controlling human cellular fate, and this community of researchers are continually unraveling this network topology. Attempts to integrate results from such accumulated knowledge resulted in literature-based protein-protein interaction networks (PPINs) and pathway databases. These databases are widely used by the community to analyze new data collected from emerging genome-wide studies with the assumption that the data within these literature-based databases is the ground truth and contain no biases. While suspicion for research focus biases is growing, a concrete proof for it is still missing. It is difficult to prove because the real PPINs are mostly unknown.

Results: Here we analyzed the longitudinal discovery process of literature-based mammalian and yeast PPINs to observe that these networks are discovered non-uniformly. The pattern of discovery is related to a theoretical concept proposed by Kauffman called "expanding the adjacent possible". We introduce a network discovery model which explicitly includes the space of possibilities in the form of a true underlying PPIN.

Conclusions: Our model strongly suggests that research focus biases exist in the observed discovery dynamics of these networks. In summary, more care should be placed when using PPIN databases for analysis of newly acquired data, and when considering prior knowledge when designing new experiments.

PubMed Disclaimer

Figures

Fig. 1
Fig. 1
Discovery of the mammalian and yeast LC-PPINs over time. Accumulation of discovered proteins (dotted line) and their interactions (solid line) and the discovery rate of interactions and proteins in the mammalian (a, b) and yeast (c, d) literature based PPINs. The accumulation of discovered proteins (red dots) and their interactions (blue dots) are plotted with respect to the ranking index of time for mammalian (e) and yeast (f) PPINs
Fig. 2
Fig. 2
The dynamics of individual proteins in the discovery of mammalian and yeast LC - and combined PPINs. a-b The distribution of growth exponents of the degrees of individual proteins; super-linear growth corresponds to an acceleration in the rate of discovery of PPIs involving the protein in question. c-d The normalized entropy plotted against the mean degree of the actual PPI discovery for the real network and also for reshuffled versions. e-h The distribution of time intervals between PPI discoveries involving each protein for the real PPI discovery process and also randomly reshuffled data in LC-PPINs (e-f) and combined PPINs made from both high-content and low-content studies (g-h)
Fig. 3
Fig. 3
Three model realizations with the scale-free (BA) clustered underlying artificial PPIN. a Distribution of degree growth exponents; (b) distribution of time intervals between PPI discoveries involving each protein; (c) the normalized entropy of PPI discoveries for each protein averaged over each degree
Fig. 4
Fig. 4
Relationship between community structure and PPI discovery rates in PPINs. a Connected components; (b) Communities; (c) modularity, which a quantity that measures the strength of community partition compared to random [30]. d Clusters with significant over-representation of proteins with accelerating or decelerating PPI discovery rates. e, f Subnetworks connecting proteins from two representative cold clusters where proteins are connected through their known interactions with other members of the cluster

References

    1. Cordeddu V, Di Schiavi E, Pennacchio LA, Ma’ayan A, Sarkozy A, Fodale V, Cecchetti S, Cardinale A, Martin J, Schackwitz W. Mutation of SHOC2 promotes aberrant protein N-myristoylation and causes Noonan-like syndrome with loose anagen hair. Nat Genet. 2009;41(9):1022–6. doi: 10.1038/ng.425. - DOI - PMC - PubMed
    1. Lim J, Hao T, Shaw C, Patel AJ, Szabó G, Rual J-F, Fisk CJ, Li N, Smolyar A, Hill DE, et al. A protein–protein interaction network for human inherited ataxias and disorders of Purkinje cell degeneration. Cell. 2006;125(4):801–14. doi: 10.1016/j.cell.2006.03.032. - DOI - PubMed
    1. Vidal M, Cusick Michael E, Barabási A-L. Interactome networks and human disease. Cell. 2011;144(6):986–98. doi: 10.1016/j.cell.2011.02.016. - DOI - PMC - PubMed
    1. Barabasi A-L, Gulbahce N, Loscalzo J. Network medicine: a network-based approach to human disease. Nat Rev Genet. 2011;12(1):56–68. doi: 10.1038/nrg2918. - DOI - PMC - PubMed
    1. Oti M, Snel B, Huynen MA, Brunner HG. Predicting disease genes using protein–protein interactions. J Med Genet. 2006;43(8):691–8. doi: 10.1136/jmg.2006.041376. - DOI - PMC - PubMed

Publication types

LinkOut - more resources