Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2010 Oct 6;7(51):1411-9.
doi: 10.1098/rsif.2010.0044. Epub 2010 Apr 8.

Incomplete and noisy network data as a percolation process

Affiliations

Incomplete and noisy network data as a percolation process

Michael P H Stumpf et al. J R Soc Interface. .

Abstract

We discuss the ramifications of noisy and incomplete observations of network data on the existence of a giant connected component (GCC). The existence of a GCC in a random graph can be described in terms of a percolation process, and building on general results for classes of random graphs with specified degree distributions we derive percolation thresholds above which GCCs exist. We show that sampling and noise can have a profound effect on the perceived existence of a GCC and find that both processes can destroy it. We also show that the absence of a GCC puts a theoretical upper bound on the false-positive rate and relate our percolation analysis to experimental protein-protein interaction data.

PubMed Disclaimer

Figures

Figure 1.
Figure 1.
Contour-plot of the fraction of nodes contained in the GCC for different fractions of true-positive and false-positive edges in a network (area labels refer to the whole area to the right and above of the corresponding curves). Only when the fraction of true and false positives are both very small, will the GCC disappear. On the left-hand side of the figure we show illustrative examples of cases which correspond (loosely) to areas in the (ρ, Ξ) plane indicated by the arrow-tips. Real edges are indicated in blue, whereas false-positive edges are drawn in green.
Figure 2.
Figure 2.
Average fraction of nodes in subnets of a network with 5000 nodes and 15 000 edges against sampling fraction/subnet size. Shown are results obtained for a scale-free network grown under the BA preferential attachment model (black), the corresponding BC graph (see text for details) (grey), and an Erdös–Rényi random graph. Averages were taken over 1000 independent simulated networks (black circles, scale-free RGG; crosses, scale-free BC; triangles, ER, random graph).
Figure 3.
Figure 3.
Representations of eight networks in the IntAct database. The two early data sets generated by mass spectrometry (Ho et al. 2002; Gavin et al. 2002) do not show evidence for a GCC, as may well be expected given that they focus on protein complexes. The details of the data sets are provided in table 1.
Figure 4.
Figure 4.
The coloured regions indicate combinations in (〈k2〉, ρ, Ξ)-space for which we will observe a GCC. (a) The corresponds to the case of in Ito et al. (2001), (b) to the case of in Gavin et al. (2002). Perhaps somewhat unexpectedly we also observe a pronounced effect of (ξ ≈ 14.5% in (a) and ≈ 12.1% in (b)) on the boundary of the parameter space allowing for a GCC with high probability.

References

    1. Arfken G., Weber H. 2005. Mathematical methods for physicists, 6th edn. New York, NY: Academic Press.
    1. Bader J. S., Chaudhuri A., Rothberg J. M., Chant J. 2004. Gaining confidence in high-throughput protein interaction networks. Nat. Biotechnol. 22, 78–85. (10.1038/nbt924) - DOI - PubMed
    1. Barabasi A., Albert R. 1999. Emergence of scaling in random networks. Science 286, 509–512. (10.1126/science.286.5439.509) - DOI - PubMed
    1. Barabasi A., Ravasz E., Vicsek T. 2001. Deterministic scale-free networks. Physica A 299, 559–564. (10.1016/S0378-4371(01)00369-7) - DOI
    1. Bender E., Canfield E. 1978. The asymptotic number of labeled graphs with given degree sequence. J. Combin. Theory A 24, 296–307. (10.1016/0097-3165(78)90059-6) - DOI

Publication types

MeSH terms

LinkOut - more resources