Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2016 Apr 12:6:24115.
doi: 10.1038/srep24115.

Overlapping Community Detection based on Network Decomposition

Affiliations

Overlapping Community Detection based on Network Decomposition

Zhuanlian Ding et al. Sci Rep. .

Abstract

Community detection in complex network has become a vital step to understand the structure and dynamics of networks in various fields. However, traditional node clustering and relatively new proposed link clustering methods have inherent drawbacks to discover overlapping communities. Node clustering is inadequate to capture the pervasive overlaps, while link clustering is often criticized due to the high computational cost and ambiguous definition of communities. So, overlapping community detection is still a formidable challenge. In this work, we propose a new overlapping community detection algorithm based on network decomposition, called NDOCD. Specifically, NDOCD iteratively splits the network by removing all links in derived link communities, which are identified by utilizing node clustering technique. The network decomposition contributes to reducing the computation time and noise link elimination conduces to improving the quality of obtained communities. Besides, we employ node clustering technique rather than link similarity measure to discover link communities, thus NDOCD avoids an ambiguous definition of community and becomes less time-consuming. We test our approach on both synthetic and real-world networks. Results demonstrate the superior performance of our approach both in computation time and accuracy compared to state-of-the-art algorithms.

PubMed Disclaimer

Figures

Figure 1
Figure 1. Comparison of computation time of different algorithms on synthetic networks with different sizes.
Plots show runtime (s) for networks with n = 100 ~ 1000, k = 10, kmax = 50, u = 0.1, τ1 = 2, τ2 = 1, cmax = 50, cmin = 10, om = 2, on/n = 10%.
Figure 2
Figure 2. The effects of mixing parameter u and network size n on synthetic networks.
(a) EQ for networks with n = 500, k = 25, kmax = 50, u = 0.1 ~ 0.6, τ1 = 2, τ2 = 1, cmax = 50, cmin = 10, om = 2, on/n = 10%, (b) ENMI for networks with n = 500, k = 25, kmax = 50, u = 0.1 ~ 0.6, τ1 = 2, τ2 = 1, cmax = 50, cmin = 10, om = 2, on/n = 10%, (c) EQ for networks with n = 100 ~ 1000, k = 25, kmax = 50, u = 0.1, τ1 = 2, τ2 = 1, cmax = 50, cmin = 10, om = 2, on/n = 10%, (d) ENMI for networks with n = 100 ~ 1000, k = 25, kmax = 50, u = 0.1, τ1 = 2, τ2 = 1, cmax = 50, cmin = 10, om = 2, on/n = 10%.
Figure 3
Figure 3. The effects of overlapping diversity om and overlapping density on/n on synthetic networks.
(a) EQ for networks with n = 500, k = 25, kmax = 50, u = 0.4, τ1 = 2, τ2 = 1, cmax = 50, cmin = 10, om = 2 ~ 8, on/n = 10%, (b) ENMI for networks with n = 500, k = 25, kmax = 50, u = 0.4, τ1 = 2, τ2 = 1, cmax = 50, cmin = 10, om = 2 ~ 8, on/n = 10%, (c) EQ for networks with n = 500, k = 25, kmax = 50, u = 0.4, τ1 = 2, τ2 = 1, cmax = 50, cmin = 10, om = 2, on/n = 10% ~ 60%, (d) ENMI for networks with n = 500, k = 25, kmax = 50, u = 0.4, τ1 = 2, τ2 = 1, cmax = 50, cmin = 10, om = 2, on/n = 10% ~ 60%.
Figure 4
Figure 4. Histogram of the detected community sizes on LFR benchmark.
(a) Comparison on networks with n = 500, k = 25, kmax = 50, u = 0.1, τ1 = 2, τ2 = 1, cmax = 50, cmin = 10, om = 2, on/n = 10%, (b) Comparison on networks with n = 500, k = 25, kmax = 50, u = 0.4, τ1 = 2, τ2 = 1, cmax = 50, cmin = 10, om = 2, on/n = 10%.
Figure 5
Figure 5. Precision, Recall and F-measure values for PPI-D1, PPI-D2 and Y2H.
(a) Precision values for PPI-D1, PPI-D2 and Y2H, (b) Recall values for PPI-D1, PPI-D2 and Y2H, (c) F-measure values for PPI-D1, PPI-D2 and Y2H.
Figure 6
Figure 6. Visualization of reference and predicted complexes in PPI-D1 for NDOCD, LC and OCG.
(a) Visualization of reference complexes, (b) Visualization of predicted complexes for NDOCD, (c) Visualization of predicted complexes for LC, (d) Visualization of predicted complexes for OCG.
Figure 7
Figure 7. An illustration of our main idea.
(a) Network decomposition procedure of our method, (b) Result of our method.
Figure 8
Figure 8. An example of bridge edge in a network.
The black line represents the bridge edge in the network.

Similar articles

Cited by

References

    1. Newman M. E. J. Communities, modules and large-scale structure in networks. Nat. Phys. 8, 25–31 (2012).
    1. Lancichinetti A. & Fortunato S. Community detection algorithms: A comparative analysis. Phys. Rev. E 80, 056117, 10.1103/PhysRevE.80.056117 (2009). - DOI - PubMed
    1. Fortunato S. Community detection in graphs. Phys. Rep. 486, 75–174 (2010).
    1. Girvan M. & Newman M. E. J. Community structure in social and biological networks. Proc. Natl. Acad. Sci. USA 99, 7821–7826 (2002). - PMC - PubMed
    1. Xie J., Kelley S. & Szymanski B. K. Overlapping community detection in networks: the state of the art and comparative study. ACM Comput. Surv. 45, 43, 10.1145/2501654.2501657 (2013). - DOI

Publication types

LinkOut - more resources