A DC programming approach for finding communities in networks
- PMID: 25248085
- DOI: 10.1162/NECO_a_00673
A DC programming approach for finding communities in networks
Abstract
Automatic discovery of community structures in complex networks is a fundamental task in many disciplines, including physics, biology, and the social sciences. The most used criterion for characterizing the existence of a community structure in a network is modularity, a quantitative measure proposed by Newman and Girvan (2004). The discovery community can be formulated as the so-called modularity maximization problem that consists of finding a partition of nodes of a network with the highest modularity. In this letter, we propose a fast and scalable algorithm called DCAM, based on DC (difference of convex function) programming and DCA (DC algorithms), an innovative approach in nonconvex programming framework for solving the modularity maximization problem. The special structure of the problem considered here has been well exploited to get an inexpensive DCA scheme that requires only a matrix-vector product at each iteration. Starting with a very large number of communities, DCAM furnishes, as output results, an optimal partition together with the optimal number of communities [Formula: see text]; that is, the number of communities is discovered automatically during DCAM's iterations. Numerical experiments are performed on a variety of real-world network data sets with up to 4,194,304 nodes and 30,359,198 edges. The comparative results with height reference algorithms show that the proposed approach outperforms them not only on quality and rapidity but also on scalability. Moreover, it realizes a very good trade-off between the quality of solutions and the run time.
Similar articles
-
Algorithm for parametric community detection in networks.Phys Rev E Stat Nonlin Soft Matter Phys. 2012 Jul;86(1 Pt 2):016107. doi: 10.1103/PhysRevE.86.016107. Epub 2012 Jul 13. Phys Rev E Stat Nonlin Soft Matter Phys. 2012. PMID: 23005491
-
Sparse Covariance Matrix Estimation by DCA-Based Algorithms.Neural Comput. 2017 Nov;29(11):3040-3077. doi: 10.1162/neco_a_01012. Epub 2017 Sep 28. Neural Comput. 2017. PMID: 28957024
-
Column generation algorithms for exact modularity maximization in networks.Phys Rev E Stat Nonlin Soft Matter Phys. 2010 Oct;82(4 Pt 2):046112. doi: 10.1103/PhysRevE.82.046112. Epub 2010 Oct 21. Phys Rev E Stat Nonlin Soft Matter Phys. 2010. PMID: 21230350
-
Edge ratio and community structure in networks.Phys Rev E Stat Nonlin Soft Matter Phys. 2010 Feb;81(2 Pt 2):026105. doi: 10.1103/PhysRevE.81.026105. Epub 2010 Feb 17. Phys Rev E Stat Nonlin Soft Matter Phys. 2010. PMID: 20365629
-
Z-Score-Based Modularity for Community Detection in Networks.PLoS One. 2016 Jan 25;11(1):e0147805. doi: 10.1371/journal.pone.0147805. eCollection 2016. PLoS One. 2016. PMID: 26808270 Free PMC article.
Publication types
MeSH terms
LinkOut - more resources
Full Text Sources
Other Literature Sources