Tolerating the community detection resolution limit with edge weighting
- PMID: 21728617
- DOI: 10.1103/PhysRevE.83.056119
Tolerating the community detection resolution limit with edge weighting
Abstract
Communities of vertices within a giant network such as the World Wide Web are likely to be vastly smaller than the network itself. However, Fortunato and Barthélemy have proved that modularity maximization algorithms for community detection may fail to resolve communities with fewer than √L/2 edges, where L is the number of edges in the entire network. This resolution limit leads modularity maximization algorithms to have notoriously poor accuracy on many real networks. Fortunato and Barthélemy's argument can be extended to networks with weighted edges as well, and we derive this corollary argument. We conclude that weighted modularity algorithms may fail to resolve communities with less than √Wε/2 total edge weight, where W is the total edge weight in the network and ε is the maximum weight of an intercommunity edge. If ε is small, then small communities can be resolved. Given a weighted or unweighted network, we describe how to derive new edge weights in order to achieve a low ε, we modify the Clauset, Newman, and Moore (CNM) community detection algorithm to maximize weighted modularity, and we show that the resulting algorithm has greatly improved accuracy. In experiments with an emerging community standard benchmark, we find that our simple CNM variant is competitive with the most accurate community detection methods yet proposed.
© 2011 American Physical Society
Similar articles
-
Algorithm for parametric community detection in networks.Phys Rev E Stat Nonlin Soft Matter Phys. 2012 Jul;86(1 Pt 2):016107. doi: 10.1103/PhysRevE.86.016107. Epub 2012 Jul 13. Phys Rev E Stat Nonlin Soft Matter Phys. 2012. PMID: 23005491
-
Network community-detection enhancement by proper weighting.Phys Rev E Stat Nonlin Soft Matter Phys. 2011 Apr;83(4 Pt 2):046104. doi: 10.1103/PhysRevE.83.046104. Epub 2011 Apr 11. Phys Rev E Stat Nonlin Soft Matter Phys. 2011. PMID: 21599237
-
Locally optimal heuristic for modularity maximization of networks.Phys Rev E Stat Nonlin Soft Matter Phys. 2011 May;83(5 Pt 2):056105. doi: 10.1103/PhysRevE.83.056105. Epub 2011 May 6. Phys Rev E Stat Nonlin Soft Matter Phys. 2011. PMID: 21728603
-
Edge ratio and community structure in networks.Phys Rev E Stat Nonlin Soft Matter Phys. 2010 Feb;81(2 Pt 2):026105. doi: 10.1103/PhysRevE.81.026105. Epub 2010 Feb 17. Phys Rev E Stat Nonlin Soft Matter Phys. 2010. PMID: 20365629
-
Detecting communities using asymptotical surprise.Phys Rev E Stat Nonlin Soft Matter Phys. 2015 Aug;92(2):022816. doi: 10.1103/PhysRevE.92.022816. Epub 2015 Aug 24. Phys Rev E Stat Nonlin Soft Matter Phys. 2015. PMID: 26382463
Cited by
-
Micro-blog user community discovery using generalized SimRank edge weighting method.PLoS One. 2018 May 7;13(5):e0196447. doi: 10.1371/journal.pone.0196447. eCollection 2018. PLoS One. 2018. PMID: 29734358 Free PMC article.
-
Global vs local modularity for network community detection.PLoS One. 2018 Oct 29;13(10):e0205284. doi: 10.1371/journal.pone.0205284. eCollection 2018. PLoS One. 2018. PMID: 30372429 Free PMC article.
-
Spike-train communities: finding groups of similar spike trains.J Neurosci. 2011 Feb 9;31(6):2321-36. doi: 10.1523/JNEUROSCI.2853-10.2011. J Neurosci. 2011. PMID: 21307268 Free PMC article.
-
Community landscapes: an integrative approach to determine overlapping network module hierarchy, identify key nodes and predict network dynamics.PLoS One. 2010 Sep 2;5(9):e12528. doi: 10.1371/journal.pone.0012528. PLoS One. 2010. PMID: 20824084 Free PMC article.
-
Clustering of 770,000 genomes reveals post-colonial population structure of North America.Nat Commun. 2017 Feb 7;8:14238. doi: 10.1038/ncomms14238. Nat Commun. 2017. PMID: 28169989 Free PMC article.
Publication types
LinkOut - more resources
Other Literature Sources