Finding community structure in very large networks
- PMID: 15697438
- DOI: 10.1103/PhysRevE.70.066111
Finding community structure in very large networks
Abstract
The discovery and analysis of community structure in networks is a topic of considerable recent interest within the physics community, but most methods proposed so far are unsuitable for very large networks because of their computational cost. Here we present a hierarchical agglomeration algorithm for detecting community structure which is faster than many competing algorithms: its running time on a network with n vertices and m edges is O (md log n) where d is the depth of the dendrogram describing the community structure. Many real-world networks are sparse and hierarchical, with m approximately n and d approximately log n, in which case our algorithm runs in essentially linear time, O (n log(2) n). As an example of the application of this algorithm we use it to analyze a network of items for sale on the web site of a large on-line retailer, items in the network being linked if they are frequently purchased by the same buyer. The network has more than 400 000 vertices and 2 x 10(6) edges. We show that our algorithm can extract meaningful communities from this network, revealing large-scale patterns present in the purchasing habits of customers.
Similar articles
-
Finding local community structure in networks.Phys Rev E Stat Nonlin Soft Matter Phys. 2005 Aug;72(2 Pt 2):026132. doi: 10.1103/PhysRevE.72.026132. Epub 2005 Aug 29. Phys Rev E Stat Nonlin Soft Matter Phys. 2005. PMID: 16196669
-
Unfolding communities in large complex networks: combining defensive and offensive label propagation for core extraction.Phys Rev E Stat Nonlin Soft Matter Phys. 2011 Mar;83(3 Pt 2):036103. doi: 10.1103/PhysRevE.83.036103. Epub 2011 Mar 8. Phys Rev E Stat Nonlin Soft Matter Phys. 2011. PMID: 21517554
-
Towards real-time community detection in large networks.Phys Rev E Stat Nonlin Soft Matter Phys. 2009 Jun;79(6 Pt 2):066107. doi: 10.1103/PhysRevE.79.066107. Epub 2009 Jun 16. Phys Rev E Stat Nonlin Soft Matter Phys. 2009. PMID: 19658564
-
Adaptive clustering algorithm for community detection in complex networks.Phys Rev E Stat Nonlin Soft Matter Phys. 2008 Oct;78(4 Pt 2):046115. doi: 10.1103/PhysRevE.78.046115. Epub 2008 Oct 30. Phys Rev E Stat Nonlin Soft Matter Phys. 2008. PMID: 18999501
-
Fast algorithm for detecting community structure in networks.Phys Rev E Stat Nonlin Soft Matter Phys. 2004 Jun;69(6 Pt 2):066133. doi: 10.1103/PhysRevE.69.066133. Epub 2004 Jun 18. Phys Rev E Stat Nonlin Soft Matter Phys. 2004. PMID: 15244693
Cited by
-
Community detection in networks by dynamical optimal transport formulation.Sci Rep. 2022 Oct 7;12(1):16811. doi: 10.1038/s41598-022-20986-y. Sci Rep. 2022. PMID: 36207412 Free PMC article.
-
Clustering Scientific Publications Based on Citation Relations: A Systematic Comparison of Different Methods.PLoS One. 2016 Apr 28;11(4):e0154404. doi: 10.1371/journal.pone.0154404. eCollection 2016. PLoS One. 2016. PMID: 27124610 Free PMC article.
-
Surprise maximization reveals the community structure of complex networks.Sci Rep. 2013;3:1060. doi: 10.1038/srep01060. Epub 2013 Jan 14. Sci Rep. 2013. PMID: 23320141 Free PMC article.
-
Opioid analgesic and benzodiazepine prescribing among Medicaid-enrollees with opioid use disorders: The influence of provider communities.J Addict Dis. 2017 Jan-Mar;36(1):14-22. doi: 10.1080/10550887.2016.1211784. Epub 2016 Jul 22. J Addict Dis. 2017. PMID: 27449904 Free PMC article.
-
A DIseAse MOdule Detection (DIAMOnD) algorithm derived from a systematic analysis of connectivity patterns of disease proteins in the human interactome.PLoS Comput Biol. 2015 Apr 8;11(4):e1004120. doi: 10.1371/journal.pcbi.1004120. eCollection 2015 Apr. PLoS Comput Biol. 2015. PMID: 25853560 Free PMC article.
LinkOut - more resources
Other Literature Sources
Research Materials