. 2017 Sep;10(3):93.

doi: 10.3390/a10030093. Epub 2017 Aug 19.

Post-Processing Partitions to Identify Domains of Modularity Optimization

William H Weir^{1

2}, Scott Emmons¹, Ryan Gibson¹, Dane Taylor¹, Peter J Mucha^{1

2}

Affiliations

¹ Carolina Center for Interdisciplinary Applied Mathematics, Department of Mathematics, University of North Carolina, Chapel Hill, NC 27599, USA.
² Curriculum in Bioinformatics and Computational Biology, University of North Carolina, Chapel Hill, NC 27599, USA.

PMID: 29046743
PMCID: PMC5642987
DOI: 10.3390/a10030093

Post-Processing Partitions to Identify Domains of Modularity Optimization

William H Weir et al. Algorithms. 2017 Sep.

. 2017 Sep;10(3):93.

doi: 10.3390/a10030093. Epub 2017 Aug 19.

Authors

William H Weir^{1

2}, Scott Emmons¹, Ryan Gibson¹, Dane Taylor¹, Peter J Mucha^{1

2}

Affiliations

¹ Carolina Center for Interdisciplinary Applied Mathematics, Department of Mathematics, University of North Carolina, Chapel Hill, NC 27599, USA.
² Curriculum in Bioinformatics and Computational Biology, University of North Carolina, Chapel Hill, NC 27599, USA.

PMID: 29046743
PMCID: PMC5642987
DOI: 10.3390/a10030093

Abstract

We introduce the Convex Hull of Admissible Modularity Partitions (CHAMP) algorithm to prune and prioritize different network community structures identified across multiple runs of possibly various computational heuristics. Given a set of partitions, CHAMP identifies the domain of modularity optimization for each partition-i.e., the parameter-space domain where it has the largest modularity relative to the input set-discarding partitions with empty domains to obtain the subset of partitions that are "admissible" candidate community structures that remain potentially optimal over indicated parameter domains. Importantly, CHAMP can be used for multi-dimensional parameter spaces, such as those for multilayer networks where one includes a resolution parameter and interlayer coupling. Using the results from CHAMP, a user can more appropriately select robust community structures by observing the sizes of domains of optimization and the pairwise comparisons between partitions in the admissible subset. We demonstrate the utility of CHAMP with several example networks. In these examples, CHAMP focuses attention onto pruned subsets of admissible partitions that are 20-to-1785 times smaller than the sets of unique partitions obtained by community detection heuristics that were input into CHAMP.

Keywords: community detection; modularity; multilayer networks; networks; resolution parameter.

PubMed Disclaimer

Conflict of interest statement

Conflicts of Interest: The authors declare no conflict of interest.

Figures

**Figure A1**
Visualizations of partitions labeled in white in Figure 6A, with Senators grouped according to their state. The listed AMI is the average over layers of the AMI in each layer (Congress) between the communities and political party affiliations for that Congress. Partitions are labeled “*X.Y*” with X the number of communities with ≥ 5 nodes and Y the rank of the domain area for that number of communities

**Figure A2**
Visualizations of partitions labeled in white in Figure 6A, with Senators sorted by their most frequent community label (with the labels sorted by last appearance in time), and within communities by first appearance. The listed AMI is the average over layers of the AMI in each layer (Congress) between the communities and political party affiliations in that Congress.

**Figure 1**
(A) Modularity Q(γ) given by Equation (1) versus resolution parameter γ for 50, 000 runs (10% of results displayed here) of the Louvain algorithm [42,47] at different γ on the unweighted NCAA Division I-A (2000) college football network [37,38]. Grey triangles indicate the number of communities that include ≥ 5 nodes in each run, while the green step function shows the number in the optimal partition in each domain; (B) Graphical depiction of CHAMP algorithm (see Section 2). Each line indicates *Q_σ*(γ) given by Equation (2) for a particular partition σ. Both panels show the convex hull of these lines as the dashed green piecewise-linear curve, with the transition values represented by downward triangles.

**Figure 2**
(A) ForceAtlas2 [52] layout, created with [53], of the unweighted NCAA Division I-A (2000) college football network. Nodes are colored according to the dominant 12-community partition with the widest γ-domain γ ∈ [1.45, 3.89], with node shapes and border indicating their conference labels; (B) Pairwise adjusted mutual information (N = AMI) between all partitions in the admissible subset identified by CHAMP, arranged by their corresponding γ-domains of optimality. Dashed lines indicate the transition values of γ identified by CHAMP.

**Figure 3**
(A) Modularity Q(γ) given by Equation (1) v. resolution parameter γ for 20, 000 runs (25% of results shown) of Louvain [42,47] on the Human Protein Reactome network [54]. Small, grey triangles indicate the number of communities that include ≥ 5 nodes in each run, while the dark green step function shows the number in the optimal partition in each domain. The dashed green curve is the piecewise-linear modularity function for the optimal partitions, with the transition values marked by blue triangles; (B) Pairwise AMI between all partitions in the admissible subset identified by CHAMP, arranged by their corresponding γ-domains of optimality.

**Figure 4**
ForceAtlas2 layout [52], created with [53], of the Human Reactome Network (edges downsampled to 50,000), colored according to the partitions with the two widest γ-domains of optimization identified by CHAMP from 20, 000 runs of Louvain.

**Figure 5**
(A) Modularity Q(γ) v. γ for 100, 000 runs (5% of results shown) of Louvain [42,47] on the Caltech Facebook network [31]. Orange triangles indicate the number of communities that include ≥ 5 nodes in each run, while the red step function shows the number in the optimal partition in each domain. The dashed green curve is the piecewise-linear modularity function for the optimal partitions, with the transition values marked by blue triangles. The condensed layout of communities (created with [53]) here visualizes the optimal partition found for γ ∈ [0.908, 1.09], with each pie-chart corresponding to a community, fractionally colored according to the House membership of the nodes in the community. The AMI between this partition and House labels (including the missing label) is 0.513; (B) Pairwise AMI between all partitions in the admissible subset identified by CHAMP, arranged by their corresponding γ-domains of optimality.

**Figure 6**
(A) Domains of optimization for the pruned set of partitions, colored by the number of communities within each partition. The set of partitions was generated from 240, 000 runs of GenLouvain [41] on a 600 × 400 uniform grid over [0.3, 2] × [0, 2] in (γ,ω). The largest partitions are labeled “*X.Y*” with X the number of communities with ≥ 5 nodes and Y the rank of the domain area (that is, in terms of size) for that given number of communities (e.g., “5.2” is the second-largest domain corresponding to 5-community partitions). The partitions of each labeled domain are visualized in Appendix A; (B) Weighted-average AMI of each partition with its neighboring domains’ partitions, weighted by the length of the borders between neighboring domains.

**Figure 7**
Time-varying community structure for the U.S. Senate from 1789 to 2008 according to the (A,B) 5-community and (C,D) 8-community partitions with widest domains of optimality (see labels 5.1 and 8.1 in Figure 6A); (A,C) The vertical axis indicates individual Senators, sorted by community label and time. The AMI reported here is the average over layers (Congresses) of the AMIs in each layer between the identified communities in that layer and political party labels. (This layer-averaged AMI is shown for all partitions in the convex hull over the originally searched parameter range in Figure 8.) (B,D) The vertical axis indicates the state of a Senator, sorted according to geographic region, and the horizontal axis represents time (two-year Congresses).

**Figure 8**
The domains of optimality for the time-varying U.S. Senate roll-call similarity network (as in Figure 6), colored by the layer-averaged AMI between the political-party affiliations of Senators and the community labels {*c_iσ*} for that layer.

See this image and copyright information in PMC

Cited by

Comparing Shared Patient Networks Across Payers.
Trogdon JG, Weir WH, Shai S, Mucha PJ, Kuo TM, Meyer AM, Stitzenberg KB. Trogdon JG, et al. J Gen Intern Med. 2019 Oct;34(10):2014-2020. doi: 10.1007/s11606-019-04978-9. Epub 2019 Apr 3. J Gen Intern Med. 2019. PMID: 30945065 Free PMC article.
Multiresolution Consensus Clustering in Networks.
Jeub LGS, Sporns O, Fortunato S. Jeub LGS, et al. Sci Rep. 2018 Feb 19;8(1):3259. doi: 10.1038/s41598-018-21352-7. Sci Rep. 2018. PMID: 29459635 Free PMC article.
The modulation of brain network integration and arousal during exploration.
Tardiff N, Medaglia JD, Bassett DS, Thompson-Schill SL. Tardiff N, et al. Neuroimage. 2021 Oct 15;240:118369. doi: 10.1016/j.neuroimage.2021.118369. Epub 2021 Jul 6. Neuroimage. 2021. PMID: 34242784 Free PMC article.
Finite-state parameter space maps for pruning partitions in modularity-based community detection.
Gibson RA, Mucha PJ. Gibson RA, et al. Sci Rep. 2022 Sep 23;12(1):15928. doi: 10.1038/s41598-022-20142-6. Sci Rep. 2022. PMID: 36151268 Free PMC article.
Metabolomic networks connect host-microbiome processes to human Clostridioides difficile infections.
Robinson JI, Weir WH, Crowley JR, Hink T, Reske KA, Kwon JH, Burnham CD, Dubberke ER, Mucha PJ, Henderson JP. Robinson JI, et al. J Clin Invest. 2019 Aug 12;129(9):3792-3806. doi: 10.1172/JCI126905. eCollection 2019 Aug 12. J Clin Invest. 2019. PMID: 31403473 Free PMC article.

See all "Cited by" articles

References

1. Porter MA, Onnela JP, Mucha PJ. Communities in networks. Not AMS. 2009;56:1082–1097. 1164–1166.
1. Fortunato S. Community detection in graphs. Phys Rep. 2010;486:75–174.
1. Fortunato S, Hric D. Community detection in networks: A user guide. Phys Rep. 2016;659:1–44.
1. Abbe E. Community detection and stochastic block models: Recent developments. 2017 arXiv. arXiv:1703.10146.
1. Schaub MT, Delvenne JC, Rosvall M, Lambiotte R. The many facets of community detection in complex networks. Appl Netw Sci. 2017;2 doi: 10.1007/s41109-017-0023-6. - DOI - PMC - PubMed

Grants and funding

LinkOut - more resources

Full Text Sources
- Europe PubMed Central
- PubMed Central
Other Literature Sources
- scite Smart Citations

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Post-Processing Partitions to Identify Domains of Modularity Optimization

Affiliations

Post-Processing Partitions to Identify Domains of Modularity Optimization

Authors

Affiliations

Abstract

Conflict of interest statement

Figures

Similar articles

Cited by

References

Grants and funding

LinkOut - more resources

Full Text Sources

Other Literature Sources