. 2019 Sep;16(9):843-852.

doi: 10.1038/s41592-019-0509-5. Epub 2019 Aug 30.

Assessment of network module identification across complex diseases

Sarvenaz Choobdar^#^{1

2}, Mehmet E Ahsen^#³, Jake Crawford^#⁴, Mattia Tomasoni^{1

2}, Tao Fang⁵, David Lamparter^{1

2

6}, Junyuan Lin⁷, Benjamin Hescott⁸, Xiaozhe Hu⁷, Johnathan Mercer^{9

10}, Ted Natoli¹¹, Rajiv Narayan¹¹; DREAM Module Identification Challenge Consortium; Aravind Subramanian¹¹, Jitao D Zhang⁵, Gustavo Stolovitzky^{3

12}, Zoltán Kutalik^{2

13}, Kasper Lage^{9

10

14}, Donna K Slonim^{4

15}, Julio Saez-Rodriguez^{16

17}, Lenore J Cowen^{4

7}, Sven Bergmann^{18

19

20}, Daniel Marbach^{21

22

23}

Collaborators, Affiliations

Collaborators

DREAM Module Identification Challenge Consortium:
Fabian Aicheler, Nicola Amoroso, Alex Arenas, Karthik Azhagesan, Aaron Baker, Michael Banf, Serafim Batzoglou, Anaïs Baudot, Roberto Bellotti, Sven Bergmann, Keith A Boroevich, Christine Brun, Stanley Cai, Michael Caldera, Alberto Calderone, Gianni Cesareni, Weiqi Chen, Christine Chichester, Sarvenaz Choobdar, Lenore Cowen, Jake Crawford, Hongzhu Cui, Phuong Dao, Manlio De Domenico, Andi Dhroso, Gilles Didier, Mathew Divine, Antonio Del Sol, Tao Fang, Xuyang Feng, Jose C Flores-Canales, Santo Fortunato, Anthony Gitter, Anna Gorska, Yuanfang Guan, Alain Guénoche, Sergio Gómez, Hatem Hamza, András Hartmann, Shan He, Anton Heijs, Julian Heinrich, Benjamin Hescott, Xiaozhe Hu, Ying Hu, Xiaoqing Huang, V Keith Hughitt, Minji Jeon, Lucas Jeub, Nathan T Johnson, Keehyoung Joo, InSuk Joung, Sascha Jung, Susana G Kalko, Piotr J Kamola, Jaewoo Kang, Benjapun Kaveelerdpotjana, Minjun Kim, Yoo-Ah Kim, Oliver Kohlbacher, Dmitry Korkin, Kiryluk Krzysztof, Khalid Kunji, Zoltàn Kutalik, Kasper Lage, David Lamparter, Sean Lang-Brown, Thuc Duy Le, Jooyoung Lee, Sunwon Lee, Juyong Lee, Dong Li, Jiuyong Li, Junyuan Lin, Lin Liu, Antonis Loizou, Zhenhua Luo, Artem Lysenko, Tianle Ma, Raghvendra Mall, Daniel Marbach, Tomasoni Mattia, Mario Medvedovic, Jörg Menche, Johnathan Mercer, Elisa Micarelli, Alfonso Monaco, Felix Müller, Rajiv Narayan, Oleksandr Narykov, Ted Natoli, Thea Norman, Sungjoon Park, Livia Perfetto, Dimitri Perrin, Stefano Pirrò, Teresa M Przytycka, Xiaoning Qian, Karthik Raman, Daniele Ramazzotti, Emilie Ramsahai, Balaraman Ravindran, Philip Rennert, Julio Saez-Rodriguez, Charlotta Schärfe, Roded Sharan, Ning Shi, Wonho Shin, Hai Shu, Himanshu Sinha, Donna K Slonim, Lionel Spinelli, Suhas Srinivasan, Aravind Subramanian, Christine Suver, Damian Szklarczyk, Sabina Tangaro, Suresh Thiagarajan, Laurent Tichit, Thorsten Tiede, Beethika Tripathi, Aviad Tsherniak, Tatsuhiko Tsunoda, Dénes Türei, Ehsan Ullah, Golnaz Vahedi, Alberto Valdeolivas, Jayaswal Vivek, Christian von Mering, Andra Waagmeester, Bo Wang, Yijie Wang, Barbara A Weir, Shana White, Sebastian Winkler, Ke Xu, Taosheng Xu, Chunhua Yan, Liuqing Yang, Kaixian Yu, Xiangtian Yu, Gaia Zaffaroni, Mikhail Zaslavskiy, Tao Zeng, Jitao D Zhang, Lu Zhang, Weijia Zhang, Lixia Zhang, Xinyu Zhang, Junpeng Zhang, Xin Zhou, Jiarui Zhou, Hongtu Zhu, Junjie Zhu, Guido Zuccon

Affiliations

¹ Department of Computational Biology, University of Lausanne, Lausanne, Switzerland.
² Swiss Institute of Bioinformatics, Lausanne, Switzerland.
³ Icahn Institute for Genomics and Multiscale Biology and Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York, NY, USA.
⁴ Department of Computer Science, Tufts University, Medford, MA, USA.
⁵ Roche Pharma Research and Early Development, Pharmaceutical Sciences, Roche Innovation Center Basel, F. Hoffmann-La Roche Ltd, Basel, Switzerland.
⁶ Verge Genomics, San Francisco, CA, USA.
⁷ Department of Mathematics, Tufts University, Medford, MA, USA.
⁸ College of Computer and Information Science, Northeastern University, Boston, MA, USA.
⁹ Department of Surgery, Massachusetts General Hospital, Harvard Medical School, Boston, MA, USA.
¹⁰ Stanley Center at the Broad Institute of MIT and Harvard, Cambridge, MA, USA.
¹¹ Broad Institute of MIT and Harvard, Cambridge, MA, USA.
¹² IBM T.J. Watson Research Center, Yorktown Heights, NY, USA.
¹³ University Institute of Primary Care and Public Health, University of Lausanne, Lausanne, Switzerland.
¹⁴ Institute for Biological Psychiatry, Mental Health Center Sct. Hans, University of Copenhagen, Roskilde, Denmark.
¹⁵ Department of Immunology, Tufts University School of Medicine, Boston, MA, USA.
¹⁶ Institute for Computational Biomedicine, Faculty of Medicine, Heidelberg University, Bioquant, Heidelberg, Germany.
¹⁷ RWTH Aachen University, Faculty of Medicine, Joint Research Center for Computational Biomedicine, Aachen, Germany.
¹⁸ Department of Computational Biology, University of Lausanne, Lausanne, Switzerland. sven.bergmann@unil.ch.
¹⁹ Swiss Institute of Bioinformatics, Lausanne, Switzerland. sven.bergmann@unil.ch.
²⁰ Department of Integrative Biomedical Sciences, University of Cape Town, Cape Town, South Africa. sven.bergmann@unil.ch.
²¹ Department of Computational Biology, University of Lausanne, Lausanne, Switzerland. daniel.marbach.dm1@roche.com.
²² Swiss Institute of Bioinformatics, Lausanne, Switzerland. daniel.marbach.dm1@roche.com.
²³ Roche Pharma Research and Early Development, Pharmaceutical Sciences, Roche Innovation Center Basel, F. Hoffmann-La Roche Ltd, Basel, Switzerland. daniel.marbach.dm1@roche.com.

^# Contributed equally.

PMID: 31471613
PMCID: PMC6719725
DOI: 10.1038/s41592-019-0509-5

Assessment of network module identification across complex diseases

Sarvenaz Choobdar et al. Nat Methods. 2019 Sep.

. 2019 Sep;16(9):843-852.

doi: 10.1038/s41592-019-0509-5. Epub 2019 Aug 30.

Authors

Collaborators

DREAM Module Identification Challenge Consortium:
Fabian Aicheler, Nicola Amoroso, Alex Arenas, Karthik Azhagesan, Aaron Baker, Michael Banf, Serafim Batzoglou, Anaïs Baudot, Roberto Bellotti, Sven Bergmann, Keith A Boroevich, Christine Brun, Stanley Cai, Michael Caldera, Alberto Calderone, Gianni Cesareni, Weiqi Chen, Christine Chichester, Sarvenaz Choobdar, Lenore Cowen, Jake Crawford, Hongzhu Cui, Phuong Dao, Manlio De Domenico, Andi Dhroso, Gilles Didier, Mathew Divine, Antonio Del Sol, Tao Fang, Xuyang Feng, Jose C Flores-Canales, Santo Fortunato, Anthony Gitter, Anna Gorska, Yuanfang Guan, Alain Guénoche, Sergio Gómez, Hatem Hamza, András Hartmann, Shan He, Anton Heijs, Julian Heinrich, Benjamin Hescott, Xiaozhe Hu, Ying Hu, Xiaoqing Huang, V Keith Hughitt, Minji Jeon, Lucas Jeub, Nathan T Johnson, Keehyoung Joo, InSuk Joung, Sascha Jung, Susana G Kalko, Piotr J Kamola, Jaewoo Kang, Benjapun Kaveelerdpotjana, Minjun Kim, Yoo-Ah Kim, Oliver Kohlbacher, Dmitry Korkin, Kiryluk Krzysztof, Khalid Kunji, Zoltàn Kutalik, Kasper Lage, David Lamparter, Sean Lang-Brown, Thuc Duy Le, Jooyoung Lee, Sunwon Lee, Juyong Lee, Dong Li, Jiuyong Li, Junyuan Lin, Lin Liu, Antonis Loizou, Zhenhua Luo, Artem Lysenko, Tianle Ma, Raghvendra Mall, Daniel Marbach, Tomasoni Mattia, Mario Medvedovic, Jörg Menche, Johnathan Mercer, Elisa Micarelli, Alfonso Monaco, Felix Müller, Rajiv Narayan, Oleksandr Narykov, Ted Natoli, Thea Norman, Sungjoon Park, Livia Perfetto, Dimitri Perrin, Stefano Pirrò, Teresa M Przytycka, Xiaoning Qian, Karthik Raman, Daniele Ramazzotti, Emilie Ramsahai, Balaraman Ravindran, Philip Rennert, Julio Saez-Rodriguez, Charlotta Schärfe, Roded Sharan, Ning Shi, Wonho Shin, Hai Shu, Himanshu Sinha, Donna K Slonim, Lionel Spinelli, Suhas Srinivasan, Aravind Subramanian, Christine Suver, Damian Szklarczyk, Sabina Tangaro, Suresh Thiagarajan, Laurent Tichit, Thorsten Tiede, Beethika Tripathi, Aviad Tsherniak, Tatsuhiko Tsunoda, Dénes Türei, Ehsan Ullah, Golnaz Vahedi, Alberto Valdeolivas, Jayaswal Vivek, Christian von Mering, Andra Waagmeester, Bo Wang, Yijie Wang, Barbara A Weir, Shana White, Sebastian Winkler, Ke Xu, Taosheng Xu, Chunhua Yan, Liuqing Yang, Kaixian Yu, Xiangtian Yu, Gaia Zaffaroni, Mikhail Zaslavskiy, Tao Zeng, Jitao D Zhang, Lu Zhang, Weijia Zhang, Lixia Zhang, Xinyu Zhang, Junpeng Zhang, Xin Zhou, Jiarui Zhou, Hongtu Zhu, Junjie Zhu, Guido Zuccon

Affiliations

¹ Department of Computational Biology, University of Lausanne, Lausanne, Switzerland.
² Swiss Institute of Bioinformatics, Lausanne, Switzerland.
³ Icahn Institute for Genomics and Multiscale Biology and Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York, NY, USA.
⁴ Department of Computer Science, Tufts University, Medford, MA, USA.
⁵ Roche Pharma Research and Early Development, Pharmaceutical Sciences, Roche Innovation Center Basel, F. Hoffmann-La Roche Ltd, Basel, Switzerland.
⁶ Verge Genomics, San Francisco, CA, USA.
⁷ Department of Mathematics, Tufts University, Medford, MA, USA.
⁸ College of Computer and Information Science, Northeastern University, Boston, MA, USA.
⁹ Department of Surgery, Massachusetts General Hospital, Harvard Medical School, Boston, MA, USA.
¹⁰ Stanley Center at the Broad Institute of MIT and Harvard, Cambridge, MA, USA.
¹¹ Broad Institute of MIT and Harvard, Cambridge, MA, USA.
¹² IBM T.J. Watson Research Center, Yorktown Heights, NY, USA.
¹³ University Institute of Primary Care and Public Health, University of Lausanne, Lausanne, Switzerland.
¹⁴ Institute for Biological Psychiatry, Mental Health Center Sct. Hans, University of Copenhagen, Roskilde, Denmark.
¹⁵ Department of Immunology, Tufts University School of Medicine, Boston, MA, USA.
¹⁶ Institute for Computational Biomedicine, Faculty of Medicine, Heidelberg University, Bioquant, Heidelberg, Germany.
¹⁷ RWTH Aachen University, Faculty of Medicine, Joint Research Center for Computational Biomedicine, Aachen, Germany.
¹⁸ Department of Computational Biology, University of Lausanne, Lausanne, Switzerland. sven.bergmann@unil.ch.
¹⁹ Swiss Institute of Bioinformatics, Lausanne, Switzerland. sven.bergmann@unil.ch.
²⁰ Department of Integrative Biomedical Sciences, University of Cape Town, Cape Town, South Africa. sven.bergmann@unil.ch.
²¹ Department of Computational Biology, University of Lausanne, Lausanne, Switzerland. daniel.marbach.dm1@roche.com.
²² Swiss Institute of Bioinformatics, Lausanne, Switzerland. daniel.marbach.dm1@roche.com.
²³ Roche Pharma Research and Early Development, Pharmaceutical Sciences, Roche Innovation Center Basel, F. Hoffmann-La Roche Ltd, Basel, Switzerland. daniel.marbach.dm1@roche.com.

^# Contributed equally.

PMID: 31471613
PMCID: PMC6719725
DOI: 10.1038/s41592-019-0509-5

Abstract

Many bioinformatics methods have been proposed for reducing the complexity of large gene or protein networks into relevant subnetworks or modules. Yet, how such methods compare to each other in terms of their ability to identify disease-relevant modules in different types of network remains poorly understood. We launched the 'Disease Module Identification DREAM Challenge', an open competition to comprehensively assess module identification methods across diverse protein-protein interaction, signaling, gene co-expression, homology and cancer-gene networks. Predicted network modules were tested for association with complex traits and diseases using a unique collection of 180 genome-wide association studies. Our robust assessment of 75 module identification methods reveals top-performing algorithms, which recover complementary trait-associated modules. We find that most of these modules correspond to core disease-relevant pathways, which often comprise therapeutic targets. This community challenge establishes biologically interpretable benchmarks, tools and guidelines for molecular network analysis to study human disease biology.

PubMed Disclaimer

Conflict of interest statement

The authors declare no competing interests.

Figures

**Fig. 1. The Disease Module Identification DREAM Challenge.**
a, Network types included in the challenge. Throughout the paper, boxplot center lines show the median, box limits show upper and lower quartiles, whiskers show 1.5× interquartile range and points show outliers. b, Outline of the challenge. c, Outline of the scoring.

**Fig. 2. Assessment of module identification methods.**
a, Main types of module identification approach used in the challenge. b, Final scores of the 42 module identification methods applied in Sub-challenge 1 for each of the six networks, as well as the overall score summarizing performance across networks (evaluated using the holdout GWAS set at 5% FDR; method IDs are defined in Supplementary Table 2). Ranks are indicated for the top ten methods. The last row shows the mean performance of 17 random modularizations of the networks (error bars show the standard deviation). c, Robustness of the overall ranking was evaluated by subsampling the GWAS set used for evaluation 1,000 times. For each method, the resulting distribution of ranks is shown as a boxplot. d, Number of trait-associated modules per network. Boxplots show the number of trait-associated modules across the 42 methods, normalized by the size of the respective network.

**Fig. 3. Complementarity of module predictions from different methods and networks.**
a, Similarity of module predictions from different methods (color) and networks (shape). The closer two points are in the plot, the more similar the corresponding module predictions (multidimensional scaling, see Methods). The top two methods are highlighted for each network. b, Total number of predicted modules versus average module size for each method (same color scheme as in a). The top five methods (numbered) produced modular decompositions of varying granularity. c, Challenge score (number of trait-associated modules) versus modularity is shown for each of the 42 methods (same color scheme as in a). Modularity is a topological quality metric for modules based on the fraction of within-module edges. d, Final scores of multi-network module identification methods in Sub-challenge 2 (evaluated using the holdout GWAS set at 5% FDR). For comparison, the overall best-performing method from Sub-challenge 1 is also shown (method K1, purple). Teams used different combinations of the six challenge networks for their multi-network predictions (shown on the left). The difference between the top single-network module predictions and the top multi-network module predictions is not significant when subsampling the GWASs (Bayes factor < 3, Supplementary Fig. 5). The last row shows the mean performance of 17 random modularizations of the networks (error bars show standard deviation).

**Fig. 4. Overlap between modules associated with different traits and diseases.**
a, Average number of trait-associated modules identified by challenge methods for each trait in Sub-challenge 1. For traits where multiple GWASs were available, results for the best-powered study are shown. HDL, high-density lipoprotein; LDL, low-density lipoprotein. b, Histograms showing the number of distinct traits per trait-associated module (brown) and gene (gray). c, Trait network showing similarity between GWAS traits based on overlap of associated modules (force-directed graph layout). Node size corresponds to the number of genes in trait-associated modules and edge width corresponds to the degree of overlap (Jaccard index, only edges for which the overlap is significant are shown (Bonferroni-corrected hypergeometric P < 0.05, see Methods)). Traits without any edges are not shown.

**Fig. 5. Support for trait-module genes in diverse datasets.**
a, Example module from the consensus analysis in the STRING protein–protein interaction network (force-directed graph layout). The module is associated to height (n = 25 genes, FDR-corrected Pascal P = 0.005, see Methods). Color indicates Pascal GWAS gene scores (Methods). The module includes genes that are genome-wide significant (magenta and pink) as well as genes that do not reach the genome-wide significance threshold, but are predicted to be involved in height due to their module membership (blue and gray). b, Member genes of the height-associated module are supported by independent datasets: 24% of module genes are implicated in monogenic skeletal growth disorders (red squares, enrichment P = 7.5 × 10⁻⁴ (one-sided Fisher’s exact test)) and 28% of module genes have coding variants associated to height in an ExomeChip study published after the challenge (black diamonds, enrichment P = 1.9 × 10⁻⁶). The form of this module follows its function: two submodules comprise proteins involved in collagen fibril (yellow) and elastic fiber formation (green), while the proteins that link these submodules (orange) indeed have the biological function of crosslinking collagen fibril and elastic fibers.

**Fig. 6. Example trait modules comprising therapeutically relevant pathways.**
a–c, The modules are from the STRING protein–protein interaction networks and were generated using the consensus method. Node colors correspond to Pascal gene scores in the respective GWAS (Methods). For the two inflammatory disorders (a,b), red squares indicate genes causing monogenic immunodeficiency disorders (enrichment P values of 4.1 × 10⁻⁸ and 1.2 × 10⁻⁶, respectively (one-sided Fisher’s exact test)). a, Module associated with rheumatoid arthritis (n = 25 genes, FDR-corrected Pascal P = 0.04) that is involved in T cell activation. A costimulatory pathway is highlighted green, T cell response is regulated by activating (*CD28*) and inhibitory (*CTLA4*) surface receptors, which bind B7 family ligands (*CD80* and *CD86*) expressed on the surface of activated antigen-presenting cells. The therapeutic agent CTLA4-Ig binds and blocks B7 ligands, thus inhibiting T cell response. b, Cytokine signaling module associated with inflammatory bowel disease (n = 42 genes, FDR-corrected Pascal P = 0.0006). The module includes the four known Janus kinases (*JAK1-3* and *TYK2*, highlighted green), which are engaged by cytokine receptors to mediate activation of specific transcription factors (*STATs*). Inhibitors of JAK–STAT signaling are being tested in clinical trials for both ulcerative colitis and Crohn’s disease. c, Module associated with myocardial infarction (n = 36 genes, FDR-corrected Pascal P = 0.0001) comprising two main components of the NO/cGMP signaling pathway (endothelial nitric oxide synthases (*NOS1-3*) and soluble guanylate cyclases (*GUCY1A2*, *GUCY1A3* and *GUCY1B3*), highlighted green), a key therapeutic target for cardiovascular disease.

See this image and copyright information in PMC

References

1. Schadt EE. Molecular networks as sensors and drivers of common human diseases. Nature. 2009;461:218–223. doi: 10.1038/nature08454. - DOI - PubMed
1. Marbach D, et al. Tissue-specific regulatory circuits reveal variable modular perturbations across complex diseases. Nat. Methods. 2016;13:366–370. doi: 10.1038/nmeth.3799. - DOI - PMC - PubMed
1. Bonder MJ, et al. Disease variants alter transcription factor levels and methylation of their binding sites. Nat. Genet. 2017;49:131–138. doi: 10.1038/ng.3721. - DOI - PubMed
1. Califano A, Butte AJ, Friend S, Ideker T, Schadt E. Leveraging models of cell regulation and GWAS data in integrative network-based association studies. Nat. Genet. 2012;44:841–847. doi: 10.1038/ng.2355. - DOI - PMC - PubMed
1. Hartwell LH, Hopfield JJ, Leibler S, Murray AW. From molecular to modular cell biology. Nature. 1999;402:C47–C52. doi: 10.1038/35011540. - DOI - PubMed

Publication types

Actions
Actions

MeSH terms

Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions

Grants and funding

LinkOut - more resources

Full Text Sources
Other Literature Sources
- H1 Connect - Access expert opinions and insights on biomedical research.
- The Lens - Patent Citations Database

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Assessment of network module identification across complex diseases

Collaborators

Affiliations

Assessment of network module identification across complex diseases

Authors

Collaborators

Affiliations

Abstract

Conflict of interest statement

Figures

References

Publication types

MeSH terms

Grants and funding

LinkOut - more resources

Full Text Sources

Other Literature Sources