Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2014 Feb 11;9(2):e86693.
doi: 10.1371/journal.pone.0086693. eCollection 2014.

DiME: a scalable disease module identification algorithm with application to glioma progression

Affiliations

DiME: a scalable disease module identification algorithm with application to glioma progression

Yunpeng Liu et al. PLoS One. .

Abstract

Disease module is a group of molecular components that interact intensively in the disease specific biological network. Since the connectivity and activity of disease modules may shed light on the molecular mechanisms of pathogenesis and disease progression, their identification becomes one of the most important challenges in network medicine, an emerging paradigm to study complex human disease. This paper proposes a novel algorithm, DiME (Disease Module Extraction), to identify putative disease modules from biological networks. We have developed novel heuristics to optimise Community Extraction, a module criterion originally proposed for social network analysis, to extract topological core modules from biological networks as putative disease modules. In addition, we have incorporated a statistical significance measure, B-score, to evaluate the quality of extracted modules. As an application to complex diseases, we have employed DiME to investigate the molecular mechanisms that underpin the progression of glioma, the most common type of brain tumour. We have built low (grade II)--and high (GBM)--grade glioma co-expression networks from three independent datasets and then applied DiME to extract potential disease modules from both networks for comparison. Examination of the interconnectivity of the identified modules have revealed changes in topology and module activity (expression) between low- and high- grade tumours, which are characteristic of the major shifts in the constitution and physiology of tumour cells during glioma progression. Our results suggest that transcription factors E2F4, AR and ETS1 are potential key regulators in tumour progression. Our DiME compiled software, R/C++ source code, sample data and a tutorial are available at http://www.cs.bham.ac.uk/~szh/DiME.

PubMed Disclaimer

Conflict of interest statement

Competing Interests: The authors have declared that no competing interests exist.

Figures

Figure 1
Figure 1. General work flow for the DiME framework.
Figure 2
Figure 2. Correlation of scores with B-scores.
All modules with size larger than 2 and B-score formula image are included. A few modules whose B-score is 0 (indicating scores exceeding the lower limit of detection in the B-score algorithm) were excluded. Fitted lines of formula image versus formula image are shown. The fitted Pearson's correlation formula image values are 0.57 (grade II glioma, left panel) and 0.65 (GBM, right panel) respectively, with both correlation formula image values smaller than 0.0001 in Pearson's correlation tests.
Figure 3
Figure 3. DiME is robust to edge noise in co-epxression networks.
Shown in the plots are results for the grade II glioma networks (left panel) and GBM networks (right panel). The horizontal axes display the technique used, and vertical axes show average conservation scores. Only modules with size larger than 5 are taken into consideration. Asterisks denote statistical significance in Student's formula image-tests when comparing means with MCODE modules: “***” - formula image.
Figure 4
Figure 4. Visualisation of grade II glioma modules with B-score less than and their inter-module connectivity.
Nodes represent extracted modules, node size represents module size and node color represents (log-transformed) fold-change in average module gene expression level compared with normal patient samples (Red - increase in average expression, green - decrease in average expression, lavender - no change in average expression). Edge widths are proportional to connectivity (i.e., number of co-expression gene pairs) between module pairs.
Figure 5
Figure 5. Visualisation of GBM modules with B-score less than and their inter-module connectivity.
Nodes represent extracted modules, node size represents module size and node color represents (log-transformed) fold-change in average module gene expression level compared with normal patient samples (Red - increase in average expression, green - decrease in average expression, lavender - no change in average expression). Edge widths are proportional to connectivity (i.e., number of co-expression gene pairs) between module pairs.
Figure 6
Figure 6. Comparison of module reproducibility among different algorithms.
Shown are box plots of average reproducibility (Jaccard index) for each technique used. Asterisks denote statistical significance in Student's formula image-tests when comparing means with MCODE modules: “*” - formula image.
Figure 7
Figure 7. Heat map showing expression landscape of all genes in the 7 conserved common modules across grade II glioma and GBM samples.
Rows correspond to genes grouped by modules and columns correspond to samples grouped by tumour grade.

References

    1. Barabási AL, Oltvai ZN (2004) Network biology: understanding the cell's functional organization. Nature Reviews Genetics 5: 101–113. - PubMed
    1. Barabási AL, Gulbahce N, Loscalzo J (2011) Network medicine: a network-based approach to human disease. Nature Reviews Genetics 12: 56–68. - PMC - PubMed
    1. Newman ME (2006) Modularity and community structure in networks. Proceedings of the National Academy of Sciences 103: 8577–8582. - PMC - PubMed
    1. Cerami E, Demir E, Schultz N, Taylor BS, Sander C (2010) Automated network analysis identifies core pathways in glioblastoma. PloS one 5: e8918. - PMC - PubMed
    1. Ruan J, Dean AK, Zhang W (2010) A general co-expression network-based approach to gene expression analysis: comparison and applications. BMC systems biology 4: 8. - PMC - PubMed

Publication types