. 2011;6(7):e22670.

doi: 10.1371/journal.pone.0022670. Epub 2011 Jul 29.

Exploring and exploiting disease interactions from multi-relational gene and phenotype networks

Darcy A Davis¹, Nitesh V Chawla

Affiliations

Affiliation

¹ Interdisciplinary Center for Network Science and Applications, Department of Computer Science and Engineering, University of Notre Dame, Notre Dame, Indiana, United States of America.

PMID: 21829475
PMCID: PMC3146471
DOI: 10.1371/journal.pone.0022670

Exploring and exploiting disease interactions from multi-relational gene and phenotype networks

Darcy A Davis et al. PLoS One. 2011.

. 2011;6(7):e22670.

doi: 10.1371/journal.pone.0022670. Epub 2011 Jul 29.

Authors

Darcy A Davis¹, Nitesh V Chawla

Affiliation

¹ Interdisciplinary Center for Network Science and Applications, Department of Computer Science and Engineering, University of Notre Dame, Notre Dame, Indiana, United States of America.

PMID: 21829475
PMCID: PMC3146471
DOI: 10.1371/journal.pone.0022670

Abstract

The availability of electronic health care records is unlocking the potential for novel studies on understanding and modeling disease co-morbidities based on both phenotypic and genetic data. Moreover, the insurgence of increasingly reliable phenotypic data can aid further studies on investigating the potential genetic links among diseases. The goal is to create a feedback loop where computational tools guide and facilitate research, leading to improved biological knowledge and clinical standards, which in turn should generate better data. We build and analyze disease interaction networks based on data collected from previous genetic association studies and patient medical histories, spanning over 12 years, acquired from a regional hospital. By exploring both individual and combined interactions among these two levels of disease data, we provide novel insight into the interplay between genetics and clinical realities. Our results show a marked difference between the well defined structure of genetic relationships and the chaotic co-morbidity network, but also highlight clear interdependencies. We demonstrate the power of these dependencies by proposing a novel multi-relational link prediction method, showing that disease co-morbidity can enhance our currently limited knowledge of genetic association. Furthermore, our methods for integrated networks of diverse data are widely applicable and can provide novel advances for many problems in systems biology and personalized medicine.

PubMed Disclaimer

Conflict of interest statement

Competing Interests: The authors have declared that no competing interests exist.

Figures

**Figure 1. Global network properties.**
(A) Degree distributions and (B) clustering spectrums of the phenotypic (PDN) and genetic (GDN) disease networks. The PDN has higher average degree and clustering coefficient due to very high edge density. Interestingly, the degree distribution of the GDN generally decreasing while the PDN is more uniform, indicating that many diseases are co-morbid with a large number of other diseases, often with few or no underlying shared genes.

**Figure 2. The Phenotypic and Genetic Disease Networks.**
(A) The phenotypic disease network (PDN) is constructed based on clinical history of 700,00 patients. Each node represents a unique disease, and two nodes are connected if the diseases co-morbid significantly more than randomly expected according to population prevalence. (B) The genetic disease network (GDN) is constructed on the same disease nodes, but edges instead indicate that the disease pair shares a significant number of gene associations. In both networks, black edges indicate hierarchically related diseases (is-a relationships). For each network, the accompanying table displays the most relevant Disease Ontology codes associated with each cluster. Purity corresponds to the percent of member nodes which are accurately described by the DO term, and completeness indicates the percentage of descendants of the DO term which belong to the cluster. For a detailed definition, see Materials and Methods. It is clear that the PDN and GDN are structurally different. Nonetheless, both networks form some easily defined clusters but also have some dense regions containing diverse DO terms.

**Figure 3. The Multi-Relational Disease Network.**
This network is created by overlaying the phenotypic (PDN) and genetic (GDN) networks, which contain the same disease nodes. Blue edges indicate phenotypic links, red edges are genetic, green edges are both genetic and phenotypic, and black edge are is-a relationships. The two-tone nodes indicate original cluster membership in the GDN (inner circle) and PDN (outer circle). Regions where multiple nodes share the same color pattern correspond to groups of diseases which cluster together in both the PDN and the GDN. These overlaps are common and in some cases quite large, such as the teal-and-green cluster containing the heart diseases. Still, none of the overlaps fully contain a PDN or GDN cluster. The overlapping regions are listed in the accompanying table, along with the most relevant Disease Ontology codes associated with the cluster.

**Figure 4. Genetic vs. phenotypic mutual information.**
Each data point represents a disease pair which is linked in both the PDN and the GDN. The plot illustrates the correlation between the mutual information edges weights in each respective network. There is some upward trend but the effect is far from linear. In aggregate, the values have a Pearson correlation of .473, a weak-to-moderate positive correlation.

**Figure 5. Finding edge probabilities given partial structures.**
This toy example demonstrates how to calculate the probability of a specific edge type closing an open triad pattern, based on the triad counts for the full network. This calculation corresponds the Equation 3. The numbers in this example do not represent the real network. The table of actual edge probabilities for the MRDN can be found in Table S1.

**Figure 6. Link prediction performance.**
(A) Receiver operating curves (ROC) and (B) precision-recall curves for the multi-relational link predictor (MRLP) and three traditional neighborhood-based link prediction methods: common neighbors, Jaccard coefficient, and the Adamic/Adar measure. MRLP is the best method with respect to area under the receiver operating curve (AUROC). The precision-recall curve, which is less biased, shows that MRLP is most accurate with the highest ranked predictions, but is not always optimal for lower prediction thresholds.

**Figure 7. Link predictor performance by individual disease.**
Area under the receiver operating curve (AUROC) comparison of link predictor performance for each unique disease. The experiments were hold-one-out, where all genetic associations of the testing disease were removed. The x axis shows the performance of Adamic/Adar on the phenotypic data only, and the y axis is the performance using the MRLP on the multi-relational network. Each point which falls above the diagonal indicates that multi-relational evidence improved link prediction performance for the corresponding disease.

See this image and copyright information in PMC

Cited by

Exploring the perceptions of patients with chronic respiratory diseases and their insights into pulmonary rehabilitation in Bangladesh.
Habib GMM, Uzzaman N, Rabinovich R, Akhter S, Ali M, Sultana M, Pinnock H; RESPIRE Collaboration. Habib GMM, et al. J Glob Health. 2024 Feb 2;14:04036. doi: 10.7189/jogh.14.04036. J Glob Health. 2024. PMID: 38299780 Free PMC article.
Network-based analysis of comorbidities risk during an infection: SARS and HIV case studies.
Moni MA, Liò P. Moni MA, et al. BMC Bioinformatics. 2014 Oct 24;15(1):333. doi: 10.1186/1471-2105-15-333. BMC Bioinformatics. 2014. PMID: 25344230 Free PMC article.
Network mirroring for drug repositioning.
Park S, Lee DG, Shin H. Park S, et al. BMC Med Inform Decis Mak. 2017 May 18;17(Suppl 1):55. doi: 10.1186/s12911-017-0449-x. BMC Med Inform Decis Mak. 2017. PMID: 28539121 Free PMC article.
Disease association study of Autoimmune and autoinflammatory diseases by integrating multi-modal data and hierarchical ontologies.
Liu A, Su Y, Zhu J, Li YY. Liu A, et al. Front Immunol. 2025 Jun 4;16:1575490. doi: 10.3389/fimmu.2025.1575490. eCollection 2025. Front Immunol. 2025. PMID: 40534874 Free PMC article.
Approaches to Integrating Metabolomics and Multi-Omics Data: A Primer.
Jendoubi T. Jendoubi T. Metabolites. 2021 Mar 21;11(3):184. doi: 10.3390/metabo11030184. Metabolites. 2021. PMID: 33801081 Free PMC article. Review.

See all "Cited by" articles

References

1. Baudot A, Gómez-López G, Valencia A. Translational disease interpretation with molecular networks. Genome Biology. 2009;10:221. - PMC - PubMed
1. Emilsson V, Thorleifsson G, Zhang B, Leonardson A, Zink F, et al. Genetics of gene expression and its effect on disease. Nature. 2008;452:423–428. - PubMed
1. Schadt E. Molecular networks as sensors and drivers of common human diseases. Nature. 2009;461:218–223. - PubMed
1. Goh KI, Cusick ME, Valle D, Childs B, Vidal M, et al. Proceedings of the National Academy of Sciences; 2007. The human disease network. pp. 8685–8690. - PMC - PubMed
1. Hidalgo C, Blumm N, Barabási A, Christakis N, Meyers L. A dynamic network approach for the study of human phenotypes. PLoS Comput Biol. 2009;5:e1000353. - PMC - PubMed

Publication types

Actions

MeSH terms

Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions

LinkOut - more resources

Full Text Sources
Other Literature Sources
- The Lens - Patent Citations Database
Medical
- MedlinePlus Health Information

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Exploring and exploiting disease interactions from multi-relational gene and phenotype networks

Affiliation

Exploring and exploiting disease interactions from multi-relational gene and phenotype networks

Authors

Affiliation

Abstract

Conflict of interest statement

Figures

Similar articles

Cited by

References

Publication types

MeSH terms

LinkOut - more resources

Full Text Sources

Other Literature Sources

Medical

Abstract

Conflict of interest statement

Figures

Similar articles

Cited by

References

Publication types

MeSH terms

Related information

LinkOut - more resources

Full Text Sources

Other Literature Sources

Medical