Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2022 Sep 1;12(1):14914.
doi: 10.1038/s41598-022-19244-y.

Visualizing novel connections and genetic similarities across diseases using a network-medicine based approach

Affiliations

Visualizing novel connections and genetic similarities across diseases using a network-medicine based approach

Brian Ferolito et al. Sci Rep. .

Abstract

Understanding the genetic relationships between human disorders could lead to better treatment and prevention strategies, especially for individuals with multiple comorbidities. A common resource for studying genetic-disease relationships is the GWAS Catalog, a large and well curated repository of SNP-trait associations from various studies and populations. Some of these populations are contained within mega-biobanks such as the Million Veteran Program (MVP), which has enabled the genetic classification of several diseases in a large well-characterized and heterogeneous population. Here we aim to provide a network of the genetic relationships among diseases and to demonstrate the utility of quantifying the extent to which a given resource such as MVP has contributed to the discovery of such relations. We use a network-based approach to evaluate shared variants among thousands of traits in the GWAS Catalog repository. Our results indicate many more novel disease relationships that did not exist in early studies and demonstrate that the network can reveal clusters of diseases mechanistically related. Finally, we show novel disease connections that emerge when MVP data is included, highlighting methodology that can be used to indicate the contributions of a given biobank.

PubMed Disclaimer

Conflict of interest statement

A.L.B is co-scientific founder of Scipher Medicine, Inc., which applies network medicine strategies to biomarker development and personalized drug selection and Foodome, Inc. that apply data science to health, and DataPolis, that explores the implications of human mobility. All other authors declare no competing interests.

Figures

Figure 1
Figure 1
Phenotypic network assembled from GWAS catalog. Network in which nodes are traits that are connected with others to which they share genetic variants in the GWAS Catalog. The network communities detected are highlighted and labeled (A)–(H). Node colors represent disease categories and node size reflects connectivity in the network. The top high degree nodes are labeled 1–10 and their respective names are shown in Table 1. Only significant edges are shown (FDR < 0.05), the edge width indicates the overlap of variants between a pair of phenotypes (Jaccard Index), and lighter shade edges connect nodes in different communities.
Figure 2
Figure 2
Degree distribution of phenotypic network. Log-binned degree distribution of the phenotypic network using a log–log scale. A power-law distribution which is a feature of scale-free networks. K represents the average degree of the bin where bin has size 2n-1. pK is obtained from the number of nodes found in the bin divided by the width of the bin.
Figure 3
Figure 3
Degree distribution by trait category. Trait categories are defined by the EFO ontology system parent terms.
Figure 4
Figure 4
Network community ‘E’ characterized by immune-related disorders. Focused subgraph of community E from the Phenotypic Network. The most connected diseases in the community are Crohn’s disease (1), rheumatoid arthritis (2), ulcerative colitis (3), psoriasis (4), and lupus (5).
Figure 5
Figure 5
Network community ‘A’ characterized by vascular disorders. After removal of traits unrelated to diseases from the visualization, the most connected nodes in the community were coronary heart disease (1), stroke (2), coronary artery disease (3), metabolic syndrome (4), cardiovascular disease (5), hypertriglyceridemia (6), gout (7), chronic kidney disease (8), diabetes mellitus (9), and atrial fibrillation (10).
Figure 6
Figure 6
Disease connections that emerge from MVP data. Subgraph containing 196 traits and 297 edges that were formed only by the inclusion of genetic variant associations from the Million Veteran Program.
Figure 7
Figure 7
Community correlation between notes of different ancestry networks. Histogram of Pearson product-moment correlation coefficients for shared community members between the same trait in different ancestry networks. The count represents number of traits that have the given correlation coefficient.

References

    1. Dugoff EH, Canudas-Romo V, Buttorff C, Leff B, Anderson GF. Multiple chronic conditions and life expectancy: A life table analysis. Med. Care. 2014;52:688–694. doi: 10.1097/MLR.0000000000000166. - DOI - PubMed
    1. Cortaredona S, Ventelou B. The extra cost of comorbidity: Multiple illnesses and the economic burden of non-communicable diseases. BMC Med. 2017;15:1–11. doi: 10.1186/s12916-017-0978-2. - DOI - PMC - PubMed
    1. Gibson G. Decanalization and the origin of complex disease. Nat. Rev. Genet. 2009;10:134–140. doi: 10.1038/nrg2502. - DOI - PubMed
    1. Goh KI, et al. The human disease network. Proc. Natl. Acad. Sci. U.S.A. 2007 doi: 10.1073/pnas.0701361104. - DOI - PMC - PubMed
    1. Hidalgo CA, Blumm N, Barabasi AL, Christakis NA. A dynamic network approach for the study of human phenotypes. PLoS Comput. Biol. 2009;5:e1000353. doi: 10.1371/journal.pcbi.1000353. - DOI - PMC - PubMed

Publication types

Grants and funding