Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2015 Jul 15:9:35.
doi: 10.1186/s12918-015-0184-9.

Finding directionality and gene-disease predictions in disease associations

Affiliations

Finding directionality and gene-disease predictions in disease associations

Manuel Garcia-Albornoz et al. BMC Syst Biol. .

Abstract

Background: Understanding the underlying molecular mechanisms in human diseases is important for diagnosis and treatment of complex conditions and has traditionally been done by establishing associations between disorder-genes and their associated diseases. This kind of network analysis usually includes only the interaction of molecular components and shared genes. The present study offers a network and association analysis under a bioinformatics frame involving the integration of HUGO Gene Nomenclature Committee approved gene symbols, KEGG metabolic pathways and ICD-10-CM codes for the analysis of human diseases based on the level of inclusion and hypergeometric enrichment between genes and metabolic pathways shared by the different human disorders.

Methods: The present study offers the integration of HGNC approved gene symbols, KEGG metabolic pathways andICD-10-CM codes for the analysis of associations based on the level of inclusion and hypergeometricenrichment between genes and metabolic pathways shared by different diseases.

Results: 880 unique ICD-10-CM codes were mapped to the 4315 OMIM phenotypes and 3083 genes with phenotype-causing mutation. From this, a total of 705 ICD-10-CM codes were linked to 1587 genes with phenotype-causing mutations and 801 KEGG pathways creating a tripartite network composed by 15,455 code-gene-pathway interactions. These associations were further used for an inclusion analysis between diseases along with gene-disease predictions based on a hypergeometric enrichment methodology.

Conclusions: The results demonstrate that even though a large number of genes and metabolic pathways are shared between diseases of the same categories, inclusion levels between these genes and pathways are directional and independent of the disease classification. However, the gene-disease-pathway associations can be used for prediction of new gene-disease interactions that will be useful in drug discovery and therapeutic applications.

PubMed Disclaimer

Figures

Fig. 1
Fig. 1
Disease-disease interactions. a ICD-10-CM code classification. b Bipartite disease-disease network for 4315 OMIM phenotypes classified into 880 unique ICD-10-CM codes. Nodes represent codes and edges represent shared genes. The size of the node denotes the number of edges involved in each code
Fig. 2
Fig. 2
Gene and disease network analysis. a Number of OMIM phenotypes and codes by ICD-10-CM category. b Number of unique disease-genes by ICD-10-CM category and the number of enzyme-producing genes. c 880 unique ICD-10-CM codes are linked to 3083 genes with phenotype-causing mutation creating 4241 disease-gene associations. Of the 880 codes a total of 705 codes and 1587 genes are linked to 801 metabolic pathways creating 15,455 code-gene-pathway interactions. A further analysis revealed a total of 6706 genes being involved in at least one KEGG metabolic pathway, and hereby 5119 genes with no known phenotype-causing mutation could be included in our analysis. These 5119 genes with no known phenotype-causing mutation are linked to 546 different KEGG pathways sharing 479 pathways with genes carrying phenotype-causing mutation and are only linked to 67 additional KEGG metabolic pathways
Fig. 3
Fig. 3
Inclusion analysis by disease category. a For two diseases sharing a certain number of elements (genes or pathways), the inclusion index (τ) will be low for a disease with a high number of total elements compared with the number of shared elements. When the number of shared elements increases compared with the total number of elements of the disease, the index level increases. Therefore, different index values can be calculated for two diseases sharing elements depending on the total number of elements of each disease. Consequently, this index allows obtaining not only the degree of interaction between diseases, but also the directionality of the interaction. A value of τ = 1 indicates that one disease is a subset of another. b Boxplot of calculated level of inclusion between disease-pairs belonging to the same ICD-10-CM category based on shared genes. c Boxplot of calculated level of inclusion between disease-pairs belonging to the same ICD-10-CM category based on shared pathways
Fig. 4
Fig. 4
Inclusion analysis for Neoplasm category. a Boxplot of calculated level of inclusion (τ) based on shared genes for neoplasms as disease X and Y. b Boxplot of calculated level of inclusion based on shared genes for neoplasms as disease X by ICD-10-CM category. c Boxplot of calculated level of inclusion based on shared genes for neoplasms as disease Y by ICD-10-CM category. d Boxplot of calculated level of inclusion based on shared pathways for neoplasms as disease X and Y. e Boxplot of calculated level of inclusion based on shared pathways for neoplasms as disease X by ICD-10-CM category. f Boxplot of calculated level of inclusion based on shared pathways for neoplasms as disease Y by ICD-10-CM category

Similar articles

Cited by

References

    1. Goh KI, Cusick ME, Valle D, Childs B, Vidal M, Barabási AL. The human disease network. Proc Natl Acad Sci. 2007;104:8685–90. doi: 10.1073/pnas.0701361104. - DOI - PMC - PubMed
    1. Barabási AL, Gulbahce N, Loscalzo J. Network medicine: a network-based approach to human disease. Nat Rev Genet. 2011;12:56–68. doi: 10.1038/nrg2918. - DOI - PMC - PubMed
    1. Rozenblatt-Rosen O, Deo RC, Padi M, Adelmant G, Calderwood MA, Rolland T, Grace M, Dricot A, Askenazi M, Tavares M, Pevzner SJ, Abderazzaq F, Byrdsong D, Carvunis AR, Chen AA, Cheng J, Correll M, Duarte M, Fan C, Feltkamp MC, Ficarro SB, Franchi R, Garg BK, Gulbahce N, Hao T, Holthaus AM, James R, Korkhin A, Litovchick L, Mar JC, Pak TR, Rabello S, Rubio R, Shen Y, Singh S, Spangle JM, Tasan M, Wanamaker S, Webber JT, Roecklein-Canfield J, Johannsen E, Barabási AL, Beroukhim R, Kieff E, Cusick ME, Hill DE, Münger K, Marto JA, Quackenbush J, Roth FP, DeCaprio JA, Vidal M. Interpreting cancer genomes using systematic host network perturbations by tumour virus proteins. Nature. 2012;487:491–5. doi: 10.1038/nature11288. - DOI - PMC - PubMed
    1. Sharma A, Gulbahce N, Pevzner SJ, Menche J, Ladenvall C, Folkersen L, Eriksson P, Orho-Melander M, Barabási AL. Network-based analysis of genome wide association data provides novel candidate genes for lipid and lipoprotein traits. Mol Cell Proteomics. 2013;12:3398–408. doi: 10.1074/mcp.M112.024851. - DOI - PMC - PubMed
    1. Ibañez K, Boullosa C, Tabares-Seisdedos R, Baudot A, Valencia A. Molecular evidence for the inverse comorbidity between central nervous system disorders and cancers detected by transcriptomic meta-analyses. PLoS Genet. 2014;10:e1004173. doi: 10.1371/journal.pgen.1004173. - DOI - PMC - PubMed

Publication types

LinkOut - more resources