Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2014 Sep 17;15(1):304.
doi: 10.1186/1471-2105-15-304.

Predicting disease associations via biological network analysis

Affiliations

Predicting disease associations via biological network analysis

Kai Sun et al. BMC Bioinformatics. .

Abstract

Background: Understanding the relationship between diseases based on the underlying biological mechanisms is one of the greatest challenges in modern biology and medicine. Exploring disease-disease associations by using system-level biological data is expected to improve our current knowledge of disease relationships, which may lead to further improvements in disease diagnosis, prognosis and treatment.

Results: We took advantage of diverse biological data including disease-gene associations and a large-scale molecular network to gain novel insights into disease relationships. We analysed and compared four publicly available disease-gene association datasets, then applied three disease similarity measures, namely annotation-based measure, function-based measure and topology-based measure, to estimate the similarity scores between diseases. We systematically evaluated disease associations obtained by these measures against a statistical measure of comorbidity which was derived from a large number of medical patient records. Our results show that the correlation between our similarity measures and comorbidity scores is substantially higher than expected at random, confirming that our similarity measures are able to recover comorbidity associations. We also demonstrated that our predicted disease associations correlated with disease associations generated from genome-wide association studies significantly higher than expected at random. Furthermore, we evaluated our predicted disease associations via mining the literature on PubMed, and presented case studies to demonstrate how these novel disease associations can be used to enhance our current knowledge of disease relationships.

Conclusions: We present three similarity measures for predicting disease associations. The strong correlation between our predictions and known disease associations demonstrates the ability of our measures to provide novel insights into disease relationships.

PubMed Disclaimer

Figures

Figure 1
Figure 1
The overlap of datasets. The overlap of diseases (denoted by ‘D’), genes (denoted by ‘G’) and their associations (denoted by ‘A’) between the four disease-gene association datasets we analysed. Boxes on the left list the sizes of the datasets. The size of the intersection of the datasets is marked in bold.
Figure 2
Figure 2
Evaluation against comorbidity. ROC curves obtained by evaluating the three disease similarity measures against comorbidity. Due to space limitations, only ROC curves of FunDO are shown here (see Additional file 1: Figure S5 for ROC curves of other datasets). The ϕ-correlation threshold was set to 0.06 (the same threshold was used in [47]). We evaluated diseases annotated with at least 1, 3, 5, 7, 10, 15 genes, shown by curves with different colours in each plot.

References

    1. Gatza ML, Lucas JE, Barry WT, Kim JW, Wang Q, Crawford MD, Datto MB, Kelley M, Mathey-Prevot B, Potti A, Nevins JR. A pathway-based classification of human breast cancer. Proc Nat Acad Sci. 2010;107(15):6994–6999. doi: 10.1073/pnas.0912708107. - DOI - PMC - PubMed
    1. Loscalzo J, Kohane I, Barabasi AL. Mol Syst Biol. 2007. Human disease classification in the postgenomic era: a complex systems approach to human pathobiology. - PMC - PubMed
    1. Hamosh A, Scott AF, Amberger JS, Bocchini CA, McKusick VA. Online Mendelian Inheritance in Man (OMIM), a knowledgebase of human genes and genetic disorders. Nucleic Acids Res. 2005;33(suppl 1):D514–D517. - PMC - PubMed
    1. van Driel MA, Bruggeman J, Vriend G, Brunner HG, Leunissen JA. A text-mining analysis of the human phenome. Eur J Human Genet. 2006;14(5):535–542. doi: 10.1038/sj.ejhg.5201585. - DOI - PubMed
    1. Lage K, Karlberg EO, Størling ZM, Ólason PI, Pedersen AG, Rigina O, Hinsby AM, Tümer Z, Pociot F, Tommerup N, Moreau Y, Brunak S. A human phenome-interactome network of protein complexes implicated in genetic disorders. Nat Biotechnol. 2007;25(3):309–316. doi: 10.1038/nbt1295. - DOI - PubMed

Publication types

LinkOut - more resources