Rare disease knowledge enrichment through a data-driven approach
- PMID: 30764825
- PMCID: PMC6376651
- DOI: 10.1186/s12911-019-0752-9
Rare disease knowledge enrichment through a data-driven approach
Abstract
Background: Existing resources to assist the diagnosis of rare diseases are usually curated from the literature that can be limited for clinical use. It often takes substantial effort before the suspicion of a rare disease is even raised to utilize those resources. The primary goal of this study was to apply a data-driven approach to enrich existing rare disease resources by mining phenotype-disease associations from electronic medical record (EMR).
Methods: We first applied association rule mining algorithms on EMR to extract significant phenotype-disease associations and enriched existing rare disease resources (Human Phenotype Ontology and Orphanet (HPO-Orphanet)). We generated phenotype-disease bipartite graphs for HPO-Orphanet, EMR, and enriched knowledge base HPO-Orphanet + and conducted a case study on Hodgkin lymphoma to compare performance on differential diagnosis among these three graphs.
Results: We used disease-disease similarity generated by the eRAM, an existing rare disease encyclopedia, as a gold standard to compare the three graphs with sensitivity and specificity as (0.17, 0.36, 0.46) and (0.52, 0.47, 0.51) for three graphs respectively. We also compared the top 15 diseases generated by the HPO-Orphanet + graph with eRAM and another clinical diagnostic tool, the Phenomizer.
Conclusions: Per our evaluation results, our approach was able to enrich existing rare disease knowledge resources with phenotype-disease associations from EMR and thus support rare disease differential diagnosis.
Keywords: Data-driven approach; Differential diagnosis; Knowledge enrichment; Rare disease.
Conflict of interest statement
Ethics approval and consent to participate
This study used existing records to conduct a retrospective study. The study and a waiver of informed consent were approved by Mayo Clinic Institutional Review Board in accordance with 45 CFR 46.116 (Approval #17–003030).
Consent for publication
The author(s) declare(s) that the manuscript does not contain any individual person’s data. So this paper requires no consent to publish.
Competing interests
The authors declare that they have no competing interests.
Publisher’s Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Figures
References
-
- Boat TF, Field MJ. Rare diseases and orphan products: accelerating research and development. Washington, D.C.: National Academies Press; 2011. - PubMed
-
- Survey of the delay in diagnosis for 8 rare diseases in Europe. Available at: https://wwweurordisorg/sites/default/files/publicationsFact_Sheet_Eurordiscare2pdf.
-
- Rare Diseases Difficult to Diagnose, Cures Hard to Come By. Available at: https://news.aamc.org/research/article/rare-diseases-difficult-diagnose-....
-
- Phenotype Definition. Available at: http://medical-dictionary.thefreedictionary.com/phenotype.
-
- Hodgkin Lymphoma Differential Diagnosis. Available at: https://emedicine.medscape.com/article/201886-differential.
Publication types
MeSH terms
Grants and funding
LinkOut - more resources
Full Text Sources
Medical
