Deep Phenotyping on Electronic Health Records Facilitates Genetic Diagnosis by Clinical Exomes
- PMID: 29961570
- PMCID: PMC6035281
- DOI: 10.1016/j.ajhg.2018.05.010
Deep Phenotyping on Electronic Health Records Facilitates Genetic Diagnosis by Clinical Exomes
Abstract
Integration of detailed phenotype information with genetic data is well established to facilitate accurate diagnosis of hereditary disorders. As a rich source of phenotype information, electronic health records (EHRs) promise to empower diagnostic variant interpretation. However, how to accurately and efficiently extract phenotypes from heterogeneous EHR narratives remains a challenge. Here, we present EHR-Phenolyzer, a high-throughput EHR framework for extracting and analyzing phenotypes. EHR-Phenolyzer extracts and normalizes Human Phenotype Ontology (HPO) concepts from EHR narratives and then prioritizes genes with causal variants on the basis of the HPO-coded phenotype manifestations. We assessed EHR-Phenolyzer on 28 pediatric individuals with confirmed diagnoses of monogenic diseases and found that the genes with causal variants were ranked among the top 100 genes selected by EHR-Phenolyzer for 16/28 individuals (p < 2.2 × 10-16), supporting the value of phenotype-driven gene prioritization in diagnostic sequence interpretation. To assess the generalizability, we replicated this finding on an independent EHR dataset of ten individuals with a positive diagnosis from a different institution. We then assessed the broader utility by examining two additional EHR datasets, including 31 individuals who were suspected of having a Mendelian disease and underwent different types of genetic testing and 20 individuals with positive diagnoses of specific Mendelian etiologies of chronic kidney disease from exome sequencing. Finally, through several retrospective case studies, we demonstrated how combined analyses of genotype data and deep phenotype data from EHRs can expedite genetic diagnoses. In summary, EHR-Phenolyzer leverages EHR narratives to automate phenotype-driven analysis of clinical exomes or genomes, facilitating the broader implementation of genomic medicine.
Keywords: biomedical informatics; diagnosis; electronic health records; exome; genome; knowledge engineering; natural language processing; next-generation sequencing; phenotyping; precision medicine.
Copyright © 2018 American Society of Human Genetics. Published by Elsevier Inc. All rights reserved.
Figures






Similar articles
-
Whole genome sequencing of one complex pedigree illustrates challenges with genomic medicine.BMC Med Genomics. 2017 Feb 23;10(1):10. doi: 10.1186/s12920-017-0246-5. BMC Med Genomics. 2017. PMID: 28228131 Free PMC article.
-
Genome analysis and knowledge-driven variant interpretation with TGex.BMC Med Genomics. 2019 Dec 30;12(1):200. doi: 10.1186/s12920-019-0647-8. BMC Med Genomics. 2019. PMID: 31888639 Free PMC article.
-
Massively parallel sequencing and targeted exomes in familial kidney disease can diagnose underlying genetic disorders.Kidney Int. 2017 Dec;92(6):1493-1506. doi: 10.1016/j.kint.2017.06.013. Epub 2017 Aug 23. Kidney Int. 2017. PMID: 28844315
-
The use of electronic health records for psychiatric phenotyping and genomics.Am J Med Genet B Neuropsychiatr Genet. 2018 Oct;177(7):601-612. doi: 10.1002/ajmg.b.32548. Epub 2017 May 30. Am J Med Genet B Neuropsychiatr Genet. 2018. PMID: 28557243 Free PMC article. Review.
-
Personalized medicine in chronic kidney disease by detection of monogenic mutations.Nephrol Dial Transplant. 2020 Mar 1;35(3):390-397. doi: 10.1093/ndt/gfz028. Nephrol Dial Transplant. 2020. PMID: 30809662 Free PMC article. Review.
Cited by
-
Phenotype-aware prioritisation of rare Mendelian disease variants.Trends Genet. 2022 Dec;38(12):1271-1283. doi: 10.1016/j.tig.2022.07.002. Epub 2022 Aug 4. Trends Genet. 2022. PMID: 35934592 Free PMC article. Review.
-
Artificial intelligence enables comprehensive genome interpretation and nomination of candidate diagnoses for rare genetic diseases.Genome Med. 2021 Oct 14;13(1):153. doi: 10.1186/s13073-021-00965-0. Genome Med. 2021. PMID: 34645491 Free PMC article.
-
Genetic pleiotropy of ERCC6 loss-of-function and deleterious missense variants links retinal dystrophy, arrhythmia, and immunodeficiency in diverse ancestries.Hum Mutat. 2021 Aug;42(8):969-977. doi: 10.1002/humu.24220. Epub 2021 May 31. Hum Mutat. 2021. PMID: 34005834 Free PMC article.
-
Term-BLAST-like alignment tool for concept recognition in noisy clinical texts.Bioinformatics. 2023 Dec 1;39(12):btad716. doi: 10.1093/bioinformatics/btad716. Bioinformatics. 2023. PMID: 38001031 Free PMC article.
-
Phenotype-genotype comorbidity analysis of patients with rare disorders provides insight into their pathological and molecular bases.PLoS Genet. 2020 Oct 1;16(10):e1009054. doi: 10.1371/journal.pgen.1009054. eCollection 2020 Oct. PLoS Genet. 2020. PMID: 33001999 Free PMC article.
References
-
- van Nimwegen K.J., Schieving J.H., Willemsen M.A., Veltman J.A., van der Burg S., van der Wilt G.J., Grutters J.P. The diagnostic pathway in complex paediatric neurology: A cost analysis. Eur. J. Paediatr. Neurol. 2015;19:233–239. - PubMed
-
- Vissers L.E.L.M., van Nimwegen K.J.M., Schieving J.H., Kamsteeg E.J., Kleefstra T., Yntema H.G., Pfundt R., van der Wilt G.J., Krabbenborg L., Brunner H.G. A clinical utility study of exome sequencing versus conventional genetic testing in pediatric neurology. Genet. Med. 2017;19:1055–1063. - PMC - PubMed
-
- Graungaard A.H., Skov L. Why do we need a diagnosis? A qualitative study of parents’ experiences, coping and needs, when the newborn child is severely disabled. Child Care Health Dev. 2007;33:296–307. - PubMed
-
- Sawyer S.L., Hartley T., Dyment D.A., Beaulieu C.L., Schwartzentruber J., Smith A., Bedford H.M., Bernard G., Bernier F.P., Brais B., FORGE Canada Consortium. Care4Rare Canada Consortium Utility of whole-exome sequencing for those near the end of the diagnostic odyssey: Time to address gaps in care. Clin. Genet. 2016;89:275–284. - PMC - PubMed
Publication types
MeSH terms
Grants and funding
LinkOut - more resources
Full Text Sources
Other Literature Sources
Research Materials