Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2015 Nov;22(6):1220-30.
doi: 10.1093/jamia/ocv112. Epub 2015 Sep 5.

Desiderata for computable representations of electronic health records-driven phenotype algorithms

Affiliations

Desiderata for computable representations of electronic health records-driven phenotype algorithms

Huan Mo et al. J Am Med Inform Assoc. 2015 Nov.

Abstract

Background: Electronic health records (EHRs) are increasingly used for clinical and translational research through the creation of phenotype algorithms. Currently, phenotype algorithms are most commonly represented as noncomputable descriptive documents and knowledge artifacts that detail the protocols for querying diagnoses, symptoms, procedures, medications, and/or text-driven medical concepts, and are primarily meant for human comprehension. We present desiderata for developing a computable phenotype representation model (PheRM).

Methods: A team of clinicians and informaticians reviewed common features for multisite phenotype algorithms published in PheKB.org and existing phenotype representation platforms. We also evaluated well-known diagnostic criteria and clinical decision-making guidelines to encompass a broader category of algorithms.

Results: We propose 10 desired characteristics for a flexible, computable PheRM: (1) structure clinical data into queryable forms; (2) recommend use of a common data model, but also support customization for the variability and availability of EHR data among sites; (3) support both human-readable and computable representations of phenotype algorithms; (4) implement set operations and relational algebra for modeling phenotype algorithms; (5) represent phenotype criteria with structured rules; (6) support defining temporal relations between events; (7) use standardized terminologies and ontologies, and facilitate reuse of value sets; (8) define representations for text searching and natural language processing; (9) provide interfaces for external software algorithms; and (10) maintain backward compatibility.

Conclusion: A computable PheRM is needed for true phenotype portability and reliability across different EHR products and healthcare systems. These desiderata are a guide to inform the establishment and evolution of EHR phenotype algorithm authoring platforms and languages.

Keywords: computable representation; data models; electronic health records; phenotype algorithms; phenotype standardization.

PubMed Disclaimer

Figures

Figure 1:
Figure 1:
Phenotype algorithm for identifying type 2 diabetes mellitus (T2DM) from electronic medical records (EMR or EHR). T1DM: type 1 diabetes mellitus; Dx: diagnoses, defined as recorded using International Classification of Diseases, 9th Revision (ICD-9) codes; med: medication; physcn: physicians; Rx: prescriptions. More details can be found in the appendix and on PheKB.org.
Figure 2:
Figure 2:
Schematic of desiderata for computable phenotype electronic health record-driven phenotyping. Numerals 1–9 in the figure correspond to Desiderata 1–9 (Desideratum 10 is not depicted in this Figure).

Similar articles

Cited by

References

    1. Mosley JD, Van Driest SL, Larkin EK, et al. Mechanistic phenotypes: an aggregative phenotyping strategy to identify disease mechanisms using GWAS data. PLoS ONE. 2013;8:e81503. - PMC - PubMed
    1. Namjou B, Keddache M, Marsolo K, et al. EMR-linked GWAS study: investigation of variation landscape of loci for body mass index in children. Front Genet. 2013;4:268. - PMC - PubMed
    1. Pathak J, Kiefer RC, Bielinski SJ, et al. Mining the human phenome using semantic web technologies: a case study for Type 2 Diabetes. AMIA Annu Symp Proc AMIA Symp AMIA Symp. 2012;2012:699–708. - PMC - PubMed
    1. Li L, Ruau D, Chen R, et al. Systematic identification of risk factors for Alzheimer’s disease through shared genetic architecture and electronic medical records. Pac Symp Biocomput Pac Symp Biocomput. 2013;2013:224–235. - PubMed
    1. Denny JC, Crawford DC, Ritchie MD, et al. Variants near FOXE1 are associated with hypothyroidism and other thyroid conditions: using electronic medical records for genome- and phenome-wide studies. Am J Hum Genet. 2011;89:529–542. - PMC - PubMed

Publication types