Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2022 Jan 28;22(1):23.
doi: 10.1186/s12911-022-01759-z.

Under-specification as the source of ambiguity and vagueness in narrative phenotype algorithm definitions

Affiliations

Under-specification as the source of ambiguity and vagueness in narrative phenotype algorithm definitions

Jingzhi Yu et al. BMC Med Inform Decis Mak. .

Abstract

Introduction: Currently, one of the commonly used methods for disseminating electronic health record (EHR)-based phenotype algorithms is providing a narrative description of the algorithm logic, often accompanied by flowcharts. A challenge with this mode of dissemination is the potential for under-specification in the algorithm definition, which leads to ambiguity and vagueness.

Methods: This study examines incidents of under-specification that occurred during the implementation of 34 narrative phenotyping algorithms in the electronic Medical Record and Genomics (eMERGE) network. We reviewed the online communication history between algorithm developers and implementers within the Phenotype Knowledge Base (PheKB) platform, where questions could be raised and answered regarding the intended implementation of a phenotype algorithm.

Results: We developed a taxonomy of under-specification categories via an iterative review process between two groups of annotators. Under-specifications that lead to ambiguity and vagueness were consistently found across narrative phenotype algorithms developed by all involved eMERGE sites.

Discussion and conclusion: Our findings highlight that under-specification is an impediment to the accuracy and efficiency of the implementation of current narrative phenotyping algorithms, and we propose approaches for mitigating these issues and improved methods for disseminating EHR phenotyping algorithms.

Keywords: Algorithm: Natural Language; Ambiguity; Electronic Health Records (EHR); Phenotyping; Under-Specification; Vagueness.

PubMed Disclaimer

Conflict of interest statement

Not applicable.

Figures

Fig. 1
Fig. 1
Example of raising issues of vagueness and under-specification in the PheKB database, from the Chronic Kidney Disease phenotype. https://phekb.org/phenotype/chronic-kidney-disease
Fig. 2
Fig. 2
Categories of under-specification and other common issues identified in narrative phenotype algorithms

References

    1. Pathak J, Kho AN, Denny JC. Electronic health records-driven phenotyping: challenges, recent advances, and perspectives. J Am Med Inform Assoc JAMIA. 2013;20(e2):e206–e211. doi: 10.1136/amiajnl-2013-002428. - DOI - PMC - PubMed
    1. Wei W-Q, Denny JC. Extracting research-quality phenotypes from electronic health records to support precision medicine. Genome Med [Internet]. 2015 Apr 30 [cited 2020 Sep 9];7(1). Available from: https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4416392/ - PMC - PubMed
    1. Gottesman O, Kuivaniemi H, Tromp G, Faucett WA, Li R, Manolio TA, et al. The electronic medical records and genomics (eMERGE) network: past, present, and future. Genet Med Off J Am Coll Med Genet. 2013;15(10):761–771. - PMC - PubMed
    1. McCarty CA, Chisholm RL, Chute CG, Kullo IJ, Jarvik GP, Larson EB, et al. The eMERGE Network: a consortium of biorepositories linked to electronic medical records data for conducting genomic studies. BMC Med Genomics. 2011;4:13. doi: 10.1186/1755-8794-4-13. - DOI - PMC - PubMed
    1. Califf RM. The Patient-Centered Outcomes Research Network: a national infrastructure for comparative effectiveness research. N C Med J. 2014;75(3):204–210. - PubMed

Publication types

LinkOut - more resources