Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2013 Jun;20(e1):e147-54.
doi: 10.1136/amiajnl-2012-000896. Epub 2013 Mar 26.

Validation of electronic medical record-based phenotyping algorithms: results and lessons learned from the eMERGE network

Affiliations

Validation of electronic medical record-based phenotyping algorithms: results and lessons learned from the eMERGE network

Katherine M Newton et al. J Am Med Inform Assoc. 2013 Jun.

Abstract

Background: Genetic studies require precise phenotype definitions, but electronic medical record (EMR) phenotype data are recorded inconsistently and in a variety of formats.

Objective: To present lessons learned about validation of EMR-based phenotypes from the Electronic Medical Records and Genomics (eMERGE) studies.

Materials and methods: The eMERGE network created and validated 13 EMR-derived phenotype algorithms. Network sites are Group Health, Marshfield Clinic, Mayo Clinic, Northwestern University, and Vanderbilt University.

Results: By validating EMR-derived phenotypes we learned that: (1) multisite validation improves phenotype algorithm accuracy; (2) targets for validation should be carefully considered and defined; (3) specifying time frames for review of variables eases validation time and improves accuracy; (4) using repeated measures requires defining the relevant time period and specifying the most meaningful value to be studied; (5) patient movement in and out of the health plan (transience) can result in incomplete or fragmented data; (6) the review scope should be defined carefully; (7) particular care is required in combining EMR and research data; (8) medication data can be assessed using claims, medications dispensed, or medications prescribed; (9) algorithm development and validation work best as an iterative process; and (10) validation by content experts or structured chart review can provide accurate results.

Conclusions: Despite the diverse structure of the five EMRs of the eMERGE sites, we developed, validated, and successfully deployed 13 electronic phenotype algorithms. Validation is a worthwhile process that not only measures phenotype performance but also strengthens phenotype algorithm definitions and enhances their inter-institutional sharing.

Keywords: electronic health record; electronic medical record; genomics; phenotype; validation studies.

PubMed Disclaimer

Figures

Figure 1
Figure 1
Phenotype development and validation. In this two-stage process a primary site first develops and executes the phenotype (boxes), and then secondary sites execute the phenotype (ovals). At each step feedback to primary and secondary sites may lead to revisions in the methods (arrows).

References

    1. Office of the National Coordinator for Health Information Technology. Electronic Health Records and Meaningful Use 2011; http://healthit.hhs.gov/portal/server.pt?open=512&objID=1325&par.... (accessed 31 May 2011)
    1. Walker JM. Electronic medical records and health care transformation. Health Aff (Millwood). 2005;24:1118–20. - PubMed
    1. Walker J, Pan E, Johnston D, Adler-Milstein J, et al. The value of health care information exchange and interoperability. Health Affairs, no. (2005): doi:10.1377/hlthaff.w5.10 (accessed 18 May 2013)
    1. Edwards BJ, Haynes C, Levenstien MA, et al. Power and sample size calculations in the presence of phenotype errors for case/control genetic association studies. BMC Genetics 2005;6:18 doi:10.1186/1471-2156-6-18 (accessed 18 May 2013). - PMC - PubMed
    1. Rice JP, Saccone NL, Rasmussen E. Definition of the phenotype. Adv Genet 2001;42:69–76 - PubMed

Publication types