Validation of electronic medical record-based phenotyping algorithms: results and lessons learned from the eMERGE network
- PMID: 23531748
- PMCID: PMC3715338
- DOI: 10.1136/amiajnl-2012-000896
Validation of electronic medical record-based phenotyping algorithms: results and lessons learned from the eMERGE network
Abstract
Background: Genetic studies require precise phenotype definitions, but electronic medical record (EMR) phenotype data are recorded inconsistently and in a variety of formats.
Objective: To present lessons learned about validation of EMR-based phenotypes from the Electronic Medical Records and Genomics (eMERGE) studies.
Materials and methods: The eMERGE network created and validated 13 EMR-derived phenotype algorithms. Network sites are Group Health, Marshfield Clinic, Mayo Clinic, Northwestern University, and Vanderbilt University.
Results: By validating EMR-derived phenotypes we learned that: (1) multisite validation improves phenotype algorithm accuracy; (2) targets for validation should be carefully considered and defined; (3) specifying time frames for review of variables eases validation time and improves accuracy; (4) using repeated measures requires defining the relevant time period and specifying the most meaningful value to be studied; (5) patient movement in and out of the health plan (transience) can result in incomplete or fragmented data; (6) the review scope should be defined carefully; (7) particular care is required in combining EMR and research data; (8) medication data can be assessed using claims, medications dispensed, or medications prescribed; (9) algorithm development and validation work best as an iterative process; and (10) validation by content experts or structured chart review can provide accurate results.
Conclusions: Despite the diverse structure of the five EMRs of the eMERGE sites, we developed, validated, and successfully deployed 13 electronic phenotype algorithms. Validation is a worthwhile process that not only measures phenotype performance but also strengthens phenotype algorithm definitions and enhances their inter-institutional sharing.
Keywords: electronic health record; electronic medical record; genomics; phenotype; validation studies.
Figures
References
-
- Office of the National Coordinator for Health Information Technology. Electronic Health Records and Meaningful Use 2011; http://healthit.hhs.gov/portal/server.pt?open=512&objID=1325&par.... (accessed 31 May 2011)
-
- Walker JM. Electronic medical records and health care transformation. Health Aff (Millwood). 2005;24:1118–20. - PubMed
-
- Walker J, Pan E, Johnston D, Adler-Milstein J, et al. The value of health care information exchange and interoperability. Health Affairs, no. (2005): doi:10.1377/hlthaff.w5.10 (accessed 18 May 2013)
-
- Rice JP, Saccone NL, Rasmussen E. Definition of the phenotype. Adv Genet 2001;42:69–76 - PubMed
Publication types
MeSH terms
Grants and funding
- U01-HG-004610/HG/NHGRI NIH HHS/United States
- U01 HG006375/HG/NHGRI NIH HHS/United States
- UL1 TR000150/TR/NCATS NIH HHS/United States
- U19 HL065962/HL/NHLBI NIH HHS/United States
- U01 HG004603/HG/NHGRI NIH HHS/United States
- U01HG004609/HG/NHGRI NIH HHS/United States
- U01 HG004609/HG/NHGRI NIH HHS/United States
- U01 HG004599/HG/NHGRI NIH HHS/United States
- U01 HG006378/HG/NHGRI NIH HHS/United States
- R01 GM105688/GM/NIGMS NIH HHS/United States
- U01 HG004610/HG/NHGRI NIH HHS/United States
- U01-HG-004608/HG/NHGRI NIH HHS/United States
- U01 HG004608/HG/NHGRI NIH HHS/United States
- U01-HG-04599/HG/NHGRI NIH HHS/United States
- K08 AG019180/AG/NIA NIH HHS/United States
- U01-HG-04603/HG/NHGRI NIH HHS/United States
LinkOut - more resources
Full Text Sources
Other Literature Sources