Facilitating phenotype transfer using a common data model

George Hripcsak¹, Ning Shang², Peggy L Peissig³, Luke V Rasmussen⁴, Cong Liu², Barbara Benoit⁵, Robert J Carroll⁶, David S Carrell⁷, Joshua C Denny⁸, Ozan Dikilitas⁹, Vivian S Gainer⁵, Kayla Marie Howell¹⁰, Jeffrey G Klann⁵, Iftikhar J Kullo⁹, Todd Lingren¹¹, Frank D Mentch¹², Shawn N Murphy⁵, Karthik Natarajan¹³, Jennifer A Pacheco⁴, Wei-Qi Wei⁶, Ken Wiley¹⁴, Chunhua Weng²

Affiliations

¹ Department of Biomedical Informatics, Columbia University, New York, NY, United States; Medical Informatics Services, NewYork-Presbyterian Hospital, New York, NY, United States. Electronic address: hripcsak@columbia.edu.
² Department of Biomedical Informatics, Columbia University, New York, NY, United States.
³ Center for Precision Medicine Research, Marshfield Clinic Research Institute, Marshfield, WI, United States.
⁴ Northwestern University Feinberg School of Medicine, Chicago, IL, United States.
⁵ Research Information Science and Computing, Partners Healthcare, Boston, MA, United States.
⁶ Department of Biomedical Informatics, Vanderbilt University Medical Center, Nashville, TN, United States.
⁷ Kaiser Permanente Washington Health Research Institute, Seattle, WA, United States.
⁸ Department of Biomedical Informatics, Vanderbilt University Medical Center, Nashville, TN, United States; Department of Medicine, Vanderbilt University Medical Center, Nashville, TN, United States.
⁹ Department of Cardiovascular Medicine, Mayo Clinic, Rochester, MN, United States.
¹⁰ Vanderbilt Institute for Clinical and Translational Research, Vanderbilt University Medical Center, Nashville, TN, United States.
¹¹ Cincinnati Children's Hospital Medical Center, Cincinnati, OH, United States.
¹² Center for Applied Genomics, Children's Hospital of Philadelphia, Philadelphia, PA, United States.
¹³ Department of Biomedical Informatics, Columbia University, New York, NY, United States; Medical Informatics Services, NewYork-Presbyterian Hospital, New York, NY, United States.
¹⁴ National Human Genome Research Institute, NIH, Bethesda, MD, United States.

PMID: 31325501
PMCID: PMC6697565
DOI: 10.1016/j.jbi.2019.103253

Facilitating phenotype transfer using a common data model

George Hripcsak et al. J Biomed Inform. 2019 Aug.

. 2019 Aug:96:103253.

doi: 10.1016/j.jbi.2019.103253. Epub 2019 Jul 17.

Authors

Affiliations

¹ Department of Biomedical Informatics, Columbia University, New York, NY, United States; Medical Informatics Services, NewYork-Presbyterian Hospital, New York, NY, United States. Electronic address: hripcsak@columbia.edu.
² Department of Biomedical Informatics, Columbia University, New York, NY, United States.
³ Center for Precision Medicine Research, Marshfield Clinic Research Institute, Marshfield, WI, United States.
⁴ Northwestern University Feinberg School of Medicine, Chicago, IL, United States.
⁵ Research Information Science and Computing, Partners Healthcare, Boston, MA, United States.
⁶ Department of Biomedical Informatics, Vanderbilt University Medical Center, Nashville, TN, United States.
⁷ Kaiser Permanente Washington Health Research Institute, Seattle, WA, United States.
⁸ Department of Biomedical Informatics, Vanderbilt University Medical Center, Nashville, TN, United States; Department of Medicine, Vanderbilt University Medical Center, Nashville, TN, United States.
⁹ Department of Cardiovascular Medicine, Mayo Clinic, Rochester, MN, United States.
¹⁰ Vanderbilt Institute for Clinical and Translational Research, Vanderbilt University Medical Center, Nashville, TN, United States.
¹¹ Cincinnati Children's Hospital Medical Center, Cincinnati, OH, United States.
¹² Center for Applied Genomics, Children's Hospital of Philadelphia, Philadelphia, PA, United States.
¹³ Department of Biomedical Informatics, Columbia University, New York, NY, United States; Medical Informatics Services, NewYork-Presbyterian Hospital, New York, NY, United States.
¹⁴ National Human Genome Research Institute, NIH, Bethesda, MD, United States.

PMID: 31325501
PMCID: PMC6697565
DOI: 10.1016/j.jbi.2019.103253

Abstract

Background: Implementing clinical phenotypes across a network is labor intensive and potentially error prone. Use of a common data model may facilitate the process.

Methods: Electronic Medical Records and Genomics (eMERGE) sites implemented the Observational Health Data Sciences and Informatics (OHDSI) Observational Medical Outcomes Partnership (OMOP) Common Data Model across their electronic health record (EHR)-linked DNA biobanks. Two previously implemented eMERGE phenotypes were converted to OMOP and implemented across the network.

Results: It was feasible to implement the common data model across sites, with laboratory data producing the greatest challenge due to local encoding. Sites were then able to execute the OMOP phenotype in less than one day, as opposed to weeks of effort to manually implement an eMERGE phenotype in their bespoke research EHR databases. Of the sites that could compare the current OMOP phenotype implementation with the original eMERGE phenotype implementation, specific agreement ranged from 100% to 43%, with disagreements due to the original phenotype, the OMOP phenotype, changes in data, and issues in the databases. Using the OMOP query as a standard comparison revealed differences in the original implementations despite starting from the same definitions, code lists, flowcharts, and pseudocode.

Conclusion: Using a common data model can dramatically speed phenotype implementation at the cost of having to populate that data model, though this will produce a net benefit as the number of phenotype implementations increases. Inconsistencies among the implementations of the original queries point to a potential benefit of using a common data model so that actual phenotype code and logic can be shared, mitigating human error in reinterpretation of a narrative phenotype definition.

Keywords: Common data model; Electronic health records; Phenotyping.

PubMed Disclaimer

Conflict of interest statement

Conflicts of interest:

None reported.

Figures

**Figure 1.. Phenotyping flowchart and issues reported.**
Phenotypes were based on the published eMERGE phenotype definition, which included a narrative definition, high-level concept code lists, a flowchart, and pseudocode, and it was taken as the gold standard definition and therefore had no issues. The original eMERGE implementation had each local site write software to query their local database based on the eMERGE definition. In this study, each site converted its data to the OHDSI OMOP database using extract-transfer-load (ETL) software and OMOP vocabulary mappings. The eMERGE definition was encoded as a single OMOP phenotype using the OHDSI Atlas tool. A site could use a local copy of Atlas running on their OMOP database to run the phenotype or use SQL that was generated automatically by Atlas for five database management systems. Where possible, the OMOP result of the phenotype query was compared to the original result. Sites reported issues that they encountered, which are shown in grey squares adjacent to the most relevant step. Issues that caused a significant difference between the original and OMOP query are marked with a star in bold. Those that caused a moderate difference are marked with a star in non-bold. Issues that caused little or no change are marked with a smaller round bullet in a smaller font.

See this image and copyright information in PMC

References

1. Newton KM, Peissig PL, Kho AN, Bielinski SJ, Berg RL, Choudhary V, Basford M, Chute CG, Kullo IJ, Li R, Pacheco JA, Rasmussen LV, Spangler L, Denny JC. J Am Med Inform Assoc. 2013 Jun;20(e1):e147–54. Validation of electronic medical record–based phenotyping algorithms: results and lessons learned from the eMERGE network. J Am Med Inform Assoc 2013;20(e1):e147–54. - PMC - PubMed
1. Mo H, Thompson WK, Rasmussen LV, Pacheco JA, Jiang G, Kiefer R, Zhu Q, Xu J, Montague E, Carrell DS, Lingren T, Mentch FD, Ni Y, Wehbe FH, Peissig PL, Tromp G, Larson EB, Chute CG, Pathak J, Denny JC, Speltz P, Kho AN, Jarvik GP, Bejan CA, Williams MS, Borthwick K, Kitchner TE, Roden DM, Harris PA. Desiderata for computable representations of electronic health records-driven phenotype algorithms. J Am Med Inform Assoc. 2015. November;22(6):1220–30. doi: 10.1093/jamia/ocv112. - DOI - PMC - PubMed
1. Conway M, Berg RL, Carrell D, Denny JC, Kho AN, Kullo IJ, Linneman JG, Pacheco JA, Peissig P, Rasmussen L, Weston N, Chute CG, Pathak J. Analyzing the heterogeneity and complexity of Electronic Health Record oriented phenotyping algorithms. AMIA Annu Symp Proc. 2011;2011:274–83. - PMC - PubMed
1. Hripcsak G, Albers DJ. High-fidelity phenotyping: richness and freedom from bias. J Am Med Inform Assoc 2017. doi: 10.1093/jamia/ocx110. - DOI - PMC - PubMed
1. Hripcsak G, Albers DJ. Next-generation phenotyping of electronic health records. J Am Med Inform Assoc 2013;20:117–21. doi: 10.1136/amiajnl-2012-001145. - DOI - PMC - PubMed

Publication types

Actions
Actions

MeSH terms

Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions

Grants and funding

LinkOut - more resources

Full Text Sources
Medical
- ClinicalTrials.gov
- MedlinePlus Health Information

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Facilitating phenotype transfer using a common data model

Affiliations

Facilitating phenotype transfer using a common data model

Authors

Affiliations

Abstract

Conflict of interest statement

Figures

References

Publication types

MeSH terms

Grants and funding

LinkOut - more resources

Full Text Sources

Medical