Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2015 May 6:8:15.
doi: 10.1186/s13040-015-0048-2. eCollection 2015.

The effects of electronic medical record phenotyping details on genetic association studies: HDL-C as a case study

Affiliations

The effects of electronic medical record phenotyping details on genetic association studies: HDL-C as a case study

Logan Dumitrescu et al. BioData Min. .

Abstract

Background: Biorepositories linked to de-identified electronic medical records (EMRs) have the potential to complement traditional epidemiologic studies in genotype-phenotype studies of complex human diseases and traits. A major challenge in meeting this potential is the use of EMR-derived data to extract phenotypes and covariates for genetic association studies. Unlike traditional epidemiologic data, EMR-derived data are collected for clinical care and are therefore highly variable across patients. The variability of clinical data coupled with the challenges associated with searching unstructured clinical notes requires the development of algorithms to extract phenotypes for analysis. Given the number of possible algorithms that could be developed for any one EMR-derived phenotype, we explored here the impact algorithm decision logic has on genetic association study results for a single quantitative trait, high density lipoprotein cholesterol (HDL-C).

Results: We used five different algorithms to extract HDL-C from African American subjects genotyped on the Illumina Metabochip (n = 11,519) as part of Epidemiologic Architecture for Genes Linked to Environment (EAGLE). Tests of association between HDL-C and genetic risk scores for HDL-C associated variants suggest that the genetic effect size does not vary substantially across the five HDL-C definitions.

Conclusions: These data collectively suggest that, at least for this quantitative trait, algorithm decision logic and phenotyping details do not appreciably impact genetic association study test statistics.

Keywords: Electronic medical record; Genetic risk score; HDL-C; PAGE I study; eMERGE network.

PubMed Disclaimer

Figures

Figure 1
Figure 1
Distribution of HDL-C risk alleles in EAGLE BioVU African Americans.
Figure 2
Figure 2
Additive effects of HDL-C risk alleles on various HDL-C measurements. Effect sizes (betas) from the linear regression analysis with the unweighted GRS, adjusted for sex, are shown as expected HDL-C levels (in mg/dl; black diamonds), along with their 95% confidence intervals.

References

    1. Manolio TA. Biorepositories–at the bleeding edge. Int J Epidemiol. 2008;37:231–3. doi: 10.1093/ije/dym282. - DOI - PubMed
    1. McCarty C, Chisholm R, Chute C, Kullo I, Jarvik G, Larson E, et al. The eMERGE network: a consortium of biorepositories linked to electronic medical records data for conducting genomic studies. BMC Med Genet. 2011;4:13. - PMC - PubMed
    1. Gottesman O, Kuivaniemi H, Tromp G, Faucett WA, Li R, Manolio TA, et al. The Electronic Medical Records and Genomics (eMERGE) network: past, present, and future. Genet Med. 2013;15:761–71. doi: 10.1038/gim.2013.72. - DOI - PMC - PubMed
    1. Kho AN, Pacheco JA, Peissig PL, Rasmussen L, Newton KM, Weston N, et al. Electronic Medical Records for Genetic Research: Results of the eMERGE Consortium. Sci Transl Med. 2011;3:79re1. doi: 10.1126/scitranslmed.3001807. - DOI - PMC - PubMed
    1. Ritchie MD, Denny JC, Crawford DC, Ramirez AH, Weiner JB, Pulley JM, et al. Robust replication of genotype-phenotype associations across multiple diseases in an electronic medical record. Am J Hum Genet. 2010;86:560–72. doi: 10.1016/j.ajhg.2010.03.003. - DOI - PMC - PubMed