Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2012 Mar-Apr;19(2):212-8.
doi: 10.1136/amiajnl-2011-000439. Epub 2011 Nov 19.

Use of diverse electronic medical record systems to identify genetic risk for type 2 diabetes within a genome-wide association study

Affiliations

Use of diverse electronic medical record systems to identify genetic risk for type 2 diabetes within a genome-wide association study

Abel N Kho et al. J Am Med Inform Assoc. 2012 Mar-Apr.

Abstract

Objective: Genome-wide association studies (GWAS) require high specificity and large numbers of subjects to identify genotype-phenotype correlations accurately. The aim of this study was to identify type 2 diabetes (T2D) cases and controls for a GWAS, using data captured through routine clinical care across five institutions using different electronic medical record (EMR) systems.

Materials and methods: An algorithm was developed to identify T2D cases and controls based on a combination of diagnoses, medications, and laboratory results. The performance of the algorithm was validated at three of the five participating institutions compared against clinician review. A GWAS was subsequently performed using cases and controls identified by the algorithm, with samples pooled across all five institutions.

Results: The algorithm achieved 98% and 100% positive predictive values for the identification of diabetic cases and controls, respectively, as compared against clinician review. By standardizing and applying the algorithm across institutions, 3353 cases and 3352 controls were identified. Subsequent GWAS using data from five institutions replicated the TCF7L2 gene variant (rs7903146) previously associated with T2D.

Discussion: By applying stringent criteria to EMR data collected through routine clinical care, cases and controls for a GWAS were identified that subsequently replicated a known genetic variant. The use of standard terminologies to define data elements enabled pooling of subjects and data across five different institutions to achieve the robust numbers required for GWAS.

Conclusions: An algorithm using commonly available data from five different EMR can accurately identify T2D cases and controls for genetic study across multiple institutions.

PubMed Disclaimer

Conflict of interest statement

Competing interests: None.

Figures

Figure 1
Figure 1
Algorithm for the identification of subjects with type 2 diabetes. *Random glucose >200 mg/dl, fasting glucose >125 mg/dl, HbA1c ≥6.5%. HbA1c, hemoglobin A1c; ICD-9, International Classification of Diseases, version 9; T1DM, type 1 diabetes mellitus; T2DM, type 2 diabetes mellitus.
Figure 2
Figure 2
Algorithm for the identification of type 2 diabetes controls. *Glucose ≥110 mg/dl, HbA1c ≥6.0%. HbA1c, hemoglobin A1c; ICD-9, International Classification of Diseases, version 9; T1DM, type 1 diabetes mellitus; T2DM, type 2 diabetes mellitus.

References

    1. Fox CS, Pencina MJ, Meigs JB, et al. Trends in the incidence of type 2 diabetes mellitus from the 1970s to the 1990s: the Framingham Heart Study. Circulation 2006;113:2914–18 - PubMed
    1. Sloan FA, Bethel MA, Ruiz D, Jr, et al. The growing burden of diabetes mellitus in the US elderly population. Arch Intern Med 2008;168:192–9; discussion 199. - PubMed
    1. Scott LJ, Mohlke KL, Bonnycastle LL, et al. A genome-wide association study of type 2 diabetes in Finns detects multiple susceptibility variants. Science 2007;316:1341–5 - PMC - PubMed
    1. Sladek R, Rocheleau G, Rung J, et al. A genome-wide association study identifies novel risk loci for type 2 diabetes. Nature 2007;445:881–5 - PubMed
    1. Steinthorsdottir V, Thorleifsson G, Reynisdottir I, et al. A variant in CDKAL1 influences insulin response and risk of type 2 diabetes. Nat Genet 2007;39:770–5 - PubMed

Publication types