Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2015 Nov 5:2015:824-32.
eCollection 2015.

Contrasting Association Results between Existing PheWAS Phenotype Definition Methods and Five Validated Electronic Phenotypes

Affiliations

Contrasting Association Results between Existing PheWAS Phenotype Definition Methods and Five Validated Electronic Phenotypes

Joseph B Leader et al. AMIA Annu Symp Proc. .

Abstract

Phenome-Wide Association Studies (PheWAS) comprehensively investigate the association between genetic variation and a wide array of outcome traits. Electronic health record (EHR) based PheWAS uses various abstractions of International Classification of Diseases, Ninth Revision (ICD-9) codes to identify case/control status for diagnoses that are used as the phenotypic variables. However, there have not been comparisons within a PheWAS between results from high quality derived phenotypes and high-throughput but potentially inaccurate use of ICD-9 codes for case/control definition. For this study we first developed a group of high quality algorithms for five phenotypes. Next we evaluated the association of these "gold standard" phenotypes and 4,636,178 genetic variants with minor allele frequency > 0.01 and compared the results from high-throughput associations at the 3 digit, 5 digit, and PheWAS codes for defining case/control status. We found that certain diseases contained similar patient populations across phenotyping methods but had differences in PheWAS.

PubMed Disclaimer

Figures

Figure 1.
Figure 1.
Venn diagram demonstrating the overlap of cases by phenotype method for T2DM
Figure 2.
Figure 2.
Venn diagram demonstrating the overlap of cases by phenotype method for Obesity
Figure 3.
Figure 3.
Venn diagram demonstrating the overlap of cases by phenotype method for ACS
Figure 4.
Figure 4.
Manhattan-plots of all results. These are the results for the 5-digit ICD-9 code defined case/control diagnoses. Supplementary figures show the results for the 3-digit and PheWAS Code defined case/control status and are available online. The points in gold show the significance of the association result for the ICD-9 based diagnoses most similar to the gold-standard defined phenotype, for example the ICD-9 code 250.00 is the ICD-9 code for T2DM.
Figure 5.
Figure 5.
Sun Plot of association results for SNP rs7127254, coded allele T, present across the three methods of identifying case/control status used for these analyses. This SNP was associated with the gold standard phenotype of obesity in our study with p-value 6.19×10−7, and the array of metabolic syndrome comorbidities also associated with SNP notable. The most significant association for this SNP was with the 5 digit ICD-9 of T2DM.
Figure 6.
Figure 6.
Sun Plot of results for SNP rs2277251, coded allele T. This SNP was associated with the gold standard phenotype of T2DM with p-value 7.08×10−7. SNP-phenotype associations (p < 0.01) that were also present for this SNP across the 3 methods of identifying case/control status. The spectrum of other comorbidities related to T2DM also associated with this SNP is notable.
Figure 7.
Figure 7.
Sun Plot of results for SNP rs10009355, coded allele T. This SNP was associated with the gold standard phenotype of obstructive CAD with p-value 8.24×10−7. SNP-phenotype associations (p < 0.01) that were also present for this SNP across the 3 methods of identifying case/control status. Additional comorbidities related to CAD are associated with this SNP.

References

    1. Pendergrass SA, Brown-Gentry K, Dudek SM, Torstenson ES, Ambite JL, Avery CL, et al. The use of phenome-wide association studies (PheWAS) for exploration of novel genotype-phenotype relationships and pleiotropy discovery. Genet Epidemiol. 2011 Jul;35(5):410–22. - PMC - PubMed
    1. Crawford DC, Crosslin DR, Tromp G, Kullo IJ, Kuivaniemi H, Hayes MG, et al. eMERGEing progress in genomics-the first seven years. Front Genet. 2014;5:184. - PMC - PubMed
    1. Denny JC, Ritchie MD, Basford MA, Pulley JM, Bastarache L, Brown-Gentry K, et al. PheWAS: demonstrating the feasibility of a phenome-wide scan to discover gene-disease associations. Bioinformatics. 2010 May 1;26(9):1205–10. - PMC - PubMed
    1. Hebbring SJ, Schrodi SJ, Ye Z, Zhou Z, Page D, Brilliant MH. A PheWAS approach in studying HLA-DRB1*1501. Genes Immun. 2013 Apr;14(3):187–91. - PMC - PubMed
    1. Namjou B, Marsolo K, Caroll RJ, Denny JC, Ritchie MD, Verma SS, et al. Phenome-wide association study (PheWAS) in EMR-linked pediatric cohorts, genetically links PLCL1 to speech language development and IL5-IL13 to Eosinophilic Esophagitis. Front Genet. 2014;5:401. - PMC - PubMed

LinkOut - more resources