Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2010 May 1;26(9):1205-10.
doi: 10.1093/bioinformatics/btq126. Epub 2010 Mar 24.

PheWAS: demonstrating the feasibility of a phenome-wide scan to discover gene-disease associations

Affiliations

PheWAS: demonstrating the feasibility of a phenome-wide scan to discover gene-disease associations

Joshua C Denny et al. Bioinformatics. .

Abstract

Motivation: Emergence of genetic data coupled to longitudinal electronic medical records (EMRs) offers the possibility of phenome-wide association scans (PheWAS) for disease-gene associations. We propose a novel method to scan phenomic data for genetic associations using International Classification of Disease (ICD9) billing codes, which are available in most EMR systems. We have developed a code translation table to automatically define 776 different disease populations and their controls using prevalent ICD9 codes derived from EMR data. As a proof of concept of this algorithm, we genotyped the first 6005 European-Americans accrued into BioVU, Vanderbilt's DNA biobank, at five single nucleotide polymorphisms (SNPs) with previously reported disease associations: atrial fibrillation, Crohn's disease, carotid artery stenosis, coronary artery disease, multiple sclerosis, systemic lupus erythematosus and rheumatoid arthritis. The PheWAS software generated cases and control populations across all ICD9 code groups for each of these five SNPs, and disease-SNP associations were analyzed. The primary outcome of this study was replication of seven previously known SNP-disease associations for these SNPs.

Results: Four of seven known SNP-disease associations using the PheWAS algorithm were replicated with P-values between 2.8 x 10(-6) and 0.011. The PheWAS algorithm also identified 19 previously unknown statistical associations between these SNPs and diseases at P < 0.01. This study indicates that PheWAS analysis is a feasible method to investigate SNP-disease associations. Further evaluation is needed to determine the validity of these associations and the appropriate statistical thresholds for clinical significance.

Availability: The PheWAS software and code translation table are freely available at http://knowledgemap.mc.vanderbilt.edu/research.

PubMed Disclaimer

Figures

Fig. 1.
Fig. 1.
Phenome-wide scan for association with rs3135388. MS is replicated from prior analyses. The dashed line represents the P = 0.05; the dotted line represents the Bonferroni correction.
Fig. 2.
Fig. 2.
Phenome-wide scan for association for four additional SNPs with known disease-SNP associations. The boxed diseases represent associations replicated from prior GWAS analyses. The dashed line represents the P = 0.05; the dotted line represents the Bonferroni correction.

Similar articles

Cited by

References

    1. Benjamin EJ, et al. Genome-wide association with select biomarker traits in the Framingham Heart Study. BMC Med. Genet. 2007;8(Suppl. 1):S11. - PMC - PubMed
    1. Benson L, et al. Trends in the diagnosis of overweight and obesity in children and adolescents: 1999–2007. Pediatrics. 2009;123:e153–158. - PubMed
    1. Bilder RM, et al. Phenomics: the systematic study of phenotypes on a genome-wide scale. Neuroscience. 2009;164:30–42. - PMC - PubMed
    1. Denny JC, et al. Increased hospital mortality in patients with bedside hippus. Am. J. Med. 2008;121:239–245. - PubMed
    1. Denny JC, et al. Identifying UMLS concepts from ECG impressions using knowledgemap. AMIA Annu. Symp. Proc. 2005:196–200. - PMC - PubMed

Publication types