Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2011 Mar 1;9(1):41-66.
doi: 10.2174/187569211794728805.

Statistical Optimization of Pharmacogenomics Association Studies: Key Considerations from Study Design to Analysis

Affiliations

Statistical Optimization of Pharmacogenomics Association Studies: Key Considerations from Study Design to Analysis

Benjamin J Grady et al. Curr Pharmacogenomics Person Med. .

Abstract

Research in human genetics and genetic epidemiology has grown significantly over the previous decade, particularly in the field of pharmacogenomics. Pharmacogenomics presents an opportunity for rapid translation of associated genetic polymorphisms into diagnostic measures or tests to guide therapy as part of a move towards personalized medicine. Expansion in genotyping technology has cleared the way for widespread use of whole-genome genotyping in the effort to identify novel biology and new genetic markers associated with pharmacokinetic and pharmacodynamic endpoints. With new technology and methodology regularly becoming available for use in genetic studies, a discussion on the application of such tools becomes necessary. In particular, quality control criteria have evolved with the use of GWAS as we have come to understand potential systematic errors which can be introduced into the data during genotyping. There have been several replicated pharmacogenomic associations, some of which have moved to the clinic to enact change in treatment decisions. These examples of translation illustrate the strength of evidence necessary to successfully and effectively translate a genetic discovery. In this review, the design of pharmacogenomic association studies is examined with the goal of optimizing the impact and utility of this research. Issues of ascertainment, genotyping, quality control, analysis and interpretation are considered.

PubMed Disclaimer

Conflict of interest statement

CONFLICT OF INTERESTS

None declared/applicable

Figures

Figure 1
Figure 1
Example of treatment dependent and treatment differential study designs answering a harmacogenomic question related to antiepileptic drug efficacy.
Figure 2
Figure 2
The plots of the top two principal component vectors resulting from principal components analysis before and after removal of outliers and with true race coded by color. A) Prior to exclusion of outliers, there are several individuals who do not cluster cleanly. B) After outlier removal, the presence of two general clusters can be visualized. Open Diamond = African American; Open Circle = European American; Boxed Plus Sign = Asian; Boxed X = Hispanic; Plus Sign = Other; Filled Triangle = Unknown.
Figure 2
Figure 2
The plots of the top two principal component vectors resulting from principal components analysis before and after removal of outliers and with true race coded by color. A) Prior to exclusion of outliers, there are several individuals who do not cluster cleanly. B) After outlier removal, the presence of two general clusters can be visualized. Open Diamond = African American; Open Circle = European American; Boxed Plus Sign = Asian; Boxed X = Hispanic; Plus Sign = Other; Filled Triangle = Unknown.
Figure 3
Figure 3
Q-Q plots showing the negative log-transformed p-values for all SNPs in an association study. A) Before quality control, many association results have low p-values. B) Subsequent to quality control, it can be seen that the presence of a departure from expected p-values was likely the result of systematic bias from low quality data. The line demonstrates what would be expected if the null hypothesis of no association holds.
Figure 3
Figure 3
Q-Q plots showing the negative log-transformed p-values for all SNPs in an association study. A) Before quality control, many association results have low p-values. B) Subsequent to quality control, it can be seen that the presence of a departure from expected p-values was likely the result of systematic bias from low quality data. The line demonstrates what would be expected if the null hypothesis of no association holds.
Figure 4
Figure 4
A flow chart of potential analysis designs to pursue based on the form of the study question.
Figure 5
Figure 5
Regression equations used in genetic epidemiology. A) The identity link function for linear regression predicts the mean trait value, Y. B) The logit link for logistic regression predicts the odds, [p/(1-p)], of the outcome. C) Poisson regression uses the log function to predict the rate, λ, of events. D) Cox proportional hazards regression also uses the log function but predicts the hazard, h(t), at time t. Each regression function assumes an intercept B0 - the value of the mean, odds, rate or hazard when all independent variables are coded to 0 - and fits a coefficient Bn to describe the effect of each independent variable xn.
Figure 6
Figure 6
Coding conventions for genetic markers with two alleles when performing regression.
Figure 7
Figure 7
Analysis methods used to explore gene-gene interactions.

Similar articles

Cited by

References

    1. Lazarou J, Pomeranz BH, Corey PN. Incidence of adverse drug reactions in hospitalized patients: a meta-analysis of prospective studies. JAMA. 1998;279(15):1200–1205. - PubMed
    1. Botstein D, White RL, Skolnick M, et al. Construction of a genetic linkage map in man using restriction fragment length polymorphisms. Am J Hum Genet. 1980;32(3):314–331. - PMC - PubMed
    1. Weber JL, May PE. Abundant class of human DNA polymorphisms which can be typed using the polymerase chain reaction. Am J Hum Genet. 1989;44:388–396. - PMC - PubMed
    1. Collins FS, Guyer MS, Charkravarti A. Variations on a theme: cataloging human DNA sequence variation. Science. 1997;278(5343):1580–1581. - PubMed
    1. Durbin RM, Abecasis GR, Altshuler DL, et al. A map of human genome variation from population-scale sequencing. Nature. 2010;467(7319):1061–1073. - PMC - PubMed

LinkOut - more resources