Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Comparative Study
. 2008 Sep;32(6):560-6.
doi: 10.1002/gepi.20330.

Analysis of multiple SNPs in a candidate gene or region

Affiliations
Comparative Study

Analysis of multiple SNPs in a candidate gene or region

Juliet Chapman et al. Genet Epidemiol. 2008 Sep.

Abstract

We consider the analysis of multiple single nucleotide polymorphisms (SNPs) within a gene or region. The simplest analysis of such data is based on a series of single SNP hypothesis tests, followed by correction for multiple testing, but it is intuitively plausible that a joint analysis of the SNPs will have higher power, particularly when the causal locus may not have been observed. However, standard tests, such as a likelihood ratio test based on an unrestricted alternative hypothesis, tend to have large numbers of degrees of freedom and hence low power. This has motivated a number of alternative test statistics. Here we compare several of the competing methods, including the multivariate score test (Hotelling's test) of Chapman et al. ([2003] Hum. Hered. 56:18-31), Fisher's method for combining P-values, the minimum P-value approach, a Fourier-transform-based approach recently suggested by Wang and Elston ([2007] Am. J. Human Genet. 80:353-360) and a Bayesian score statistic proposed for microarray data by Goeman et al. ([2005] J. R. Stat. Soc. B 68:477-493). Some relationships between these methods are pointed out, and simulation results given to show that the minimum P-value and the Goeman et al. ([2005] J. R. Stat. Soc. B 68:477-493) approaches work well over a range of scenarios. The Wang and Elston approach often performs poorly; we explain why, and show how its performance can be substantially improved.

PubMed Disclaimer

Figures

Figure 1
Figure 1
QQ-plot comparing the distribution of true test statistics, based upon the CTLA4 region, with the distribution of the simulated asymptotic values.

References

    1. Byng MC, Whittaker JC, Cuthbert AP, Mathew CG, Lewis CM. Snp subset selection for genetic association studies. Ann Hum Genet. 2003;67:543–556. - PubMed
    1. Chapman JM, Cooper JD, Todd JA, Clayton DG. Detecting disease associations due to linkage disequilibrium using haplotype tags: A class of tests and the determinants of statistical power. Hum Hered. 2003;56:18–31. - PubMed
    1. Clayton D. SNPHAP, a program for estimating frequencies of haplotypes of large numbers of diallelic markers from unphased genotype data from unrelated subjects. 2003. http://www-gene.cimr.cam.ac.uk/clayton/software/
    1. Fan R, Knapp M. Genome association studies of complex diseases by case-control designs. Am J Hum Genet. 2003;72:850–868. - PMC - PubMed
    1. Fisher RA. Statistical methods for research workers. 4 edition OLiver and Boyd; London: 1932.

Publication types

LinkOut - more resources