Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2007 Oct;3(10):1827-37.
doi: 10.1371/journal.pgen.0030170. Epub 2007 Aug 22.

Power to detect risk alleles using genome-wide tag SNP panels

Affiliations

Power to detect risk alleles using genome-wide tag SNP panels

Michael A Eberle et al. PLoS Genet. 2007 Oct.

Abstract

Advances in high-throughput genotyping and the International HapMap Project have enabled association studies at the whole-genome level. We have constructed whole-genome genotyping panels of over 550,000 (HumanHap550) and 650,000 (HumanHap650Y) SNP loci by choosing tag SNPs from all populations genotyped by the International HapMap Project. These panels also contain additional SNP content in regions that have historically been overrepresented in diseases, such as nonsynonymous sites, the MHC region, copy number variant regions and mitochondrial DNA. We estimate that the tag SNP loci in these panels cover the majority of all common variation in the genome as measured by coverage of both all common HapMap SNPs and an independent set of SNPs derived from complete resequencing of genes obtained from SeattleSNPs. We also estimate that, given a sample size of 1,000 cases and 1,000 controls, these panels have the power to detect single disease loci of moderate risk (lambda approximately 1.8-2.0). Relative risks as low as lambda approximately 1.1-1.3 can be detected using 10,000 cases and 10,000 controls depending on the sample population and disease model. If multiple loci are involved, the power increases significantly to detect at least one locus such that relative risks 20%-35% lower can be detected with 80% power if between two and four independent loci are involved. Although our SNP selection was based on HapMap data, which is a subset of all common SNPs, these panels effectively capture the majority of all common variation and provide high power to detect risk alleles that are not represented in the HapMap data.

PubMed Disclaimer

Conflict of interest statement

Competing interests. MAE, KK, DAP, KLG, RS, LG, KAVM, and CTL are employed by the company that makes the array chips analyzed. PCN, LZ, and SSM are former employees.

Figures

Figure 1
Figure 1. Coverage of the HapMap (A) or SeattleSNPs (B) Datasets As Measured by the Proportion of Common Variation in Each Dataset That Is Captured by a SNP in Either HumanHap550 (CEU; CHB + JPT) or HumanHap650Y (YRI) at Various r 2 Thresholds
Lines show coverage for the CEU (red), CHB + JPT (green), and YRI (blue) populations.
Figure 2
Figure 2. Total Power to Detect a Single Common Risk Allele in the HapMap Data Using HumanHap550 for CEU (A) and CHB + JPT (B) and HumanHap650Y for YRI (C) in a Study Size of 1,000 cases and 1,000 Controls under a Multiplicative Disease Model
Power is calculated for a 5% false discovery rate after a Bonferroni correction for multiple testing (red line). The solid black line represents the power to detect a risk allele if all the common HapMap SNPs in each respective population are genotyped using the same significance cutoff. Also shown are the powers for different frequency ranges as dashed lines, where the lower line indicates the power for risk allele frequencies from 5%–10% and the upper line indicates the power for risk allele frequencies from 40%–50%.
Figure 3
Figure 3. The Minimum Risk Detectable at 80% Power with p ≤ 0.05 after a Bonferroni Correction for Multiple Testing (χ2 ≥ 28.5687) for Various Sample Sizes under Multiplicative (Red) and Additive (Blue) Models
Power is calculated as the ability to detect a single common risk allele in the HapMap data in a whole genome association study within CEU samples.
Figure 4
Figure 4. Total Power to Detect a Single Common Risk Allele outside of the HapMap Data Using 1,000 cases and 1,000 Controls under a Multiplicative Model
The power is estimated using the common SNPs in 68 resequenced SeatleSNPs genes that were not characterized in the HapMap Project for (A) 23 CEU samples using HumanHap550 and (B) 24 YRI samples using HumanHap650Y. Power is calculated for a 5% false discovery rate after a Bonferroni correction for multiple testing (red line). The solid black line shows the power to detect a common risk allele in the HapMap data (i.e., solid red lines in Figure 2).
Figure 5
Figure 5. Power to Detect At Least One Risk Allele in CEU (HumanHap550) and YRI (HumanHap650Y) in 1,000 Cases and 1,000 Controls under a Multiplicative Model
Curves represent the cases where there is a single risk locus (red), two independent loci (blue), or four independent loci (green). The red lines are the single risk loci as shown in Figure 4. Under the multiple loci cases, the relative risk represents the minimum risk for all risk alleles and the corresponding power represents a lower bound on the power.

Similar articles

Cited by

References

    1. Botstein D, Risch N. Discovering genotypes underlying human phenotypes: past successes for mendelian disease, future approaches for complex disease. Nat Genet. 2003;33(Suppl):228–237. - PubMed
    1. Collins FS, Guyer MS, Charkravarti A. Variations on a theme: cataloging human DNA sequence variation. Science. 1997;278:1580–1581. - PubMed
    1. Kruglyak L. Prospects for whole-genome LD mapping of common disease genes. Nat Genet. 1999;22:139–144. - PubMed
    1. de Bakker PI, Yelensky R, Pe'er I, Gabriel SB, Daly MJ, et al. Efficiency and power in genetic association studies. Nat Genet. 2005;37:1217–1223. - PubMed
    1. Hirschhorn JN, Daly MJ. Genome-wide association studies for common diseases and complex traits. Nat Rev Genet. 2005;6:95–108. - PubMed