Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2007;2(10):2492-501.
doi: 10.1038/nprot.2007.366.

Designing candidate gene and genome-wide case-control association studies

Affiliations

Designing candidate gene and genome-wide case-control association studies

Krina T Zondervan et al. Nat Protoc. 2007.

Abstract

This protocol describes how to appropriately design a genetic association case-control study, either focusing on a candidate gene (CG) or region or implementing a genome-wide approach. The steps described involve: (i) defining the case phenotype in adequate detail; (ii) checking the heritability of the disease in question; (iii) considering whether a population-based study is the appropriate design for the research question; (iv) the appropriate selection of controls; (v) sample size calculations and (vi) giving due consideration to whether it is a de novo or replication study. General guidelines are given, as well as specific examples of a CG and a genome-wide association study into type 2 diabetes. Software and websites used in this protocol include the International HapMap Consortium website, Genetic Power Calculator, CaT, and SNPSpD. Running each of the programs takes only a few seconds; the rate-limiting steps involve thinking through the designs and parameters in the disease models.

PubMed Disclaimer

Figures

Fig 1
Fig 1
Required number of cases (=number of controls) to detect varying disease allele frequencies and GRRs with 80% power in a) a CG scenario with indirect association assuming either 18 independent tagSNPs (solid lines; per-SNP type I error rate = 0.0028) or 11 independent tagSNPs (dashed lines; per-SNP type I error rate = 0.0046) and b) a GWA scenario assuming either 500,000 independent tagSNPs (solid lines; per-SNP type I error rate= 1×10−7) or 300,000 independent tag SNPs (dashed lines; per-SNP type I error rate= 1.67×10−7). A multiplicative model was assumed (GRRAA = (GRRAa2)) and numbers were adjusted for a mean r2 of 0.97 (Caucasians) between a common tagSNP and common disease allele.

Similar articles

Cited by

References

    1. Gilliam TC, et al. Localization of the Huntington’s disease gene to a small segment of chromosome 4 flanked by D4S10 and the telomere. Cell. 1987;50:565–71. - PubMed
    1. Kerem B, et al. Identification of the cystic fibrosis gene: genetic analysis. Science. 1989;245:1073–80. - PubMed
    1. The International HapMap Consortium A haplotype map of the human genome. Nature. 2005;437:1299–320. - PMC - PubMed
    1. Palmer LJ, Cardon LR. Shaking the tree: mapping complex disease genes with linkage disequilibrium. Lancet. 2005;366:1223–34. - PubMed
    1. Zondervan KT, Cardon LR. The complex interplay among factors that influence allelic association. Nat Rev Genet. 2004;5:89–100. - PubMed

Publication types