Designing candidate gene and genome-wide case-control association studies

Krina T Zondervan¹, Lon R Cardon

Affiliations

PMID: 17947991
PMCID: PMC4180089
DOI: 10.1038/nprot.2007.366

Designing candidate gene and genome-wide case-control association studies

Krina T Zondervan et al. Nat Protoc. 2007.

. 2007;2(10):2492-501.

doi: 10.1038/nprot.2007.366.

Authors

Krina T Zondervan¹, Lon R Cardon

Affiliation

¹ Wellcome Trust Centre for Human Genetics, University of Oxford, Oxford, UK. krina.zondervan@well.ox.ac.uk

PMID: 17947991
PMCID: PMC4180089
DOI: 10.1038/nprot.2007.366

Abstract

This protocol describes how to appropriately design a genetic association case-control study, either focusing on a candidate gene (CG) or region or implementing a genome-wide approach. The steps described involve: (i) defining the case phenotype in adequate detail; (ii) checking the heritability of the disease in question; (iii) considering whether a population-based study is the appropriate design for the research question; (iv) the appropriate selection of controls; (v) sample size calculations and (vi) giving due consideration to whether it is a de novo or replication study. General guidelines are given, as well as specific examples of a CG and a genome-wide association study into type 2 diabetes. Software and websites used in this protocol include the International HapMap Consortium website, Genetic Power Calculator, CaT, and SNPSpD. Running each of the programs takes only a few seconds; the rate-limiting steps involve thinking through the designs and parameters in the disease models.

PubMed Disclaimer

Figures

**Fig 1**
Required number of cases (=number of controls) to detect varying disease allele frequencies and GRRs with 80% power in a) a CG scenario with indirect association assuming either 18 independent tagSNPs (solid lines; per-SNP type I error rate = 0.0028) or 11 independent tagSNPs (dashed lines; per-SNP type I error rate = 0.0046) and b) a GWA scenario assuming either 500,000 independent tagSNPs (solid lines; per-SNP type I error rate= 1×10⁻⁷) or 300,000 independent tag SNPs (dashed lines; per-SNP type I error rate= 1.67×10⁻⁷). A multiplicative model was assumed (GRR_AA = (GRR_Aa²)) and numbers were adjusted for a mean r² of 0.97 (Caucasians) between a common tagSNP and common disease allele.

See this image and copyright information in PMC

References

1. Gilliam TC, et al. Localization of the Huntington’s disease gene to a small segment of chromosome 4 flanked by D4S10 and the telomere. Cell. 1987;50:565–71. - PubMed
1. Kerem B, et al. Identification of the cystic fibrosis gene: genetic analysis. Science. 1989;245:1073–80. - PubMed
1. The International HapMap Consortium A haplotype map of the human genome. Nature. 2005;437:1299–320. - PMC - PubMed
1. Palmer LJ, Cardon LR. Shaking the tree: mapping complex disease genes with linkage disequilibrium. Lancet. 2005;366:1223–34. - PubMed
1. Zondervan KT, Cardon LR. The complex interplay among factors that influence allelic association. Nat Rev Genet. 2004;5:89–100. - PubMed

Publication types

Actions

MeSH terms

Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions

Grants and funding

076113/WT_/Wellcome Trust/United Kingdom

LinkOut - more resources

Full Text Sources
Miscellaneous
- NCI CPTAC Assay Portal

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Designing candidate gene and genome-wide case-control association studies

Affiliation

Designing candidate gene and genome-wide case-control association studies

Authors

Affiliation

Abstract

Figures

References

Publication types

MeSH terms

Grants and funding

LinkOut - more resources

Full Text Sources

Miscellaneous