Power in the phenotypic extremes: a simulation study of power in discovery and replication of rare variants

Lin T Guey¹, Jasmina Kravic, Olle Melander, Noël P Burtt, Jason M Laramie, Valeriya Lyssenko, Anna Jonsson, Eero Lindholm, Tiinamaija Tuomi, Bo Isomaa, Peter Nilsson, Peter Almgren, Sekar Kathiresan, Leif Groop, Albert B Seymour, David Altshuler, Benjamin F Voight

Affiliations

PMID: 21308769
DOI: 10.1002/gepi.20572

Power in the phenotypic extremes: a simulation study of power in discovery and replication of rare variants

Lin T Guey et al. Genet Epidemiol. 2011 May.

. 2011 May;35(4):236-46.

doi: 10.1002/gepi.20572.

Authors

Affiliation

¹ Applied Quantitative Genotherapeutics, Pfizer Biotherapeutics, Cambridge, MA 02144, USA.

PMID: 21308769
DOI: 10.1002/gepi.20572

Abstract

Next-generation sequencing technologies are making it possible to study the role of rare variants in human disease. Many studies balance statistical power with cost-effectiveness by (a) sampling from phenotypic extremes and (b) utilizing a two-stage design. Two-stage designs include a broad-based discovery phase and selection of a subset of potential causal genes/variants to be further examined in independent samples. We evaluate three parameters: first, the gain in statistical power due to extreme sampling to discover causal variants; second, the informativeness of initial (Phase I) association statistics to select genes/variants for follow-up; third, the impact of extreme and random sampling in (Phase 2) replication. We present a quantitative method to select individuals from the phenotypic extremes of a binary trait, and simulate disease association studies under a variety of sample sizes and sampling schemes. First, we find that while studies sampling from extremes have excellent power to discover rare variants, they have limited power to associate them to phenotype—suggesting high false-negative rates for upcoming studies. Second, consistent with previous studies, we find that the effect sizes estimated in these studies are expected to be systematically larger compared with the overall population effect size; in a well-cited lipids study, we estimate the reported effect to be twofold larger. Third, replication studies require large samples from the general population to have sufficient power; extreme sampling could reduce the required sample size as much as fourfold. Our observations offer practical guidance for the design and interpretation of studies that utilize extreme sampling.

PubMed Disclaimer

Publication types

Actions

MeSH terms

Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions

LinkOut - more resources

Full Text Sources
- Wiley

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Power in the phenotypic extremes: a simulation study of power in discovery and replication of rare variants

Affiliation

Power in the phenotypic extremes: a simulation study of power in discovery and replication of rare variants

Authors

Affiliation

Abstract

Publication types

MeSH terms

LinkOut - more resources

Full Text Sources