Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2003 Sep;25(2):149-57.
doi: 10.1002/gepi.10260.

Optimal two-stage genotyping in population-based association studies

Affiliations

Optimal two-stage genotyping in population-based association studies

Jaya M Satagopan et al. Genet Epidemiol. 2003 Sep.

Abstract

We propose a cost-effective two-stage approach to investigate gene-disease associations when testing a large number of candidate markers using a case-control design. Under this approach, all the markers are genotyped and tested at stage 1 using a subset of affected cases and unaffected controls, and the most promising markers are genotyped on the remaining individuals and tested using all the individuals at stage 2. The sample size at stage 1 is chosen such that the power to detect the true markers of association is 1-beta(1) at significance level alpha(1). The most promising markers are tested at significance level alpha(2) at stage 2. In contrast, a one-stage approach would evaluate and test all the markers on all the cases and controls to identify the markers significantly associated with the disease. The goal is to determine the two-stage parameters (alpha(1), beta(1), alpha(2)) that minimize the cost of the study such that the desired overall significance is alpha and the desired power is close to 1-beta, the power of the one-stage approach. We provide analytic formulae to estimate the two-stage parameters. The properties of the two-stage approach are evaluated under various parametric configurations and compared with those of the corresponding one-stage approach. The optimal two-stage procedure does not depend on the signal of the markers associated with the study. Further, when there is a large number of markers, the optimal procedure is not substantially influenced by the total number of markers associated with the disease. The results show that, compared to a one-stage approach, a two-stage procedure typically halves the cost of the study.

PubMed Disclaimer

Figures

Fig. 1.
Fig. 1.
Power of two-stage approach for D=1, α=0.05, 1−β=0.80, and ϵ=0.01. Note that 1−β is power of one-stage approach. Power is shown for values of m ranging from 25–1,000. Horizontal axis shows cost fraction (T2nR)/(T1nR). Vertical axis shows maximum power attained by two-stage procedure for a given cost fraction. Two horizontal lines correspond to powers of 79% and 80%. Two vertical lines correspond to cost fractions of 50% and 55%.

Comment in

References

    1. Boehnke M 1994. Limits of resolution of genetic linkage studies: implications for the positional cloning of human genetic diseases. Am J Hum Genet 55:379–390. - PMC - PubMed
    1. Cardon LR, Bell JI. 2001. Association study designs for complex diseases. Nat Rev Genet 2:91–99. - PubMed
    1. Cardon LR, Palmer LJ. 2003. Population stratification and spurious allelic association. Lancet 361:598–604. - PubMed
    1. Elston RC. 1994. P values, power, and pitfalls in the linkage analysis of psychiatric disorders. In: Gershon ES, Clonings CR, editors. Genetic approaches to mental disorders. Proceedings of the Annual Meeting of the American Psychopathological Association. Washington, DC: American Psychiatric Press. p 3–21.
    1. Elston RC, Guo X, Williams LV. 1996. Two-stage global search designs for linkage analysis using pairs of affected relatives. Genet Epidemiol 13:535–558. - PubMed

Publication types

Substances

LinkOut - more resources