Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2009 Jun 2;4(6):e5639.
doi: 10.1371/journal.pone.0005639.

Failure to replicate a genetic association may provide important clues about genetic architecture

Affiliations

Failure to replicate a genetic association may provide important clues about genetic architecture

Casey S Greene et al. PLoS One. .

Abstract

Replication has become the gold standard for assessing statistical results from genome-wide association studies. Unfortunately this replication requirement may cause real genetic effects to be missed. A real result can fail to replicate for numerous reasons including inadequate sample size or variability in phenotype definitions across independent samples. In genome-wide association studies the allele frequencies of polymorphisms may differ due to sampling error or population differences. We hypothesize that some statistically significant independent genetic effects may fail to replicate in an independent dataset when allele frequencies differ and the functional polymorphism interacts with one or more other functional polymorphisms. To test this hypothesis, we designed a simulation study in which case-control status was determined by two interacting polymorphisms with heritabilities ranging from 0.025 to 0.4 with replication sample sizes ranging from 400 to 1600 individuals. We show that the power to replicate the statistically significant independent main effect of one polymorphism can drop dramatically with a change of allele frequency of less than 0.1 at a second interacting polymorphism. We also show that differences in allele frequency can result in a reversal of allelic effects where a protective allele becomes a risk factor in replication studies. These results suggest that failure to replicate an independent genetic effect may provide important clues about the complexity of the underlying genetic architecture. We recommend that polymorphisms that fail to replicate be checked for interactions with other polymorphisms, particularly when samples are collected from groups with distinct ethnic backgrounds or different geographic regions.

PubMed Disclaimer

Conflict of interest statement

Competing Interests: The authors have declared that no competing interests exist.

Figures

Figure 1
Figure 1. A. This is an example of power results and marginal penetrance tables for an epistatic model with a heritability of 0.1.
Part A shows power results. As the minor allele frequency approaches the epistatic minor allele frequency, the power to detect the main effect in a replication sample is reduced. A change of 0.07 in minor allele frequency (formula image) is enough to drop the power to replicate formula image from 80% to 20% for this model. It is apparent in B that as the minor allele frequency, formula image, of formula image in the sampled population moves from 0.3 to 0.5 the marginal penetrances of the alleles for formula image (formula image) become equal and the main effect is lost. When the replication sample is performed at an allele frequency of 0.3 the power to detect a main effect is near 100%, at an allele frequency of 0.4 the power to detect a main effect is near 60%, and at an allele frequency of 0.5 the marginal penetrances are equivalent and no main effect remains.
Figure 2
Figure 2. This figure summarizes power for many models and heritabilities.
The effect described in Figure 1 is consistent across very large to very small heritability models (0.4 to 0.025). In most cases a change in allele frequency of less than 0.1 is enough to reduce the power to replicate a main effect from 80% to 20%. Results shown are for a sample including 800 cases and 800 controls. Results with datasets containing 400 and 800 individuals are similar and can be found in supplementary figures S1 and S2.
Figure 3
Figure 3. This figure shows marginal penetrances under a number of possible replication scenarios.
In part A the marginal penetrances with the allele frequencies found in the discovery phase of a hypothetical genome-wide association study. In part B The marginal penetrances found in the replication phase of a genome-wide association study under a situation where the result would replicate. In part C the marginal penetrances found in the replication phase of a genome-wide association study under a situation where the result would not replicate. In part D the marginal penetrances found in the replication phase of a genome-wide association study under a situation where the allele first discovered as a risk factor would now appear protective.
Figure 4
Figure 4. This flowchart represents a method by which candidate SNPs can be divided into tiers for later evaluation based on statistical results and biological information.
Tier 1 markers are likely to provide the easiest gene-function studies but provide the least new information. Tier 2 markers have the potential to implicate previously unknown genes in known pathways and are also likely to lead to feasible gene-function studies. Tier 3 markers have the greatest potential to implicate new genes and pathways in the disease process, but gene-function confirmation is likely to be the most difficult for these markers, particularly if clues about role are limited.

References

    1. Hirschhorn JN, Daly MJ. Genome-wide association studies for common diseases and complex traits. Nat Rev Genet. 2005;6:95–108. - PubMed
    1. Wang WYS, Barratt BJ, Clayton DG, Todd JA. Genome-wide association studies: theoretical and practical concerns. Nat Rev Genet. 2005;6:109–118. - PubMed
    1. Pe'er I, Yelensky R, Altshuler D, Daly MJ. Estimation of the multiple testing burden for genomewide association studies of nearly all common variants. Genetic Epidemiology. 2008;32:381–385. - PubMed
    1. Chanock SJ, Manolio T, Boehnke M, Boerwinkle E, Hunter DJ, et al. Replicating genotypephenotype associations. Nature. 2007;447:655–60. - PubMed
    1. Shriner D, Vaughan LK, Padilla MA, Tiwari HK. Problems with Genome-Wide association studies. Science. 2007;316:1840–1841. - PubMed

Publication types