Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Comparative Study
. 2005 Oct;171(2):813-23.
doi: 10.1534/genetics.105.044206. Epub 2005 Jul 14.

Ranks of genuine associations in whole-genome scans

Affiliations
Comparative Study

Ranks of genuine associations in whole-genome scans

Dmitri V Zaykin et al. Genetics. 2005 Oct.

Abstract

With the recent advances in high-throughput genotyping techniques, it is now possible to perform whole-genome association studies to fine map causal polymorphisms underlying important traits that influence susceptibility to human diseases and efficacy of drugs. Once a genome scan is completed the results can be sorted by the association statistic value. What is the probability that true positives will be encountered among the first most associated markers? When a particular polymorphism is found associated with the trait, there is a chance that it represents either a "true" or a "false" association (TA vs. FA). Setting appropriate significance thresholds has been considered to provide assurance of sufficient odds that the associations found to be significant are genuine. However, the problem with genome scans involving thousands of markers is that the statistic values of FAs can reach quite extreme magnitudes. In such situations, the distributions corresponding to TAs and the most extreme FAs become comparable and significance thresholds tend to penalize TAs and FAs in a similar fashion. When sorting between true and false associations, the "typical" place (i.e., rank) of TAs among the most significant outcomes becomes important, ordered by the association statistic value. The distribution of ranks that we study here allows calculation of several useful quantities. In particular, it gives the number of most significant markers needed for a follow-up study to guarantee that a true association is included with certain probability. This can be calculated conditionally on having applied a multiple-testing correction. Effects of multilocus (e.g., haplotype association) tests and impact of linkage disequilibrium on the distribution of ranks associated with TAs are evaluated and can be taken into account.

PubMed Disclaimer

Figures

Figure 1.
Figure 1.
Diffusion-generated P-value correlation decay. The distance between two neighboring markers (1 unit on the x-axis) is 15 kb.
Figure 2.
Figure 2.
P-value correlation decay within (left) and between (right) LD blocks. The distance between two neighboring markers (1 unit on the x-axis) is 15 kb.
Figure 3.
Figure 3.
Expected power associated with individual effects (bottom line) given overall power for three effects (m = 3) (top line).
Figure 4.
Figure 4.
Plot of degrees of freedom vs. significance level (α) for 80% power tests at the 70% quantile.
Figure 5.
Figure 5.
Plot of degrees of freedom vs. significance level (α) for 80% power tests at the 5% quantile.

References

    1. Benjamini, Y., and Y. Hochberg,, 1995. Controlling the false discovery rate—a practical and powerful approach to multiple testing. J. R. Stat. Soc. Ser. B 57: 289–300.
    1. Goldstein, D. B., K. R. Ahmadi, M. E. Weale and N. W. Wood, 2003. Genome scans and candidate gene approaches in the study of common diseases and variable drug responses. Trends Genet. 19: 615–622. - PubMed
    1. Hamilton, D. C., and D. E. Cole, 2004. Standardizing a composite measure of linkage disequilibrium. Ann. Hum. Genet. 68: 234–239. - PubMed
    1. Ioannidis, J. P., E. E. Ntzani, T. A. Trikalinos and D. G. Contopoulos-Ioannidis, 2001. Replication validity of genetic association studies. Nat. Genet. 29: 306–309. - PubMed
    1. Kammerer, S., R. B. Roth, R. Reneland, G. Marnellos, C. R. Hoyal et al., 2004. Large-scale association study identifies ICAM gene region as breast and prostate cancer susceptibility locus. Cancer Res. 64: 8906–8910. - PubMed

Publication types

Substances