Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2013 Jun;21(6):680-6.
doi: 10.1038/ejhg.2012.220. Epub 2012 Oct 24.

Statistical tests for detecting associations with groups of genetic variants: generalization, evaluation, and implementation

Affiliations

Statistical tests for detecting associations with groups of genetic variants: generalization, evaluation, and implementation

John Ferguson et al. Eur J Hum Genet. 2013 Jun.

Abstract

With recent advances in sequencing, genotyping arrays, and imputation, GWAS now aim to identify associations with rare and uncommon genetic variants. Here, we describe and evaluate a class of statistics, generalized score statistics (GSS), that can test for an association between a group of genetic variants and a phenotype. GSS are a simple weighted sum of single-variant statistics and their cross-products. We show that the majority of statistics currently used to detect associations with rare variants are equivalent to choosing a specific set of weights within this framework. We then evaluate the power of various weighting schemes as a function of variant characteristics, such as MAF, the proportion associated with the phenotype, and the direction of effect. Ultimately, we find that two classical tests are robust and powerful, but details are provided as to when other GSS may perform favorably. The software package CRaVe is available at our website (http://dceg.cancer.gov/bb/tools/crave).

PubMed Disclaimer

Figures

Figure 1
Figure 1
The power (y-axis) to detect an association between a gene with 40-independent SNPs and a disease is illustrated for four different test statistics (ROV, HOI, MDF, SUM), as a function of the number (x-axis) of those SNPs that increase disease risk.
Figure 2
Figure 2
The power (y-axis) to detect an association between a gene with 40-independent SNPs and a disease is illustrated for four different test statistics (MDF+, MDF, STZ+, STZ), as a function of the proportion (x-axis) of the 10-associated SNPs that reduce disease risk. The remaining associated SNPs increase risk.
Figure 3
Figure 3
The lines illustrate the number of SNVs that need to be influential in order for the SUM test to have higher power than the MDF test, as a function of the total number of independent SNVs in the tested region. Results are based on simulations where SNVs have MAF equally spaced between 0.005 and 0.02, and the influential SNVs are randomly distributed, have equal effect size, and increase risk. The effect size (β in equation (15)) was chosen so the MDF test had a specified power: 0.2 (brown), 0.5 (red), or 0.8 (orange).

Similar articles

References

    1. Ahituv N, Kavaslar N, Schackwitz W, et al. Medical sequencing at the extremes of human body mass. Am J Hum Genet. 2007;80:779–791. - PMC - PubMed
    1. Cohen JC, Boerwinkle E, Mosley TH, Hobbs HH. Sequence variations in pcsk9, low ldl, and protection against coronary heart disease. N Engl J Med. 2006;354:1264–1272. - PubMed
    1. Cohen JC, Kiss RS, Pertsemlidis A, Marcel YL, McPherson R, Hobbs HH. Multiple rare alleles contribute to low plasma levels of hdl cholesterol. Science. 2004;5685:869–872. - PubMed
    1. Nejentsev S, Walker N, Riches D, Egholm M, Todd JA. Rare variants of ifih1, a gene implicated in antiviral responses, protect against type 1 diabetes. Science. 2009;5925:387–389. - PMC - PubMed
    1. Romeo S, Pennacchio LA, Fu Y, et al. Population based resequencing of angptl4 uncovers variations that reduce triglycerides and increase hdl. Nat Genet. 2007;4:513–516. - PMC - PubMed

Publication types

MeSH terms

Substances