Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2007 Nov;81(5):927-38.
doi: 10.1086/521558. Epub 2007 Oct 3.

Haplotype-based association analysis via variance-components score test

Affiliations

Haplotype-based association analysis via variance-components score test

Jung-Ying Tzeng et al. Am J Hum Genet. 2007 Nov.

Abstract

Haplotypes provide a more informative format of polymorphisms for genetic association analysis than do individual single-nucleotide polymorphisms. However, the practical efficacy of haplotype-based association analysis is challenged by a trade-off between the benefits of modeling abundant variation and the cost of the extra degrees of freedom. To reduce the degrees of freedom, several strategies have been considered in the literature. They include (1) clustering evolutionarily close haplotypes, (2) modeling the level of haplotype sharing, and (3) smoothing haplotype effects by introducing a correlation structure for haplotype effects and studying the variance components (VC) for association. Although the first two strategies enjoy a fair extent of power gain, empirical evidence showed that VC methods may exhibit only similar or less power than the standard haplotype regression method, even in cases of many haplotypes. In this study, we report possible reasons that cause the underpowered phenomenon and show how the power of the VC strategy can be improved. We construct a score test based on the restricted maximum likelihood or the marginal likelihood function of the VC and identify its nontypical limiting distribution. Through simulation, we demonstrate the validity of the test and investigate the power performance of the VC approach and that of the standard haplotype regression approach. With suitable choices for the correlation structure, the proposed method can be directly applied to unphased genotypic data. Our method is applicable to a wide-ranging class of models and is computationally efficient and easy to implement. The broad coverage and the fast and easy implementation of this method make the VC strategy an effective tool for haplotype analysis, even in modern genomewide association studies.

PubMed Disclaimer

Figures

Figure  1.
Figure 1.
The 10 largest eigenvalues of matrix W-1/2P0SP0W-1/2. The eigenvalues are dominated by the first few and decrease rapidly to 0. The eigenvalues are averages across the 1,000 replications in the simulation scenario of high haplotype diversity and an allele frequency of 0.1 for binary traits.
Figure  2.
Figure 2.
QQ-plots of the test statistic Tτ. We defined the standardized Tτ as formula image and the scaled Tτ,i as Tτ,i/bi, where bi is the scale parameter of the Gamma approximation. The left panels show the standardized Tτ against the expected quantiles from Normal(0,1). The right panels show the scaled Tτ against the expected quantiles from the Gamma(formula image,1). The upper panels are for quantitative traits, and the lower panels are for binary traits. These QQ-plots suggest that, for both trait types, Tτ values deviate from the normal distribution, as predicted by the usual asymptotic theory, and the empirical distribution of Tτ values can be approximated by a Gamma (ai,bi) distribution appropriately.
Figure  3.
Figure 3.
P values from the ALS data analysis around the most promising SNP reported by Schymick et al. (i.e., SNP rs4363506, with location indicated by the arrows). The P values are presented on the scale of negative logarithm of base 10. A, P values of the three-SNP haplotype test by use of the VC-score method (solid line) and P values of a single-SNP genotypic test (dotted dashed line). B, P values of the three-SNP haplotype test by use of the VC-score method (solid line) and P values of the three-SNP haplotype test by use of the fixed-effect method (dashed line). C, P values of the four-SNP haplotype test by use of the VC-score method (solid line) and P values of the four-SNP haplotype test by use of the fixed-effect method (dashed line).

Similar articles

Cited by

References

Web Resources

    1. Authors’ Web site, http://www4.stat.ncsu.edu/∼tzeng/Softwares/Hap-VC/R/ (for R code for implementing the VC-score test)
    1. NINDS Human Genetics DNA and Cell Line Repository, http://ccr.coriell.org/ninds/ (for the ALS study by Schymick et al.)
    1. Online Mendelian Inheritance in Man (OMIM), http://www.ncbi.nlm.nih.gov/Omim/ (for DOCK1) - PubMed

References

    1. The International HapMap Consortium (2003) The International HapMap Project. Nature 426:789–79610.1038/nature02168 - DOI - PubMed
    1. Akey J, Jin L, Xiong M (2001) Haplotypes vs single marker linkage disequilibrium tests: what do we gain? Eur J Hum Genet 9:291–30010.1038/sj.ejhg.5200619 - DOI - PubMed
    1. Pe’er I, de Bakker PI, Maller J, Yelensky R, Altshuler D, Daly MJ (2006) Evaluating and improving power in whole-genome association studies using fixed marker sets. Nat Genet 38:663–66710.1038/ng1816 - DOI - PubMed
    1. Zaitlen N, Kang HM, Eskin E, Halperin E (2007) Leveraging the HapMap correlation structure in association studies. Am J Hum Genet 80:683–691 - PMC - PubMed
    1. Seltman H, Roeder K, Devlin B (2003) Evolutionary-based association analysis using haplotype data. Genet Epidemiol 25:48–5810.1002/gepi.10246 - DOI - PubMed

Publication types

LinkOut - more resources