Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2011;71(3):148-60.
doi: 10.1159/000324841. Epub 2011 Jul 20.

A Bayesian hierarchical model for detecting haplotype-haplotype and haplotype-environment interactions in genetic association studies

Affiliations

A Bayesian hierarchical model for detecting haplotype-haplotype and haplotype-environment interactions in genetic association studies

Jun Li et al. Hum Hered. 2011.

Abstract

Objective: Genetic association studies based on haplotypes are powerful in the discovery and characterization of the genetic basis of complex human diseases. However, statistical methods for detecting haplotype-haplotype and haplotype-environment interactions have not yet been fully developed owing to the difficulties encountered: large numbers of potential haplotypes and unknown haplotype pairs. Furthermore, methods for detecting the association between rare haplotypes and disease have not kept pace with their counterpart of common haplotypes.

Methods/results: We herein propose an efficient and robust method to tackle these problems based on a Bayesian hierarchical generalized linear model. Our model simultaneously fits environmental effects, main effects of numerous common and rare haplotypes, and haplotype-haplotype and haplotype-environment interactions. The key to the approach is the use of a continuous prior distribution on coefficients that favors sparseness in the fitted model and facilitates computation. We develop a fast expectation-maximization algorithm to fit models by estimating posterior modes of coefficients. We incorporate our algorithm into the iteratively weighted least squares for classical generalized linear models as implemented in the R package glm. We evaluate the proposed method and compare its performance to existing methods on extensive simulated data.

Conclusion: The results show that the proposed method performs well under all situations and is more powerful than existing approaches.

PubMed Disclaimer

Figures

Fig. 1
Fig. 1
Main effect model: estimated 68 and 95% coverages of the ‘true’ values (indicated by bold and thin horizontal lines in the left column, respectively) and empirical powers or type I error rates [empirical powers or type I error rates for α = 0.001 (×), α = 0.01 (o), and α = 0.05 (+)] for each of 4 haplotypes based on the four methods with sample sizes of 250 (top), 500 (middle), and 1,000 (bottom). B = BayesGLM; C = GLM; R = rGLM; S = ScoreGLM.
Fig. 2
Fig. 2
Main and interacting effect model: estimated 68 and 95% coverages of the ‘true’ values (indicated by bold and thin horizontal lines in the left column, respectively) and empirical powers or type I error rates [empirical powers or type I error rates for α = 0.001 (×), α = 0.01 (o), and α = 0.05 (+)] for each of 8 predictors based on the four methods with sample sizes of 250 (top), 500 (middle), and 1,000 (bottom). B = BayesGLM; C = GLM; R = rGLM; S = ScoreGLM.
Fig. 3
Fig. 3
Full model: estimated 68 and 95% coverages of the ‘true’ values (indicated by bold and thin horizontal lines in the 1st, 3rd, and 5th columns, respectively) and empirical powers or type I error rates [empirical powers or type I error rates for α = 0.001 (×), α = 0.01 (o), and α = 0.05 (+)] for each of 81 predictors based on BayesGLM with sample sizes of 250 (first 2 columns), 500 (3rd and 4th columns), and 1,000 (last 2 columns). The black labels on the vertical axis stand for the disease-associated predictors, while the gray labels stand for the non-disease-associated predictors.

References

    1. Botstein D, Risch N. Discovering genotypes underlying human phenotypes: past successes for mendelian disease, future approaches for complex disease. Nat Genet. 2003;33(suppl):228–237. - PubMed
    1. Risch N, Merikangas K. The future of genetic studies of complex human diseases. Science. 1996;273:1616–1617. - PubMed
    1. Chapman NH, Wijsman EM. Genome screens using linkage disequilibrium tests: optimal marker characteristics and feasibility. Am J Hum Genet. 1998;63:1872–1885. - PMC - PubMed
    1. Clark AG. The role of haplotypes in candidate gene studies. Genet Epidemiol. 2004;27:321–333. - PubMed
    1. Davidson S. Research suggests importance of haplotypes over SNPs. Nat Biotechnol. 2000;18:1134–1135. - PubMed

Publication types