Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2020 Feb 28;39(5):617-638.
doi: 10.1002/sim.8434. Epub 2019 Dec 21.

Semiparametric Bayesian variable selection for gene-environment interactions

Affiliations

Semiparametric Bayesian variable selection for gene-environment interactions

Jie Ren et al. Stat Med. .

Abstract

Many complex diseases are known to be affected by the interactions between genetic variants and environmental exposures beyond the main genetic and environmental effects. Study of gene-environment (G×E) interactions is important for elucidating the disease etiology. Existing Bayesian methods for G×E interaction studies are challenged by the high-dimensional nature of the study and the complexity of environmental influences. Many studies have shown the advantages of penalization methods in detecting G×E interactions in "large p, small n" settings. However, Bayesian variable selection, which can provide fresh insight into G×E study, has not been widely examined. We propose a novel and powerful semiparametric Bayesian variable selection model that can investigate linear and nonlinear G×E interactions simultaneously. Furthermore, the proposed method can conduct structural identification by distinguishing nonlinear interactions from main-effects-only case within the Bayesian framework. Spike-and-slab priors are incorporated on both individual and group levels to identify the sparse main and interaction effects. The proposed method conducts Bayesian variable selection more efficiently than existing methods. Simulation shows that the proposed model outperforms competing alternatives in terms of both identification and prediction. The proposed Bayesian method leads to the identification of main and interaction effects with important implications in a high-throughput profiling study with high-dimensional SNP data.

Keywords: Bayesian variable selection; MCMC; gene-environment interactions; high-dimensional genomic data; semiparametric modeling.

PubMed Disclaimer

Conflict of interest statement

CONFLICT OF INTEREST

The authors declare no potential conflict of interests.

Figures

FIGURE 1
FIGURE 1
Nonlinear G×E effect of SNP rs1106380 from the Nurses’ Health Study (NHS) data. The blue dashed lines represent the 95% credible region

Similar articles

Cited by

References

    1. Hunter DJ. Gene–environment interactions in human diseases. Nat Rev Genet. 2005;6(4):23–36. 10.1038/nrg1578 - DOI - PubMed
    1. Hutter CM, Mechanic LE, Chatterjee N, Kraft P, Gillanders EM. Gene-environment interactions in cancer epidemiology: a national cancer institute think tank report. Genetic Epidemiology. 2013;37(7):643–657. 10.1002/gepi.21756 - DOI - PMC - PubMed
    1. Mukherjee B, Ahn J, Gruber SB, Chatterjee N. Testing gene-environment interaction in large-scale case-control association studies: possible choices and comparisons. Am J Epidemiol. 2011;175(3):177–190. 10.1093/aje/kwr367 - DOI - PMC - PubMed
    1. Centers for Disease Control and Prevention. National Diabetes Statistics Report. Atlanta, GA: Centers for Disease Control and Prevention: 2017.
    1. Ma S, Yang L, Romero R, Cui Y. Varying coefficient model for gene–environment interaction: a non-linear look. Bioinformatics. 2011;27(15):2119–2126. 10.1093/bioinformatics/btr318 - DOI - PMC - PubMed

Publication types

LinkOut - more resources