Using principal components of genetic variation for robust and powerful detection of gene-gene interactions in case-control and case-only studies
- PMID: 20206333
- PMCID: PMC2833365
- DOI: 10.1016/j.ajhg.2010.01.026
Using principal components of genetic variation for robust and powerful detection of gene-gene interactions in case-control and case-only studies
Abstract
Many popular methods for exploring gene-gene interactions, including the case-only approach, rely on the key assumption that physically distant loci are in linkage equilibrium in the underlying population. These methods utilize the presence of correlation between unlinked loci in a disease-enriched sample as evidence of interactions among the loci in the etiology of the disease. We use data from the CGEMS case-control genome-wide association study of breast cancer to demonstrate empirically that the case-only and related methods have the potential to create large-scale false positives because of the presence of population stratification (PS) that creates long-range linkage disequilibrium in the genome. We show that the bias can be removed by considering parametric and nonparametric methods that assume gene-gene independence between unlinked loci, not in the entire population, but only conditional on population substructure that can be uncovered based on the principal components of a suitably large panel of PS markers. Applications in the CGEMS study as well as simulated data show that the proposed methods are robust to the presence of population stratification and are yet much more powerful, relative to standard logistic regression methods that are also commonly used as robust alternatives to the case-only type methods.
Copyright 2010 The American Society of Human Genetics. Published by Elsevier Inc. All rights reserved.
Figures
 
              
              
              
              
                
                
                 
              
              
              
              
                
                
                 
              
              
              
              
                
                
                 
              
              
              
              
                
                
                 
              
              
              
              
                
                
                References
- 
    - Piegorsch W.W., Weinberg C.R., Taylor J.A. Non-hierarchical logistic models and case-only designs for assessing susceptibility in population-based case-control studies. Stat. Med. 1994;13:153–162. - PubMed
 
- 
    - Khoury M.J., Flanders W.D. Nontraditional epidemiologic approaches in the analysis of gene-environment interaction: Case-control studies with no controls! Am. J. Epidemiol. 1996;144:207–213. - PubMed
 
- 
    - Zhang Y., Liu J.S. Bayesian inference of epistatic interactions in case-control studies. Nat. Genet. 2007;39:1167–1173. - PubMed
 
Publication types
MeSH terms
Grants and funding
LinkOut - more resources
- Full Text Sources
 
        