Powerful multi-marker association tests: unifying genomic distance-based regression and logistic regression
- PMID: 20976795
- PMCID: PMC3345567
- DOI: 10.1002/gepi.20529
Powerful multi-marker association tests: unifying genomic distance-based regression and logistic regression
Abstract
To detect genetic association with common and complex diseases, many statistical tests have been proposed for candidate gene or genome-wide association studies with the case-control design. Due to linkage disequilibrium (LD), multi-marker association tests can gain power over single-marker tests with a Bonferroni multiple testing adjustment. Among many existing multi-marker association tests, most target to detect only one of many possible aspects in distributional differences between the genotypes of cases and controls, such as allele frequency differences, while a few new ones aim to target two or three aspects, all of which can be implemented in logistic regression. In contrast to logistic regression, a genomic distance-based regression (GDBR) approach aims to detect some high-order genotypic differences between cases and controls. A recent study has confirmed the high power of GDBR tests. At this moment, the popular logistic regression and the emerging GDBR approaches are completely unrelated; for example, one has to choose between the two. In this article, we reformulate GDBR as logistic regression, opening a venue to constructing other powerful tests while overcoming some limitations of GDBR. For example, asymptotic distributions can replace time-consuming permutations for deriving P-values and covariates, including gene-gene interactions, can be easily incorporated. Importantly, this reformulation facilitates combining GDBR with other existing methods in a unified framework of logistic regression. In particular, we show that Fisher's P-value combining method can boost statistical power by incorporating information from allele frequencies, Hardy-Weinberg disequilibrium, LD patterns, and other higher-order interactions among multi-markers as captured by GDBR.
© 2010 Wiley-Liss, Inc.
Figures
Similar articles
-
Relationship between genomic distance-based regression and kernel machine regression for multi-marker association testing.Genet Epidemiol. 2011 May;35(4):211-6. doi: 10.1002/gepi.20567. Genet Epidemiol. 2011. PMID: 21308765 Free PMC article.
-
Single-marker and two-marker association tests for unphased case-control genotype data, with a power comparison.Genet Epidemiol. 2010 Jan;34(1):67-77. doi: 10.1002/gepi.20436. Genet Epidemiol. 2010. PMID: 19557751 Free PMC article.
-
A unified framework for detecting genetic association with multiple SNPs in a candidate gene or region: contrasting genotype scores and LD patterns between cases and controls.Hum Hered. 2010;69(1):1-13. doi: 10.1159/000243149. Epub 2009 Oct 2. Hum Hered. 2010. PMID: 19797904 Free PMC article.
-
OPATs: Omnibus P-value association tests.Brief Bioinform. 2019 Jan 18;20(1):1-14. doi: 10.1093/bib/bbx068. Brief Bioinform. 2019. PMID: 28981573 Free PMC article. Review.
-
On selecting markers for association studies: patterns of linkage disequilibrium between two and three diallelic loci.Genet Epidemiol. 2003 Jan;24(1):57-67. doi: 10.1002/gepi.10217. Genet Epidemiol. 2003. PMID: 12508256 Review.
Cited by
-
GEE-based SNP set association test for continuous and discrete traits in family-based association studies.Genet Epidemiol. 2013 Dec;37(8):778-86. doi: 10.1002/gepi.21763. Epub 2013 Oct 25. Genet Epidemiol. 2013. PMID: 24166731 Free PMC article.
-
Relationship between genomic distance-based regression and kernel machine regression for multi-marker association testing.Genet Epidemiol. 2011 May;35(4):211-6. doi: 10.1002/gepi.20567. Genet Epidemiol. 2011. PMID: 21308765 Free PMC article.
-
Using the gene ontology to scan multilevel gene sets for associations in genome wide association studies.Genet Epidemiol. 2012 Jan;36(1):3-16. doi: 10.1002/gepi.20632. Epub 2011 Dec 7. Genet Epidemiol. 2012. PMID: 22161999 Free PMC article. Clinical Trial.
-
Adaptive tests for detecting gene-gene and gene-environment interactions.Hum Hered. 2011;72(2):98-109. doi: 10.1159/000330632. Epub 2011 Sep 16. Hum Hered. 2011. PMID: 21934325 Free PMC article.
-
A multi-SNP association test for complex diseases incorporating an optimal P-value threshold algorithm in nuclear families.BMC Genomics. 2015 May 15;16(1):381. doi: 10.1186/s12864-015-1620-3. BMC Genomics. 2015. PMID: 25975968 Free PMC article.
References
-
- Chapman JM, Cooper JD, Todd JA, Clayton DG. Detecting disease associations due to linkage disequilibrium using haplotype tags: a class of tests and the determinants of statistical power. Hum Hered. 2003;56:18–31. - PubMed
-
- Chen J, Chatterjee N. Exploiting Hardy-Weinberg equilibrium for efficient screening of single SNP associations from case-control studies. Human Heredity. 2007;63:196–204. - PubMed
Publication types
MeSH terms
Substances
Grants and funding
LinkOut - more resources
Full Text Sources
Research Materials