Fast and robust association tests for untyped SNPs in case-control studies
- PMID: 20689309
- PMCID: PMC2952185
- DOI: 10.1159/000308456
Fast and robust association tests for untyped SNPs in case-control studies
Abstract
Genome-wide association studies (GWASs) aim to genotype enough single nucleotide polymorphisms (SNPs) to effectively capture common genetic variants across the genome. Even though the number of SNPs genotyped in such studies can exceed a million, there is still interest in testing association with SNPs that were not genotyped in the study sample. Analyses of such untyped SNPs can assist in signal localization, permit cross-platform integration of samples from separate studies, and can improve power - especially for rarer SNPs. External information on a larger collection of SNPs from an appropriate reference panel, comprising both SNPs typed in the sample and the untyped SNPs we wish to test for association, is necessary for an untyped variant analysis to proceed. Linkage disequilibrium patterns observed in the reference panel are then used to infer the likely genotype at the untyped SNPs in the study sample. We propose here a novel statistical approach for testing untyped SNPs in case-control GWAS, based on an efficient score function derived from a prospective likelihood, that automatically accounts for the variability in the process of estimating the untyped variant. Computationally efficient methods of phasing can be used without affecting the validity of the test, and simple measures of haplotype sharing can be used to infer genotypes at the untyped SNPs, making our approach computationally much faster than existing approaches for untyped analysis. At the same time, we show, using simulated data, that our approach often has performance nearly equivalent to hidden Markov methods of untyped analysis. The software package 'untyped' is available to implement our approach.
Copyright © 2010 S. Karger AG, Basel.
Figures

Similar articles
-
ATRIUM: testing untyped SNPs in case-control association studies with related individuals.Am J Hum Genet. 2009 Nov;85(5):667-78. doi: 10.1016/j.ajhg.2009.10.006. Am J Hum Genet. 2009. PMID: 19913122 Free PMC article.
-
Analysis of untyped SNPs: maximum likelihood and imputation methods.Genet Epidemiol. 2010 Dec;34(8):803-15. doi: 10.1002/gepi.20527. Genet Epidemiol. 2010. PMID: 21104886 Free PMC article.
-
Accuracy of genome-wide imputation of untyped markers and impacts on statistical power for association studies.BMC Genet. 2009 Jun 16;10:27. doi: 10.1186/1471-2156-10-27. BMC Genet. 2009. PMID: 19531258 Free PMC article.
-
Accurate Imputation of Untyped Variants from Deep Sequencing Data.Methods Mol Biol. 2021;2243:271-281. doi: 10.1007/978-1-0716-1103-6_13. Methods Mol Biol. 2021. PMID: 33606262 Review.
-
Molecular genetic studies of complex phenotypes.Transl Res. 2012 Feb;159(2):64-79. doi: 10.1016/j.trsl.2011.08.001. Epub 2011 Aug 31. Transl Res. 2012. PMID: 22243791 Free PMC article. Review.
Cited by
-
An efficient multi-locus mixed-model approach for genome-wide association studies in structured populations.Nat Genet. 2012 Jun 17;44(7):825-30. doi: 10.1038/ng.2314. Nat Genet. 2012. PMID: 22706313 Free PMC article.
-
Imputation without doing imputation: a new method for the detection of non-genotyped causal variants.Genet Epidemiol. 2014 Apr;38(3):173-90. doi: 10.1002/gepi.21792. Epub 2014 Feb 17. Genet Epidemiol. 2014. PMID: 24535679 Free PMC article.
-
Family-based association tests using genotype data with uncertainty.Biostatistics. 2012 Apr;13(2):228-40. doi: 10.1093/biostatistics/kxr045. Epub 2011 Dec 8. Biostatistics. 2012. PMID: 22156512 Free PMC article.
References
-
- McCarthy MI, Abecasis GR, Cardon LR, Goldstein DB, Little J, Ioannidis JPA, Hirschhorn J. Genome-wide association studies for complex traits: consensus, uncertainty and challenges. Nat Rev Genet. 2008;9:356–369. - PubMed
-
- Marchini J, Howie B, Myers S, McVean G, Donnelly P. A new multipoint method for genome-wide association studies by imputation of genotypes. Nat Genet. 2007;39:906–913. - PubMed
-
- Li Y, Willer CJ, Ding J, Scheet P, Abecasis GR: Markov model for rapid haplotyping and genotype imputation in genome wide studies. Submitted.
-
- Nicolae DL. Quantifying the amount of missing information in genetic association studies. Genet Epidemiol. 2006;30:703–717. - PubMed
-
- Nicolae DL. Testing untyped alleles (TUNA) – applications to genome-wide association studies. Genet Epidemiol. 2006;30:718–727. - PubMed
Publication types
MeSH terms
Grants and funding
LinkOut - more resources
Full Text Sources