A flexible and accurate genotype imputation method for the next generation of genome-wide association studies
- PMID: 19543373
- PMCID: PMC2689936
- DOI: 10.1371/journal.pgen.1000529
A flexible and accurate genotype imputation method for the next generation of genome-wide association studies
Abstract
Genotype imputation methods are now being widely used in the analysis of genome-wide association studies. Most imputation analyses to date have used the HapMap as a reference dataset, but new reference panels (such as controls genotyped on multiple SNP chips and densely typed samples from the 1,000 Genomes Project) will soon allow a broader range of SNPs to be imputed with higher accuracy, thereby increasing power. We describe a genotype imputation method (IMPUTE version 2) that is designed to address the challenges presented by these new datasets. The main innovation of our approach is a flexible modelling framework that increases accuracy and combines information across multiple reference panels while remaining computationally feasible. We find that IMPUTE v2 attains higher accuracy than other methods when the HapMap provides the sole reference panel, but that the size of the panel constrains the improvements that can be made. We also find that imputation accuracy can be greatly enhanced by expanding the reference panel to contain thousands of chromosomes and that IMPUTE v2 outperforms other methods in this setting at both rare and common SNPs, with overall error rates that are 15%-20% lower than those of the closest competing method. One particularly challenging aspect of next-generation association studies is to integrate information across multiple reference panels genotyped on different sets of SNPs; we show that our approach to this problem has practical advantages over other suggested solutions.
Conflict of interest statement
The authors have declared that no competing interests exist.
Figures





Similar articles
-
Comprehensive evaluation of imputation performance in African Americans.J Hum Genet. 2012 Jul;57(7):411-21. doi: 10.1038/jhg.2012.43. Epub 2012 May 31. J Hum Genet. 2012. PMID: 22648186 Free PMC article.
-
Accuracy of genome-wide imputation of untyped markers and impacts on statistical power for association studies.BMC Genet. 2009 Jun 16;10:27. doi: 10.1186/1471-2156-10-27. BMC Genet. 2009. PMID: 19531258 Free PMC article.
-
Effect of genome-wide genotyping and reference panels on rare variants imputation.J Genet Genomics. 2012 Oct 20;39(10):545-50. doi: 10.1016/j.jgg.2012.07.002. Epub 2012 Jul 24. J Genet Genomics. 2012. PMID: 23089364
-
Genotype Imputation from Large Reference Panels.Annu Rev Genomics Hum Genet. 2018 Aug 31;19:73-96. doi: 10.1146/annurev-genom-083117-021602. Epub 2018 May 23. Annu Rev Genomics Hum Genet. 2018. PMID: 29799802 Review.
-
Accurate Imputation of Untyped Variants from Deep Sequencing Data.Methods Mol Biol. 2021;2243:271-281. doi: 10.1007/978-1-0716-1103-6_13. Methods Mol Biol. 2021. PMID: 33606262 Review.
Cited by
-
Mendel-GPU: haplotyping and genotype imputation on graphics processing units.Bioinformatics. 2012 Nov 15;28(22):2979-80. doi: 10.1093/bioinformatics/bts536. Epub 2012 Sep 5. Bioinformatics. 2012. PMID: 22954633 Free PMC article.
-
Associations of ATR and CHEK1 single nucleotide polymorphisms with breast cancer.PLoS One. 2013 Jul 3;8(7):e68578. doi: 10.1371/journal.pone.0068578. Print 2013. PLoS One. 2013. PMID: 23844225 Free PMC article.
-
How imputation can mitigate SNP ascertainment Bias.BMC Genomics. 2021 May 12;22(1):340. doi: 10.1186/s12864-021-07663-6. BMC Genomics. 2021. PMID: 33980139 Free PMC article.
-
Common genetic variation in ETV6 is associated with colorectal cancer susceptibility.Nat Commun. 2016 May 5;7:11478. doi: 10.1038/ncomms11478. Nat Commun. 2016. PMID: 27145994 Free PMC article.
-
Using object oriented bayesian networks to model linkage, linkage disequilibrium and mutations between STR markers.PLoS One. 2012;7(9):e43873. doi: 10.1371/journal.pone.0043873. Epub 2012 Sep 11. PLoS One. 2012. PMID: 22984448 Free PMC article.
References
-
- Gudmundsson J, Sulem P, Manolescu A, Amundadottir LT, Gudbjartsson D, et al. Genome-wide association study identifies a second prostate cancer susceptibility variant at 8q24. Nat Genet. 2007;39:631–637. - PubMed
Publication types
MeSH terms
Grants and funding
LinkOut - more resources
Full Text Sources
Other Literature Sources
Medical
Molecular Biology Databases