Genotype imputation of Metabochip SNPs using a study-specific reference panel of ~4,000 haplotypes in African Americans from the Women's Health Initiative
- PMID: 22851474
- PMCID: PMC3410659
- DOI: 10.1002/gepi.21603
Genotype imputation of Metabochip SNPs using a study-specific reference panel of ~4,000 haplotypes in African Americans from the Women's Health Initiative
Abstract
Genetic imputation has become standard practice in modern genetic studies. However, several important issues have not been adequately addressed including the utility of study-specific reference, performance in admixed populations, and quality for less common (minor allele frequency [MAF] 0.005-0.05) and rare (MAF < 0.005) variants. These issues only recently became addressable with genome-wide association studies (GWAS) follow-up studies using dense genotyping or sequencing in large samples of non-European individuals. In this work, we constructed a study-specific reference panel of 3,924 haplotypes using African Americans in the Women's Health Initiative (WHI) genotyped on both the Metabochip and the Affymetrix 6.0 GWAS platform. We used this reference panel to impute into 6,459 WHI SNP Health Association Resource (SHARe) study subjects with only GWAS genotypes. Our analysis confirmed the imputation quality metric Rsq (estimated r(2) , specific to each SNP) as an effective post-imputation filter. We recommend different Rsq thresholds for different MAF categories such that the average (across SNPs) Rsq is above the desired dosage r(2) (squared Pearson correlation between imputed and experimental genotypes). With a desired dosage r(2) of 80%, 99.9% (97.5%, 83.6%, 52.0%, 20.5%) of SNPs with MAF > 0.05 (0.03-0.05, 0.01-0.03, 0.005-0.01, and 0.001-0.005) passed the post-imputation filter. The average dosage r(2) for these SNPs is 94.7%, 92.1%, 89.0%, 83.1%, and 79.7%, respectively. These results suggest that for African Americans imputation of Metabochip SNPs from GWAS data, including low frequency SNPs with MAF 0.005-0.05, is feasible and worthwhile for power increase in downstream association analysis provided a sizable reference panel is available.
© 2012 Wiley Periodicals, Inc.
Figures
References
-
- Anderson G, Cummings S, Freedman LS, Furberg C, Henderson M, Johnson SR, Kuller L, Manson J, Oberman A, Prentice RL, Rossouw JE. Design of the women’s health initiative clinical trial and observational study. Controlled Clinical Trials. 1998;19(1):61–109. - PubMed
-
- Buyske S, Wu Y, Carty CL, Cheng I, Assimes TL, Dumitrescu L, Hindorff LA, Mitchell S, Ambite JL, Boerwinkle E, Buzkova P, Carlson CS, Cochran B, Duggan D, Eaton CB, Fesinmeyer MD, Franceschini N, Haessler J, Jenny N, Hyun Min Kang, Lin Y, Le Marchand L, Matise TC, Robinson JG, Rodriguez C, Schumacher FR, Voight BF, Young A, Manolio TA, Mohlke KL, Haiman CA, Peters U, Crawford DC, North KE. Evaluation of theMetabochip genotyping array in African Americans and implications for fine mapping of GWAS-ldentified loci: the PAGE Study. 2011 In preparation. - PMC - PubMed
Publication types
MeSH terms
Grants and funding
- R01 CA082659/CA/NCI NIH HHS/United States
- R01 HG006292/HG/NHGRI NIH HHS/United States
- R01 HG006703/HG/NHGRI NIH HHS/United States
- U01 HG004802/HG/NHGRI NIH HHS/United States
- N01 WH022110/WH/WHI NIH HHS/United States
- U01HG004803/HG/NHGRI NIH HHS/United States
- U01HG004802/HG/NHGRI NIH HHS/United States
- U01 HG004803/HG/NHGRI NIH HHS/United States
- U01HG004798/HG/NHGRI NIH HHS/United States
- 3-R01-CA082659-11S1/CA/NCI NIH HHS/United States
- UC2 HL102924/HL/NHLBI NIH HHS/United States
- U01HG004790/HG/NHGRI NIH HHS/United States
- U01 HG004801/HG/NHGRI NIH HHS/United States
- U01 HG004790/HG/NHGRI NIH HHS/United States
- P30 CA015704/CA/NCI NIH HHS/United States
- RC2 HL102924/HL/NHLBI NIH HHS/United States
- R01HG006292/HG/NHGRI NIH HHS/United States
- 5 UC2 HL102924-02/HL/NHLBI NIH HHS/United States
- U01HG004801/HG/NHGRI NIH HHS/United States
- U01 HG004798/HG/NHGRI NIH HHS/United States
- U01HG004801-01/HG/NHGRI NIH HHS/United States
LinkOut - more resources
Full Text Sources
Molecular Biology Databases
