Genotype imputation performance of three reference panels using African ancestry individuals
- PMID: 29637265
- PMCID: PMC6209094
- DOI: 10.1007/s00439-018-1881-4
Genotype imputation performance of three reference panels using African ancestry individuals
Abstract
Genotype imputation estimates unobserved genotypes from genome-wide makers, to increase genome coverage and power for genome-wide association studies. Imputation has been successful for European ancestry populations in which very large reference panels are available. Smaller subsets of African descent populations are available in 1000 Genomes (1000G), the Consortium on Asthma among African ancestry Populations in the Americas (CAAPA) and the Haplotype Reference Consortium (HRC). We compared the performance of these reference panels when imputing variation in 3747 African Americans (AA) from two cohorts (HCV and COPDGene) genotyped using Illumina Omni microarrays. The haplotypes of 2504 (1000G), 883 (CAAPA) and 32,470 individuals (HRC) were used as reference. We compared the number of variants, imputation quality, imputation accuracy and coverage between panels. In both cohorts, 1000G imputed 1.5-1.6× more variants than CAAPA and 1.2× more than HRC. Similar findings were observed for variants with imputation R2 > 0.5 and for rare, low-frequency, and common variants. When merging imputed variants of the three panels, the total number was 62-63 M with 20 M overlapping variants imputed by all three panels, and a range of 5-15 M variants imputed exclusively with one of them. For overlapping variants, imputation quality was highest for HRC, followed by 1000G, then CAAPA, and improved as the minor allele frequency increased. 1000G, HRC and CAAPA provided high performance and accuracy for imputation of African American individuals, increasing the number of variants available for subsequent analyses. These panels are complementary and would benefit from the development of an integrated African reference panel.
Conflict of interest statement
Figures



Similar articles
-
Improving power of association tests using multiple sets of imputed genotypes from distributed reference panels.Genet Epidemiol. 2017 Dec;41(8):744-755. doi: 10.1002/gepi.22067. Epub 2017 Sep 1. Genet Epidemiol. 2017. PMID: 28861891 Free PMC article.
-
Genotype imputation for Han Chinese population using Haplotype Reference Consortium as reference.Hum Genet. 2018 Jul;137(6-7):431-436. doi: 10.1007/s00439-018-1894-z. Epub 2018 May 31. Hum Genet. 2018. PMID: 29855708
-
Rare variant genotype imputation with thousands of study-specific whole-genome sequences: implications for cost-effective study designs.Eur J Hum Genet. 2015 Jul;23(7):975-83. doi: 10.1038/ejhg.2014.216. Epub 2014 Oct 8. Eur J Hum Genet. 2015. PMID: 25293720 Free PMC article.
-
Two-stage strategy using denoising autoencoders for robust reference-free genotype imputation with missing input genotypes.J Hum Genet. 2024 Oct;69(10):511-518. doi: 10.1038/s10038-024-01261-6. Epub 2024 Jun 25. J Hum Genet. 2024. PMID: 38918526 Free PMC article. Review.
-
Genotype Imputation from Large Reference Panels.Annu Rev Genomics Hum Genet. 2018 Aug 31;19:73-96. doi: 10.1146/annurev-genom-083117-021602. Epub 2018 May 23. Annu Rev Genomics Hum Genet. 2018. PMID: 29799802 Review.
Cited by
-
Multi-ethnic transcriptome-wide association study of prostate cancer.PLoS One. 2020 Sep 28;15(9):e0236209. doi: 10.1371/journal.pone.0236209. eCollection 2020. PLoS One. 2020. PMID: 32986714 Free PMC article.
-
Accurate Imputation of Untyped Variants from Deep Sequencing Data.Methods Mol Biol. 2021;2243:271-281. doi: 10.1007/978-1-0716-1103-6_13. Methods Mol Biol. 2021. PMID: 33606262 Review.
-
weIMPUTE: a user-friendly web-based genotype imputation platform.Front Genet. 2025 Mar 17;16:1532464. doi: 10.3389/fgene.2025.1532464. eCollection 2025. Front Genet. 2025. PMID: 40165935 Free PMC article.
-
Use of >100,000 NHLBI Trans-Omics for Precision Medicine (TOPMed) Consortium whole genome sequences improves imputation quality and detection of rare variant associations in admixed African and Hispanic/Latino populations.PLoS Genet. 2019 Dec 23;15(12):e1008500. doi: 10.1371/journal.pgen.1008500. eCollection 2019 Dec. PLoS Genet. 2019. PMID: 31869403 Free PMC article.
-
High-throughput framework for genetic analyses of adverse drug reactions using electronic health records.PLoS Genet. 2021 Jun 1;17(6):e1009593. doi: 10.1371/journal.pgen.1009593. eCollection 2021 Jun. PLoS Genet. 2021. PMID: 34061827 Free PMC article.
References
-
- Alric L, Fort M, Izopet J, et al. (1997) Genes of the major histocompatibility complex class II influence the outcome of hepatitis C virus infection. Gastroenterology 113:1675–81 - PubMed
MeSH terms
Grants and funding
LinkOut - more resources
Full Text Sources
Other Literature Sources