Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2019 Dec 23;15(12):e1008500.
doi: 10.1371/journal.pgen.1008500. eCollection 2019 Dec.

Use of >100,000 NHLBI Trans-Omics for Precision Medicine (TOPMed) Consortium whole genome sequences improves imputation quality and detection of rare variant associations in admixed African and Hispanic/Latino populations

Madeline H Kowalski  1 Huijun Qian  2 Ziyi Hou  3 Jonathan D Rosen  1 Amanda L Tapia  1 Yue Shan  1 Deepti Jain  4 Maria Argos  5 Donna K Arnett  6 Christy Avery  7 Kathleen C Barnes  8 Lewis C Becker  9 Stephanie A Bien  10 Joshua C Bis  11 John Blangero  12 Eric Boerwinkle  13   14 Donald W Bowden  15 Steve Buyske  16 Jianwen Cai  17 Michael H Cho  18   19 Seung Hoan Choi  20 Hélène Choquet  21 L Adrienne Cupples  22   23 Mary Cushman  24 Michelle Daya  8 Paul S de Vries  14 Patrick T Ellinor  20   25 Nauder Faraday  9 Myriam Fornage  26 Stacey Gabriel  27 Santhi K Ganesh  28   29 Misa Graff  7 Namrata Gupta  27 Jiang He  30 Susan R Heckbert  31   32 Bertha Hidalgo  33 Chani J Hodonsky  7 Marguerite R Irvin  33 Andrew D Johnson  23   34 Eric Jorgenson  21 Robert Kaplan  35 Sharon L R Kardia  36 Tanika N Kelly  30 Charles Kooperberg  10 Jessica A Lasky-Su  18   19 Ruth J F Loos  37   38 Steven A Lubitz  20   25 Rasika A Mathias  9 Caitlin P McHugh  4 Courtney Montgomery  39 Jee-Young Moon  35 Alanna C Morrison  14 Nicholette D Palmer  15 Nathan Pankratz  40 George J Papanicolaou  41 Juan M Peralta  12 Patricia A Peyser  36 Stephen S Rich  42 Jerome I Rotter  43 Edwin K Silverman  18   19 Jennifer A Smith  36 Nicholas L Smith  31   32   44 Kent D Taylor  43 Timothy A Thornton  4 Hemant K Tiwari  45 Russell P Tracy  46 Tao Wang  47 Scott T Weiss  18   19 Lu-Chen Weng  20 Kerri L Wiggins  11 James G Wilson  48 Lisa R Yanek  9 Sebastian Zöllner  49   50 Kari E North  7   51 Paul L Auer  52 NHLBI Trans-Omics for Precision Medicine (TOPMed) ConsortiumTOPMed Hematology & Hemostasis Working GroupLaura M Raffield  53 Alexander P Reiner  31 Yun Li  1   53   54
Affiliations

Use of >100,000 NHLBI Trans-Omics for Precision Medicine (TOPMed) Consortium whole genome sequences improves imputation quality and detection of rare variant associations in admixed African and Hispanic/Latino populations

Madeline H Kowalski et al. PLoS Genet. .

Abstract

Most genome-wide association and fine-mapping studies to date have been conducted in individuals of European descent, and genetic studies of populations of Hispanic/Latino and African ancestry are limited. In addition, these populations have more complex linkage disequilibrium structure. In order to better define the genetic architecture of these understudied populations, we leveraged >100,000 phased sequences available from deep-coverage whole genome sequencing through the multi-ethnic NHLBI Trans-Omics for Precision Medicine (TOPMed) program to impute genotypes into admixed African and Hispanic/Latino samples with genome-wide genotyping array data. We demonstrated that using TOPMed sequencing data as the imputation reference panel improves genotype imputation quality in these populations, which subsequently enhanced gene-mapping power for complex traits. For rare variants with minor allele frequency (MAF) < 0.5%, we observed a 2.3- to 6.1-fold increase in the number of well-imputed variants, with 11-34% improvement in average imputation quality, compared to the state-of-the-art 1000 Genomes Project Phase 3 and Haplotype Reference Consortium reference panels. Impressively, even for extremely rare variants with minor allele count <10 (including singletons) in the imputation target samples, average information content rescued was >86%. Subsequent association analyses of TOPMed reference panel-imputed genotype data with hematological traits (hemoglobin (HGB), hematocrit (HCT), and white blood cell count (WBC)) in ~21,600 African-ancestry and ~21,700 Hispanic/Latino individuals identified associations with two rare variants in the HBB gene (rs33930165 with higher WBC [p = 8.8x10-15] in African populations, rs11549407 with lower HGB [p = 1.5x10-12] and HCT [p = 8.8x10-10] in Hispanics/Latinos). By comparison, neither variant would have been genome-wide significant if either 1000 Genomes Project Phase 3 or Haplotype Reference Consortium reference panels had been used for imputation. Our findings highlight the utility of the TOPMed imputation reference panel for identification of novel rare variant associations not previously detected in similarly sized genome-wide studies of under-represented African and Hispanic/Latino populations.

PubMed Disclaimer

Conflict of interest statement

Edwin K Silverman and Michael H Cho have received grant support from GSK, MHC has received consulting fees from Genentech. Scott T. Weiss and Kathleen C. Barnes received royalties from UpToDate. Patrick T. Ellinor is supported by a grant from Bayer AG to the Broad Institute focused on the genetics and therapeutics of cardiovascular diseases, and has also served on advisory boards or consulted for Bayer AG, Quest Diagnostics and Novartis. Steven A Lubitz receives sponsored research support from Bristol Myers Squibb / Pfizer, Bayer HealthCare, and Boehringer Ingelheim, and has consulted for Abbott, Quest Diagnostics, Bristol Myers Squibb / Pfizer. Other authors declared no conflicts of interest.

Figures

Fig 1
Fig 1. Comparison of imputation reference panels, for variants with MAF < 1%.
Imputation quality (measured by true R2 [Y-axis]) is plotted with progressively more stringent post-imputation filtering from left to right, with filtering according to estimated R2 (X-axis), for variants with MAF < 1%. Top panels are for the JHS cohort and bottom panels for the HCHS/SOL cohort. Three reference panels are shown: TOPMed (TOPMed freeze 5b), 1000G (the 1000 Genomes Phase 3), and HRC (the Haplotype Reference Consortium).

References

    1. Auer PL, Johnsen JM, Johnson AD, Logsdon BA, Lange LA, Nalls MA, et al. Imputation of exome sequence variants into population- based samples and blood-cell-trait-associated loci in African Americans: NHLBI GO Exome Sequencing Project. Am J Hum Genet. 2012;91(5):794–808. Epub 2012/10/30. 10.1016/j.ajhg.2012.08.031 . - DOI - PMC - PubMed
    1. Duan Q, Liu EY, Auer PL, Zhang G, Lange EM, Jun G, et al. Imputation of coding variants in African Americans: better performance using data from the exome sequencing project. Bioinformatics. 2013;29(21):2744–9. Epub 2013/08/21. 10.1093/bioinformatics/btt477 . - DOI - PMC - PubMed
    1. Lu F-P, Lin K-P, Kuo H-K. Diabetes and the risk of multi-system aging phenotypes: a systematic review and meta-analysis. PloS one. 2009;4(1):e4144 10.1371/journal.pone.0004144 - DOI - PMC - PubMed
    1. Liu EY, Buyske S, Aragaki AK, Peters U, Boerwinkle E, Carlson C, et al. Genotype Imputation of MetabochipSNPs Using a Study-Specific Reference Panel of ~4,000 Haplotypes in African Americans From the Women’s Health Initiative. Genet Epidemiol. 2012;36(2):107–17. 10.1002/gepi.21603 - DOI - PMC - PubMed
    1. Liu EY, Li M, Wang W, Li Y. MaCH-admix: genotype imputation for admixed populations. Genetic epidemiology. 2013;37(1):25–37. Epub 2012/10/18. 10.1002/gepi.21690 . - DOI - PMC - PubMed

Publication types

MeSH terms

Grants and funding