Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2022 Aug 9;13(1):4664.
doi: 10.1038/s41467-022-32095-5.

Transferability of genetic loci and polygenic scores for cardiometabolic traits in British Pakistani and Bangladeshi individuals

Affiliations

Transferability of genetic loci and polygenic scores for cardiometabolic traits in British Pakistani and Bangladeshi individuals

Qin Qin Huang et al. Nat Commun. .

Abstract

Individuals with South Asian ancestry have a higher risk of heart disease than other groups but have been largely excluded from genetic research. Using data from 22,000 British Pakistani and Bangladeshi individuals with linked electronic health records from the Genes & Health cohort, we conducted genome-wide association studies of coronary artery disease and its key risk factors. Using power-adjusted transferability ratios, we found evidence for transferability for the majority of cardiometabolic loci powered to replicate. The performance of polygenic scores was high for lipids and blood pressure, but lower for BMI and coronary artery disease. Adding a polygenic score for coronary artery disease to clinical risk factors showed significant improvement in reclassification. In Mendelian randomisation using transferable loci as instruments, our findings were consistent with results in European-ancestry individuals. Taken together, trait-specific transferability of trait loci between populations is an important consideration with implications for risk prediction and causal inference.

PubMed Disclaimer

Conflict of interest statement

N.S. is now employed by GlaxoSmithKline. All other authors declare no competing interests.

Figures

Fig. 1
Fig. 1. Summary of study design, research questions and analyses conducted.
The coloured boxes indicate input data. Within the white boxes, the black text indicates the analyses we used to address the questions in blue. BPB British Pakistanis and Bangladeshi ancestry, EUR European ancestry, SAS South Asian ancestry, CAD coronary artery disease, BMI body mass index, SNP single nucleotide polymorphism, GWAS genome-wide association study, MR Mendelian randomisation, PGS polygenic score, UKBB UK Biobank. Datasets and discovery GWAS that were used in each analysis are provided in Supplementary Data 4.
Fig. 2
Fig. 2. SNP heritability and trans-ancestry genetic correlations for cardiometabolic traits.
a SNP heritability was estimated using GCTA in G&H (orange) and eMERGE (grey) for cardiometabolic traits, namely coronary artery disease (CAD; n = 17,348 and 32,816 unrelated samples from G&H and eMERGE, respectively), body-mass index (BMI; n = 13,926 and 37,160), high-density lipoprotein cholesterol (HDL-C; n = 11,316 and 16,049), low-density lipoprotein cholesterol (LDL-C; n = 12,856 and 15,856), triglycerides (TG; n = 11,125 and 14,384), systolic blood pressure (SBP), and diastolic blood pressure (DBP; n = 15,908 and 11,864 for blood pressure). Medication data are not available in eMERGE so the highest measurements for LDL-C, SBP, and DBP are used. Error bars represent 95% confidence intervals in both plots. b Genetic correlations were estimated using Popcorn based on GWAS summary statistics generated from G&H and European-ancestry individuals from UK Biobank. Red indicates that the genetic correlation is nominally significantly lower than 1 (p-value = 0.02 for BMI; two-sided and not adjusted for multiple comparisons). Medication-adjusted lipid and blood pressure levels are used. For rg estimates of 1 (TG and DBP), the method cannot derive confidence intervals. Sample sizes of GWAS for cardiometabolic traits in G&H are in Table 1.
Fig. 3
Fig. 3. Comparison of the predictive accuracy of polygenic scores in people of British Pakistani and Bangladeshi versus European ancestry.
Incremental AUC is shown for coronary artery disease (CAD; n = 17,348 and 32,816 unrelated samples from G&H and eMERGE, respectively) and Incremental R2 is shown for its continuous risk factors, namely body-mass index (BMI; n = 13,926 and 37,160), high-density lipoprotein cholesterol (HDL-C; n = 11,316 and 16,049), low-density lipoprotein cholesterol (LDL-C; n = 12,856 and 15,856), triglycerides (TG; n = 11,125 and 14,384), systolic blood pressure (SBP), and diastolic blood pressure (DBP; n = 15,908 and 11,864 for blood pressure). Grey indicates European-ancestry (EUR) individuals from eMERGE and orange British Pakistani and Bangladeshi (BPB) individuals from G&H. Error bars represent 95% confidence intervals estimated by bootstrap resampling of samples. The highest measurements for LDL-C, SBP, and DBP are compared between eMERGE and G&H.
Fig. 4
Fig. 4. Net reclassification index (NRI) for coronary artery disease with the addition of a polygenic score to QRISK3.
Estimates for categorical NRI for the integrated score compared to QRISK3 in all samples (n = 420 unrelated cases and 7702 unrelated non-cases) as well as in age-by-sex subgroups (n = 207 and 2779 in males aged 25–54; n = 51 and 4187 in females aged 25–54; n = 114 and 344 in males aged 55–84; n = 48 and 392 in females aged 55–84) are shown. Red indicates NRI in cases and blue in controls. The error bars indicate 95% confidence intervals estimated using the bootstrap method.
Fig. 5
Fig. 5. Mendelian randomisation estimates of risk factors on coronary artery disease in European (eMERGE) and British South Asian (G&H) ancestry individuals.
Two-sample Mendelian randomisation (MR) estimates for the causal effects are presented based on genetic instrument variables identified from EUR discovery GWAS for each risk factor. All independent genome-wide significant loci were used as instruments for eMERGE and only the transferable loci for G&H. Effect estimates are presented as odds ratios with 95% confidence intervals per standard deviation increase in the reported unit of the trait: triglycerides (TG), systolic blood pressure (SBP), low-density lipoprotein cholesterol (LDL-C), high-density lipoprotein cholesterol (HDL-C), diastolic blood pressure (DBP), body mass index (BMI). The two-sided p-value (P; not adjusted for multiple comparisons) and the number of single nucleotide polymorphism instruments (N SNPs) included in the MR analysis are shown for each exposure. GWAS for CAD was performed in n = 22,008 (1110 cases) samples from G&H, and n = 32,816 (6815 cases) unrelated samples from eMERGE.

Similar articles

Cited by

References

    1. Barnett AH, et al. Type 2 diabetes and cardiovascular risk in the UK south Asian community. Diabetologia. 2006;49:2234–2246. doi: 10.1007/s00125-006-0325-1. - DOI - PubMed
    1. Gurdasani D, Barroso I, Zeggini E, Sandhu MS. Genomics of disease risk in globally diverse populations. Nat. Rev. Genet. 2019;20:520–535. doi: 10.1038/s41576-019-0144-0. - DOI - PubMed
    1. Morales J, et al. A standardized framework for representation of ancestry data in genomics studies, with application to the NHGRI-EBI GWAS Catalog. Genome Biol. 2018;19:21. doi: 10.1186/s13059-018-1396-2. - DOI - PMC - PubMed
    1. Martin AR, et al. Clinical use of current polygenic risk scores may exacerbate health disparities. Nat. Genet. 2019;51:584–591. doi: 10.1038/s41588-019-0379-x. - DOI - PMC - PubMed
    1. Duncan L, et al. Analysis of polygenic risk score usage and performance in diverse human populations. Nat. Commun. 2019;10:3328. doi: 10.1038/s41467-019-11112-0. - DOI - PMC - PubMed

Publication types