Analysis of polygenic risk score usage and performance in diverse human populations
- PMID: 31346163
- PMCID: PMC6658471
- DOI: 10.1038/s41467-019-11112-0
Analysis of polygenic risk score usage and performance in diverse human populations
Abstract
A historical tendency to use European ancestry samples hinders medical genetics research, including the use of polygenic scores, which are individual-level metrics of genetic risk. We analyze the first decade of polygenic scoring studies (2008-2017, inclusive), and find that 67% of studies included exclusively European ancestry participants and another 19% included only East Asian ancestry participants. Only 3.8% of studies were among cohorts of African, Hispanic, or Indigenous peoples. We find that predictive performance of European ancestry-derived polygenic scores is lower in non-European ancestry samples (e.g. African ancestry samples: t = -5.97, df = 24, p = 3.7 × 10-6), and we demonstrate the effects of methodological choices in polygenic score distributions for worldwide populations. These findings highlight the need for improved treatment of linkage disequilibrium and variant frequencies when applying polygenic scoring to cohorts of non-European ancestry, and bolster the rationale for large-scale GWAS in diverse human populations.
Conflict of interest statement
The authors declare no competing interests.
Figures




Similar articles
-
Ancestry effects on type 2 diabetes genetic risk inference in Hispanic/Latino populations.BMC Med Genet. 2020 Jun 25;21(Suppl 2):132. doi: 10.1186/s12881-020-01068-0. BMC Med Genet. 2020. PMID: 32580712 Free PMC article.
-
Generalizability of Polygenic Risk Scores for Breast Cancer Among Women With European, African, and Latinx Ancestry.JAMA Netw Open. 2021 Aug 2;4(8):e2119084. doi: 10.1001/jamanetworkopen.2021.19084. JAMA Netw Open. 2021. PMID: 34347061 Free PMC article.
-
Polygenic prediction for underrepresented populations through transfer learning by utilizing genetic similarity shared with European populations.Brief Bioinform. 2024 Nov 22;26(1):bbaf048. doi: 10.1093/bib/bbaf048. Brief Bioinform. 2024. PMID: 39905953 Free PMC article.
-
Genome-Wide Association Studies of Cancer in Diverse Populations.Cancer Epidemiol Biomarkers Prev. 2018 Apr;27(4):405-417. doi: 10.1158/1055-9965.EPI-17-0169. Epub 2017 Jun 21. Cancer Epidemiol Biomarkers Prev. 2018. PMID: 28637795 Free PMC article. Review.
-
Accounting for linkage disequilibrium in association analysis of diverse populations.Genet Epidemiol. 2014 Apr;38(3):265-73. doi: 10.1002/gepi.21788. Epub 2014 Jan 26. Genet Epidemiol. 2014. PMID: 24464495 Review.
Cited by
-
A genotype imputation method for de-identified haplotype reference information by using recurrent neural network.PLoS Comput Biol. 2020 Oct 1;16(10):e1008207. doi: 10.1371/journal.pcbi.1008207. eCollection 2020 Oct. PLoS Comput Biol. 2020. PMID: 33001993 Free PMC article.
-
Circulating Free DNA and Its Emerging Role in Autoimmune Diseases.J Pers Med. 2021 Feb 20;11(2):151. doi: 10.3390/jpm11020151. J Pers Med. 2021. PMID: 33672659 Free PMC article. Review.
-
Genome-wide association study of psychiatric and substance use comorbidity in Mexican individuals.Sci Rep. 2021 Mar 24;11(1):6771. doi: 10.1038/s41598-021-85881-4. Sci Rep. 2021. PMID: 33762635 Free PMC article.
-
Epidemiology and genomics of prostate cancer in Asian men.Nat Rev Urol. 2021 May;18(5):282-301. doi: 10.1038/s41585-021-00442-8. Epub 2021 Mar 10. Nat Rev Urol. 2021. PMID: 33692499 Review.
-
Ancestry, race and ethnicity: the role and relevance of language in clinical genetics practice.J Med Genet. 2024 Mar 21;61(4):313-318. doi: 10.1136/jmg-2023-109370. J Med Genet. 2024. PMID: 38050060 Free PMC article.
References
Publication types
MeSH terms
Grants and funding
LinkOut - more resources
Full Text Sources
Other Literature Sources
Miscellaneous