Bayesian analysis of genetic association across tree-structured routine healthcare data in the UK Biobank
- PMID: 28759005
- PMCID: PMC5580804
- DOI: 10.1038/ng.3926
Bayesian analysis of genetic association across tree-structured routine healthcare data in the UK Biobank
Abstract
Genetic discovery from the multitude of phenotypes extractable from routine healthcare data can transform understanding of the human phenome and accelerate progress toward precision medicine. However, a critical question when analyzing high-dimensional and heterogeneous data is how best to interrogate increasingly specific subphenotypes while retaining statistical power to detect genetic associations. Here we develop and employ a new Bayesian analysis framework that exploits the hierarchical structure of diagnosis classifications to analyze genetic variants against UK Biobank disease phenotypes derived from self-reporting and hospital episode statistics. Our method displays a more than 20% increase in power to detect genetic effects over other approaches and identifies new associations between classical human leukocyte antigen (HLA) alleles and common immune-mediated diseases (IMDs). By applying the approach to genetic risk scores (GRSs), we show the extent of genetic sharing among IMDs and expose differences in disease perception or diagnosis with potential clinical implications.
Conflict of interest statement
G.M. and P.D. are cofounders of, holder of shares in, and consultants to Genomics PLC. G.M., P.D. and S.L. are partners in Peptide Groove LLP. Peptide Groove has licensed HLA typing technology to Affymetrix Ltd. The other authors declare no competing financial interests.
Figures





Similar articles
-
Genetically determined serum urate levels and cardiovascular and other diseases in UK Biobank cohort: A phenome-wide mendelian randomization study.PLoS Med. 2019 Oct 18;16(10):e1002937. doi: 10.1371/journal.pmed.1002937. eCollection 2019 Oct. PLoS Med. 2019. PMID: 31626644 Free PMC article.
-
HLA allele-calling using multi-ancestry whole-exome sequencing from the UK Biobank identifies 129 novel associations in 11 autoimmune diseases.Commun Biol. 2023 Nov 3;6(1):1113. doi: 10.1038/s42003-023-05496-5. Commun Biol. 2023. PMID: 37923823 Free PMC article.
-
Genome-wide association study identifies HLA 8.1 ancestral haplotype alleles as major genetic risk factors for myositis phenotypes.Genes Immun. 2015 Oct;16(7):470-80. doi: 10.1038/gene.2015.28. Epub 2015 Aug 20. Genes Immun. 2015. PMID: 26291516 Free PMC article.
-
Association analyses based on false discovery rate implicate new loci for coronary artery disease.Nat Genet. 2017 Sep;49(9):1385-1391. doi: 10.1038/ng.3913. Epub 2017 Jul 17. Nat Genet. 2017. PMID: 28714975
-
Host genetic factors affecting hepatitis B infection outcomes: Insights from genome-wide association studies.World J Gastroenterol. 2018 Aug 14;24(30):3347-3360. doi: 10.3748/wjg.v24.i30.3347. World J Gastroenterol. 2018. PMID: 30122875 Free PMC article. Review.
Cited by
-
Reverse GWAS: Using genetics to identify and model phenotypic subtypes.PLoS Genet. 2019 Apr 5;15(4):e1008009. doi: 10.1371/journal.pgen.1008009. eCollection 2019 Apr. PLoS Genet. 2019. PMID: 30951530 Free PMC article.
-
The impact of age on genetic risk for common diseases.PLoS Genet. 2021 Aug 26;17(8):e1009723. doi: 10.1371/journal.pgen.1009723. eCollection 2021 Aug. PLoS Genet. 2021. PMID: 34437535 Free PMC article.
-
Molecular characteristics of Staphylococcus aureus associated prosthetic joint infections after hip fractures treated with hemiarthroplasty: a retrospective genome-wide association study.Sci Rep. 2020 Oct 6;10(1):16553. doi: 10.1038/s41598-020-73736-3. Sci Rep. 2020. PMID: 33024212 Free PMC article.
-
Genetic architecture of common non-Alzheimer's disease dementias.Neurobiol Dis. 2020 Aug;142:104946. doi: 10.1016/j.nbd.2020.104946. Epub 2020 May 19. Neurobiol Dis. 2020. PMID: 32439597 Free PMC article. Review.
-
Phenome-wide association study (PheWAS) of colorectal cancer risk SNP effects on health outcomes in UK Biobank.Br J Cancer. 2022 Mar;126(5):822-830. doi: 10.1038/s41416-021-01655-9. Epub 2021 Dec 15. Br J Cancer. 2022. PMID: 34912076 Free PMC article.
References
-
- Cohen JC, Boerwinkle E, Mosley TH, Jr, Hobbs HH. Sequence variations in PCSK9, low LDL, and protection against coronary heart disease. N Engl J Med. 2006;354:1264–72. - PubMed
-
- Mallal S, et al. HLA-B*5701 screening for hypersensitivity to abacavir. N Engl J Med. 2008;358:568–79. - PubMed
-
- Manolio TA. Bringing genome-wide association findings into clinical use. Nat Rev Genet. 2013;14:549–58. - PubMed
-
- Nelson MR, et al. The support of human genetic evidence for approved drug indications. Nat Genet. 2015;47:856–60. - PubMed
-
- Sanseau P, et al. Use of genome-wide association studies for drug repositioning. Nat Biotechnol. 2012;30:317–20. - PubMed
MeSH terms
Substances
Grants and funding
LinkOut - more resources
Full Text Sources
Other Literature Sources
Medical
Research Materials