Leveraging large-scale biobanks for therapeutic target discovery
- PMID: 41376171
- PMCID: PMC12799792
- DOI: 10.1016/j.xhgg.2025.100556
Leveraging large-scale biobanks for therapeutic target discovery
Abstract
Large biobanks, including the Million Veteran Program (MVP), the UK Biobank, and FinnGen, provide genetic association results for more than 1 million individuals for hundreds of phenotypes. To select targets for pharmaceutical development, as well as to improve the understanding of existing targets, we harmonized these studies and performed two-sample Mendelian randomization (MR) on 2,003 phenotypes using genetic variants associated with gene expression (derived from GTEx and eQTLGen) and plasma protein levels (derived from ARIC, Fenland, and deCODE) as proxies of target modulation. We found 69,669 gene-trait pairs with evidence (p ≤ 1.6 × 10-9) for causal effects. From the selected gene-trait pairs, we observed 6,447 genes with strong causal evidence for at least one of 2,003 investigated traits. As expected, being identified as a gene-trait pair in our approach was significantly associated with higher odds of being an approved drug target and indication. We were able to rediscover 9% of approved drug targets in ChEMBL 34. Moreover, identified gene-traits were significantly associated with higher odds of being previously described as a gene-trait pair in OMIM, ClinVar, mouse knockout data, and rare variant burden studies. To enhance the translational potential of the resource, we developed a predictive ranking model trained using approved drug targets described in ChEMBL 34 as well as several different biological annotations. This model was able to accurately predict the odds of a particular significant MR result being developed into an approved drug and its clinical indication (precision-recall area under the receiver operating characteristic curve 0.79). We make our results publicly available in CIPHER.
Keywords: Mendelian randomization; genomics; target identification.
Published by Elsevier Inc.
Conflict of interest statement
Declaration of interests A.S.B. reports grants outside of this work from AstraZeneca, Bayer, Biogen, BioMarin, and Sanofi. M.G. is a full-time employee of Regeneron Genetics Center; her main contributions occurred while she was an employee of Open Targets. J.P.C. is a full-time employee at Novartis Institutes for Biomedical Research; his main contributions to the project occurred while employed at the VA Boston Healthcare System. M.A.K. is a full-time employee at Variant Bio; his main contributions occurred while he was an employee of Open Targets.
Figures
References
-
- Gaziano J.M., Concato J., Brophy M., Fiore L., Pyarajan S., Breeling J., Whitbourne S., Deen J., Shannon C., Humphries D., et al. Million Veteran Program: A mega-biobank to study genetic influences on health and disease. J. Clin. Epidemiol. 2016;70:214–223. doi: 10.1016/j.jclinepi.2015.09.016. - DOI - PubMed
-
- Sudlow C., Gallacher J., Allen N., Beral V., Burton P., Danesh J., Downey P., Elliott P., Green J., Landray M., et al. UK Biobank: An Open Access Resource for Identifying the Causes of a Wide Range of Complex Diseases of Middle and Old Age. PLoS Med. 2015;12 doi: 10.1371/journal.pmed.1001779. - DOI - PMC - PubMed
-
- Pan-UKB team. 2020. https://pan.ukbb.broadinstitute.org
MeSH terms
LinkOut - more resources
Full Text Sources
Research Materials
Miscellaneous
