Large-scale integration of omics and electronic health records to identify potential risk protein biomarkers and therapeutic drugs for cancer prevention
- PMID: 41338217
- PMCID: PMC12824608
- DOI: 10.1016/j.ajhg.2025.11.008
Large-scale integration of omics and electronic health records to identify potential risk protein biomarkers and therapeutic drugs for cancer prevention
Abstract
Identifying risk protein targets and their therapeutic drugs is crucial for effective cancer prevention. Here, we conduct integrative and fine-mapping analyses of large genome-wide association studies data for breast, colorectal, lung, ovarian, pancreatic, and prostate cancers and characterize 710 lead variants independently associated with cancer risk. Through mapping protein quantitative trait loci (pQTLs) for these variants using plasma proteomics data from over 75,000 participants, we identify 365 proteins associated with cancer risk. Subsequent colocalization analysis identifies 101 proteins, including 74 not reported in previous studies. We further characterize 36 potential druggable proteins for cancers or other disease indications. Analyzing >3.5 million electronic health records, we conducted analyses of emulated trials for 11 drugs across 290 comparisons and identified three drugs significantly associated with reduced colorectal cancer risk: caffeine vs. paroxetine (hazard ratio [HR], 0.51; 95% confidence interval [CI], 0.41-0.64), haloperidol vs. prochlorperazine (HR, 0.47; 95% CI, 0.33-0.68), and trazodone hydrochloride vs. paroxetine (HR, 0.49; 95% CI, 0.38-0.63). Conversely, caffeine was associated with increased cancer risk in comparison with finasteride (colorectal cancer) and fluoxetine (breast cancer). Meta-analysis identified six drugs significantly associated with cancer risk, including acetazolamide, which was associated with reduced colorectal cancer risk (HR, 0.79; 95% CI, 0.72-0.87). This study identifies previously unreported protein biomarkers and candidate drug targets across six major cancer types and highlights several approved drugs with potential chemopreventive effects.
Keywords: EHR; GWAS; acetazolamide; caffeine; drug trial emulation; human cancers; pQTL; proteomics; risk variants; therapeutic drug.
Copyright © 2025 American Society of Human Genetics. Published by Elsevier Inc. All rights reserved.
Conflict of interest statement
Declaration of interests The authors declare no competing interests.
Update of
-
Large-scale integration of omics and electronic health records to identify potential risk protein biomarkers and therapeutic drugs for cancer prevention.medRxiv [Preprint]. 2025 Jul 11:2024.05.29.24308170. doi: 10.1101/2024.05.29.24308170. medRxiv. 2025. Update in: Am J Hum Genet. 2026 Jan 8;113(1):41-56. doi: 10.1016/j.ajhg.2025.11.008. PMID: 38853880 Free PMC article. Updated. Preprint.
References
-
- Nelson M.R., Tipney H., Painter J.L., Shen J., Nicoletti P., Shen Y., Floratos A., Sham P.C., Li M.J., Wang J., et al. The support of human genetic evidence for approved drug indications. Nat. Genet. 2015;47:856–860. - PubMed
-
- Peters U., Tomlinson I. Utilizing Human Genetics to Develop Chemoprevention for Cancer-Too Good an Opportunity to be Missed. Cancer Prev. Res. 2024;17:7–12. - PubMed
-
- Jia G., Ping J., Shu X., Yang Y., Cai Q., Kweon S.S., Choi J.Y., Kubo M., Park S.K., Bolla M.K., et al. Genome- and transcriptome-wide association studies of 386,000 Asian and European-ancestry women provide new insights into breast cancer genetics. Am. J. Hum. Genet. 2022;109:2185–2195. - PMC - PubMed
MeSH terms
Substances
LinkOut - more resources
Full Text Sources
Medical
