. 2024 Dec;30(12):3601-3613.

doi: 10.1038/s41591-024-03233-x. Epub 2024 Sep 25.

A foundation model for clinician-centered drug repurposing

Kexin Huang^#^{1

2}, Payal Chandak^#³, Qianwen Wang¹, Shreyas Havaldar⁴, Akhil Vaid^{4

5}, Jure Leskovec², Girish N Nadkarni⁵, Benjamin S Glicksberg^{4

5}, Nils Gehlenborg¹, Marinka Zitnik^{6

7

8

9}

Affiliations

¹ Department of Biomedical Informatics, Harvard Medical School, Boston, MA, USA.
² Department of Computer Science, Stanford University, Stanford, CA, USA.
³ Harvard-MIT Program in Health Sciences and Technology, Cambridge, MA, USA.
⁴ Hasso Plattner Institute for Digital Health, Icahn School of Medicine at Mount Sinai, Mount Sinai, NY, USA.
⁵ Charles Bronfman Institute for Personalized Medicine, Icahn School of Medicine at Mount Sinai, Mount Sinai, NY, USA.
⁶ Department of Biomedical Informatics, Harvard Medical School, Boston, MA, USA. marinka@hms.harvard.edu.
⁷ Broad Institute of MIT and Harvard, Cambridge, MA, USA. marinka@hms.harvard.edu.
⁸ Harvard Data Science Initiative, Cambridge, MA, USA. marinka@hms.harvard.edu.
⁹ Kempner Institute for the Study of Natural and Artificial Intelligence, Harvard University, Cambridge, MA, USA. marinka@hms.harvard.edu.

^# Contributed equally.

PMID: 39322717
PMCID: PMC11645266
DOI: 10.1038/s41591-024-03233-x

A foundation model for clinician-centered drug repurposing

Kexin Huang et al. Nat Med. 2024 Dec.

. 2024 Dec;30(12):3601-3613.

doi: 10.1038/s41591-024-03233-x. Epub 2024 Sep 25.

Authors

Affiliations

¹ Department of Biomedical Informatics, Harvard Medical School, Boston, MA, USA.
² Department of Computer Science, Stanford University, Stanford, CA, USA.
³ Harvard-MIT Program in Health Sciences and Technology, Cambridge, MA, USA.
⁴ Hasso Plattner Institute for Digital Health, Icahn School of Medicine at Mount Sinai, Mount Sinai, NY, USA.
⁵ Charles Bronfman Institute for Personalized Medicine, Icahn School of Medicine at Mount Sinai, Mount Sinai, NY, USA.
⁶ Department of Biomedical Informatics, Harvard Medical School, Boston, MA, USA. marinka@hms.harvard.edu.
⁷ Broad Institute of MIT and Harvard, Cambridge, MA, USA. marinka@hms.harvard.edu.
⁸ Harvard Data Science Initiative, Cambridge, MA, USA. marinka@hms.harvard.edu.
⁹ Kempner Institute for the Study of Natural and Artificial Intelligence, Harvard University, Cambridge, MA, USA. marinka@hms.harvard.edu.

^# Contributed equally.

PMID: 39322717
PMCID: PMC11645266
DOI: 10.1038/s41591-024-03233-x

Abstract

Drug repurposing-identifying new therapeutic uses for approved drugs-is often a serendipitous and opportunistic endeavour to expand the use of drugs for new diseases. The clinical utility of drug-repurposing artificial intelligence (AI) models remains limited because these models focus narrowly on diseases for which some drugs already exist. Here we introduce TxGNN, a graph foundation model for zero-shot drug repurposing, identifying therapeutic candidates even for diseases with limited treatment options or no existing drugs. Trained on a medical knowledge graph, TxGNN uses a graph neural network and metric learning module to rank drugs as potential indications and contraindications for 17,080 diseases. When benchmarked against 8 methods, TxGNN improves prediction accuracy for indications by 49.2% and contraindications by 35.1% under stringent zero-shot evaluation. To facilitate model interpretation, TxGNN's Explainer module offers transparent insights into multi-hop medical knowledge paths that form TxGNN's predictive rationales. Human evaluation of TxGNN's Explainer showed that TxGNN's predictions and explanations perform encouragingly on multiple axes of performance beyond accuracy. Many of TxGNN's new predictions align well with off-label prescriptions that clinicians previously made in a large healthcare system. TxGNN's drug-repurposing predictions are accurate, consistent with off-label drug use, and can be investigated by human experts through multi-hop interpretable rationales.

PubMed Disclaimer

Conflict of interest statement

Competing interests: The authors declare no competing interests.

Figures

**Fig. 1. TxGNN is a graph foundation model for drug repurposing, identifying candidate drugs for diseases with limited treatment options and limited molecular data.**
a, Drug repurposing involves the exploration of new therapeutic applications for existing drugs to treat diseases. Leveraging existing safety and efficacy data can dramatically cut costs and time to deliver life-saving therapeutics. b, Computational drug repurposing considered for diseases with already available treatments and well-understood molecular mechanisms. However, many diseases lack treatments and a complete understanding of disease mechanisms. These inherent constraints pose challenges for AI models. TxGNN addresses this challenge by formulating drug repurposing as a zero-shot prediction problem. c, TxGNN presents an AI framework that generates actionable predictions for zero-shot drug repurposing. The TxGNN geometric deep-learning model incorporates a vast and comprehensive biological KG to accurately predict the likelihood of indication or contraindication for any given disease–drug pair. In addition, TxGNN generates explainable multi-hop paths, facilitating human understanding of how the prediction is grounded in medical knowledge. d, The TxGNN model trained on a medical KG of disease mechanisms across 17,080 diseases and drug mechanisms of action for 7,957 drugs.

**Fig. 2. TxGNN accurately predicts drug indications and contraindications.**
a, TxGNN: a deep-learning model that learns to reason over a large-scale KG to predict the relationship between drugs and disease. In zero-shot repurposing, limited indication and mechanism information are available for the queried disease. Our key insight revolves around the interconnectedness of biological systems. We recognize that diseases, despite their distinctiveness, can exhibit partial similarities and share multiple underlying mechanisms. Based on this motivation, we have developed a specialized module known as disease pooling, which harnesses the power of network medicine principles. This module identifies mechanistically similar diseases and employs them to enhance the information available for the queried disease. The disease pooling module has substantially improved the prioritization of repurposing candidates within zero-shot settings. b, The TxGNN disease similarity score provides a nuanced and meaningful measure of the relationship between diseases. This metric empowers TxGNN to discover similar diseases that can inform and enrich the mechanistic understanding of queried diseases lacking treatment information. c, The conventional AI-based repurposing evaluating indication predictions on diseases where the model may have seen other approved drugs during training. In this scenario, we show that TxGNN achieves good performance along with existing methods. d, Provision of a more realistic evaluation, by introducing a new setup for assessing zero-shot repurposing, where the model is evaluated on diseases that have no approved drugs available during training. In this challenging setting, we observed a notable degradation in performance for baseline methods. In contrast, TxGNN consistently exhibits robust performance, surpassing the best baseline by up to 19% for indications and 23.9% for contraindications. These results highlight TxGNN’s reasoning capabilities when confronted with queried diseases lacking treatment options. For both c and d, the evaluation uses the AUPRC and is conducted with five random data splits (n = 5). The average performance is shown and the 95% confidence intervals (CIs) are represented by error bars.

**Fig. 3. TxGNN predicts drug indications and contraindications across challenging disease areas with small molecular datasets.**
a, Nine ‘disease area’ splits constructed to evaluate how well each model can generalize to new diseases when using only a limited amount of disease-associated molecular data and no information about its treatments. The diseases in the holdout set: (1) have no approved drugs in training, (2) have limited overlap with the training disease set because we excluded similar diseases and (3) lack molecular data because we deliberately removed their biological neighbors from the training set. These data splits constitute challenging but realistic evaluation scenarios that mimic zero-shot drug-repurposing settings. b–f, Holdout folds evaluating diseases related to adrenal glands (b), autoimmune diseases (c), neurodegenerative diseases (d), metabolic disorders (e) and cardiovascular diseases (f). The results for four disease areas—anemia, diabetes, cancer and mental health—are provided in Supplementary Fig. 7. Raw scores are provided in Supplementary Tables 1 and 2. TxGNN shows up to 59.3% improvement over the next best baseline in ranking therapeutic candidates, measured by AUPRC. Each method under each split is conducted with five random data splits (n = 5). The average performance is shown and the 95% CIs are represented by error bars.

**Fig. 4. Development, visualization and evaluation of multi-hop interpretable paths in TxGNN Explainer.**
a, Predictions alone are often insufficient for trustworthy machine learning model deployment. We developed TxGNN Explainer to aid human experts using graph AI explainability techniques. TxGNN Explainer identifies a sparse, interpretable subgraph underlying the model’s predictions. For each drug candidate, it generates a multi-hop path of biomedical concepts linking the disease to the drug. A visualization module then transforms these subgraphs into multi-hop paths that align with human cognitive processes. b, An interactive tool designed to help experts explore TxGNN predictions and explanations. The ‘Control panel’ lets users select a disease and view top-ranked predictions. The ‘Edge threshold’ module adjusts the sparsity of explanations, controlling the density of displayed multi-hop paths. The ‘Drug embedding’ panel compares a selected drug’s position with the entire repurposing candidate library. The ‘Path explanation’ panel shows crucial biological relationships for TxGNN’s therapeutic predictions. c, Evaluating the usefulness of TxGNN explanations by conducting a user study involving five clinicians, five clinical researchers and two pharmacists. These participants were shown 16 drug–disease pairs with TxGNN’s predictions, where 12 predictions were accurate. For each pairing, participants indicated whether they agreed or disagreed with TxGNN’s predictions using the explanations provided. d, Comparison of the performance of TxGNN Explainer with a no-explanation baseline regarding user answer accuracy, task completion time and user confidence. The results are aggregated on 192 trials (12 participants × 16 tasks) and reveal a significant improvement in accuracy (P = 0.044), confidence (P = 0.004) and time to think (P = 0.0013) when explanations were provided. Error bars represent 95% CIs and the center of the error bar is the average performance. The statistics are computed using a two-sided Tukey’s HSD test without multiple-test adjustments. e, The qualitative usability questions for participants after the user study. Human experts agreed that the explanations provided by TxGNN helped assess drug-repurposing candidates and instilled greater trust in the TxGNN’s predictions than using predictions alone.

**Fig. 5. Drug-repurposing predictions and multi-hop interpretable paths produced by TxGNN align with medical evidence.**
a, We assessed the alignment of drug-repurposing candidates identified by TxGNN with established medical reasoning across three rare diseases. The process begins with the TxGNN Predictor, which selects potential drugs for repurposing based on a queried disease, and continues with the TxGNN Explorer, which provides interpretable paths explaining the selection. Our case studies conclude with independent verification of the TxGNN’s predictions against clinical knowledge, showcasing the congruence between the TxGNN’s recommendations and medical insights. b, TxGNN predicts zolpidem, typically used as a sedative, as a repurposing candidate for Kleefstra’s syndrome, characterized by developmental delays and neurological symptoms. Despite zolpidem’s conventional inhibitory effects on the brain, TxGNN Explainer suggests its potential to enhance prefrontal cortex activity and improve cognitive functions in those with Kleefstra’s syndrome. TxGNN’s counterintuitive recommendation aligns with emerging clinical evidence of zolpidem’s ability to awaken dormant neurons, potentially aiding speech, motor skills and alertness in patients with neurodevelopmental disorders. c, TxGNN identifies tretinoin as the top candidate for treating Ehlers–Danlos syndrome. TxGNN’s predictive rationale is rooted in the drug’s interactions with albumin (*ALB*) and *ALDH1A2*, which aligns with medical insights about Ehlers–Danlos syndrome with regard to collagen loss and inflammation mitigation. d, TxGNN identifying amyl nitrite as a therapeutic option for NSIAD. In NSIAD, an *AVPR2* mutation leads to water and sodium imbalances. TxGNN Explorer points out the connection between NSIAD and amyl nitrite through congestive heart failure, a condition with similar fluid retention issues, by exploring gene interactions (*AVPR2* and *NPR1*) that regulate electrolyte balance.

**Fig. 6. Evaluating TxGNN’s predictions in a large healthcare system.**
a, The steps for evaluating TxGNN’s novel indication predictions using EMRs. First, we matched the drugs and diseases in the TxGNN KG to the EMR database, resulting in a curated cohort of 1.27 million patients spanning 478 diseases and 1,290 drugs. Next, we calculated the log(OR) for each drug–disease pair to indicate drug usage for specific diseases. We validated the log(OR) metric as a proxy for clinical usage by comparing drug–disease pairs against FDA-approved indications. Finally, we evaluated TxGNN’s novel predictions to determine if their log(OR) values exhibited enrichment within the medical records. b, The racial diversity within the patient cohort. c, The sex distribution of the patient cohort. d, The medical records encompass a diverse range of diseases spanning major disease areas. e, A substantial enrichment of log(OR) values for FDA-approved drugs in validating log(OR) values as a proxy metric for clinical prescription, although most drug–disease pairs exhibited low log(OR) values. In addition, we noted that contraindications displayed similar log(OR) values to the general nonindicated drug–disease pairs, minimizing potential confounders such as adverse drug effects. f, Evaluation of log(OR) values for the novel indications proposed by TxGNN. The y axis represents the log(OR) values of the disease–drug pairs, serving as a proxy for clinical usage. We ranked TxGNN’s predictions for each disease and extracted the average log(OR) values for the top predicted drug (n = 470), top five predicted drugs (n = 2,314), top 5% predicted drugs (n = 27,618) and bottom 50% predicted drugs (n = 123,718). The red line represents the average log(OR) for FDA-approved indications, whereas the green line represents the average log(OR) for contraindications. Predicted drugs are consistent with off-label prescription decisions made by clinicians. The error bar is a 95% CI. g, Provision of a case study of TxGNN’s predicted scores plotted against the log(OR) for Wilson’s disease. Each point on the plot represents a therapeutic candidate. The top most likely drug identified by TxGNN is highlighted, indicating its associated TxGNN and log(OR) scores.

See this image and copyright information in PMC

Update of

A foundation model for clinician-centered drug repurposing.
Huang K, Chandak P, Wang Q, Havaldar S, Vaid A, Leskovec J, Nadkarni G, Glicksberg BS, Gehlenborg N, Zitnik M. Huang K, et al. medRxiv [Preprint]. 2024 Aug 7:2023.03.19.23287458. doi: 10.1101/2023.03.19.23287458. medRxiv. 2024. Update in: Nat Med. 2024 Dec;30(12):3601-3613. doi: 10.1038/s41591-024-03233-x. PMID: 39148855 Free PMC article. Updated. Preprint.

References

1. Feigin, V. L. et al. Burden of neurological disorders across the us from 1990-2017: a global burden of disease study. JAMA Neurol.78, 165–176 (2021). - PMC - PubMed
1. O’Connell, D. Neglected diseases. Nature449, 157–157 (2007).
1. Rare Disease Day 2021. US Food and Drug Administrationfda.gov/news-events/fda-voices/rare-disease-day-2021-fda-shows-sustained... (2023).
1. Pushpakom, S. et al. Drug repurposing: progress, challenges and recommendations. Nat. Rev. Drug Discov.18, 41–58 (2019). - PubMed
1. Abdelsayed, M., Kort, E. J., Jovinge, S. & Mercola, M. Repurposing drugs to treat cardiovascular disease in the era of precision medicine. Nat. Rev. Cardiol.19, 751–764 (2022). - PMC - PubMed

MeSH terms

Actions
Actions
Actions
Actions

Grants and funding

R01 HD108794/HD/NICHD NIH HHS/United States

LinkOut - more resources

Full Text Sources
- Nature Publishing Group
- PubMed Central

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

A foundation model for clinician-centered drug repurposing

Affiliations

A foundation model for clinician-centered drug repurposing

Authors

Affiliations

Abstract

Conflict of interest statement

Figures

Update of

References

MeSH terms

Grants and funding

LinkOut - more resources

Full Text Sources