This is a preprint.
Integrating Interpretable Machine Learning and Multi-omics Systems Biology for Personalized Biomarker Discovery and Drug Repurposing in Alzheimer's Disease
- PMID: 40196631
- PMCID: PMC11974764
- DOI: 10.1101/2025.03.24.644676
Integrating Interpretable Machine Learning and Multi-omics Systems Biology for Personalized Biomarker Discovery and Drug Repurposing in Alzheimer's Disease
Abstract
Background: Alzheimer's disease (AD) is a complex neurodegenerative disorder with substantial molecular variability across different brain regions and individuals, hindering therapeutic development. This study introduces PRISM-ML, an interpretable machine learning (ML) framework integrating multiomics data to uncover patient-specific biomarkers, subtissue-level pathology, and drug repurposing opportunities.
Methods: We harmonized transcriptomic and genomic data of three independent brain studies containing 2105 post-mortem brain samples (1363 AD, 742 controls) across nine tissues. A Random Forest classifier with SHapley Additive exPlanations (SHAP) identified patient-level biomarkers. Clustering further delineated each tissue into subtissues, and network analysis revealed critical "bottleneck" (hub) genes. Finally, a knowledge graph-based screening identified multi-target drug candidates, and a real-world pharmacoepidemiologic study evaluated their clinical relevance.
Results: We uncovered 36 molecularly distinct subtissues, each defined by a set of associated unique biomarkers and genetic drivers. Through network analysis of gene-gene interactions networks, we highlighted 262 bottleneck genes enriched in synaptic, cytoskeletal, and membrane-associated processes. Knowledge graph queries identified six FDA-approved drugs predicted to target multiple bottleneck genes and AD-relevant pathways simultaneously. One candidate, promethazine, demonstrated an association with reduced AD incidence in a large healthcare dataset of over 364000 individuals (hazard ratios ≤ 0.43; p < 0.001). These findings underscore the potential for multi-target approaches, reveal connections between AD and cardiovascular pathways, and offer novel insights into the heterogeneous biology of AD.
Conclusions: PRISM-ML bridges interpretable ML with multi-omics and systems biology to decode AD heterogeneity, revealing region-specific mechanisms and repurposable therapeutics. The validation of promethazine in real-world data underscores the clinical relevance of multi-target strategies, paving the way for more personalized treatments in AD and other complex disorders.
Keywords: Biological Network; Computational Biology; Drug Repurposing; GWAS; Personalized Medicine; Transcriptomics.
Conflict of interest statement
Competing interests The authors declare that they have no competing interests.
Figures




References
-
- Monteiro AR, Barbosa DJ, Remião F, Silva R. Alzheimer’s disease: Insights and new prospects in disease pathophysiology, biomarkers and disease-modifying drugs. Biochem Pharmacol. 2023. May 1;211:115522. - PubMed
-
- De A, Mishra TK, Saraf S, Tripathy B, Reddy SS. A Review on the Use of Modern Computational Methods in Alzheimer’s Disease-Detection and Prediction. Curr Alzheimer Res [Internet]. 2024. Mar 12 [cited 2025 Jan 12];20(12):845–61. Available from: https://pubmed.ncbi.nlm.nih.gov/38468529/ - PubMed
-
- Young AL, Oxtoby NP, Garbarino S, Fox NC, Barkhof F, Schott JM, et al. Data-driven modelling of neurodegenerative disease progression: thinking outside the black box. Nat Rev Neurosci [Internet]. 2024. Feb 1 [cited 2025 Jan 12];25(2):111–30. Available from: https://pubmed.ncbi.nlm.nih.gov/38191721/ - PubMed
-
- Lundberg SM, Lee SI. A Unified Approach to Interpreting Model Predictions. Adv Neural Inf Process Syst [Internet]. 2017. May 22 [cited 2025 Jan 12];2017-December:4766–75. Available from: https://arxiv.org/abs/1705.07874v2
Publication types
Grants and funding
LinkOut - more resources
Full Text Sources