Machine learning-based drug-drug interaction prediction: a critical review of models, limitations, and data challenges
- PMID: 40808680
- PMCID: PMC12344460
- DOI: 10.3389/fphar.2025.1632775
Machine learning-based drug-drug interaction prediction: a critical review of models, limitations, and data challenges
Abstract
Background/objectives: New computational methods, based on statistical, machine learning, and deep learning techniques using drug-related entities (e.g., genes, protein bindings, etc.), help reduce the costs of in-vitro experiments through drug-drug interaction prediction (DDIp). This review examines recent advances in DDIp. It presents an in-depth review of the state-of-the-art studies relating to semi-supervised, supervised, self-supervised learning, and other techniques such as graph-based learning and matrix factorization methods for predicting DDIs. All possible interactions between drugs are not known, and accurately predicting interactions is even more difficult due to the complex nature of drug-drug interactions (DDI).
Methods: Of the 49 papers published in Web of Science in the last 6 years, 24 papers were considered relevant based on information presented in their titles and abstracts. The included articles focus specifically on predicting DDIs using a type of machine learning algorithm. Excluded articles focused on drug discovery, drug repurposing, molecular representation, or the extraction of biomedical interactions. The methodology, results limitations, and future research directions were studied for each paper. Common challenges, limitations, and future research directions were analyzed.
Results and conclusion: The main limitations are class imbalance, poor performance on new drugs, limited explainability, and the need for additional data sources.
Keywords: adverse drug reactions; drug-drug interaction; graph-based learning; healthcare; machine learning techniques; semi-supervised learning; supervised learning.
Copyright © 2025 Gheorghita, Bocanet and Iantovics.
Conflict of interest statement
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Figures








References
-
- Cao J., Lin X., Guo S., Liu L., Liu T., Wang B. (2021). “Bipartite graph embedding via mutual information maximization,” in Proceedings of the 14th ACM international conference on web search and data mining (New York, NY, USA: Association for Computing Machinery; ), 635–643. 10.1145/3437963.3441783 - DOI
-
- Chandra Umakantham O., Srinivasan S., Pathak V. (2024). Detecting side effects of adverse drug reactions through drug-drug interactions using graph neural networks and self-supervised learning. IEEE Access 12, 93823–93840. 10.1109/ACCESS.2024.3407877 - DOI
-
- Chen T., Guestrin C. (2016). “XGBoost: a scalable tree boosting system,” in Proceedings of the 22nd ACM SIGKDD international conference on knowledge discovery and data mining (New York, NY, USA: Association for Computing Machinery; ), 785–794. 10.1145/2939672.2939785 - DOI
-
- Chu X., Lin Y., Wang Y., Wang L., Wang J., Gao J. (2019). MLRDA: a multi-task semi-supervised learning framework for drug-drug interaction prediction, 4518–4524.