. 2023 Apr 13;23(8):3962.

doi: 10.3390/s23083962.

Development and Validation of an Explainable Machine Learning-Based Prediction Model for Drug-Food Interactions from Chemical Structures

Quang-Hien Kha^{1

2}, Viet-Huan Le^{1

2

3}, Truong Nguyen Khanh Hung⁴, Ngan Thi Kim Nguyen⁵, Nguyen Quoc Khanh Le^{2

6

7

8}

Affiliations

¹ International Ph.D. Program in Medicine, College of Medicine, Taipei Medical University, Taipei 110, Taiwan.
² AIBioMed Research Group, Taipei Medical University, Taipei 110, Taiwan.
³ Department of Thoracic Surgery, Khanh Hoa General Hospital, Nha Trang City 65000, Vietnam.
⁴ Department of Orthopedic and Trauma, Cho Ray Hospital, Ho Chi Minh City 70000, Vietnam.
⁵ Undergraduate Program of Nutrition Science, National Taiwan Normal University, Taipei 106, Taiwan.
⁶ Professional Master Program in Artificial Intelligence in Medicine, College of Medicine, Taipei Medical University, Taipei 110, Taiwan.
⁷ Research Center for Artificial Intelligence in Medicine, Taipei Medical University, Taipei 110, Taiwan.
⁸ Translational Imaging Research Center, Taipei Medical University Hospital, Taipei 110, Taiwan.

PMID: 37112302
PMCID: PMC10143839
DOI: 10.3390/s23083962

Development and Validation of an Explainable Machine Learning-Based Prediction Model for Drug-Food Interactions from Chemical Structures

Quang-Hien Kha et al. Sensors (Basel). 2023.

. 2023 Apr 13;23(8):3962.

doi: 10.3390/s23083962.

Authors

Quang-Hien Kha^{1

2}, Viet-Huan Le^{1

2

3}, Truong Nguyen Khanh Hung⁴, Ngan Thi Kim Nguyen⁵, Nguyen Quoc Khanh Le^{2

6

7

8}

Affiliations

¹ International Ph.D. Program in Medicine, College of Medicine, Taipei Medical University, Taipei 110, Taiwan.
² AIBioMed Research Group, Taipei Medical University, Taipei 110, Taiwan.
³ Department of Thoracic Surgery, Khanh Hoa General Hospital, Nha Trang City 65000, Vietnam.
⁴ Department of Orthopedic and Trauma, Cho Ray Hospital, Ho Chi Minh City 70000, Vietnam.
⁵ Undergraduate Program of Nutrition Science, National Taiwan Normal University, Taipei 106, Taiwan.
⁶ Professional Master Program in Artificial Intelligence in Medicine, College of Medicine, Taipei Medical University, Taipei 110, Taiwan.
⁷ Research Center for Artificial Intelligence in Medicine, Taipei Medical University, Taipei 110, Taiwan.
⁸ Translational Imaging Research Center, Taipei Medical University Hospital, Taipei 110, Taiwan.

PMID: 37112302
PMCID: PMC10143839
DOI: 10.3390/s23083962

Abstract

Possible drug-food constituent interactions (DFIs) could change the intended efficiency of particular therapeutics in medical practice. The increasing number of multiple-drug prescriptions leads to the rise of drug-drug interactions (DDIs) and DFIs. These adverse interactions lead to other implications, e.g., the decline in medicament's effect, the withdrawals of various medications, and harmful impacts on the patients' health. However, the importance of DFIs remains underestimated, as the number of studies on these topics is constrained. Recently, scientists have applied artificial intelligence-based models to study DFIs. However, there were still some limitations in data mining, input, and detailed annotations. This study proposed a novel prediction model to address the limitations of previous studies. In detail, we extracted 70,477 food compounds from the FooDB database and 13,580 drugs from the DrugBank database. We extracted 3780 features from each drug-food compound pair. The optimal model was eXtreme Gradient Boosting (XGBoost). We also validated the performance of our model on one external test set from a previous study which contained 1922 DFIs. Finally, we applied our model to recommend whether a drug should or should not be taken with some food compounds based on their interactions. The model can provide highly accurate and clinically relevant recommendations, especially for DFIs that may cause severe adverse events and even death. Our proposed model can contribute to developing more robust predictive models to help patients, under the supervision and consultants of physicians, avoid DFI adverse effects in combining drugs and foods for therapy.

Keywords: DrugBank; FooDB; adverse food reaction; chemical informatics; drug–food interactions; drug–nutrient interactions; explainable artificial intelligence; machine learning; precision medicine; simplified molecular-input line-entry system.

PubMed Disclaimer

Conflict of interest statement

The authors declare no conflict of interest.

Figures

**Figure 1**
The workflow of our study. First, we obtained the SMILES notations of drug and food constituents from DrugBank and FooDB databases. After pre-processing, we filtered out 1133 drugs and 4341 food compounds, making 2,382,903 drug–food pairs in the benchmark dataset. We subsequently used *PyBioMed* and *RDKit* packages in Python to extract 3780 features of each interacting drug–food pair. We applied a four-step feature selection process to the training set to find the 18 most important features. Five classification algorithms were applied to the training data via five-fold cross-validation. As XGBoost gave the best prediction outcome, we fine-tuned it using the validation set. Finally, we tested our optimum XGBoost model on the internal test set and one external test set containing 1922 drug-food pairs. Finally, we used the model to recommend some common drug–food compound combinations.

**Figure 2**
Confusion matrix of our optimal XGBoost model on the testing and the external test sets. On the testing set (**left** plot): The model most accurately detected positive and non-significant DFIs (recall 0.99 in both classes) while only recognizing 87% of negative DFIs. Likewise, on the external test set (**right** plot), the model recognized all positive DFIs and 99% of non-significant DFIs. Negative DFIs were recognized as acceptable, with 94% of those discriminated against.

**Figure 3**
The SHAP (SHapley Additive exPlanations) plot of eighteen optimal features. The red dots of *MRVSA0, EstateVSA2, MRVSA9, MRVSA8* and blue dots of *PEOEVSA5, MTPSA+MTPSA, VSAEstate10+VSAEstate10* gather on the right side of the x-axis, indicating that the high values and low values of these features, respectively, direct the model in recognizing the non-significant DFIs. High *PEOEVSA5, EstateVSA7, slogPVSA9, MTPSA+MTPSA*, and low values of *MRVSA0, MRVSA9* help detect the negative DFIs. The positive DFIs are identified by the increasing values of *PEOEVSA5, EstateVSA0*LabuteASA, EstateVSA1*VSAEstate8* and the decline of *PEOEVSA9, EstateVSA7, EstateVSA2, slogPVSA9, MRVSA2, VSAEstate7+VSAEstate7, slogPVSA0, PEOEVSA12*.

See this image and copyright information in PMC

Cited by

Multi-task localization of the hemidiaphragms and lung segmentation in portable chest X-ray images of COVID-19 patients.
Morís DI, de Moura J, Aslani S, Jacob J, Novo J, Ortega M. Morís DI, et al. Digit Health. 2024 Feb 1;10:20552076231225853. doi: 10.1177/20552076231225853. eCollection 2024 Jan-Dec. Digit Health. 2024. PMID: 38313365 Free PMC article.
Artificial intelligence, medications, pharmacogenomics, and ethics.
Haga SB. Haga SB. Pharmacogenomics. 2024;25(14-15):611-622. doi: 10.1080/14622416.2024.2428587. Epub 2024 Nov 15. Pharmacogenomics. 2024. PMID: 39545629
Explainable artificial intelligence for stroke prediction through comparison of deep learning and machine learning models.
Moulaei K, Afshari L, Moulaei R, Sabet B, Mousavi SM, Afrash MR. Moulaei K, et al. Sci Rep. 2024 Dec 28;14(1):31392. doi: 10.1038/s41598-024-82931-5. Sci Rep. 2024. PMID: 39733046 Free PMC article.
The Impact of Artificial Intelligence on Healthcare: A Comprehensive Review of Advancements in Diagnostics, Treatment, and Operational Efficiency.
Faiyazuddin M, Rahman SJQ, Anand G, Siddiqui RK, Mehta R, Khatib MN, Gaidhane S, Zahiruddin QS, Hussain A, Sah R. Faiyazuddin M, et al. Health Sci Rep. 2025 Jan 5;8(1):e70312. doi: 10.1002/hsr2.70312. eCollection 2025 Jan. Health Sci Rep. 2025. PMID: 39763580 Free PMC article.
Predicting Natural Product-Drug Interactions with Knowledge Graph Embeddings.
Taneja SB, Dilán-Pantojas IO, Boyce RD. Taneja SB, et al. AMIA Jt Summits Transl Sci Proc. 2025 Jun 10;2025:556-565. eCollection 2025. AMIA Jt Summits Transl Sci Proc. 2025. PMID: 40502231 Free PMC article.

See all "Cited by" articles

References

1. Bushra R., Aslam N., Khan A.Y. Food-drug interactions. Oman Med. J. 2011;26:77. doi: 10.5001/omj.2011.21. - DOI - PMC - PubMed
1. Edwards I.R., Aronson J.K. Adverse drug reactions: Definitions, diagnosis, and management. Lancet. 2000;356:1255–1259. doi: 10.1016/S0140-6736(00)02799-9. - DOI - PubMed
1. Kantor E.D., Rehm C.D., Haas J.S., Chan A.T., Giovannucci E.L. Trends in prescription drug use among adults in the United States from 1999–2012. JAMA. 2015;314:1818–1830. doi: 10.1001/jama.2015.13766. - DOI - PMC - PubMed
1. Sutherland J.J., Daly T.M., Liu X., Goldstein K., Johnston J.A., Ryan T.P. Co-prescription trends in a large cohort of subjects predict substantial drug-drug interactions. PLoS ONE. 2015;10:e0118991. doi: 10.1371/journal.pone.0118991. - DOI - PMC - PubMed
1. Ryu J.Y., Kim H.U., Lee S.Y. Deep learning improves prediction of drug–drug and drug–food interactions. Proc. Natl. Acad. Sci. USA. 2018;115:E4304–E4311. doi: 10.1073/pnas.1803294115. - DOI - PMC - PubMed

MeSH terms

Actions
Actions
Actions
Actions
Actions

Grants and funding

LinkOut - more resources

Full Text Sources
Medical
- MedlinePlus Health Information

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Development and Validation of an Explainable Machine Learning-Based Prediction Model for Drug-Food Interactions from Chemical Structures

Affiliations

Development and Validation of an Explainable Machine Learning-Based Prediction Model for Drug-Food Interactions from Chemical Structures

Authors

Affiliations

Abstract

Conflict of interest statement

Figures

Similar articles

Cited by

References

MeSH terms

Grants and funding

LinkOut - more resources

Full Text Sources

Medical

Abstract

Conflict of interest statement

Figures

Similar articles

Cited by

References

MeSH terms

Related information

Grants and funding

LinkOut - more resources

Full Text Sources

Medical