Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2023 Apr 13;23(8):3962.
doi: 10.3390/s23083962.

Development and Validation of an Explainable Machine Learning-Based Prediction Model for Drug-Food Interactions from Chemical Structures

Affiliations

Development and Validation of an Explainable Machine Learning-Based Prediction Model for Drug-Food Interactions from Chemical Structures

Quang-Hien Kha et al. Sensors (Basel). .

Abstract

Possible drug-food constituent interactions (DFIs) could change the intended efficiency of particular therapeutics in medical practice. The increasing number of multiple-drug prescriptions leads to the rise of drug-drug interactions (DDIs) and DFIs. These adverse interactions lead to other implications, e.g., the decline in medicament's effect, the withdrawals of various medications, and harmful impacts on the patients' health. However, the importance of DFIs remains underestimated, as the number of studies on these topics is constrained. Recently, scientists have applied artificial intelligence-based models to study DFIs. However, there were still some limitations in data mining, input, and detailed annotations. This study proposed a novel prediction model to address the limitations of previous studies. In detail, we extracted 70,477 food compounds from the FooDB database and 13,580 drugs from the DrugBank database. We extracted 3780 features from each drug-food compound pair. The optimal model was eXtreme Gradient Boosting (XGBoost). We also validated the performance of our model on one external test set from a previous study which contained 1922 DFIs. Finally, we applied our model to recommend whether a drug should or should not be taken with some food compounds based on their interactions. The model can provide highly accurate and clinically relevant recommendations, especially for DFIs that may cause severe adverse events and even death. Our proposed model can contribute to developing more robust predictive models to help patients, under the supervision and consultants of physicians, avoid DFI adverse effects in combining drugs and foods for therapy.

Keywords: DrugBank; FooDB; adverse food reaction; chemical informatics; drug–food interactions; drug–nutrient interactions; explainable artificial intelligence; machine learning; precision medicine; simplified molecular-input line-entry system.

PubMed Disclaimer

Conflict of interest statement

The authors declare no conflict of interest.

Figures

Figure 1
Figure 1
The workflow of our study. First, we obtained the SMILES notations of drug and food constituents from DrugBank and FooDB databases. After pre-processing, we filtered out 1133 drugs and 4341 food compounds, making 2,382,903 drug–food pairs in the benchmark dataset. We subsequently used PyBioMed and RDKit packages in Python to extract 3780 features of each interacting drug–food pair. We applied a four-step feature selection process to the training set to find the 18 most important features. Five classification algorithms were applied to the training data via five-fold cross-validation. As XGBoost gave the best prediction outcome, we fine-tuned it using the validation set. Finally, we tested our optimum XGBoost model on the internal test set and one external test set containing 1922 drug-food pairs. Finally, we used the model to recommend some common drug–food compound combinations.
Figure 2
Figure 2
Confusion matrix of our optimal XGBoost model on the testing and the external test sets. On the testing set (left plot): The model most accurately detected positive and non-significant DFIs (recall 0.99 in both classes) while only recognizing 87% of negative DFIs. Likewise, on the external test set (right plot), the model recognized all positive DFIs and 99% of non-significant DFIs. Negative DFIs were recognized as acceptable, with 94% of those discriminated against.
Figure 3
Figure 3
The SHAP (SHapley Additive exPlanations) plot of eighteen optimal features. The red dots of MRVSA0, EstateVSA2, MRVSA9, MRVSA8 and blue dots of PEOEVSA5, MTPSA+MTPSA, VSAEstate10+VSAEstate10 gather on the right side of the x-axis, indicating that the high values and low values of these features, respectively, direct the model in recognizing the non-significant DFIs. High PEOEVSA5, EstateVSA7, slogPVSA9, MTPSA+MTPSA, and low values of MRVSA0, MRVSA9 help detect the negative DFIs. The positive DFIs are identified by the increasing values of PEOEVSA5, EstateVSA0*LabuteASA, EstateVSA1*VSAEstate8 and the decline of PEOEVSA9, EstateVSA7, EstateVSA2, slogPVSA9, MRVSA2, VSAEstate7+VSAEstate7, slogPVSA0, PEOEVSA12.

Similar articles

Cited by

References

    1. Bushra R., Aslam N., Khan A.Y. Food-drug interactions. Oman Med. J. 2011;26:77. doi: 10.5001/omj.2011.21. - DOI - PMC - PubMed
    1. Edwards I.R., Aronson J.K. Adverse drug reactions: Definitions, diagnosis, and management. Lancet. 2000;356:1255–1259. doi: 10.1016/S0140-6736(00)02799-9. - DOI - PubMed
    1. Kantor E.D., Rehm C.D., Haas J.S., Chan A.T., Giovannucci E.L. Trends in prescription drug use among adults in the United States from 1999–2012. JAMA. 2015;314:1818–1830. doi: 10.1001/jama.2015.13766. - DOI - PMC - PubMed
    1. Sutherland J.J., Daly T.M., Liu X., Goldstein K., Johnston J.A., Ryan T.P. Co-prescription trends in a large cohort of subjects predict substantial drug-drug interactions. PLoS ONE. 2015;10:e0118991. doi: 10.1371/journal.pone.0118991. - DOI - PMC - PubMed
    1. Ryu J.Y., Kim H.U., Lee S.Y. Deep learning improves prediction of drug–drug and drug–food interactions. Proc. Natl. Acad. Sci. USA. 2018;115:E4304–E4311. doi: 10.1073/pnas.1803294115. - DOI - PMC - PubMed