A systematic mapping review on the capability of large language models in drug-drug interaction analysis

Himel Mondal¹, Ipsita Dash², Shaikat Mondal³, Seshadri Reddy Varikasuvu⁴, Rintu Kumar Gayen⁵, Shreya Sharma⁶, Sairavi Kiran Biri⁷

Affiliations

¹ Department of Physiology, All India Institute of Medical Sciences, Deoghar, India.
² Department of Biochemistry, Saheed Rendo Majhi Medical College, Kalahandi, India.
³ Department of Physiology, Raiganj Government Medical College, Raiganj, India.
⁴ Department of Biochemistry, All India Institute of Medical Sciences, Deoghar, India.
⁵ Department of Electronics and Communication Engineering, Institute of Engineering and Management, Kolkata, India.
⁶ Neuromodulation Laboratory, All India Institute of Medical Sciences, Deoghar, India.
⁷ Department of Biochemistry, Phoulo Jhano Medical College, Dumka, India.

PMID: 40999995
DOI: 10.1080/17512433.2025.2568090

Review

A systematic mapping review on the capability of large language models in drug-drug interaction analysis

Himel Mondal et al. Expert Rev Clin Pharmacol. 2025 Sep.

. 2025 Sep;18(9):683-690.

doi: 10.1080/17512433.2025.2568090. Epub 2025 Sep 29.

Authors

Himel Mondal¹, Ipsita Dash², Shaikat Mondal³, Seshadri Reddy Varikasuvu⁴, Rintu Kumar Gayen⁵, Shreya Sharma⁶, Sairavi Kiran Biri⁷

Affiliations

¹ Department of Physiology, All India Institute of Medical Sciences, Deoghar, India.
² Department of Biochemistry, Saheed Rendo Majhi Medical College, Kalahandi, India.
³ Department of Physiology, Raiganj Government Medical College, Raiganj, India.
⁴ Department of Biochemistry, All India Institute of Medical Sciences, Deoghar, India.
⁵ Department of Electronics and Communication Engineering, Institute of Engineering and Management, Kolkata, India.
⁶ Neuromodulation Laboratory, All India Institute of Medical Sciences, Deoghar, India.
⁷ Department of Biochemistry, Phoulo Jhano Medical College, Dumka, India.

PMID: 40999995
DOI: 10.1080/17512433.2025.2568090

Abstract

Background: Drug-drug interaction (DDI) is a global health concern affecting patient safety and treatment outcomes. Large language models (LLMs), such as ChatGPT, offer accessible alternatives; however, their effectiveness in DDI analysis remains unclear. This review evaluates the current evidence on the performance of LLM-based chatbots in identifying DDIs.

Methods: A PRISMA-compliant systematic review (PROSPERO: CRD420251020360) was conducted using PubMed, Scopus, and Web of Science (studies published between 1 January 2015, and 31 March 2025). Eligible studies included those using publicly accessible LLM chatbots for DDI detection.

Results: Nine studies (2023-2025) evaluated publicly accessible LLM chatbots, including ChatGPT, Bing AI, and Google Bard, for DDI identification. Methods varied from patient-level polypharmacy screening to single-drug checks and case vignettes. Chatbot performance was inconsistent: ChatGPT identified many potential DDIs, with ChatGPT-4.0 generally identifying more potential DDIs, but with variable accuracy, while Bing AI and Google Bard were less reliable.

Conclusion: Publicly accessible LLM chatbots demonstrate variable and partial effectiveness in detecting DDIs. There is a clear need to develop dedicated, freely available chatbots designed specifically for DDI identification. Future research should focus on standardizing evaluation methods and expanding access to improve medication safety in clinical practice.

Prospero: CRD420251020360.

Keywords: Large language models; artificial intelligence; chatGPT; chatbot; drug–drug interactions.

Plain language summary

Taking many medicines at once (polypharmacy) can lead to drug-drug interactions (DDIs), where one drug affects how another works, causing side effects or reducing treatment success. Detecting DDIs is important, but it often relies on costly tools or expert knowledge, which may not be readily available in all settings. This study looked at how well public AI chatbots like ChatGPT, Bing AI, and Google Bard identify DDIs. Their performance was inconsistent across different chatbots and not reliable enough for medical use. Further research is needed to comment on their safety and accuracy.

PubMed Disclaimer

Publication types

Actions
Actions

MeSH terms

Actions
Actions
Actions
Actions
Actions
Actions
Actions

LinkOut - more resources

Full Text Sources
- Atypon
Medical
- MedlinePlus Health Information

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

A systematic mapping review on the capability of large language models in drug-drug interaction analysis

Affiliations

A systematic mapping review on the capability of large language models in drug-drug interaction analysis

Authors

Affiliations

Abstract

Plain language summary

Publication types

MeSH terms

LinkOut - more resources

Full Text Sources

Medical