Analyses of different prescriptions for health using artificial intelligence: a critical approach based on the international guidelines of health institutions

Affiliations

¹ Universidade Federal de Santa Catarina, Florianópolis, Santa Catarina, Brazil.
² Faculdade de Medicina, Universidade Federal de Goiás, Goiânia, Brazil.
³ Yale University School of Medicine, New Haven, USA.
⁴ Postgraduate Program in Health Sciences, Universidade Estadual de Montes Claros, Montes Claros, MG Brazil.
⁵ School of Nutrition Sciences, University of Ottawa, Ottawa, ON K1N 6N5 Canada.
⁶ Department of Psychiatry, Yale University School of Medicine, New Haven, CT USA.
⁷ Physical Education and Sport Department, State University of Montes Claros. Av. Doutor Rui Braga, S/N, Vila Mauriceia, Montes Claros, MG Brazil.

^# Contributed equally.

PMID: 40832454
PMCID: PMC12358344 (available on 2026-12-01)
DOI: 10.1007/s13755-025-00368-0

Analyses of different prescriptions for health using artificial intelligence: a critical approach based on the international guidelines of health institutions

Vítor Marcelo Soares Campos et al. Health Inf Sci Syst. 2025.

. 2025 Aug 17;13(1):52.

doi: 10.1007/s13755-025-00368-0. eCollection 2025 Dec.

Authors

Affiliations

¹ Universidade Federal de Santa Catarina, Florianópolis, Santa Catarina, Brazil.
² Faculdade de Medicina, Universidade Federal de Goiás, Goiânia, Brazil.
³ Yale University School of Medicine, New Haven, USA.
⁴ Postgraduate Program in Health Sciences, Universidade Estadual de Montes Claros, Montes Claros, MG Brazil.
⁵ School of Nutrition Sciences, University of Ottawa, Ottawa, ON K1N 6N5 Canada.
⁶ Department of Psychiatry, Yale University School of Medicine, New Haven, CT USA.
⁷ Physical Education and Sport Department, State University of Montes Claros. Av. Doutor Rui Braga, S/N, Vila Mauriceia, Montes Claros, MG Brazil.

^# Contributed equally.

PMID: 40832454
PMCID: PMC12358344 (available on 2026-12-01)
DOI: 10.1007/s13755-025-00368-0

Abstract

Purpose: Large-language models (LLMs) are increasingly used for health advice, but their alignment with evidence-based guidelines and sensitivity to question phrasing remain unclear.

Methods: In May 2025, we evaluated ChatGPT 4.0, ChatGPT 4.5, and DeepSeek V3 using four clinical vignettes: major depression with polysubstance use, irritable bowel syndrome flare, new-onset hypertension requiring exercise counseling, and chronic low back pain. Each scenario was tested with clinician- and patient-style prompts, generating 24 responses. Outputs were benchmarked against 89 guideline-derived recommendations from three authoritative sources per domain. Two blinded reviewers scored concordance (1 = actionable detail, 0.5 = generic mention, 0 = absent), with adjudication by a third reviewer. Inter-rater reliability was measured using Cronbach's α.

Results: ChatGPT 4.5 achieved the highest guideline concordance (61.9%), followed by DeepSeek V3 (60.7%) and ChatGPT 4.0 (53.7%). Performance varied by domain, exceeding 67% in mental health but dropping below 45% in nutrition. Prompt phrasing influenced capture rates, with clinician-style prompts improving scores in exercise and pain domains, while patient-style prompts outperformed in nutrition. Reviewer agreement was high (α = 0.97 for chatbot scoring; 0.80 for matrix coding).

Conclusion: LLMs can rapidly generate draft care plans that reflect clinical guidelines, though they favor generic over individualized advice. By introducing a unique, domain-agnostic scoring rubric that aligns AI-generated 30-day care plans with gold-standard guidelines, and by applying it in parallel to mental health, nutrition, exercise, and physical therapy scenarios, our study delivers the first prompt-sensitive audit showing where current LLMs exceed, match, or fall short of multidisciplinary best practices.

Supplementary information: The online version contains supplementary material available at 10.1007/s13755-025-00368-0.

Keywords: Artificial intelligence; Chatbots in healthcare; Digital health; Machine learning; Personalized medicine.

© The Author(s), under exclusive licence to Springer Nature Switzerland AG 2025. Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

PubMed Disclaimer

Conflict of interest statement

Conflict of interestThe authors declare no conflicts of interest.

References

1. Dergaa I, Ben Saad H, El Omri A, Glenn J, Clark C, Washif J, et al. Using artificial intelligence for exercise prescription in personalised health promotion: a critical evaluation of OpenAI’s GPT-4 model. Biol Sport. 2024;41:221–41. - PMC - PubMed
1. Dergaa I, Chamari K, Zmijewski P, Ben Saad H. From human writing to artificial intelligence generated text: examining the prospects and potential threats of ChatGPT in academic writing. Biol Sport. 2023;40:615–22. - PMC - PubMed
1. Michael L. Littman. Gathering Strength, Gathering Storms: The One Hundred Year Study on Artificial Intelligence (AI100) 2021 Study Panel Report. 2021.
1. Aung YYM, Wong DCS, Ting DSW. The promise of artificial intelligence: a review of the opportunities and challenges of artificial intelligence in healthcare. Br Med Bull. 2021;139:4–15. - PubMed
1. Pavlik JV. Collaborating with ChatGPT: considering the implications of generative artificial intelligence for journalism and media education. Journalism Mass Commun Educator. 2023;78:84–93.

LinkOut - more resources

Full Text Sources
- Springer

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Analyses of different prescriptions for health using artificial intelligence: a critical approach based on the international guidelines of health institutions

Affiliations

Analyses of different prescriptions for health using artificial intelligence: a critical approach based on the international guidelines of health institutions

Authors

Affiliations

Abstract

Conflict of interest statement

References

LinkOut - more resources

Full Text Sources