Evaluation of the integration of retrieval-augmented generation in large language model for breast cancer nursing care responses
- PMID: 39730573
- PMCID: PMC11680762
- DOI: 10.1038/s41598-024-81052-3
Evaluation of the integration of retrieval-augmented generation in large language model for breast cancer nursing care responses
Abstract
Breast cancer is one of the most common malignant tumors in women worldwide. Although large language models (LLMs) can provide breast cancer nursing care consultation, inherent hallucinations can lead to inaccurate responses. Retrieval-augmented generation (RAG) technology can improve LLM performance, offering a new approach for clinical applications. In the present study, we evaluated the performance of a LLM in breast cancer nursing care using RAG technology. In the control group (GPT-4), questions were answered directly using the GPT-4 model, whereas the experimental group (RAG-GPT) used the GPT-4 model combined with RAG. A knowledge base for breast cancer nursing was created for the RAG-GPT group, and 15 of 200 real-world clinical care questions were answered randomly. The primary endpoint was overall satisfaction, and the secondary endpoints were accuracy and empathy. RAG-GPT included a curated knowledge base related to breast cancer nursing care, including textbooks, guidelines, and traditional Chinese therapy. The RAG-GPT group showed significantly higher overall satisfaction than that of the GPT-4 group (8.4 ± 0.84 vs. 5.4 ± 1.27, p < 0.01) as well as an improved accuracy of responses (8.6 ± 0.69 vs. 5.6 ± 0.96, p < 0.01). However, there was no inter-group difference in empathy (8.4 ± 0.85 vs. 7.8 ± 1.22, p > 0.05). Overall, this study revealed that RAG technology could improve LLM performance significantly, likely because of the increased accuracy of the answers without diminishing empathy. These findings provide a theoretical basis for applying RAG technology to LLMs in clinical nursing practice and education.
Keywords: Breast cancer nursing care; ChatGPT; GPT-4; Large language models; Nurse; Retrieval-augmented generation.
© 2024. The Author(s).
Conflict of interest statement
Declarations. Competing interests: The authors declare no competing interests. Approval for human experiments: As this study did not involve human or animal research and the ChatGPT API is freely accessible online, no ethical committee approval was required.
Figures



Similar articles
-
Application of NotebookLM, a large language model with retrieval-augmented generation, for lung cancer staging.Jpn J Radiol. 2025 Apr;43(4):706-712. doi: 10.1007/s11604-024-01705-1. Epub 2024 Nov 25. Jpn J Radiol. 2025. PMID: 39585559
-
Improving accuracy of GPT-3/4 results on biomedical data using a retrieval-augmented language model.PLOS Digit Health. 2024 Aug 21;3(8):e0000568. doi: 10.1371/journal.pdig.0000568. eCollection 2024 Aug. PLOS Digit Health. 2024. PMID: 39167594 Free PMC article.
-
Custom Large Language Models Improve Accuracy: Comparing Retrieval Augmented Generation and Artificial Intelligence Agents to Noncustom Models for Evidence-Based Medicine.Arthroscopy. 2025 Mar;41(3):565-573.e6. doi: 10.1016/j.arthro.2024.10.042. Epub 2024 Nov 7. Arthroscopy. 2025. PMID: 39521391
-
Enhancing medical AI with retrieval-augmented generation: A mini narrative review.Digit Health. 2025 Apr 21;11:20552076251337177. doi: 10.1177/20552076251337177. eCollection 2025 Jan-Dec. Digit Health. 2025. PMID: 40343063 Free PMC article. Review.
-
Integrating Retrieval-Augmented Generation with Large Language Models in Nephrology: Advancing Practical Applications.Medicina (Kaunas). 2024 Mar 8;60(3):445. doi: 10.3390/medicina60030445. Medicina (Kaunas). 2024. PMID: 38541171 Free PMC article. Review.
Cited by
-
Evaluating Large Language Models for Automated CPT Code Prediction in Endovascular Neurosurgery.J Med Syst. 2025 Jan 24;49(1):15. doi: 10.1007/s10916-025-02149-4. J Med Syst. 2025. PMID: 39853605
-
Enhancing Patient Outcomes in Head and Neck Cancer Radiotherapy: Integration of Electronic Patient-Reported Outcomes and Artificial Intelligence-Driven Oncology Care Using Large Language Models.Cancers (Basel). 2025 Jul 15;17(14):2345. doi: 10.3390/cancers17142345. Cancers (Basel). 2025. PMID: 40723229 Free PMC article.
References
MeSH terms
Grants and funding
LinkOut - more resources
Full Text Sources
Medical