Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2025 Sep;89(3):921-929.
doi: 10.1007/s12020-025-04294-9. Epub 2025 Jun 5.

Enhancing patient-centered care with AI: a study of responses to neuroendocrine neoplasms queries

Collaborators, Affiliations

Enhancing patient-centered care with AI: a study of responses to neuroendocrine neoplasms queries

Francesco Panzuto et al. Endocrine. 2025 Sep.

Abstract

Introduction: Large Language Models (LLMs) are increasingly used in oncology, but their application in neuroendocrine neoplasms (NENs) is still unexplored.

Aim: To evaluate the accuracy, clarity, and completeness of LLM responses to clinically relevant NEN management questions.

Material and methods: A study was conducted from October to December 2024, during which a team of experts posed nine key NEN management questions to three LLMs: ChatGPT Plus, Microsoft Copilot, and Perplexity. Responses were assessed by 22 expert physicians across specialties using a 5-point Likert scale based on scientific accuracy, clarity, and completeness, and additionally evaluated by 24 NEN patients for clarity and relevance. Primary outcomes included LLM performance across evaluation criteria and factors influencing ratings, analyzed using a linear mixed-effects model.

Results: ChatGPT Plus scored highest (M = 3.72, SE = 0.19), followed by Copilot (M = 3.54, SE = 0.19) and Perplexity (M = 3.22, SE = 0.19), with clarity rating the highest. The chemotherapy indications question received the lowest scores, underscoring LLM challenges in handling complex clinical decisions.

Discussion: This study highlights LLMs' potential in NEN management as informative tools with clear but variably accurate responses. Continuous improvement and clinician oversight are essential for their successful integration into patient communication.

Keywords: Artificial intelligence; ChatGPT; Management; Neuroendocrine tumors.

PubMed Disclaimer

Conflict of interest statement

Compliance with ethical standards. Conflict of interest: The authors declare no competing interests.

References

    1. Y. Abboud, A. Shah, R. Sutariya et al. Gastroenteropancreatic neuroendocrine tumor incidence by sex and age in the US. JAMA Oncol. 11, 345–349 (2025). https://doi.org/10.1001/jamaoncol.2024.5937 . - DOI - PubMed
    1. F. Panzuto, E. Merola, M.E. Pavel et al. Stage IV gastro-entero-pancreatic neuroendocrine neoplasms: a risk score to predict clinical outcome. Oncologist 22, 409–415 (2017). https://doi.org/10.1634/theoncologist.2016-0351 . - DOI - PubMed - PMC
    1. L. Magi, F. Mazzuca, M. Rinzivillo et al. Multidisciplinary management of neuroendocrine neoplasia: a real-world experience from a referral center. J. Clin. Med. 8, 910 (2019). https://doi.org/10.3390/jcm8060910 . - DOI - PubMed - PMC
    1. M. Maida, D. Ramai, Y. Mori et al. The role of generative language systems in increasing patient awareness of colon cancer screening. Endoscopy. (2024). https://doi.org/10.1055/a-2388-6084 .
    1. O. Ruiz Sarrias, M.P. Martínez Del Prado, M.Á Sala Gonzalez et al. Leveraging large language models for precision monitoring of chemotherapy-induced toxicities: a pilot study with expert comparisons and future directions. Cancers 16, 2830 (2024). https://doi.org/10.3390/cancers16162830 . - DOI - PubMed - PMC

LinkOut - more resources