Why do users override alerts? Utilizing large language model to summarize comments and optimize clinical decision support
- PMID: 38452289
- PMCID: PMC11105133
- DOI: 10.1093/jamia/ocae041
Why do users override alerts? Utilizing large language model to summarize comments and optimize clinical decision support
Abstract
Objectives: To evaluate the capability of using generative artificial intelligence (AI) in summarizing alert comments and to determine if the AI-generated summary could be used to improve clinical decision support (CDS) alerts.
Materials and methods: We extracted user comments to alerts generated from September 1, 2022 to September 1, 2023 at Vanderbilt University Medical Center. For a subset of 8 alerts, comment summaries were generated independently by 2 physicians and then separately by GPT-4. We surveyed 5 CDS experts to rate the human-generated and AI-generated summaries on a scale from 1 (strongly disagree) to 5 (strongly agree) for the 4 metrics: clarity, completeness, accuracy, and usefulness.
Results: Five CDS experts participated in the survey. A total of 16 human-generated summaries and 8 AI-generated summaries were assessed. Among the top 8 rated summaries, five were generated by GPT-4. AI-generated summaries demonstrated high levels of clarity, accuracy, and usefulness, similar to the human-generated summaries. Moreover, AI-generated summaries exhibited significantly higher completeness and usefulness compared to the human-generated summaries (AI: 3.4 ± 1.2, human: 2.7 ± 1.2, P = .001).
Conclusion: End-user comments provide clinicians' immediate feedback to CDS alerts and can serve as a direct and valuable data resource for improving CDS delivery. Traditionally, these comments may not be considered in the CDS review process due to their unstructured nature, large volume, and the presence of redundant or irrelevant content. Our study demonstrates that GPT-4 is capable of distilling these comments into summaries characterized by high clarity, accuracy, and completeness. AI-generated summaries are equivalent and potentially better than human-generated summaries. These AI-generated summaries could provide CDS experts with a novel means of reviewing user comments to rapidly optimize CDS alerts both online and offline.
Keywords: alert fatigue; clinical decision support; health personnel; large language model.
© The Author(s) 2024. Published by Oxford University Press on behalf of the American Medical Informatics Association.
Conflict of interest statement
S.N. is on the advisory board for Baxter Health and Merative Micromedex. The other authors do not have conflicts of interest related to this study.
Figures



Similar articles
-
Using AI-generated suggestions from ChatGPT to optimize clinical decision support.J Am Med Inform Assoc. 2023 Jun 20;30(7):1237-1245. doi: 10.1093/jamia/ocad072. J Am Med Inform Assoc. 2023. PMID: 37087108 Free PMC article.
-
Assessing the Value of ChatGPT for Clinical Decision Support Optimization.medRxiv [Preprint]. 2023 Feb 23:2023.02.21.23286254. doi: 10.1101/2023.02.21.23286254. medRxiv. 2023. PMID: 36865144 Free PMC article. Preprint.
-
The use of artificial intelligence to optimize medication alerts generated by clinical decision support systems: a scoping review.J Am Med Inform Assoc. 2024 May 20;31(6):1411-1422. doi: 10.1093/jamia/ocae076. J Am Med Inform Assoc. 2024. PMID: 38641410 Free PMC article.
-
Clinician collaboration to improve clinical decision support: the Clickbusters initiative.J Am Med Inform Assoc. 2022 May 11;29(6):1050-1059. doi: 10.1093/jamia/ocac027. J Am Med Inform Assoc. 2022. PMID: 35244165 Free PMC article.
-
Optimizing clinical decision support alerts in electronic medical records: a systematic review of reported strategies adopted by hospitals.J Am Med Inform Assoc. 2021 Jan 15;28(1):177-183. doi: 10.1093/jamia/ocaa279. J Am Med Inform Assoc. 2021. PMID: 33186438 Free PMC article.
Cited by
-
Evaluation of a Digital Scribe: Conversation Summarization for Emergency Department Consultation Calls.Appl Clin Inform. 2024 May 15;15(3):600-11. doi: 10.1055/a-2327-4121. Online ahead of print. Appl Clin Inform. 2024. PMID: 38749477 Free PMC article.
-
Leveraging Retrieval-Augmented Large Language Models for Dietary Recommendations With Traditional Chinese Medicine's Medicine Food Homology: Algorithm Development and Validation.JMIR Med Inform. 2025 Aug 21;13:e75279. doi: 10.2196/75279. JMIR Med Inform. 2025. PMID: 40840437 Free PMC article.
-
Pearls and Pitfalls for LLMs 2.0.Radiology. 2024 Oct;313(1):e242512. doi: 10.1148/radiol.242512. Radiology. 2024. PMID: 39470427 No abstract available.
-
Detecting emergencies in patient portal messages using large language models and knowledge graph-based retrieval-augmented generation.J Am Med Inform Assoc. 2025 Jun 1;32(6):1032-1039. doi: 10.1093/jamia/ocaf059. J Am Med Inform Assoc. 2025. PMID: 40220286 Free PMC article.
-
Using large language model to guide patients to create efficient and comprehensive clinical care message.J Am Med Inform Assoc. 2024 Aug 1;31(8):1665-1670. doi: 10.1093/jamia/ocae142. J Am Med Inform Assoc. 2024. PMID: 38917441 Free PMC article.
References
-
- Parasrampuria S, Henry J.. Hospitals’ use of electronic health records data, 2015-2017. ONC Data Br. 2019;46:1-13. - PubMed
-
- Wright A, Sittig DF, Ash JS, et al.Development and evaluation of a comprehensive clinical decision support taxonomy: comparison of front-end tools in commercial and internally developed electronic health record systems. J Am Med Informatics Assoc. 2011;18(3):232-242. 10.1136/amiajnl-2011-000113 - DOI - PMC - PubMed
-
- Thomas Craig KJ, Fusco N, Lindsley K, et al.Rapid review: identification of digital health interventions in atherosclerotic-related cardiovascular disease populations to address racial, ethnic, and socioeconomic health disparities. Cardiovasc Digit Health J. 2020;1(3):139-148. 10.1016/j.cvdhj.2020.11.001 - DOI - PMC - PubMed