. 2025 Jun 1;32(6):1032-1039.

doi: 10.1093/jamia/ocaf059.

Detecting emergencies in patient portal messages using large language models and knowledge graph-based retrieval-augmented generation

Siru Liu^{1

2}, Aileen P Wright^{1

3}, Allison B McCoy¹, Sean S Huang^{1

3}, Bryan Steitz¹, Adam Wright^{1

3}

Affiliations

¹ Department of Biomedical Informatics, Vanderbilt University Medical Center, Nashville, TN 37212, United States.
² Department of Computer Science, Vanderbilt University, Nashville, TN 37240, United States.
³ Department of Medicine, Vanderbilt University Medical Center, Nashville, TN 37232, United States.

PMID: 40220286
PMCID: PMC12089757
DOI: 10.1093/jamia/ocaf059

Detecting emergencies in patient portal messages using large language models and knowledge graph-based retrieval-augmented generation

Siru Liu et al. J Am Med Inform Assoc. 2025.

. 2025 Jun 1;32(6):1032-1039.

doi: 10.1093/jamia/ocaf059.

Authors

Siru Liu^{1

2}, Aileen P Wright^{1

3}, Allison B McCoy¹, Sean S Huang^{1

3}, Bryan Steitz¹, Adam Wright^{1

3}

Affiliations

¹ Department of Biomedical Informatics, Vanderbilt University Medical Center, Nashville, TN 37212, United States.
² Department of Computer Science, Vanderbilt University, Nashville, TN 37240, United States.
³ Department of Medicine, Vanderbilt University Medical Center, Nashville, TN 37232, United States.

PMID: 40220286
PMCID: PMC12089757
DOI: 10.1093/jamia/ocaf059

Abstract

Objectives: This study aims to develop and evaluate an approach using large language models (LLMs) and a knowledge graph to triage patient messages that need emergency care. The goal is to notify patients when their messages indicate an emergency, guiding them to seek immediate help rather than using the patient portal, to improve patient safety.

Materials and methods: We selected 1020 messages sent to Vanderbilt University Medical Center providers between January 1, 2022 and March 7, 2023. We developed four models to triage these messages for emergencies: (1) Prompt-Only: the patient message was input with a prompt directly into the LLM; (2) Naïve Retrieval Augmented Generation (RAG): provided retrieved information as context to the LLM; (3) RAG from Knowledge Graph with Local Search: a knowledge graph was used to retrieve locally relevant information based on semantic similarities; (4) RAG from Knowledge Graph with Global Search: a knowledge graph was used to retrieve globally relevant information through hierarchical community detection. The knowledge base was a triage book covering 225 protocols.

Results: The RAG from Knowledge Graph model with global search outperformed other models, achieving an accuracy of 0.99, a sensitivity of 0.98, and a specificity of 0.99. It demonstrated significant improvements in triaging emergency messages compared to LLM without RAG and naïve RAG.

Discussion: The traditional LLM without any retrieval mechanism underperformed compared to models with RAG, which aligns with the expected benefits of augmenting LLMs with domain-specific knowledge sources. Our results suggest that providing external knowledge, especially in a structured manner and in community summaries, can improve LLM performance in triaging patient portal messages.

Conclusion: LLMs can effectively assist in triaging emergency patient messages after integrating with a knowledge graph about a nurse triage book. Future research should focus on expanding the knowledge graph and deploying the system to evaluate its impact on patient outcomes.

Keywords: clinical decision support; knowledge graph; large language model; message content; patient portal; patient-doctor communication; primary health care; retrieval augmented generation.

PubMed Disclaimer

Conflict of interest statement

None declared.

Figures

**Figure 1.**
The screenshot of a patient message indicating a potential medical emergency. The form includes a warning to call 911 or visit the emergency department for emergencies, followed by text fields for “Enter a subject...” (filled in with “Headache”) and “Enter your message...” (containing a detailed note about severe headache, neck tightness, left-side facial twitching, and lip numbness). The interface also shows buttons labeled “Discard,” “Draft,” “Attach,” and “Send.”

**Figure 2.**
Study overview. A flowchart showing a study overview where a knowledge graph, developed using a GPT-4–powered nurse triage book, feeds into three RAG methods (naive, local, global) and a prompt‐only approach, with all outputs evaluated via statistical tests on key performance metrics. (RAG: retrieval augmented generation, LLM: large language model).

See this image and copyright information in PMC

Cited by

Using Generative Artificial Intelligence in Health Economics and Outcomes Research: A Primer on Techniques and Breakthroughs.
Reason T, Klijn S, Rawlinson W, Benbow E, Langham J, Teitsson S, Johannesen K, Malcolm B. Reason T, et al. Pharmacoecon Open. 2025 Jul;9(4):501-517. doi: 10.1007/s41669-025-00580-4. Epub 2025 Apr 29. Pharmacoecon Open. 2025. PMID: 40301283 Free PMC article.
Harnessing the power of large language models for clinical tasks and synthesis of scientific literature.
Bakken S. Bakken S. J Am Med Inform Assoc. 2025 Jun 1;32(6):983-984. doi: 10.1093/jamia/ocaf071. J Am Med Inform Assoc. 2025. PMID: 40390622 No abstract available.

References

1. Huang M, Fan J, Prigge J, et al. Characterizing patient-clinician communication in secure medical messages: retrospective study. J Med Internet Res. 2022;24:e17273. 10.2196/17273 - DOI - PMC - PubMed
1. North F, Luhman KE, Mallmann EA, et al. A retrospective analysis of provider-to-patient secure messages: how much are they increasing, who is doing the work, and is the work happening after hours? JMIR Med Inform. 2020;8:e16521. 10.2196/16521 - DOI - PMC - PubMed
1. Sinsky CA, Shanafelt TD, Ripp JA. The electronic health record inbox: recommendations for relief. J Gen Intern Med. 2022;37:4002-4003. 10.1007/s11606-022-07766-0 - DOI - PMC - PubMed
1. Holmgren AJ, Downing NL, Tang M, et al. Assessing the impact of the COVID-19 pandemic on clinician ambulatory electronic health record use. J Am Med Inf Assoc. 2022;29:453-460. 10.1093/jamia/ocab268 - DOI - PMC - PubMed
1. Shimada SL, Petrakis BA, Rothendler JA, et al. An analysis of patient-provider secure messaging at two Veterans Health Administration medical centers: message content and resolution through secure messaging. J Am Med Inf Assoc. 2017;24:942-949. 10.1093/jamia/ocx021 - DOI - PMC - PubMed

MeSH terms

Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions

Grants and funding

LinkOut - more resources

Full Text Sources

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Detecting emergencies in patient portal messages using large language models and knowledge graph-based retrieval-augmented generation

Affiliations

Detecting emergencies in patient portal messages using large language models and knowledge graph-based retrieval-augmented generation

Authors

Affiliations

Abstract

Conflict of interest statement

Figures

Similar articles

Cited by

References

MeSH terms

Grants and funding

LinkOut - more resources

Full Text Sources

Abstract

Conflict of interest statement

Figures

Similar articles

Cited by

References

MeSH terms

Related information

Grants and funding

LinkOut - more resources

Full Text Sources