Review

. 2024 Nov 12;5(2):137-150.

doi: 10.1016/j.jointm.2024.09.002. eCollection 2025 Apr.

Critical care studies using large language models based on electronic healthcare records: A technical note

Zhongheng Zhang^{1

2}, Hongying Ni³

Affiliations

¹ Department of Emergency Medicine, Provincial Key Laboratory of Precise Diagnosis and Treatment of Abdominal Infection, Sir Run Shaw Hospital, Zhejiang University School of Medicine, Hangzhou, Zhejiang, China.
² School of Medicine, Shaoxing University, Shaoxing, Zhejiang, China.
³ Department of Critical Care Medicine, Zhejiang University School of Medicine, Affiliated Jinhua Hospital, Jinhua, China.

PMID: 40241837
PMCID: PMC11997556
DOI: 10.1016/j.jointm.2024.09.002

Review

Critical care studies using large language models based on electronic healthcare records: A technical note

Zhongheng Zhang et al. J Intensive Med. 2024.

. 2024 Nov 12;5(2):137-150.

doi: 10.1016/j.jointm.2024.09.002. eCollection 2025 Apr.

Authors

Zhongheng Zhang^{1

2}, Hongying Ni³

Affiliations

¹ Department of Emergency Medicine, Provincial Key Laboratory of Precise Diagnosis and Treatment of Abdominal Infection, Sir Run Shaw Hospital, Zhejiang University School of Medicine, Hangzhou, Zhejiang, China.
² School of Medicine, Shaoxing University, Shaoxing, Zhejiang, China.
³ Department of Critical Care Medicine, Zhejiang University School of Medicine, Affiliated Jinhua Hospital, Jinhua, China.

PMID: 40241837
PMCID: PMC11997556
DOI: 10.1016/j.jointm.2024.09.002

Abstract

The integration of large language models (LLMs) in clinical medicine, particularly in critical care, has introduced transformative capabilities for analyzing and managing complex medical information. This technical note explores the application of LLMs, such as generative pretrained transformer 4 (GPT-4) and Qwen-Chat, in interpreting electronic healthcare records to assist with rapid patient condition assessments, predict sepsis, and automate the generation of discharge summaries. The note emphasizes the significance of LLMs in processing unstructured data from electronic health records (EHRs), extracting meaningful insights, and supporting personalized medicine through nuanced understanding of patient histories. Despite the technical complexity of deploying LLMs in clinical settings, this document provides a comprehensive guide to facilitate the effective integration of LLMs into clinical workflows, focusing on the use of DashScope's application programming interface (API) services for judgment on patient prognosis and organ support recommendations based on natural language in EHRs. By illustrating practical steps and best practices, this work aims to lower the technical barriers for clinicians and researchers, enabling broader adoption of LLMs in clinical research and practice to enhance patient care and outcomes.

Keywords: Critical care; Large language model.

PubMed Disclaimer

Figures

**Figure 1**
Patient outcomes based on adherence to recommended support. This bar plot illustrates the distribution of patient outcomes according to their adherence to organ support recommendations made by a LLM. The data categorize patients into four groups: (1) Recommended and Received: Patients for whom the LLM recommended organ support, and who actually received the support. (2) Recommended but Not Received: Patients for whom the LLM recommended organ support, but who did not receive the support. (3) Not Recommended but Received: Patients for whom the LLM did not recommend organ support, but who received the support regardless. (4) Not Recommended and Not Received: Patients for whom the LLM did not recommend organ support, and who did not receive the support. The plot shows the count of patients with outcomes classified as “improved” or “worsened” within each group. The “x” axis represents the four patient groups, while the “y” axis indicates the number of patients. The outcomes are color-coded: blue for “improved” and red for “worsened.” This visualization helps in understanding the effectiveness of adhering to the LLM's support recommendations. For instance, a higher count of “improved” outcomes in the “Recommended and Received” group compared to the “Recommended but Not Received” group would suggest that following the LLM's recommendations positively impacts patient outcomes. The data used for this analysis includes 100 patients, and the statistical significance of the differences in outcomes was assessed using a chi-squared test, with results indicating a potential association between adherence to recommendations and improved outcomes. LLM: Large language model.

See this image and copyright information in PMC

Cited by

Benchmarking vision-language models for diagnostics in emergency and critical care settings.
Kurz CF, Merzhevich T, Eskofier BM, Kather JN, Gmeiner B. Kurz CF, et al. NPJ Digit Med. 2025 Jul 10;8(1):423. doi: 10.1038/s41746-025-01837-2. NPJ Digit Med. 2025. PMID: 40640347 Free PMC article.

References

1. Sblendorio E., Dentamaro V., Lo Cascio A., Germini F., Piredda M., Cicolini G. Integrating human expertise & automated methods for a dynamic and multi-parametric evaluation of large language models’ feasibility in clinical decision-making. Int J Med Inform. 2024;188 doi: 10.1016/j.ijmedinf.2024.105501. - DOI - PubMed
1. Chung P., Fong C.T., Walters A.M., Aghaeepour N., Yetisgen M., O'Reilly-Shah V.N. Large language model capabilities in perioperative risk prediction and prognostication. JAMA Surg. 2024;159(8):928–937. doi: 10.1001/jamasurg.2024.1621. - DOI - PMC - PubMed
1. Iqbal U., Lee L.T., Rahmanti A.R., Celi L.A., Li Y.J. Can large language models provide secondary reliable opinion on treatment options for dermatological diseases? J Am Med Inform Assoc. 2024;31(6):1341–1347. doi: 10.1093/jamia/ocae067. - DOI - PMC - PubMed
1. Saner F.H., Saner Y.M., Abufarhaneh E., Broering D.C., Raptis D.A. Comparative analysis of artificial intelligence (AI) languages in predicting sequential organ failure assessment (SOFA) scores. Cureus. 2024;16(5):e59662. doi: 10.7759/cureus.59662. - DOI - PMC - PubMed
1. Amrollahi F., Shashikumar S.P., Razmi F., Nemati S. Contextual embeddings from clinical notes improves prediction of sepsis. AMIA Annu Symp Proc. 2021;2020:197–202. - PMC - PubMed

Publication types

Actions

LinkOut - more resources

Full Text Sources
- Elsevier Science
- PubMed Central

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Critical care studies using large language models based on electronic healthcare records: A technical note

Affiliations

Critical care studies using large language models based on electronic healthcare records: A technical note

Authors

Affiliations

Abstract

Figures

Similar articles

Cited by

References

Publication types

LinkOut - more resources

Full Text Sources

Abstract

Figures

Similar articles

Cited by

References

Publication types

Related information

LinkOut - more resources

Full Text Sources