Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Review
. 2024 Nov 12;5(2):137-150.
doi: 10.1016/j.jointm.2024.09.002. eCollection 2025 Apr.

Critical care studies using large language models based on electronic healthcare records: A technical note

Affiliations
Review

Critical care studies using large language models based on electronic healthcare records: A technical note

Zhongheng Zhang et al. J Intensive Med. .

Abstract

The integration of large language models (LLMs) in clinical medicine, particularly in critical care, has introduced transformative capabilities for analyzing and managing complex medical information. This technical note explores the application of LLMs, such as generative pretrained transformer 4 (GPT-4) and Qwen-Chat, in interpreting electronic healthcare records to assist with rapid patient condition assessments, predict sepsis, and automate the generation of discharge summaries. The note emphasizes the significance of LLMs in processing unstructured data from electronic health records (EHRs), extracting meaningful insights, and supporting personalized medicine through nuanced understanding of patient histories. Despite the technical complexity of deploying LLMs in clinical settings, this document provides a comprehensive guide to facilitate the effective integration of LLMs into clinical workflows, focusing on the use of DashScope's application programming interface (API) services for judgment on patient prognosis and organ support recommendations based on natural language in EHRs. By illustrating practical steps and best practices, this work aims to lower the technical barriers for clinicians and researchers, enabling broader adoption of LLMs in clinical research and practice to enhance patient care and outcomes.

Keywords: Critical care; Large language model.

PubMed Disclaimer

Figures

Figure 1
Figure 1
Patient outcomes based on adherence to recommended support. This bar plot illustrates the distribution of patient outcomes according to their adherence to organ support recommendations made by a LLM. The data categorize patients into four groups: (1) Recommended and Received: Patients for whom the LLM recommended organ support, and who actually received the support. (2) Recommended but Not Received: Patients for whom the LLM recommended organ support, but who did not receive the support. (3) Not Recommended but Received: Patients for whom the LLM did not recommend organ support, but who received the support regardless. (4) Not Recommended and Not Received: Patients for whom the LLM did not recommend organ support, and who did not receive the support. The plot shows the count of patients with outcomes classified as “improved” or “worsened” within each group. The “x” axis represents the four patient groups, while the “y” axis indicates the number of patients. The outcomes are color-coded: blue for “improved” and red for “worsened.” This visualization helps in understanding the effectiveness of adhering to the LLM's support recommendations. For instance, a higher count of “improved” outcomes in the “Recommended and Received” group compared to the “Recommended but Not Received” group would suggest that following the LLM's recommendations positively impacts patient outcomes. The data used for this analysis includes 100 patients, and the statistical significance of the differences in outcomes was assessed using a chi-squared test, with results indicating a potential association between adherence to recommendations and improved outcomes. LLM: Large language model.

Similar articles

Cited by

References

    1. Sblendorio E., Dentamaro V., Lo Cascio A., Germini F., Piredda M., Cicolini G. Integrating human expertise & automated methods for a dynamic and multi-parametric evaluation of large language models’ feasibility in clinical decision-making. Int J Med Inform. 2024;188 doi: 10.1016/j.ijmedinf.2024.105501. - DOI - PubMed
    1. Chung P., Fong C.T., Walters A.M., Aghaeepour N., Yetisgen M., O'Reilly-Shah V.N. Large language model capabilities in perioperative risk prediction and prognostication. JAMA Surg. 2024;159(8):928–937. doi: 10.1001/jamasurg.2024.1621. - DOI - PMC - PubMed
    1. Iqbal U., Lee L.T., Rahmanti A.R., Celi L.A., Li Y.J. Can large language models provide secondary reliable opinion on treatment options for dermatological diseases? J Am Med Inform Assoc. 2024;31(6):1341–1347. doi: 10.1093/jamia/ocae067. - DOI - PMC - PubMed
    1. Saner F.H., Saner Y.M., Abufarhaneh E., Broering D.C., Raptis D.A. Comparative analysis of artificial intelligence (AI) languages in predicting sequential organ failure assessment (SOFA) scores. Cureus. 2024;16(5):e59662. doi: 10.7759/cureus.59662. - DOI - PMC - PubMed
    1. Amrollahi F., Shashikumar S.P., Razmi F., Nemati S. Contextual embeddings from clinical notes improves prediction of sepsis. AMIA Annu Symp Proc. 2021;2020:197–202. - PMC - PubMed

LinkOut - more resources