Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2025 Jul 1;8(7):e2522866.
doi: 10.1001/jamanetworkopen.2025.22866.

Operationalization of Artificial Intelligence Applications in the Intensive Care Unit: A Systematic Review

Affiliations

Operationalization of Artificial Intelligence Applications in the Intensive Care Unit: A Systematic Review

Willemijn E M Berkhout et al. JAMA Netw Open. .

Abstract

Importance: Artificial intelligence (AI) presents transformative opportunities to address the increasing challenges faced by health care systems globally. Particularly, in data-rich environments, such as intensive care units (ICUs), AI could assist in enhancing clinical decision-making, streamline workflows, and improve patient outcomes. Despite these promising applications, the practical implementation of AI in clinical settings remains limited.

Objective: To systematically evaluate AI system operationalization in the ICU, focusing on the AI field's progress over time, technical maturity, and risk of bias.

Evidence review: In this systematic review, 5 databases (Embase, MEDLINE ALL, Web of Science Core Collection, Cochrane Central Register of Controlled Trials, and Google Scholar) were searched for studies published from July 28, 2020, to June 10, 2024. Eligible studies evaluated AI applications designed for use within ICUs for adults (aged ≥16 years) and used data collected during ICU stays. Two reviewers independently screened titles and abstracts, with a third reviewer resolving disagreements. Data extraction included AI application aims, dataset origins, technology readiness level (TRL) categorization, and the use of reporting standards. Risk of bias was assessed using the PROBAST (Prediction Model Study Risk of Bias Assessment Tool).

Findings: Of 17 401 screened records, 1263 studies met the inclusion criteria. A total of 936 studies (74% of all studies) were classified as TRL 4 or below, indicating early-stage development or initial validation. Among these, 447 (37%) used internal datasets, 562 (46%) used MIMIC (Medical Information Mart for Intensive Care) datasets (I-IV), and 78 (6%) used the open-source eICU Collaborative Research Database. External validation (TRL 5) was achieved by 24% of studies. Only 25 (2%) progressed to clinical integration (TRL≥6), with no studies reaching full implementation (TRL 9). Although approximately half of generative AI models reached a higher TRL (14 [47%] with TRL 5), none reached clinical integration. Additionally, only 207 studies (16%) referenced reporting standards, with adherence modestly increasing from 14% in 2021 to 23% in 2024. High risk of bias was identified in 581 of 1103 studies (53%), primarily due to methodologic shortcomings in the analysis domain.

Conclusions and relevance: Despite substantial growth in AI research within intensive care medicine in recent years, the transition from development to clinical implementation still remains limited and has made little progress over time. A paradigm shift is urgently required in the medical literature-one that moves beyond retrospective validation toward the operationalization and prospective testing of AI for tangible clinical impact.

PubMed Disclaimer

Conflict of interest statement

Conflict of Interest Disclosures: Ms Workum reported receiving personal fees from Skipr Masterclass outside the submitted work. Dr Gommers reported receiving grants from SAS Software Inc outside the submitted work. No other disclosures were reported.

Figures

Figure 1.
Figure 1.. Flow Diagram of Study Review Process and the Exclusion of Studies
AI indicates artificial intelligence; ICU, intensive care unit. aThe sum of the exclusion reasons is higher than the total number of excluded studies (n = 1012) because studies can be excluded based on multiple reasons.
Figure 2.
Figure 2.. Risk of Bias According to the Different PROBAST (Prediction Model Study Risk of Bias Assessment Tool) Domains
Figure 3.
Figure 3.. Studies Published According to Their Technology Readiness Level (TRL) and Year of Publication
Light blue represents data until 2020; dark blue represents data from then onward (current findings). The total number of studies reporting on model development and prototyping (TRLs 3 and 4) increased rapidly from 184 studies in 2021 to 274 studies in 2023. Furthermore, the number of studies per year reporting on external validation (TRL 5) increased from 58 in 2021 to 93 in 2023.

References

    1. Page B, Irving D, Amalberti R, Vincent C. Health services under pressure: a scoping review and development of a taxonomy of adaptive strategies. BMJ Qual Saf. 2024;33(11):738-747. doi: 10.1136/bmjqs-2023-016686 - DOI - PMC - PubMed
    1. Al Kuwaiti A, Nazer K, Al-Reedy A, et al. A review of the role of artificial intelligence in healthcare. J Pers Med. 2023;13(6):951. doi: 10.3390/jpm13060951 - DOI - PMC - PubMed
    1. van de Sande D, van Genderen ME, Huiskens J, Gommers D, van Bommel J. Moving from bytes to bedside: a systematic review on the use of artificial intelligence in the intensive care unit. Intensive Care Med. 2021;47(7):750-760. doi: 10.1007/s00134-021-06446-7 - DOI - PMC - PubMed
    1. Muralidharan V, Adewale BA, Huang CJ, et al. A scoping review of reporting gaps in FDA-approved AI medical devices. NPJ Digit Med. 2024;7(1):273. doi: 10.1038/s41746-024-01270-x - DOI - PMC - PubMed
    1. Ouyang D, Hogan J. We need more randomized clinical trials of AI. NEJM AI. 2024;1(11):AIe2400881. doi: 10.1056/AIe2400881 - DOI

Publication types

MeSH terms