Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2023 Jan 3;10(1):1.
doi: 10.1038/s41597-022-01899-x.

MIMIC-IV, a freely accessible electronic health record dataset

Affiliations

MIMIC-IV, a freely accessible electronic health record dataset

Alistair E W Johnson et al. Sci Data. .

Erratum in

Abstract

Digital data collection during routine clinical practice is now ubiquitous within hospitals. The data contains valuable information on the care of patients and their response to treatments, offering exciting opportunities for research. Typically, data are stored within archival systems that are not intended to support research. These systems are often inaccessible to researchers and structured for optimal storage, rather than interpretability and analysis. Here we present MIMIC-IV, a publicly available database sourced from the electronic health record of the Beth Israel Deaconess Medical Center. Information available includes patient measurements, orders, diagnoses, procedures, treatments, and deidentified free-text clinical notes. MIMIC-IV is intended to support a wide array of research studies and educational material, helping to reduce barriers to conducting clinical research.

PubMed Disclaimer

Conflict of interest statement

The authors declare no competing interests.

Figures

Fig. 1
Fig. 1
An overview of the development process for MIMIC. Data are acquired from the BIDMC data warehouse, the ICU information system (MetaVision), and external sources (“acquisition”). Structured Query Language (SQL) scripts merge the various data sources into a single schema (“transformation”). Finally, deidentification algorithms are applied to selectively remove protected health information from the reformatted schema. Tables present in MIMIC-IV are provided on the far right of the figure under their respective module. Issues raised on the MIMIC Code Repository are assessed and used to improve the build process as appropriate.
Fig. 2
Fig. 2
MIMIC-IV follows a modular structure. Modules can be linked by identifiers including subject_id, hadm_id, and deidentified date and time. Example content of each module is shown.
Fig. 3
Fig. 3
Visualization of data within MIMIC-IV for a single patient’s hospitalization: hadm_id 28503629. Three vertically stacked panels highlight the variety of information available. Vital signs are shown in the top panel: note the frequency of data collection for temperature is much higher at the start of the ICU stay due to the use of targeted temperature management. Procedures from multiple sources are shown in the middle panel, including from billing information, the provider order entry system, as well as the ICU information system. The bottom panel displays patient laboratory measurements. Note that while frequent vital signs are only available when the patient is in the ICU, laboratory measures are available throughout their hospitalization.
Fig. 4
Fig. 4
Visualization of medication information documented within MIMIC-IV for a single patient’s hospitalization: hadm_id 28503629. The annotated grey line indicates care units for the patient throughout their stay. Bolus medications are indicated by markers, continuous infusions as lines, and range doses as filled boxes. For example, on day 5 of their hospital stay, the patient had two active prescriptions for heparin (one for 1600–3500 units of heparin, brown filled box, and one for 1000 units of heparin, pink line with triangles). Additionally on day 5, the patient continued to received heparin according to emar (orange circle), and was imminently transferred to the medicine/cardiology intermediate ward.

References

    1. Moody, G. B. & Mark, R. G. A database to support development and evaluation of intelligent intensive care monitoring. In Computers in Cardiology 1996, 657–660 (IEEE, 1996).
    1. Saeed M, et al. Multiparameter intelligent monitoring in intensive care ii (MIMIC-II): a public-access intensive care unit database. Critical care medicine. 2011;39:952. doi: 10.1097/CCM.0b013e31820a92c6. - DOI - PMC - PubMed
    1. Johnson AE, et al. MIMIC-III, a freely accessible critical care database. Scientific data. 2016;3:1–9. doi: 10.1038/sdata.2016.35. - DOI - PMC - PubMed
    1. Pollard TJ, et al. The eICU collaborative research database, a freely available multi-center database for critical care research. Scientific data. 2018;5:1–13. doi: 10.1038/sdata.2018.178. - DOI - PMC - PubMed
    1. Thoral PJ, et al. Sharing ICU patient data responsibly under the society of critical care medicine/european society of intensive care medicine joint data science collaboration: the amsterdam university medical centers database (AmsterdamUMCdb) example. Critical care medicine. 2021;49:e563. doi: 10.1097/CCM.0000000000004916. - DOI - PMC - PubMed