Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2018 Jan 1;25(1):32-39.
doi: 10.1093/jamia/ocx084.

The MIMIC Code Repository: enabling reproducibility in critical care research

Affiliations

The MIMIC Code Repository: enabling reproducibility in critical care research

Alistair Ew Johnson et al. J Am Med Inform Assoc. .

Abstract

Objective: Lack of reproducibility in medical studies is a barrier to the generation of a robust knowledge base to support clinical decision-making. In this paper we outline the Medical Information Mart for Intensive Care (MIMIC) Code Repository, a centralized code base for generating reproducible studies on an openly available critical care dataset.

Materials and methods: Code is provided to load the data into a relational structure, create extractions of the data, and reproduce entire analysis plans including research studies.

Results: Concepts extracted include severity of illness scores, comorbid status, administrative definitions of sepsis, physiologic criteria for sepsis, organ failure scores, treatment administration, and more. Executable documents are used for tutorials and reproduce published studies end-to-end, providing a template for future researchers to replicate. The repository's issue tracker enables community discussion about the data and concepts, allowing users to collaboratively improve the resource.

Discussion: The centralized repository provides a platform for users of the data to interact directly with the data generators, facilitating greater understanding of the data. It also provides a location for the community to collaborate on necessary concepts for research progress and share them with a larger audience. Consistent application of the same code for underlying concepts is a key step in ensuring that research studies on the MIMIC database are comparable and reproducible.

Conclusion: By providing open source code alongside the freely accessible MIMIC-III database, we enable end-to-end reproducible analysis of electronic health records.

Keywords: critical care; data mining; electronic health record; intensive care; mimic-iii; reproducibility.

PubMed Disclaimer

Figures

Figure 1.
Figure 1.
Comparison of severity of illness score distributions.
Figure 2.
Figure 2.
Comparison of areas under the receiver operating curve for SOFA scores calculated from MIMIC code and a prior research report.
Figure 3.
Figure 3.
Logic behind the query for converting aperiodically recorded ventilator settings into durations of mechanical ventilation.
Figure 4.
Figure 4.
Example of a patient who was both mechanically ventilated and receiving vasopressors for cardiovascular support.
Figure 5.
Figure 5.
Comparison of 3 methods for calculating presence of a comorbidity for a patient using billing data: an updated coding from the AHRQ which uses diagnosis-related group (DRG) codes to mask non-comorbid conditions, the same coding without the DRG masking, and finally an alternative coding that does not use DRG masking, proposed by Quan et al.
Figure 6.
Figure 6.
Example of a notebook providing a tutorial with MIMIC-III data.

References

    1. Baker M. 1,500 scientists lift the lid on reproducibility. Nature. 2016;5331:452–54. - PubMed
    1. Gruber WH, Powell AC, Torous JB. Healthcare The power of capturing and using information at the point of care. Healthcare [Internet] 2016;(January):0–1. 10.1016/j.hjdsi.2016.09.004. Accessed May 25, 2017. - DOI - PubMed
    1. Johnson AEW, Pollard TJ, Shen L, et al. . MIMIC-III, a freely accessible critical care database. Scientific Data. 2016;3: 1–9. - PMC - PubMed
    1. Pollard TJ, Johnson AEW. The MIMIC-III Clinical Database. 10.13026/C2XW26 2016 Accessed May 25, 2017. - DOI
    1. Pollard TJ, Johnson AEW, Blundell J, et al. . MIT-LCP/mimic-code: MIMIC-III v1.4. 2017. doi:10.5281/zenodo.821872. Accessed May 25, 2017.

Publication types