Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2022 Mar 31:42:108120.
doi: 10.1016/j.dib.2022.108120. eCollection 2022 Jun.

Cerner real-world data (CRWD) - A de-identified multicenter electronic health records database

Affiliations

Cerner real-world data (CRWD) - A de-identified multicenter electronic health records database

Louis Ehwerhemuepha et al. Data Brief. .

Abstract

Cerner Real-World Data TM (CRWD) is a de-identified big data source of multicenter electronic health records. Cerner Corporation secured appropriate data use agreements and permissions from more than 100 health systems in the United States contributing to the database as of March 2022. A subset of the database was extracted to include data from only patients with SARS-CoV-2 infections and is referred to as the Cerner COVID-19 Dataset. The December 2021 version of CRWD consists of 100 million patients and 1.5 billion encounters across all care settings. There are 2.3 billion, 2.9 billion, 486 million, and 11.5 billion records in the condition, medication, procedure, and lab (laboratory test) tables respectively. The 2021 Q3 COVID-19 Dataset consists of 130.1 million encounters from 3.8 million patients. The size and longitudinal nature of CRWD can be leveraged for advanced analytics and artificial intelligence in medical research across all specialties and is a rich source of novel discoveries on a wide range of conditions including but not limited to COVID-19.

Keywords: COVID-19; Cerner Real-World DataTM(CRWD); Cerner learning Health NetworkSM (LHN); Electronic Health Records (EHR); HealtheDataLab™; HealtheIntent; SARS-CoV-2.

PubMed Disclaimer

Conflict of interest statement

The authors declare no conflict of interest for this article.

Figures

Fig 1
Fig. 1
Compilation of the CRWD database.
Fig 2
Fig. 2
Geographical distribution of the CRWD, encounters per U.S. region, December 2021.
Fig 3
Fig. 3
Age distribution of the COVID-19 data set.

References

    1. Ehwerhemuepha L., et al. HealtheDataLab - a cloud computing solution for data science and advanced analytics in healthcare with application to predicting multi-center pediatric readmissions. BMC Med. Inform. Decis. Mak. 2020;20:1–12. doi: 10.1186/s12911020-01153-7. - DOI - PMC - PubMed
    1. Schwartz B., Badh R., Jones L., Jacobs B., Dham N. Creating a dilated cardiomyopathy registry to drive population-level quality improvement projects. Circ. Cardiovasc. Qual. Outcomes. 2019;12 doi: 10.1161/hcq.12.suppl_1.39. A39–A39. - DOI
    1. Abbasi S., Singh F., Griffel M., Murphy P.F. A process approach to decreasing hospital onset clostridium difficile infections. Jt. Comm. J. Qual. Patient Saf. 2020;46:146–152. doi: 10.1016/j.jcjq.2019.10.006. - DOI - PubMed
    1. HHS. Guidance Regarding Methods for De-identification of Protected Health Information in Accordance with the Health Insurance Portability and Accountability Act (HIPAA) Privacy Rule. US Department of Health and Human Serviceshttps://www.hhs.gov/hipaa/for-professionals/privacy/special-topics/de-id.... Accessed March 10, 2021.

LinkOut - more resources