Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2023 Jun 7;10(1):367.
doi: 10.1038/s41597-023-02276-y.

Unified real-time environmental-epidemiological data for multiscale modeling of the COVID-19 pandemic

Affiliations

Unified real-time environmental-epidemiological data for multiscale modeling of the COVID-19 pandemic

Hamada S Badr et al. Sci Data. .

Abstract

An impressive number of COVID-19 data catalogs exist. However, none are fully optimized for data science applications. Inconsistent naming and data conventions, uneven quality control, and lack of alignment between disease data and potential predictors pose barriers to robust modeling and analysis. To address this gap, we generated a unified dataset that integrates and implements quality checks of the data from numerous leading sources of COVID-19 epidemiological and environmental data. We use a globally consistent hierarchy of administrative units to facilitate analysis within and across countries. The dataset applies this unified hierarchy to align COVID-19 epidemiological data with a number of other data types relevant to understanding and predicting COVID-19 risk, including hydrometeorological data, air quality, information on COVID-19 control policies, vaccine data, and key demographic characteristics.

PubMed Disclaimer

Conflict of interest statement

The authors declare no competing interests.

Figures

Fig. 1
Fig. 1
Flowchart of the data harmonization for the unified COVID-19 dataset.
Fig. 2
Fig. 2
Spatial coverage map for the unified COVID-19 dataset (Admin 0 = National, Admin 1 = First administrative level (e.g., state, province), Admin 2–3 = Second and third administrative levels (e.g., county, district).
Fig. 3
Fig. 3
Geospatial ID used for the unified COVID-19 dataset.
Fig. 4
Fig. 4
Epidemiological estimates and the reported COVID-19 cases for the USA. (A) Estimated daily infections (dashed lines) and the reported cases (vertical bars); (B) Effective reproduction number (R) estimated from the estimated of daily infections.
Fig. 5
Fig. 5
Global geographical distribution of the 10 hydrometeorological variables included in the dataset – average of all daily values for 2020.

References

    1. Dong E, Du H, Gardner L. An interactive web-based dashboard to track COVID-19 in real time. Lancet Infect. Dis. 2020;20:533–534. doi: 10.1016/S1473-3099(20)30120-1. - DOI - PMC - PubMed
    1. The Atlantic Monthly Group. The COVID Tracking Project. The COVID Tracking Projecthttps://covidtracking.com/ (2023).
    1. NYC Department of Health and Mental Hygiene. NYC Coronavirus Disease 2019 (COVID-19) Data. (2023).
    1. The New York Times. Coronavirus (Covid-19) Data in the United States (Archived). (2023).
    1. Cota, W. Monitoring the number of COVID-19 cases and deaths in Brazil at municipal and federative units level. https://preprints.scielo.org/index.php/scielo/preprint/view/362/version/371, 10.1590/SciELOPreprints.362 (2020).