Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2023 Jun 2;10(1):348.
doi: 10.1038/s41597-023-02229-5.

POLCOVID: a multicenter multiclass chest X-ray database (Poland, 2020-2021)

Collaborators, Affiliations

POLCOVID: a multicenter multiclass chest X-ray database (Poland, 2020-2021)

Aleksandra Suwalska et al. Sci Data. .

Abstract

The outbreak of the SARS-CoV-2 pandemic has put healthcare systems worldwide to their limits, resulting in increased waiting time for diagnosis and required medical assistance. With chest radiographs (CXR) being one of the most common COVID-19 diagnosis methods, many artificial intelligence tools for image-based COVID-19 detection have been developed, often trained on a small number of images from COVID-19-positive patients. Thus, the need for high-quality and well-annotated CXR image databases increased. This paper introduces POLCOVID dataset, containing chest X-ray (CXR) images of patients with COVID-19 or other-type pneumonia, and healthy individuals gathered from 15 Polish hospitals. The original radiographs are accompanied by the preprocessed images limited to the lung area and the corresponding lung masks obtained with the segmentation model. Moreover, the manually created lung masks are provided for a part of POLCOVID dataset and the other four publicly available CXR image collections. POLCOVID dataset can help in pneumonia or COVID-19 diagnosis, while the set of matched images and lung masks may serve for the development of lung segmentation solutions.

PubMed Disclaimer

Conflict of interest statement

The authors declare no competing interests.

Figures

Fig. 1
Fig. 1
Exemplary images included in the POLCOVID dataset for one representative of each diagnosis group. Original CXR images (a), preprocessed lung area images (b), and lung masks (c) of normal, pneumonia and COVID-19 cases.
Fig. 2
Fig. 2
Demographic summary of the cohort. Proportions of sexes in diagnosis groups and in total are accompanied by numbers of images and numbers of missing records (a). Age distributions in diagnosis groups and in total (b).
Fig. 3
Fig. 3
Two-dimensional Gaussian mixture model (2D GMM) fitted on the results of nUMAP feature extraction for each diagnosis category. COVID-19 (a), other-type pneumonia (b), and normals (c).

References

    1. Haldane V, et al. Health systems resilience in managing the COVID-19 pandemic: lessons from 28 countries. Nat Med. 2021;27:964–980. doi: 10.1038/s41591-021-01381-y. - DOI - PubMed
    1. Vandenberg O, Martiny D, Rochas O, van Belkum A, Kozlakidis Z. Considerations for diagnostic COVID-19 tests. Nat Rev Microbiol. 2021;19:171–183. doi: 10.1038/s41579-020-00461-z. - DOI - PMC - PubMed
    1. Cleverley J, Piper J, Jones MM. The role of chest radiography in confirming covid-19 pneumonia. BMJ. 2020;370:m2426. doi: 10.1136/bmj.m2426. - DOI - PubMed
    1. Wong HYF, et al. Frequency and distribution of chest radiographic findings in patients positive for COVID-19. Radiology. 2020;296:E72–E78. doi: 10.1148/radiol.2020201160. - DOI - PMC - PubMed
    1. Wang L, Lin ZQ, Wong A. COVID-Net: a tailored deep convolutional neural network design for detection of COVID-19 cases from chest X-ray images. Sci Rep. 2020;10:19549. doi: 10.1038/s41598-020-76550-z. - DOI - PMC - PubMed

Grants and funding

LinkOut - more resources