Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2024 Oct 9;11(1):1106.
doi: 10.1038/s41597-024-03938-1.

A large multi-focus dataset for white blood cell classification

Affiliations

A large multi-focus dataset for white blood cell classification

Seongjin Park et al. Sci Data. .

Abstract

The White Blood Cell (WBC) differential test ranks as the second most frequently performed diagnostic assay. It requires manual confirmation of the peripheral blood smear by experts to identify signs of abnormalities. Automated digital microscopy has emerged as a solution to reduce this labor-intensive process and improve efficiency. Several publicly available datasets provide various WBC subtypes of differing quality and resolution. These datasets have contributed to advancing WBC classification using machine learning techniques. However, digital microscopy of blood cells with high magnification often requires a wider depth of field, posing challenges for automatic digital microscopy that necessitates capturing multiple stacks of focal planes to obtain complete images of specific blood cells. Our dataset provides 25,773 image stacks from 72 patients. The image labels consist of 18 classes encompassing normal and abnormal cells, with two experts reviewing each label. Each image includes 10 z-stacks of cropped 200 by 200 pixel images, captured using a 50X microscope with 400 nm intervals. This study presents a comprehensive multi-focus dataset for WBC classification.

PubMed Disclaimer

Conflict of interest statement

Kyu-Hwan Jung receives a consulting fee from Noul Co., Ltd. There are no other conflicts of interest for other authors.

Figures

Fig. 1
Fig. 1
The process of multi-focus image data acquisition. A stack of 10 images is captured, and a U-net-based segmentation model is used on the best-focus image determined by the Laplacian filter. Then, the bounding box is extracted to find the location of the WBC. Experts examine all z-stacks to confirm the subtype.
Fig. 2
Fig. 2
Calculated R2 scores between miLab expert classification and manual microscope are shown. Five normal WBC subtypes (neutrophil, lymphocyte, monocyte, eosinophil, basophil) and other immature WBCs are listed. The x-axis corresponds to the percentage of WBC cells of a specific subtype that exist in a ground truth slide. The y-axis represents the percentage of WBCs that miLab detected and classified by our experts. The axes are in logarithmic scale to better visualize data points.

References

    1. Stouten, K., Riedl, J. A., Levin, M.-D. & Van Gelder, W. Examination of peripheral blood smears: Performance evaluation of a digital microscope system using a large-scale leukocyte database. International Journal of Laboratory Hematology37, e137–e140, 10.1111/ijlh.12391 (2015). - PubMed
    1. Koepke, J. A. Reference Leukocyte (WBC) Differential Count (Proportional) and Evaluation of Instrumental Methods (Clinical and Laboratory Standards Institute, Wayne, Pa., 2007), 2nd ed edn.
    1. Nam, M. et al. Digital Morphology Analyzer Sysmex DI-60 vs. Manual Counting for White Blood Cell Differentials in Leukopenic Samples: A Comparative Assessment of Risk and Turnaround Time. Annals of Laboratory Medicine42, 398–405, 10.3343/alm.2022.42.4.398 (2022). - PMC - PubMed
    1. Yoon, S. et al. Performance of digital morphology analyzer Vision Pro on white blood cell differentials. Clinical Chemistry and Laboratory Medicine (CCLM)59, 1099–1106, 10.1515/cclm-2020-1701 (2021). - PubMed
    1. Yoon, S. & Kim, H. R. Analytical performance of the digital morphology analyzer Sysmex DI-60 for body fluid cell differential counts. PLOS ONE18, e0288551, 10.1371/journal.pone.0288551 (2023). - PMC - PubMed

LinkOut - more resources