Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2023 Sep 4;10(1):576.
doi: 10.1038/s41597-023-02484-6.

2DeteCT - A large 2D expandable, trainable, experimental Computed Tomography dataset for machine learning

Affiliations

2DeteCT - A large 2D expandable, trainable, experimental Computed Tomography dataset for machine learning

Maximilian B Kiss et al. Sci Data. .

Abstract

Recent research in computational imaging largely focuses on developing machine learning (ML) techniques for image reconstruction, which requires large-scale training datasets consisting of measurement data and ground-truth images. However, suitable experimental datasets for X-ray Computed Tomography (CT) are scarce, and methods are often developed and evaluated only on simulated data. We fill this gap by providing the community with a versatile, open 2D fan-beam CT dataset suitable for developing ML techniques for a range of image reconstruction tasks. To acquire it, we designed a sophisticated, semi-automatic scan procedure that utilizes a highly-flexible laboratory X-ray CT setup. A diverse mix of samples with high natural variability in shape and density was scanned slice-by-slice (5,000 slices in total) with high angular and spatial resolution and three different beam characteristics: A high-fidelity, a low-dose and a beam-hardening-inflicted mode. In addition, 750 out-of-distribution slices were scanned with sample and beam variations to accommodate robustness and segmentation tasks. We provide raw projection data, reference reconstructions and segmentations based on an open-source data processing pipeline.

PubMed Disclaimer

Conflict of interest statement

The authors declare no competing interests.

Figures

Fig. 1
Fig. 1
Overview of the scope of the 2DeteCT dataset.
Fig. 2
Fig. 2
FleX-ray Lab: the computed tomography set-up used for the data acquisition. (1) cone-beam X-ray source; (2) Thoraeus filter sail; (3) Rotation stage; (4) Sample tube; (5) Flat panel detector. The objects 1, 3, 4, and 5 move from their red transparent front position to the mid position for the acquisitions of mode 3. In both positions 3,601 projection images per slice are taken while the object rotates 360 degrees.
Fig. 3
Fig. 3
Beam spectra of a tungsten target X-ray source with an X-ray exit window made of 300 μm Beryllium operated at 60 kV with no added filtration and 90 kV tube voltage with no added filtration and filtered with a Thoraeus filter of Sn = 0.1 mm, Cu = 0.2 mm, Al = 0.5 mm simulated by TASMIP software.
Fig. 4
Fig. 4
Visualization of the scanning procedure.
Fig. 5
Fig. 5
From left to right: Sinograms and reconstructions from slice 1661 (mode1 -low-dose), slice 4300 (mode2 - artifact-free), slice 560 (mode3 - artifact-inflicted) and segmentation of slice 4300 based on the mode2 reconstruction. To illustrate the variation in the sample mix we selected different slices for each mode.

References

    1. Ravishankar S, Ye JC, Fessler JA. Image reconstruction: From sparsity to data-adaptive methods and machine learning. Proceedings of the IEEE. 2020;108:86–109. doi: 10.1109/JPROC.2019.2936204. - DOI - PMC - PubMed
    1. Lecun Y, Bottou L, Bengio Y, Haffner P. Gradient-based learning applied to document recognition. Proceedings of the IEEE. 1998;86:2278–2324. doi: 10.1109/5.726791. - DOI
    1. Deng, J. et al. Imagenet: A large-scale hierarchical image database. In 2009 IEEE Conference on Computer Vision and Pattern Recognition, 248–255, 10.1109/CVPR.2009.5206848 (2009).
    1. Knoll, F. et al. fastmri: A publicly available raw k-space and dicom dataset of knee images for accelerated mr image reconstruction using machine learning. Radiology: Artificial Intelligence2, e190007, 10.1148/ryai.2020190007 PMID: 32076662 (2020). - PMC - PubMed
    1. McCollough C. Tu-fg-207a-04: overview of the low dose ct grand challenge. Medical physics. 2016;43:3759–3760.