Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2025 Apr 11;12(1):611.
doi: 10.1038/s41597-025-04954-5.

LUND-PROBE - LUND Prostate Radiotherapy Open Benchmarking and Evaluation dataset

Affiliations

LUND-PROBE - LUND Prostate Radiotherapy Open Benchmarking and Evaluation dataset

Viktor Rogowski et al. Sci Data. .

Abstract

Radiotherapy treatment for prostate cancer relies on computed tomography (CT) and/or magnetic resonance imaging (MRI) for segmentation of target volumes and organs at risk (OARs). Manual segmentation of these volumes is regarded as the gold standard for ground truth in machine learning applications, but to acquire such data is tedious and time-consuming. A publicly available clinical dataset is presented, comprising MRI- and synthetic CT (sCT) images, target and OARs segmentations, and radiotherapy dose distributions for 432 prostate cancer patients treated with MRI-guided radiotherapy. An extended dataset with 35 patients is also included, with the addition of deep learning (DL)-generated segmentations, DL segmentation uncertainty maps, and DL segmentations manually adjusted by four radiation oncologists. The publication of these resources aims to aid research in automated radiotherapy treatment planning, segmentation, inter-observer analyses, and DL model uncertainty investigation. The dataset is hosted on the AIDA Data Hub and offers a free-to-use resource for the scientific community, valuable for the advancement of medical imaging and prostate cancer radiotherapy research.

PubMed Disclaimer

Conflict of interest statement

Competing interests: The authors declare no competing interests.

Figures

Fig. 1
Fig. 1
Data overview. A hierarchical overview of the dataset and its content for base part (n = 432, blue color, left) and extended part (n = 35, gold color, right). The content in the base part was included in the extended part for each patient.
Fig. 2
Fig. 2
Image data types in the dataset. (a) MRI T2w volume with clinical radiotherapy segmentations overlaid. The center of the fiducial marker delineation in black shows the defined center-of-mass for one of the fiducial markers inserted into the prostate. (b) Synthetic CT (sCT) volume with radiotherapy segmentations overlaid. The sCT was created from the MRI T2w images in (a) using the Spectronic MriPlanner software. The high-density sphere in the prostate, seen as a high intensity object, is placed at the fiducial marker center-of-mass point defined in the MRI T2w volume. (c) Synthetic CT (sCT) with radiotherapy segmentations and overlaid dose distribution, see dose color wash legend to the right where 100% equals 42.7 Gy. It is the same patient and corresponding image slice in (a), (b), and (c).
Fig. 3
Fig. 3
Prostate segmentations. (a) MRI T2w image slice with prostate CTV deep learning segmentation uncertainty map overlaid. (b) T2w MRI zoomed in with prostate deep learning CTV segmentation. (c) Four different oncologist’s individual prostate CTV segmentations on zoomed in T2w MRI, visualized in separate colors. (d) deep learning prostate CTV segmentation uncertainty map visualized in color, zoomed in from (a). Corresponding data for rectum is available in the dataset.
Fig. 4
Fig. 4
Pelvis segmentation overview. One patient volume oriented as demonstrated by the green model. Overlaid are the available prostate targets where PTVT_427 (dark blue) encompasses the prostate CTVT_427 (purple) together with the Bladder (yellow), left (FemoralHead_L) and right (FemoralHead_R) femoral heads (green-yellow), PenileBulb (light blue), Rectum (brown) and Genitalia (red), all available with respective name in the dataset. The BODY segmentation, encompassing the whole scanned patient volume, is the largest volume. Fiducial marker delineations are not shown.

References

    1. Ferlay, J. et al. Cancer incidence and mortality worldwide: sources, methods and major patterns in GLOBOCAN 2012. Int J Cancer136, E359–386, 10.1002/ijc.29210 (2015). - PubMed
    1. Widmark, A. et al. Ultra-hypofractionated versus conventionally fractionated radiotherapy for prostate cancer: 5-year outcomes of the HYPO-RT-PC randomised, non-inferiority, phase 3 trial. Lancet394, 385–395, 10.1016/S0140-6736(19)31131-6 (2019). - PubMed
    1. van Herk, M. Errors and margins in radiotherapy. Semin Radiat Oncol14, 52–64, 10.1053/j.semradonc.2003.10.003 (2004). - PubMed
    1. van Herk, M., Remeijer, P., Rasch, C. & Lebesque, J. V. The probability of correct target dosage: dose-population histograms for deriving treatment margins in radiotherapy. International journal of radiation oncology, biology, physics47, 1121–1135, 10.1016/s0360-3016(00)00518-6 (2000). - PubMed
    1. Persson, E. et al. Investigation of the clinical inter-observer bias in prostate fiducial marker image registration between CT and MR images. Radiation oncology16, 150, 10.1186/s13014-021-01865-8 (2021). - PMC - PubMed

LinkOut - more resources