Two-stage deep learning model for fully automated pancreas segmentation on computed tomography: Comparison with intra-reader and inter-reader reliability at full and reduced radiation dose on an external dataset
- PMID: 33595105
- DOI: 10.1002/mp.14782
Two-stage deep learning model for fully automated pancreas segmentation on computed tomography: Comparison with intra-reader and inter-reader reliability at full and reduced radiation dose on an external dataset
Abstract
Purpose: To develop a two-stage three-dimensional (3D) convolutional neural networks (CNNs) for fully automated volumetric segmentation of pancreas on computed tomography (CT) and to further evaluate its performance in the context of intra-reader and inter-reader reliability at full dose and reduced radiation dose CTs on a public dataset.
Methods: A dataset of 1994 abdomen CT scans (portal venous phase, slice thickness ≤ 3.75-mm, multiple CT vendors) was curated by two radiologists (R1 and R2) to exclude cases with pancreatic pathology, suboptimal image quality, and image artifacts (n = 77). Remaining 1917 CTs were equally allocated between R1 and R2 for volumetric pancreas segmentation [ground truth (GT)]. This internal dataset was randomly divided into training (n = 1380), validation (n = 248), and test (n = 289) sets for the development of a two-stage 3D CNN model based on a modified U-net architecture for automated volumetric pancreas segmentation. Model's performance for pancreas segmentation and the differences in model-predicted pancreatic volumes vs GT volumes were compared on the test set. Subsequently, an external dataset from The Cancer Imaging Archive (TCIA) that had CT scans acquired at standard radiation dose and same scans reconstructed at a simulated 25% radiation dose was curated (n = 41). Volumetric pancreas segmentation was done on this TCIA dataset by R1 and R2 independently on the full dose and then at the reduced radiation dose CT images. Intra-reader and inter-reader reliability, model's segmentation performance, and reliability between model-predicted pancreatic volumes at full vs reduced dose were measured. Finally, model's performance was tested on the benchmarking National Institute of Health (NIH)-Pancreas CT (PCT) dataset.
Results: Three-dimensional CNN had mean (SD) Dice similarity coefficient (DSC): 0.91 (0.03) and average Hausdorff distance of 0.15 (0.09) mm on the test set. Model's performance was equivalent between males and females (P = 0.08) and across different CT slice thicknesses (P > 0.05) based on noninferiority statistical testing. There was no difference in model-predicted and GT pancreatic volumes [mean predicted volume 99 cc (31cc); GT volume 101 cc (33 cc), P = 0.33]. Mean pancreatic volume difference was -2.7 cc (percent difference: -2.4% of GT volume) with excellent correlation between model-predicted and GT volumes [concordance correlation coefficient (CCC)=0.97]. In the external TCIA dataset, the model had higher reliability than R1 and R2 on full vs reduced dose CT scans [model mean (SD) DSC: 0.96 (0.02), CCC = 0.995 vs R1 DSC: 0.83 (0.07), CCC = 0.89, and R2 DSC:0.87 (0.04), CCC = 0.97]. The DSC and volume concordance correlations for R1 vs R2 (inter-reader reliability) were 0.85 (0.07), CCC = 0.90 at full dose and 0.83 (0.07), CCC = 0.96 at reduced dose datasets. There was good reliability between model and R1 at both full and reduced dose CT [full dose: DSC: 0.81 (0.07), CCC = 0.83 and reduced dose DSC:0.81 (0.08), CCC = 0.87]. Likewise, there was good reliability between model and R2 at both full and reduced dose CT [full dose: DSC: 0.84 (0.05), CCC = 0.89 and reduced dose DSC:0.83(0.06), CCC = 0.89]. There was no difference in model-predicted and GT pancreatic volume in TCIA dataset (mean predicted volume 96 cc (33); GT pancreatic volume 89 cc (30), p = 0.31). Model had mean (SD) DSC: 0.89 (0.04) (minimum-maximum DSC: 0.79 -0.96) on the NIH-PCT dataset.
Conclusion: A 3D CNN developed on the largest dataset of CTs is accurate for fully automated volumetric pancreas segmentation and is generalizable across a wide range of CT slice thicknesses, radiation dose, and patient gender. This 3D CNN offers a scalable tool to leverage biomarkers from pancreas morphometrics and radiomics for pancreatic diseases including for early pancreatic cancer detection.
Keywords: IM-CT: segmentation; biomarkers; machine learning/computer vision; quantitative imaging/analysis.
© 2021 American Association of Physicists in Medicine.
Similar articles
-
Bounding box-based 3D AI model for user-guided volumetric segmentation of pancreatic ductal adenocarcinoma on standard-of-care CTs.Pancreatology. 2023 Aug;23(5):522-529. doi: 10.1016/j.pan.2023.05.008. Epub 2023 May 26. Pancreatology. 2023. PMID: 37296006 Free PMC article. Clinical Trial.
-
nnU-Net-Based Pancreas Segmentation and Volume Measurement on CT Imaging in Patients with Pancreatic Cancer.Acad Radiol. 2024 Jul;31(7):2784-2794. doi: 10.1016/j.acra.2024.01.004. Epub 2024 Feb 12. Acad Radiol. 2024. PMID: 38350812
-
Pancreas segmentation using AI developed on the largest CT dataset with multi-institutional validation and implications for early cancer detection.Sci Rep. 2025 May 16;15(1):17096. doi: 10.1038/s41598-025-01802-9. Sci Rep. 2025. PMID: 40379726 Free PMC article.
-
Automatic Segmentation of Multiple Organs on 3D CT Images by Using Deep Learning Approaches.Adv Exp Med Biol. 2020;1213:135-147. doi: 10.1007/978-3-030-33128-3_9. Adv Exp Med Biol. 2020. PMID: 32030668 Review.
-
A deep learning-based approach to automatic proximal femur segmentation in quantitative CT images.Med Biol Eng Comput. 2022 May;60(5):1417-1429. doi: 10.1007/s11517-022-02529-9. Epub 2022 Mar 24. Med Biol Eng Comput. 2022. PMID: 35322343 Review.
Cited by
-
Artificial Intelligence in Pancreatic Imaging: A Systematic Review.United European Gastroenterol J. 2025 Feb;13(1):55-77. doi: 10.1002/ueg2.12723. Epub 2025 Jan 26. United European Gastroenterol J. 2025. PMID: 39865461 Free PMC article.
-
Volumetric Pancreas Segmentation on Computed Tomography: Accuracy and Efficiency of a Convolutional Neural Network Versus Manual Segmentation in 3D Slicer in the Context of Interreader Variability of Expert Radiologists.J Comput Assist Tomogr. 2022 Nov-Dec 01;46(6):841-847. doi: 10.1097/RCT.0000000000001374. Epub 2022 Sep 1. J Comput Assist Tomogr. 2022. PMID: 36055122 Free PMC article.
-
Superficial white matter analysis: An efficient point-cloud-based deep learning framework with supervised contrastive learning for consistent tractography parcellation across populations and dMRI acquisitions.Med Image Anal. 2023 Apr;85:102759. doi: 10.1016/j.media.2023.102759. Epub 2023 Jan 23. Med Image Anal. 2023. PMID: 36706638 Free PMC article.
-
Radiomics-based Machine-learning Models Can Detect Pancreatic Cancer on Prediagnostic Computed Tomography Scans at a Substantial Lead Time Before Clinical Diagnosis.Gastroenterology. 2022 Nov;163(5):1435-1446.e3. doi: 10.1053/j.gastro.2022.06.066. Epub 2022 Jul 1. Gastroenterology. 2022. PMID: 35788343 Free PMC article.
-
Radiomics-based machine learning (ML) classifier for detection of type 2 diabetes on standard-of-care abdomen CTs: a proof-of-concept study.Abdom Radiol (NY). 2022 Nov;47(11):3806-3816. doi: 10.1007/s00261-022-03668-1. Epub 2022 Sep 10. Abdom Radiol (NY). 2022. PMID: 36085379 Free PMC article.
References
REFERENCES
-
- DeSouza SV, Singh RG, Yoon HD, Murphy R, Plank LD, Petrov MS. Pancreas volume in health and disease: a systematic review and meta-analysis. Expert Rev Gastroenterol Hepatol. 2018;12:757-766.
-
- Lim S, Bae JH, Chun EJ, et al. Differences in pancreatic volume, fat content, and fat density measured by multidetector-row computed tomography according to the duration of diabetes. Acta Diabetol. 2014;51:739-748.
-
- Löhr JM, Panic N, Vujasinovic M, Verbeke CS. The ageing pancreas: a systematic review of the evidence and analysis of the consequences. J Intern Med. 2018;283:446-460.
-
- Petrov MS. Harnessing analytic morphomics for early detection of pancreatic cancer. Pancreas. 2018;47:1051-1054.
-
- Lautenbach A, Wernecke M, Riedel N, et al. Adaptive changes in pancreas post Roux-en-Y gastric bypass induced weight loss. Diabetes Metab Res Rev. 2018;34:e3025.
MeSH terms
Grants and funding
LinkOut - more resources
Full Text Sources
Other Literature Sources
Research Materials
Miscellaneous