Improving Image Segmentation with Contextual and Structural Similarity
- PMID: 38645435
- PMCID: PMC11027435
- DOI: 10.1016/j.patcog.2024.110489
Improving Image Segmentation with Contextual and Structural Similarity
Abstract
Deep learning models for medical image segmentation are usually trained with voxel-wise losses, e.g., cross-entropy loss, focusing on unary supervision without considering inter-voxel relationships. This oversight potentially leads to semantically inconsistent predictions. Here, we propose a contextual similarity loss (CSL) and a structural similarity loss (SSL) to explicitly and efficiently incorporate inter-voxel relationships for improved performance. The CSL promotes consistency in predicted object categories for each image sub-region compared to ground truth. The SSL enforces compatibility between the predictions of voxel pairs by computing pair-wise distances between them, ensuring that voxels of the same class are close together whereas those from different classes are separated by a wide margin in the distribution space. The effectiveness of the CSL and SSL is evaluated using a clinical cone-beam computed tomography (CBCT) dataset of patients with various craniomaxillofacial (CMF) deformities and a public pancreas dataset. Experimental results show that the CSL and SSL outperform state-of-the-art regional loss functions in preserving segmentation semantics.
Keywords: Cone-beam computed tomography; Image segmentation; Inter-voxel relationships; Pancreas segmentation.
Conflict of interest statement
Conflicts of Interest All authors declare that they have no conflicts of interest.
Figures





References
-
- Ambellan F, Tack A, Ehlke M, Zachow S, 2019. Automated segmentation of knee bone and cartilage combining statistical shape knowledge and convolutional neural networks: Data from the osteoarthritis initiative. Medical Image Analysis 52, 109–118. - PubMed
-
- Chen H, Qi X, Yu L, Dou Q, Qin J, Heng PA, 2017a. DCAN: Deep contour-aware networks for object instance segmentation from histology images. Medical Image Analysis 36, 135–146. - PubMed
-
- Chen J, Lu Y, Yu Q, Luo X, Adeli E, Wang Y, Lu L, Yuille AL, Zhou Y, 2021. Transunet: Transformers make strong encoders for medical image segmentation. arXiv preprint arXiv:2102.04306.
-
- Chen LC, Papandreou G, Kokkinos I, Murphy K, Yuille AL, 2017b. Deeplab: Semantic image segmentation with deep convolutional nets, atrous convolution, and fully connected CRFs. IEEE Transactions on Pattern Analysis and Machine Intelligence 40, 834–848. - PubMed
-
- Chen LC, Papandreou G, Kokkinos I, Murphy K, Yuille AL, 2018. Semantic image segmentation with deep convolutional nets and fully connected CRFs. IEEE Transactions on Pattern Analysis and Machine Intelligence 40, 834–848. - PubMed