Instance-level semantic segmentation of nuclei based on multimodal structure encoding
- PMID: 39915737
- PMCID: PMC11804060
- DOI: 10.1186/s12859-025-06066-8
Instance-level semantic segmentation of nuclei based on multimodal structure encoding
Abstract
Background: Accurate segmentation and classification of cell nuclei are crucial for histopathological image analysis. However, existing deep neural network-based methods often struggle to capture complex morphological features and global spatial distributions of cell nuclei due to their reliance on local receptive fields.
Methods: This study proposes a graph neural structure encoding framework based on a vision-language model. The framework incorporates: (1) A multi-scale feature fusion and knowledge distillation module utilizing the Contrastive Language-Image Pre-training (CLIP) model's image encoder; (2) A method to transform morphological features of cells into textual descriptions for semantic representation; and (3) A graph neural network approach to learn spatial relationships and contextual information between cell nuclei.
Results: Experimental results demonstrate that the proposed method significantly improves the accuracy of cell nucleus segmentation and classification compared to existing approaches. The framework effectively captures complex nuclear structures and global distribution features, leading to enhanced performance in histopathological image analysis.
Conclusions: By deeply mining the morphological features of cell nuclei and their spatial topological relationships, our graph neural structure encoding framework achieves high-precision nuclear segmentation and classification. This approach shows significant potential for enhancing histopathological image analysis, potentially leading to more accurate diagnoses and improved understanding of cellular structures in pathological tissues.
Keywords: Cell nucleus segmentation; Graph neural networks; Histopathological image; Multimodal fusion.
© 2025. The Author(s).
Conflict of interest statement
Declarations. Ethics approval and consent to participate: Not applicable. Consent for publication: Not applicable. Competing interests: Authors declare no conflict of interest.
Figures








References
-
- Morales S, Engan K, Naranjo V. Artificial intelligence in computational pathology-challenges and future directions. Digital Signal Process. 2021;119: 103196.
-
- Huang Y, Yang X, Liu L, Zhou H, Chang A, Zhou X, Chen R, Yu J, Chen J, Chen C. Segment anything model for medical images? Med Image Anal. 2024;92: 103061. - PubMed
-
- Hatamizadeh A, Nath V, Tang Y, Yang D, Roth HR, Xu D. Swin unetr: swin transformers for semantic segmentation of brain tumors in mri images. In: International MICCAI brainlesion workshop, 2021;272–84.
-
- Zhou Y, Graham S, Alemi Koohbanani N, Shaban M, Heng P-A, Rajpoot N. Cgc-net: cell graph convolutional network for grading of colorectal cancer histology images. In: Proceedings of the IEEE/CVF international conference on computer vision workshops, 2019;0–0
MeSH terms
Grants and funding
LinkOut - more resources
Full Text Sources