Few-shot segmentation with duplex network and attention augmented module
- PMID: 37416851
- PMCID: PMC10320285
- DOI: 10.3389/fnbot.2023.1206189
Few-shot segmentation with duplex network and attention augmented module
Abstract
Establishing the relationship between a limited number of samples and segmented objects in diverse scenarios is the primary challenge in few-shot segmentation. However, many previous works overlooked the crucial support-query set interaction and the deeper information that needs to be explored. This oversight can lead to model failure when confronted with complex scenarios, such as ambiguous boundaries. To solve this problem, a duplex network that utilizes the suppression and focus concept is proposed to effectively suppress the background and focus on the foreground. Our network includes dynamic convolution to enhance the support-query interaction and a prototype match structure to fully extract information from support and query. The proposed model is called dynamic prototype mixture convolutional networks (DPMC). To minimize the impact of redundant information, we have incorporated a hybrid attentional module called double-layer attention augmented convolutional module (DAAConv) into DPMC. This module enables the network to concentrate more on foreground information. Our experiments on PASCAL-5i and COCO-20i datasets suggested that DPMC and DAAConv outperform traditional prototype-based methods by up to 5-8% on average.
Keywords: attention module; duplex mode; few-shot segmentation; mixture models; semantic segmentation.
Copyright © 2023 Zeng, Yang, Luo and Ruan.
Conflict of interest statement
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Figures





Similar articles
-
CLIP-Driven Prototype Network for Few-Shot Semantic Segmentation.Entropy (Basel). 2023 Sep 18;25(9):1353. doi: 10.3390/e25091353. Entropy (Basel). 2023. PMID: 37761652 Free PMC article.
-
Multi-scale prototype convolutional network for few-shot semantic segmentation.PLoS One. 2025 Apr 15;20(4):e0319905. doi: 10.1371/journal.pone.0319905. eCollection 2025. PLoS One. 2025. PMID: 40233318 Free PMC article.
-
DRNet: Double Recalibration Network for Few-Shot Semantic Segmentation.IEEE Trans Image Process. 2022;31:6733-6746. doi: 10.1109/TIP.2022.3215905. Epub 2022 Oct 28. IEEE Trans Image Process. 2022. PMID: 36282824
-
Dual-Filter Cross Attention and Onion Pooling Network for Enhanced Few-Shot Medical Image Segmentation.Sensors (Basel). 2025 Mar 29;25(7):2176. doi: 10.3390/s25072176. Sensors (Basel). 2025. PMID: 40218686 Free PMC article.
-
Part-Based Semantic Transform for Few-Shot Semantic Segmentation.IEEE Trans Neural Netw Learn Syst. 2022 Dec;33(12):7141-7152. doi: 10.1109/TNNLS.2021.3084252. Epub 2022 Nov 30. IEEE Trans Neural Netw Learn Syst. 2022. PMID: 34101605
References
-
- Ao W., Zheng S., Meng Y. (2022). Few-shot semantic segmentation via mask aggregation. arXiv:2202.07231. 10.48550/arXiv.2202.07231 - DOI
-
- Bello I., Zoph B., Vaswani A., Shlens J., Le Q.V. (2019). “Attention augmented convolutional networks,” in Proceedings of the IEEE/CVF International Conference on Computer Vision (Seoul: IEEE; ), 3286–3295. 10.1109/ICCV.2019.00338 - DOI
-
- Boudiaf M., Kervadec H., Masud Z.I., Piantanida P., Ben Ayed I., Dolz J. (2021). “Few-shot segmentation without meta-learning: a good transductive inference is all you need?,” in Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (Nashville, TN: IEEE; ), 13979–13988. 10.1109/CVPR46437.2021.01376 - DOI
-
- Chen C.-F.R., Fan Q., Panda R. (2021). “Crossvit: cross-attention multi-scale vision transformer for image classification,” in Proceedings of the IEEE/CVF International Conference on Computer Vision (Montreal, QC: IEEE; ), 357–366. 10.1109/ICCV48922.2021.00041 - DOI
LinkOut - more resources
Full Text Sources