Bilateral attention decoder: A lightweight decoder for real-time semantic segmentation
- PMID: 33647536
- DOI: 10.1016/j.neunet.2021.01.021
Bilateral attention decoder: A lightweight decoder for real-time semantic segmentation
Abstract
The encoder-decoder structure has been introduced into semantic segmentation to improve the spatial accuracy of the network by fusing high- and low-level feature maps. However, recent state-of-the-art encoder-decoder-based methods can hardly attain the real-time requirement due to their complex and inefficient decoders. To address this issue, in this paper, we propose a lightweight bilateral attention decoder for real-time semantic segmentation. It consists of two blocks and can fuse different level feature maps via two steps, i.e., information refinement and information fusion. In the first step, we propose a channel attention branch to refine the high-level feature maps and a spatial attention branch for the low-level ones. The refined high-level feature maps can capture more exact semantic information and the refined low-level ones can capture more accurate spatial information, which significantly improves the information capturing ability of these feature maps. In the second step, we develop a new fusion module named pooling fusing block to fuse the refined high- and low-level feature maps. This fusion block can take full advantages of the high- and low-level feature maps, leading to high-quality fusion results. To verify the efficiency of the proposed bilateral attention decoder, we adopt a lightweight network as the backbone and compare our proposed method with other state-of-the-art real-time semantic segmentation methods on the Cityscapes and Camvid datasets. Experimental results demonstrate that our proposed method can achieve better performance with a higher inference speed. Moreover, we compare our proposed network with several state-of-the-art non-real-time semantic segmentation methods and find that our proposed network can also attain better segmentation performance.
Keywords: Attention mechanism; Deep learning; Real time; Semantic segmentation.
Copyright © 2021 Elsevier Ltd. All rights reserved.
Conflict of interest statement
Declaration of Competing Interest The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.
Similar articles
-
A Holistically-Guided Decoder for Deep Representation Learning With Applications to Semantic Segmentation and Object Detection.IEEE Trans Pattern Anal Mach Intell. 2023 Oct;45(10):11390-11406. doi: 10.1109/TPAMI.2021.3114342. Epub 2023 Sep 5. IEEE Trans Pattern Anal Mach Intell. 2023. PMID: 34587003
-
A Fast Attention-Guided Hierarchical Decoding Network for Real-Time Semantic Segmentation.Sensors (Basel). 2023 Dec 24;24(1):95. doi: 10.3390/s24010095. Sensors (Basel). 2023. PMID: 38202957 Free PMC article.
-
Feature-guided attention network for medical image segmentation.Med Phys. 2023 Aug;50(8):4871-4886. doi: 10.1002/mp.16253. Epub 2023 Feb 16. Med Phys. 2023. PMID: 36746870
-
Sparse Dynamic Volume TransUNet with multi-level edge fusion for brain tumor segmentation.Comput Biol Med. 2024 Apr;172:108284. doi: 10.1016/j.compbiomed.2024.108284. Epub 2024 Mar 15. Comput Biol Med. 2024. PMID: 38503086 Review.
-
ACCPG-Net: A skin lesion segmentation network with Adaptive Channel-Context-Aware Pyramid Attention and Global Feature Fusion.Comput Biol Med. 2023 Mar;154:106580. doi: 10.1016/j.compbiomed.2023.106580. Epub 2023 Jan 25. Comput Biol Med. 2023. PMID: 36716686 Review.
Cited by
-
Multiple-Attention Mechanism Network for Semantic Segmentation.Sensors (Basel). 2022 Jun 13;22(12):4477. doi: 10.3390/s22124477. Sensors (Basel). 2022. PMID: 35746258 Free PMC article.
-
SCFusion: Infrared and Visible Fusion Based on Salient Compensation.Entropy (Basel). 2023 Jun 27;25(7):985. doi: 10.3390/e25070985. Entropy (Basel). 2023. PMID: 37509931 Free PMC article.
-
DPACFuse: Dual-Branch Progressive Learning for Infrared and Visible Image Fusion with Complementary Self-Attention and Convolution.Sensors (Basel). 2023 Aug 16;23(16):7205. doi: 10.3390/s23167205. Sensors (Basel). 2023. PMID: 37631742 Free PMC article.
-
Progressive decomposition of infrared and visible image fusion network with joint transformer and Resnet.PLoS One. 2025 Aug 22;20(8):e0330328. doi: 10.1371/journal.pone.0330328. eCollection 2025. PLoS One. 2025. PMID: 40844972 Free PMC article.
-
Semantic-Aware Fusion Network Based on Super-Resolution.Sensors (Basel). 2024 Jun 5;24(11):3665. doi: 10.3390/s24113665. Sensors (Basel). 2024. PMID: 38894455 Free PMC article.
MeSH terms
LinkOut - more resources
Full Text Sources
Other Literature Sources