GLE-net: global-local information enhancement for semantic segmentation of remote sensing images
- PMID: 39455717
- PMCID: PMC11512047
- DOI: 10.1038/s41598-024-76622-4
GLE-net: global-local information enhancement for semantic segmentation of remote sensing images
Abstract
Remote sensing (RS) images contain a wealth of information with expansive potential for applications in image segmentation. However, Convolutional Neural Networks (CNN) face challenges in fully harnessing the global contextual information. Leveraging the formidable capabilities of global information modeling with Swin-Transformer, a novel RS images segmentation model with CNN (GLE-Net) was introduced. This integration gives rise to a revamped encoder structure. The subbranch initiates the process by extracting features at varying scales within the RS images using the Multiscale Feature Fusion Module (MFM), acquiring rich semantic information, discerning localized finer features, and adeptly handling occlusions. Subsequently, Feature Compression Module (FCM) is introduced in main branch to downsize the feature map, effectively reducing information loss while preserving finer details, enhancing segmentation accuracy for smaller targets. Finally, we integrate local features and global features through Spatial Information Enhancement Module (SIEM) for comprehensive feature modeling, augmenting the segmentation capabilities of model. We performed experiments on public datasets provided by ISPRS, yielding notably remarkable experimental outcomes. This underscores the substantial potential of our model in the realm of RS image segmentation within the context of scientific research.
Keywords: Multiscale feature; Remote sening; Swin-transformer convolutional neural networks.
© 2024. The Author(s).
Conflict of interest statement
The authors declare that they have no conflict of interest.
Figures
References
-
- Bi, H., Xu, F., Wei, Z., Xue, Y. & Xu, Z. An active deep learning approach for minimally supervised polsar image classification. IEEE Trans. Geosci. Remote Sens.57(11), 9378–9395 (2019). - DOI
-
- Yao, H., Qin, R. & Chen, X. Unmanned aerial vehicle for remote sensing applications-A review. Remote Sens.11(12), 1443 (2019). - DOI
-
- Li, R., Zheng, S., Duan, C., Wang, L. & Zhang, C. Land cover classification from remote sensing images based on multi-scale fully convolutional network. Geo-spatial Inform. Sci.25(2), 278–294 (2022). - DOI
-
- Ding, L., Zhang, J. & Bruzzone, L. Semantic segmentation of large-size VHR remote sensing images using a two-stage multiscale training architecture. IEEE Trans. Geosci. Remote Sens.58(8), 5367–5376 (2020). - DOI
-
- Pal, M. & Mather, P. M. Support vector machines for classification in remote sensing. Int. J. Remote Sens.26(5), 1007–1011 (2005). - DOI
Grants and funding
- No.2023TIAD-GPX0007/Chongqing Technology Innovation and Application Development Project
- SQ2024YFE0200856/National Key Research and Development Program of China
- KJQN202103407/Science and Technology Youth Project of Chongqing Municipal Education Commission
- 2023JDRC0033/Sichuan Science and Technology Program
- 2021-JYJ-92/Luzhou Science and Technology Program
LinkOut - more resources
Full Text Sources
