Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2025 Dec 17;11(12):453.
doi: 10.3390/jimaging11120453.

Salient Object Detection in Optical Remote Sensing Images Based on Hierarchical Semantic Interaction

Affiliations

Salient Object Detection in Optical Remote Sensing Images Based on Hierarchical Semantic Interaction

Jingfan Xu et al. J Imaging. .

Abstract

Existing salient object detection methods for optical remote sensing images still face certain limitations due to complex background variations, significant scale discrepancies among targets, severe background interference, and diverse topological structures. On the one hand, the feature transmission process often neglects the constraints and complementary effects of high-level features on low-level features, leading to insufficient feature interaction and weakened model representation. On the other hand, decoder architectures generally rely on simple cascaded structures, which fail to adequately exploit and utilize contextual information. To address these challenges, this study proposes a Hierarchical Semantic Interaction Module to enhance salient object detection performance in optical remote sensing scenarios. The module introduces foreground content modeling and a hierarchical semantic interaction mechanism within a multi-scale feature space, reinforcing the synergy and complementarity among features at different levels. This effectively highlights multi-scale and multi-type salient regions in complex backgrounds. Extensive experiments on multiple optical remote sensing datasets demonstrate the effectiveness of the proposed method. Specifically, on the EORSSD dataset, our full model integrating both CA and PA modules improves the max F-measure from 0.8826 to 0.9100 (↑2.74%), increases maxE from 0.9603 to 0.9727 (↑1.24%), and enhances the S-measure from 0.9026 to 0.9295 (↑2.69%) compared with the baseline. These results clearly demonstrate the effectiveness of the proposed modules and verify the robustness and strong generalization capability of our method in complex remote sensing scenarios.

Keywords: channel adaptive enhancement; hierarchical semantic interaction; multi-scale feature fusion; optical remote sensing images; position-aware attention; salient object detection.

PubMed Disclaimer

Conflict of interest statement

The authors declare no conflicts of interest.

Figures

Figure 1
Figure 1
HSIMNet model network structure diagram.
Figure 2
Figure 2
Channel Attention Mechanism diagram.
Figure 3
Figure 3
Position attention mechanism structure diagram.
Figure 4
Figure 4
Precision–Recall (PR) curves of the proposed HSIMNet and comparison methods on the ORSSD and EORSSD datasets.
Figure 5
Figure 5
F-measure curves of HSIMNet and other state-of-the-art methods on the ORSSD and EORSSD datasets.
Figure 6
Figure 6
Qualitative comparison of salient object detection results produced by HSIMNet and representative competing methods on challenging scenes from the ORSSD and EORSSD datasets.

References

    1. Yang H., Liu Z., Cui H., Ma N., Wang H., Zhang C., Song Y. An Electrified Railway Catenary Component Anomaly Detection Frame Based on Invariant Normal Region Prototype with Segment Anything Model. IEEE Trans. Transp. Electrif. 2025;early access doi: 10.1109/TTE.2025.3628607. - DOI
    1. Yan J., Cheng Y., Zhang F., Li M., Zhou N., Jin B., Wang H., Yang H., Zhang W. Research on Multimodal Techniques for Arc Detection in Railway Systems with Limited Data. Struct. Health Monit. 2025 doi: 10.1177/14759217251336797. - DOI
    1. Wang X., Song Y., Yang H., Wang H., Lu B., Liu Z. A Time-Frequency Dual-Domain Deep Learning Approach for High-Speed Pantograph-Catenary Dynamic Performance Prediction. Mech. Syst. Signal Process. 2025;238:113258. doi: 10.1016/j.ymssp.2025.113258. - DOI
    1. Liu H., Zhou S., Gu W., Zhuang W., Gao M., Chan C.C., Zhang X. Coordinated Planning Model for Multi-Regional Ammonia Industries Leveraging Hydrogen Supply Chain and Power Grid Integration: A Case Study of Shandong. Appl. Energy. 2025;377:124456. doi: 10.1016/j.apenergy.2024.124456. - DOI
    1. Chen N., Li B., Wang Y., Ying X., Wang L., Zhang C., Guo Y., Li M., An W. Motion and Appearance Decoupling Representation for Event Cameras. IEEE Trans. Image Process. 2025;34:5964–5977. doi: 10.1109/TIP.2025.3607632. - DOI - PubMed

LinkOut - more resources