Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2022 Aug 25;17(8):e0272666.
doi: 10.1371/journal.pone.0272666. eCollection 2022.

Semantic segmentation method of underwater images based on encoder-decoder architecture

Affiliations

Semantic segmentation method of underwater images based on encoder-decoder architecture

Jinkang Wang et al. PLoS One. .

Abstract

With the exploration and development of marine resources, deep learning is more and more widely used in underwater image processing. However, the quality of the original underwater images is so low that traditional semantic segmentation methods obtain poor segmentation results, such as blurred target edges, insufficient segmentation accuracy, and poor regional boundary segmentation effects. To solve these problems, this paper proposes a semantic segmentation method for underwater images. Firstly, the image enhancement based on multi-spatial transformation is performed to improve the quality of the original images, which is not common in other advanced semantic segmentation methods. Then, the densely connected hybrid atrous convolution effectively expands the receptive field and slows down the speed of resolution reduction. Next, the cascaded atrous convolutional spatial pyramid pooling module integrates boundary features of different scales to enrich target details. Finally, the context information aggregation decoder fuses the features of the shallow network and the deep network to extract rich contextual information, which greatly reduces information loss. The proposed method was evaluated on RUIE, HabCam UID, and UIEBD. Compared with the state-of-the-art semantic segmentation algorithms, the proposed method has advantages in segmentation integrity, location accuracy, boundary clarity, and detail in subjective perception. On the objective data, the proposed method achieves the highest MIOU of 68.3 and OA of 79.4, and it has a low resource consumption. Besides, the ablation experiment also verifies the effectiveness of our method.

PubMed Disclaimer

Conflict of interest statement

The authors have declared that no competing interests exist.

Figures

Fig 1
Fig 1. The original underwater images with low image quality.
Fig 2
Fig 2. The pipeline of the proposed method.
Fig 3
Fig 3. The flowchart of the proposed underwater image enhancement algorithm.
Fig 4
Fig 4. Comparison of underwater image enhancement effect.
The upper line shows the original underwater images, and the lower line shows the enhanced images.
Fig 5
Fig 5. Covering effect of three-layer atrous convolution with equal atrous rate.
Fig 6
Fig 6. Covering effect of hybrid atrous convolution with unequal atrous rate.
Fig 7
Fig 7. The structure of the CASPP module.
Fig 8
Fig 8. The structure of the context information aggregation decoder.
Fig 9
Fig 9. The color representation of labeled categories.
Fig 10
Fig 10. Qualitative comparisons on colored cast underwater images.
From left to right are original images, the enhanced images, and the results generated by Deeplab V3+, DFANet, APCNet, the method proposed by Liu et al., our method, and the ground truth.
Fig 11
Fig 11. Qualitative comparisons on clear underwater images.
From left to right are the original images, the enhanced images, and the results generated by Deeplab V3+, DFANet, APCNet, the method proposed by Liu et al., our method, and the ground truth.
Fig 12
Fig 12. Failure examples of the proposed method.

Similar articles

Cited by

References

    1. Li C., Guo C., Ren W., Cong R., Hou J., Kwong S., et al.. An underwater image enhancement benchmark dataset and beyond. IEEE Trans. Image Process. 2019, 29, 4376–4389. doi: 10.1109/TIP.2019.2955241 - DOI - PubMed
    1. Li C., Saeed A., Fatih P. Underwater scene prior inspired deep underwater image and video enhancement. Pattern Recog. 2020, 98, 107038.
    1. Li Y., Chen R. UDA‐Net: Densely attention network for underwater image enhancement. IET Image Process. 2021, 15, 774–785.
    1. Wei X., Yu L., Tian S., Feng P., Ning X. Underwater target detection with an attention mechanism and improved scale. Multimedia Tools App. 2021, 80, 33747–33761.
    1. Chen L.; Liu Z.; Tong L. Underwater object detection using Invert Multi-Class Adaboost with deep learning. 2020 International Joint Conference on Neural Networks (IJCNN). IEEE, 2020.

Publication types