. 2022 Aug 25;17(8):e0272666.

doi: 10.1371/journal.pone.0272666. eCollection 2022.

Semantic segmentation method of underwater images based on encoder-decoder architecture

Jinkang Wang¹, Xiaohui He¹, Faming Shao¹, Guanlin Lu¹, Ruizhe Hu¹, Qunyan Jiang¹

Affiliations

PMID: 36006956
PMCID: PMC9409518
DOI: 10.1371/journal.pone.0272666

Semantic segmentation method of underwater images based on encoder-decoder architecture

Jinkang Wang et al. PLoS One. 2022.

. 2022 Aug 25;17(8):e0272666.

doi: 10.1371/journal.pone.0272666. eCollection 2022.

Authors

Jinkang Wang¹, Xiaohui He¹, Faming Shao¹, Guanlin Lu¹, Ruizhe Hu¹, Qunyan Jiang¹

Affiliation

¹ Department of Mechanical Engineering, College of Field Engineering and Army Engineering University, PLA, Nanjing, China.

PMID: 36006956
PMCID: PMC9409518
DOI: 10.1371/journal.pone.0272666

Abstract

With the exploration and development of marine resources, deep learning is more and more widely used in underwater image processing. However, the quality of the original underwater images is so low that traditional semantic segmentation methods obtain poor segmentation results, such as blurred target edges, insufficient segmentation accuracy, and poor regional boundary segmentation effects. To solve these problems, this paper proposes a semantic segmentation method for underwater images. Firstly, the image enhancement based on multi-spatial transformation is performed to improve the quality of the original images, which is not common in other advanced semantic segmentation methods. Then, the densely connected hybrid atrous convolution effectively expands the receptive field and slows down the speed of resolution reduction. Next, the cascaded atrous convolutional spatial pyramid pooling module integrates boundary features of different scales to enrich target details. Finally, the context information aggregation decoder fuses the features of the shallow network and the deep network to extract rich contextual information, which greatly reduces information loss. The proposed method was evaluated on RUIE, HabCam UID, and UIEBD. Compared with the state-of-the-art semantic segmentation algorithms, the proposed method has advantages in segmentation integrity, location accuracy, boundary clarity, and detail in subjective perception. On the objective data, the proposed method achieves the highest MIOU of 68.3 and OA of 79.4, and it has a low resource consumption. Besides, the ablation experiment also verifies the effectiveness of our method.

PubMed Disclaimer

Conflict of interest statement

The authors have declared that no competing interests exist.

Figures

**Fig 1. The original underwater images with low image quality.**

**Fig 2. The pipeline of the proposed method.**

**Fig 3. The flowchart of the proposed underwater image enhancement algorithm.**

**Fig 4. Comparison of underwater image enhancement effect.**
The upper line shows the original underwater images, and the lower line shows the enhanced images.

**Fig 5. Covering effect of three-layer atrous convolution with equal atrous rate.**

**Fig 6. Covering effect of hybrid atrous convolution with unequal atrous rate.**

**Fig 7. The structure of the CASPP module.**

**Fig 8. The structure of the context information aggregation decoder.**

**Fig 9. The color representation of labeled categories.**

**Fig 10. Qualitative comparisons on colored cast underwater images.**
From left to right are original images, the enhanced images, and the results generated by Deeplab V3+, DFANet, APCNet, the method proposed by Liu et al., our method, and the ground truth.

**Fig 11. Qualitative comparisons on clear underwater images.**
From left to right are the original images, the enhanced images, and the results generated by Deeplab V3+, DFANet, APCNet, the method proposed by Liu et al., our method, and the ground truth.

**Fig 12. Failure examples of the proposed method.**

See this image and copyright information in PMC

Cited by

Underwater Fish Segmentation Algorithm Based on Improved PSPNet Network.
Han Y, Zheng B, Kong X, Huang J, Wang X, Ding T, Chen J. Han Y, et al. Sensors (Basel). 2023 Sep 25;23(19):8072. doi: 10.3390/s23198072. Sensors (Basel). 2023. PMID: 37836901 Free PMC article.

References

1. Li C., Guo C., Ren W., Cong R., Hou J., Kwong S., et al.. An underwater image enhancement benchmark dataset and beyond. IEEE Trans. Image Process. 2019, 29, 4376–4389. doi: 10.1109/TIP.2019.2955241 - DOI - PubMed
1. Li C., Saeed A., Fatih P. Underwater scene prior inspired deep underwater image and video enhancement. Pattern Recog. 2020, 98, 107038.
1. Li Y., Chen R. UDA‐Net: Densely attention network for underwater image enhancement. IET Image Process. 2021, 15, 774–785.
1. Wei X., Yu L., Tian S., Feng P., Ning X. Underwater target detection with an attention mechanism and improved scale. Multimedia Tools App. 2021, 80, 33747–33761.
1. Chen L.; Liu Z.; Tong L. Underwater object detection using Invert Multi-Class Adaboost with deep learning. 2020 International Joint Conference on Neural Networks (IJCNN). IEEE, 2020.

Publication types

Actions

MeSH terms

Actions
Actions
Actions
Actions
Actions

LinkOut - more resources

Full Text Sources

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Semantic segmentation method of underwater images based on encoder-decoder architecture

Affiliation

Semantic segmentation method of underwater images based on encoder-decoder architecture

Authors

Affiliation

Abstract

Conflict of interest statement

Figures

Similar articles

Cited by

References

Publication types

MeSH terms

LinkOut - more resources

Full Text Sources