Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2019 Oct:11765:192-200.
doi: 10.1007/978-3-030-32245-8_22. Epub 2019 Oct 10.

Mixed-Supervised Dual-Network for Medical Image Segmentation

Affiliations

Mixed-Supervised Dual-Network for Medical Image Segmentation

Duo Wang et al. Med Image Comput Comput Assist Interv. 2019 Oct.

Abstract

Deep learning based medical image segmentation models usually require large datasets with high-quality dense segmentations to train, which are very time-consuming and expensive to prepare. One way to tackle this difficulty is using the mixed-supervised learning framework, where only a part of data is densely annotated with segmentation label and the rest is weakly labeled with bounding boxes. The model is trained jointly in a multi-task learning setting. In this paper, we propose Mixed-Supervised Dual-Network (MSDN), a novel architecture which consists of two separate networks for the detection and segmentation tasks respectively, and a series of connection modules between the layers of the two networks. These connection modules are used to transfer useful information from the auxiliary detection task to help the segmentation task. We propose to use a recent technique called 'Squeeze and Excitation' in the connection module to boost the transfer. We conduct experiments on two medical image segmentation datasets. The proposed MSDN model outperforms multiple baselines.

Keywords: Dual-network; Medical image segmentation; Mixed-supervised learning; Multi-task learning; Squeeze-and-Excitation.

PubMed Disclaimer

Figures

Fig.1.
Fig.1.
Illustration of the Channel Squeeze and Spatial Excitation (sSE) architecture of Unary form (a) and Binary form (b).
Fig.2.
Fig.2.
Structure of Mixed-Supervised Dual-Network (MSDN).
Fig.3.
Fig.3.
(a) Original image. (b) Ground truth. (c) U-Net trained in full-supervised manner. (d) U-Net trained with only strongly-annotated data. (e) U-Net+Unary sSE. (f) MSDN-. (g),(h) Segmentation and detection results of Variant MS-Net. (i),(j) Segmentation and detection results of MSDN.

References

    1. Ronneberger O, Fischer P, Brox T: U-net: convolutional networks for biomedical image segmentation In: Navab N, Hornegger J, Wells WM, Frangi AF (eds.) MICCAI 2015. LNCS, vol. 9351, pp. 234–241. Springer, Cham: (2015). 10.1007/978-3-319-24574-4_28 - DOI
    1. Milletari F, Navab N, Ahmadi S: V-net: fully convolutional neural networks for volumetric medical image segmentation. In: International Conference on 3D Vision, pp. 565–571 (2016)
    1. Jiang J, Hu YC, Liu CJ, et al.: Multiple resolution residually connected feature streams for automatic lung tumor segmentation from CT images. IEEE Trans. Med. Imaging 38(1), 134–144 (2019) - PMC - PubMed
    1. Fan G, Liu H, Wu Z, et al.: Deep learning-based automatic segmentation of lumbosacral nerves on CT for spinal intervention: a translational study. Am. J. Neuroradiol. 40(6), 1074–1081 (2019) - PMC - PubMed
    1. Wang X, You S, Li X, Ma H: Weakly-supervised semantic segmentation byiteratively mining common object features. CVPR 2018, 1354–1362 (2018)

LinkOut - more resources