Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2021 May:70:101992.
doi: 10.1016/j.media.2021.101992. Epub 2021 Feb 6.

Federated semi-supervised learning for COVID region segmentation in chest CT using multi-national data from China, Italy, Japan

Affiliations

Federated semi-supervised learning for COVID region segmentation in chest CT using multi-national data from China, Italy, Japan

Dong Yang et al. Med Image Anal. 2021 May.

Abstract

The recent outbreak of Coronavirus Disease 2019 (COVID-19) has led to urgent needs for reliable diagnosis and management of SARS-CoV-2 infection. The current guideline is using RT-PCR for testing. As a complimentary tool with diagnostic imaging, chest Computed Tomography (CT) has been shown to be able to reveal visual patterns characteristic for COVID-19, which has definite value at several stages during the disease course. To facilitate CT analysis, recent efforts have focused on computer-aided characterization and diagnosis with chest CT scan, which has shown promising results. However, domain shift of data across clinical data centers poses a serious challenge when deploying learning-based models. A common way to alleviate this issue is to fine-tune the model locally with the target domains local data and annotations. Unfortunately, the availability and quality of local annotations usually varies due to heterogeneity in equipment and distribution of medical resources across the globe. This impact may be pronounced in the detection of COVID-19, since the relevant patterns vary in size, shape, and texture. In this work, we attempt to find a solution for this challenge via federated and semi-supervised learning. A multi-national database consisting of 1704 scans from three countries is adopted to study the performance gap, when training a model with one dataset and applying it to another. Expert radiologists manually delineated 945 scans for COVID-19 findings. In handling the variability in both the data and annotations, a novel federated semi-supervised learning technique is proposed to fully utilize all available data (with or without annotations). Federated learning avoids the need for sensitive data-sharing, which makes it favorable for institutions and nations with strict regulatory policy on data privacy. Moreover, semi-supervision potentially reduces the annotation burden under a distributed setting. The proposed framework is shown to be effective compared to fully supervised scenarios with conventional data sharing instead of model weight sharing.

Keywords: COVID-19; Chest CT; Federated learning; Semi-supervision.

PubMed Disclaimer

Conflict of interest statement

Declaration of Competing Interest The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

Figures

None
Graphical abstract
Fig. 1
Fig. 1
Federated learning with privacy preserving in medical imaging. The central federated server communicates with clients from multi-national institutions by sharing weights and gradients of models, without exchanging any sensitive data information.
Fig. 2
Fig. 2
The axial planes of chest CT scans from three different sites. Areas inside green contours represent COVID-19 affected regions annotated by radiologist. The appearance of the affected region identified as “infiltrates” range from diffused ground glass opacity (COVID, upper row) to focal nodules (lower row). (For interpretation of the references to colour in this figure legend, the reader is referred to the web version of this article.)
Fig. 3
Fig. 3
The general architecture of fully convolutional network (FCN) for COVID-19 affected region in medical imaging. Dashed lines denote skip connections which feed earlier feature maps to later neural network layers.
Fig. 4
Fig. 4
FL is to find global low-error space of θglobal for client A and B after aggregation of “gradients” ΔA and ΔB. The low-error space is defined by each client, which could correspond to high accuracy, or high model consistency for self-supervision.
Algorithm 1
Algorithm 1
Federated learning for COVID region segmentation using weighted federated averaging.
Algorithm 2
Algorithm 2
Self-supervised learning algorithm at clients with unlabeled data for COVID region segmentation.
Fig. 5
Fig. 5
Visualizations of federated semi-supervised segmentation of COVID regions in 3D CT (from the testing set of unsupervised client Image_2). “Non-FL” indicates results from the model trained with Image_1 along, and “FL” denotes results from the model trained with federated semi-supervised learning on Image_1 and Image_2. The segmentation results using the proposed framework captures the ground truth shapes better and has less false positives.
Fig. 6
Fig. 6
Accuracy (Dice’s score) comparison of different learning rates of un-/unsupervised clients.
Fig. 7
Fig. 7
Accuracy (Dice’s score) comparison of different aggregation frequency (per 5, 10, 20, 40 rounds).
Fig. 8
Fig. 8
Model performance on LIDC dataset, capturing solid nodule (1, 2); ground glass nodule (2, 3); and other abnormal patterns (4).
Fig. 9
Fig. 9
Visualization of the COVID-19 affected region segmentation/prediction (red regions) together with lungs and airways (a) in 3D space, and (b,c,d) in different (axial, sagittal, coronal) planes of a raw CT image. Note that slice thickness is 5 mm (as compared with 0.8 mm in-plane), which is the case for most images in this work. . (For interpretation of the references to colour in this figure legend, the reader is referred to the web version of this article.)

References

    1. American College of Radiology,. ACR Recommendations for the use of Chest Radiography and Computed Tomography (CT) for Suspected COVID-19 Infection.
    1. Armato III S.G., McLennan G., Bidaut L., McNitt-Gray M.F., Meyer C.R., Reeves A.P., Zhao B., Aberle D.R., Henschke C.I., Hoffman E.A., Kazerooni E.A., MacMahon H., van Beek E.J.R., Yankelevitz D., Biancardi A.M., Bland P.H., Brown M.S., Engelmann R.M., Laderach G.E., Max D., Pais R.C., Qing D.P.-Y., Roberts R.Y., Smith A.R., Starkey A., Batra P., Caligiuri P., Farooqi A., Gladish G.W., Jude C.M., Munden R.F., Petkovska I., Quint L.E., Schwartz L.H., Sundaram B., Dodd L.E., Fenimore C., Gur D., Petrick N., Freymann J., Kirby J., Hughes B., Vande Casteele A., Gupte S., Sallam M., Heath M.D., Kuhn M.H., Dharaiya E., Burns R., Fryd D.S., Salganicoff M., Anand V., Shreter U., Vastagh S., Croft B.Y., Clarke L.P. The lung image database consortium (lidc) and image database resource initiative (idri): a completed reference database of lung nodules on ct scans. Med. Phys. 2011;38(2):915–931. - PMC - PubMed
    1. Bai H.X., Wang R., Xiong Z., Hsieh B., Chang K., Halsey K., Tran T.M.L., Choi J.W., Wang D.-C., Shi L.-B., et al. Ai augmentation of radiologist performance in distinguishing covid-19 from pneumonia of other etiology on chest ct. Radiology. 2020:201491. - PMC - PubMed
    1. Bai W., Oktay O., Sinclair M., Suzuki H., Rajchl M., Tarroni G., Glocker B., King A., Matthews P.M., Rueckert D. International Conference on Medical Image Computing and Computer-Assisted Intervention. Springer; 2017. Semi-supervised learning for network-based cardiac mr image segmentation; pp. 253–260.
    1. van Berlo B., Saeed A., Ozcelebi T. Proceedings of the Third ACM International Workshop on Edge Systems, Analytics and Networking. 2020. Towards federated unsupervised representation learning; pp. 31–36.

Publication types