Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
[Preprint]. 2021 Nov 18:arXiv:2111.09461v1.

Advancing COVID-19 Diagnosis with Privacy-Preserving Collaboration in Artificial Intelligence

Affiliations

Advancing COVID-19 Diagnosis with Privacy-Preserving Collaboration in Artificial Intelligence

Xiang Bai et al. ArXiv. .

Update in

  • Erratum: Author Correction: Advancing COVID-19 diagnosis with privacy-preserving collaboration in artificial intelligence.
    Bai X, Wang H, Ma L, Xu Y, Gan J, Fan Z, Yang F, Ma K, Yang J, Bai S, Shu C, Zou X, Huang R, Zhang C, Liu X, Tu D, Xu C, Zhang W, Wang X, Chen A, Zeng Y, Yang D, Wang MW, Holalkere N, Halin NJ, Kamel IR, Wu J, Peng X, Wang X, Shao J, Mongkolwat P, Zhang J, Liu W, Roberts M, Teng Z, Beer L, Sanchez LE, Sala E, Rubin DL, Weller A, Lasenby J, Zheng C, Wang J, Li Z, Schönlieb C, Xia T. Bai X, et al. Nat Mach Intell. 2022;4(4):413. doi: 10.1038/s42256-022-00485-5. Epub 2022 Apr 8. Nat Mach Intell. 2022. PMID: 37520117 Free PMC article.

Abstract

Artificial intelligence (AI) provides a promising substitution for streamlining COVID-19 diagnoses. However, concerns surrounding security and trustworthiness impede the collection of large-scale representative medical data, posing a considerable challenge for training a well-generalised model in clinical practices. To address this, we launch the Unified CT-COVID AI Diagnostic Initiative (UCADI), where the AI model can be distributedly trained and independently executed at each host institution under a federated learning framework (FL) without data sharing. Here we show that our FL model outperformed all the local models by a large yield (test sensitivity /specificity in China: 0.973/0.951, in the UK: 0.730/0.942), achieving comparable performance with a panel of professional radiologists. We further evaluated the model on the hold-out (collected from another two hospitals leaving out the FL) and heterogeneous (acquired with contrast materials) data, provided visual explanations for decisions made by the model, and analysed the trade-offs between the model performance and the communication costs in the federated training process. Our study is based on 9,573 chest computed tomography scans (CTs) from 3,336 patients collected from 23 hospitals located in China and the UK. Collectively, our work advanced the prospects of utilising federated learning for privacy-preserving AI in digital health.

PubMed Disclaimer

Conflict of interest statement

Competing Interests Statement

The authors declare no competing interests.

Figures

Fig. 1 |
Fig. 1 |. Conceptual architecture of UCADI.
The participants first download and train the 3D CNN models based on the data of local cohorts. The trained model parameters are then encrypted and transmitted back to the server. Finally, the server produces the federated model via aggregating the contributions from each participant while without explicit access to the parameters.
Fig. 2 |
Fig. 2 |. Deployment and workflow of UCADI participants.
a, Data. Construct a local dataset based on the high-quality, well-annotated and anonymised CTs. b, Flow. The backbone of the 3D DenseNet model mainly consists of six 3D dense blocks (in green), two 3D transmit blocks (in white), and an output layer (in grey). CTs of each case are converted into a (16,128,128) tensor after adaptive sampling, decentralisation and trilinear interpolation, then feed into the 3D CNN model for pneumonia classification. c, Process. During training, the model outputs are used to calculate the weighted cross entropy to update the network parameters. While testing, five independent predictions of each case are incorporated to report the predictive diagnostic results.
Fig. 3 |
Fig. 3 |. Overview of CTs.
a, Radiological features correlated with COVID-19 pneumonia cases: ground glass opacity, interlobular septal thickening and consolidation (from left to right). b, Other non-COVID-19 cases, incl. healthy, other viral and bacterial pneumonia. c, Localised class-discriminative regions generated by GradCAM (in the heatmap) and annotated by the professional radiologists (circled in red), for COVID-19 cases.
Fig. 4 |
Fig. 4 |. COVID-19 pneumonia identification performance of 3D CNN models trained on four different data resources (Main Campus, Optical Valley, Sino-French and NCCID) individually and federatively.
a, Receiver Operating Characteristic (ROC) curves when the models are tested on the data from China, in comparison with six professional radiologists, b, ROC curves of the CNN models tested on the data from the UK, c, Numeric results of the test sensitivity, specificity and area under the curve (AUC, with 95% confidence intervals and p-values)
Fig. 5 |
Fig. 5 |. Trade-off on the performance and communication cost in federated training.
a, Relationships between transmission expense and model generalisation, b, Estimated time spent at different communication/synchronisation intervals. The statistics is measured based on a joint FL training of two clients. Each client has 200 CTs and 100 CTs for training and testing, respectively. The client’s software infrastructure is a single-core of GPU (NVIDIA GTX 1080Ti) and a CPU (Xeon(R) CPU E5–2660 v4 @ 2.00GHz). The bandwidth for transmission is around 7.2Mb/s (900KB/s), which is the average broadband speed.

References

    1. Ai T. et al. Correlation of Chest CT and RT-PCR Testing for Coronavirus Disease 2019 (COVID-19) in China: A Report of 1014 Cases. Radiology 296, (2020). - PMC - PubMed
    1. Fang Y. et al. Sensitivity of chest CT for COVID-19: Comparison to RT-PCR. Radiology vol. 296 (2020). - PMC - PubMed
    1. Kanne J. P., Little B. P., Chung J. H., Elicker B. M. & Ketai L. H. Essentials for radiologists on COVID-19: An update-radiology scientific expert panel. Radiology vol. 296 (2020). - PMC - PubMed
    1. Kucirka L. M., Lauer S. A., Laeyendecker O., Boon D. & Lessler J. Variation in False-Negative Rate of Reverse Transcriptase Polymerase Chain Reaction-Based SARS-CoV-2 Tests by Time Since Exposure. Annals of internal medicine vol. 173 (2020). - PMC - PubMed
    1. Ackerman C. M. et al. Massively multiplexed nucleic acid detection with Cas13. Nature 582, (2020). - PMC - PubMed

Publication types

LinkOut - more resources