This is a preprint.

It has not yet been peer reviewed by a journal.

The National Library of Medicine is running a pilot to include preprints that result from research funded by NIH in PMC and PubMed.

[Preprint]. 2021 Nov 18:arXiv:2111.09461v1.

Advancing COVID-19 Diagnosis with Privacy-Preserving Collaboration in Artificial Intelligence

Xiang Bai^{1

2}, Hanchen Wang³, Liya Ma¹, Yongchao Xu¹, Jiefeng Gan², Ziwei Fan², Fan Yang⁴, Ke Ma², Jiehua Yang², Song Bai², Chang Shu², Xinyu Zou², Renhao Huang², Changzheng Zhang⁵, Xiaowu Liu⁵, Dandan Tu⁵, Chuou Xu¹, Wenqing Zhang¹, Xi Wang⁶, Anguo Chen⁷, Yu Zeng⁸, Dehua Yang⁹, Ming-Wei Wang⁹, Nagaraj Holalkere¹⁰, Neil J Halin¹⁰, Ihab R Kamel¹¹, Jia Wu¹², Xuehua Peng¹³, Xiang Wang¹⁴, Jianbo Shao¹³, Pattanasak Mongkolwat¹⁵, Jianjun Zhang^{16

17}, Weiyang Liu³, Michael Roberts^{18

19}, Zhongzhao Teng²⁰, Lucian Beer²⁰, Lorena Escudero Sanchez²⁰, Evis Sala²⁰, Daniel Rubin²¹, Adrian Weller^{3

22}, Joan Lasenby³, Chuangsheng Zheng⁴, Jianming Wang²³, Zhen Li¹, Carola-Bibiane Schönlieb^{18

22}, Tian Xia²

Affiliations

¹ Department of Radiology, Tongji Hospital and Medical College, Huazhong University of Science and Technology, Wuhan, China.
² School of Artificial Intelligence and Automation, Huazhong University of Science and Technology, Wuhan, China.
³ Department of Engineering, University of Cambridge, Cambridge, UK.
⁴ Department of Radiology, Union Hospital of Tongji Medical College, Huazhong University of Science and Technology, Wuhan, China.
⁵ HUST-HW Joint Innovation Lab, Wuhan, China.
⁶ CalmCar Inc, Suzhou, China.
⁷ Wuhan Blood Centre, Wuhan, China.
⁸ MSA Capital, Beijing, China.
⁹ The National Centre for Drug Screening, Shanghai Institute of Materia Medica, Chinese Academy of Sciences, Shanghai, China.
¹⁰ CardioVascular and Interventional Radiology, Radiology for Quality and Operations, The Cardiovascular Centre at Tufts Medical Centre, Radiology, Tufts University School of Medicine, Medford, USA.
¹¹ Russell H Morgan Department of Radiology & Radiologic Science, Johns Hopkins Hospital & Medicine Institute, Baltimore, USA.
¹² Department of Radiation Oncology, School of Medicine, Stanford University, Palo Alto, USA.
¹³ Department of Radiology, Wuhan Central Hospital, Wuhan, China.
¹⁴ Department of Radiology, Wuhan Children's Hospital, Wuhan, China.
¹⁵ Faculty of Information and Communication Technology, Mahidol University, Thailand.
¹⁶ Thoracic/Head and Neck Medical Oncology, University of Texas MD Anderson Cancer Centre, Houston, USA.
¹⁷ Translational Molecular Pathology, University of Texas MD Anderson Cancer Centre, Houston, USA.
¹⁸ Department of Applied Mathematics and Theoretical Physics, University of Cambridge, Cambridge, UK.
¹⁹ Oncology R&D at AstraZeneca, Cambridge, UK.
²⁰ Department of Radiology, University of Cambridge, Cambridge, UK.
²¹ Department of Biomedical Data Science, Radiology and Medicine, Stanford University, Palo Alto, USA.
²² Alan Turing Institute, London, UK.
²³ Department of Hepatobiliary Pancreatic Surgery, Affiliated Tianyou Hospital, Wuhan University of Science and Technology, Wuhan, China.

PMID: 34815983
PMCID: PMC8609899

Advancing COVID-19 Diagnosis with Privacy-Preserving Collaboration in Artificial Intelligence

Xiang Bai et al. ArXiv. 2021.

[Preprint]. 2021 Nov 18:arXiv:2111.09461v1.

Authors

Affiliations

¹ Department of Radiology, Tongji Hospital and Medical College, Huazhong University of Science and Technology, Wuhan, China.
² School of Artificial Intelligence and Automation, Huazhong University of Science and Technology, Wuhan, China.
³ Department of Engineering, University of Cambridge, Cambridge, UK.
⁴ Department of Radiology, Union Hospital of Tongji Medical College, Huazhong University of Science and Technology, Wuhan, China.
⁵ HUST-HW Joint Innovation Lab, Wuhan, China.
⁶ CalmCar Inc, Suzhou, China.
⁷ Wuhan Blood Centre, Wuhan, China.
⁸ MSA Capital, Beijing, China.
⁹ The National Centre for Drug Screening, Shanghai Institute of Materia Medica, Chinese Academy of Sciences, Shanghai, China.
¹⁰ CardioVascular and Interventional Radiology, Radiology for Quality and Operations, The Cardiovascular Centre at Tufts Medical Centre, Radiology, Tufts University School of Medicine, Medford, USA.
¹¹ Russell H Morgan Department of Radiology & Radiologic Science, Johns Hopkins Hospital & Medicine Institute, Baltimore, USA.
¹² Department of Radiation Oncology, School of Medicine, Stanford University, Palo Alto, USA.
¹³ Department of Radiology, Wuhan Central Hospital, Wuhan, China.
¹⁴ Department of Radiology, Wuhan Children's Hospital, Wuhan, China.
¹⁵ Faculty of Information and Communication Technology, Mahidol University, Thailand.
¹⁶ Thoracic/Head and Neck Medical Oncology, University of Texas MD Anderson Cancer Centre, Houston, USA.
¹⁷ Translational Molecular Pathology, University of Texas MD Anderson Cancer Centre, Houston, USA.
¹⁸ Department of Applied Mathematics and Theoretical Physics, University of Cambridge, Cambridge, UK.
¹⁹ Oncology R&D at AstraZeneca, Cambridge, UK.
²⁰ Department of Radiology, University of Cambridge, Cambridge, UK.
²¹ Department of Biomedical Data Science, Radiology and Medicine, Stanford University, Palo Alto, USA.
²² Alan Turing Institute, London, UK.
²³ Department of Hepatobiliary Pancreatic Surgery, Affiliated Tianyou Hospital, Wuhan University of Science and Technology, Wuhan, China.

PMID: 34815983
PMCID: PMC8609899

Update in

Erratum: Author Correction: Advancing COVID-19 diagnosis with privacy-preserving collaboration in artificial intelligence.
Bai X, Wang H, Ma L, Xu Y, Gan J, Fan Z, Yang F, Ma K, Yang J, Bai S, Shu C, Zou X, Huang R, Zhang C, Liu X, Tu D, Xu C, Zhang W, Wang X, Chen A, Zeng Y, Yang D, Wang MW, Holalkere N, Halin NJ, Kamel IR, Wu J, Peng X, Wang X, Shao J, Mongkolwat P, Zhang J, Liu W, Roberts M, Teng Z, Beer L, Sanchez LE, Sala E, Rubin DL, Weller A, Lasenby J, Zheng C, Wang J, Li Z, Schönlieb C, Xia T. Bai X, et al. Nat Mach Intell. 2022;4(4):413. doi: 10.1038/s42256-022-00485-5. Epub 2022 Apr 8. Nat Mach Intell. 2022. PMID: 37520117 Free PMC article.

Abstract

Artificial intelligence (AI) provides a promising substitution for streamlining COVID-19 diagnoses. However, concerns surrounding security and trustworthiness impede the collection of large-scale representative medical data, posing a considerable challenge for training a well-generalised model in clinical practices. To address this, we launch the Unified CT-COVID AI Diagnostic Initiative (UCADI), where the AI model can be distributedly trained and independently executed at each host institution under a federated learning framework (FL) without data sharing. Here we show that our FL model outperformed all the local models by a large yield (test sensitivity /specificity in China: 0.973/0.951, in the UK: 0.730/0.942), achieving comparable performance with a panel of professional radiologists. We further evaluated the model on the hold-out (collected from another two hospitals leaving out the FL) and heterogeneous (acquired with contrast materials) data, provided visual explanations for decisions made by the model, and analysed the trade-offs between the model performance and the communication costs in the federated training process. Our study is based on 9,573 chest computed tomography scans (CTs) from 3,336 patients collected from 23 hospitals located in China and the UK. Collectively, our work advanced the prospects of utilising federated learning for privacy-preserving AI in digital health.

PubMed Disclaimer

Conflict of interest statement

Competing Interests Statement

The authors declare no competing interests.

Figures

**Fig. 1 |. Conceptual architecture of UCADI.**
The participants first download and train the 3D CNN models based on the data of local cohorts. The trained model parameters are then encrypted and transmitted back to the server. Finally, the server produces the federated model via aggregating the contributions from each participant while without explicit access to the parameters.

**Fig. 2 |. Deployment and workflow of UCADI participants.**
a, Data. Construct a local dataset based on the high-quality, well-annotated and anonymised CTs. b, Flow. The backbone of the 3D DenseNet model mainly consists of six 3D dense blocks (in green), two 3D transmit blocks (in white), and an output layer (in grey). CTs of each case are converted into a (16,128,128) tensor after adaptive sampling, decentralisation and trilinear interpolation, then feed into the 3D CNN model for pneumonia classification. c, Process. During training, the model outputs are used to calculate the weighted cross entropy to update the network parameters. While testing, five independent predictions of each case are incorporated to report the predictive diagnostic results.

**Fig. 3 |. Overview of CTs.**
a, Radiological features correlated with COVID-19 pneumonia cases: ground glass opacity, interlobular septal thickening and consolidation (from left to right). b, Other non-COVID-19 cases, incl. healthy, other viral and bacterial pneumonia. c, Localised class-discriminative regions generated by GradCAM (in the heatmap) and annotated by the professional radiologists (circled in red), for COVID-19 cases.

**Fig. 4 |. COVID-19 pneumonia identification performance of 3D CNN models trained on four different data resources (Main Campus, Optical Valley, Sino-French and NCCID) individually and federatively.**
a, Receiver Operating Characteristic (ROC) curves when the models are tested on the data from China, in comparison with six professional radiologists, b, ROC curves of the CNN models tested on the data from the UK, c, Numeric results of the test sensitivity, specificity and area under the curve (AUC, with 95% confidence intervals and p-values)

**Fig. 5 |. Trade-off on the performance and communication cost in federated training.**
a, Relationships between transmission expense and model generalisation, b, Estimated time spent at different communication/synchronisation intervals. The statistics is measured based on a joint FL training of two clients. Each client has 200 CTs and 100 CTs for training and testing, respectively. The client’s software infrastructure is a single-core of GPU (NVIDIA GTX 1080Ti) and a CPU (Xeon(R) CPU E5–2660 v4 @ 2.00GHz). The bandwidth for transmission is around 7.2Mb/s (900KB/s), which is the average broadband speed.

See this image and copyright information in PMC

References

1. Ai T. et al. Correlation of Chest CT and RT-PCR Testing for Coronavirus Disease 2019 (COVID-19) in China: A Report of 1014 Cases. Radiology 296, (2020). - PMC - PubMed
1. Fang Y. et al. Sensitivity of chest CT for COVID-19: Comparison to RT-PCR. Radiology vol. 296 (2020). - PMC - PubMed
1. Kanne J. P., Little B. P., Chung J. H., Elicker B. M. & Ketai L. H. Essentials for radiologists on COVID-19: An update-radiology scientific expert panel. Radiology vol. 296 (2020). - PMC - PubMed
1. Kucirka L. M., Lauer S. A., Laeyendecker O., Boon D. & Lessler J. Variation in False-Negative Rate of Reverse Transcriptase Polymerase Chain Reaction-Based SARS-CoV-2 Tests by Time Since Exposure. Annals of internal medicine vol. 173 (2020). - PMC - PubMed
1. Ackerman C. M. et al. Massively multiplexed nucleic acid detection with Cas13. Nature 582, (2020). - PMC - PubMed

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

This is a preprint.

Advancing COVID-19 Diagnosis with Privacy-Preserving Collaboration in Artificial Intelligence

Affiliations

Advancing COVID-19 Diagnosis with Privacy-Preserving Collaboration in Artificial Intelligence

Authors

Affiliations

Update in

Abstract

Conflict of interest statement

Figures

References

Publication types

Grants and funding

LinkOut - more resources

Full Text Sources

Research Materials