Federated learning enables big data for rare cancer boundary detection
- PMID: 36470898
- PMCID: PMC9722782
- DOI: 10.1038/s41467-022-33407-5
Federated learning enables big data for rare cancer boundary detection
Erratum in
-
Author Correction: Federated learning enables big data for rare cancer boundary detection.Nat Commun. 2023 Jan 26;14(1):436. doi: 10.1038/s41467-023-36188-7. Nat Commun. 2023. PMID: 36702828 Free PMC article. No abstract available.
Abstract
Although machine learning (ML) has shown promise across disciplines, out-of-sample generalizability is concerning. This is currently addressed by sharing multi-site data, but such centralization is challenging/infeasible to scale due to various limitations. Federated ML (FL) provides an alternative paradigm for accurate and generalizable ML, by only sharing numerical model updates. Here we present the largest FL study to-date, involving data from 71 sites across 6 continents, to generate an automatic tumor boundary detector for the rare disease of glioblastoma, reporting the largest such dataset in the literature (n = 6, 314). We demonstrate a 33% delineation improvement for the surgically targetable tumor, and 23% for the complete tumor extent, over a publicly trained model. We anticipate our study to: 1) enable more healthcare studies informed by large diverse data, ensuring meaningful results for rare diseases and underrepresented populations, 2) facilitate further analyses for glioblastoma by releasing our consensus model, and 3) demonstrate the FL effectiveness at such scale and task-complexity as a paradigm shift for multi-site collaborations, alleviating the need for data-sharing.
© 2022. The Author(s).
Conflict of interest statement
The Intel-affiliated authors (B. Edwards, M. Sheller, S. Wang, G.A. Reina, P. Foley, A. Gruzdev, D. Karkada, P. Shah, J. Martin) would like to disclose the following (potential) competing interests as Intel employees. Intel may develop proprietary software that is related in reputation to the OpenFL open source project highlighted in this work. In addition, the work demonstrates feasibility of federated learning for brain tumor boundary detection models. Intel may benefit by selling products to support an increase in demand for this use-case. The remaining authors declare no competing interests.
Figures



References
-
- Mårtensson G, et al. The reliability of a deep learning model in clinical out-of-distribution MRI data: a multicohort study. Med. Image Anal. 2020;66:101714. - PubMed
-
- Marcus, G. Deep learning: a critical appraisal. arXiv preprint arXiv:1801.00631 (2018).
-
- Aggarwal, C. C. et al. Neural Networks and Deep Learning Vol. 10, 978–983 (Springer, 2018).
Publication types
MeSH terms
Grants and funding
- R01 CA270027/CA/NCI NIH HHS/United States
- U01 CA080098/CA/NCI NIH HHS/United States
- MR/W021684/1/MRC_/Medical Research Council/United Kingdom
- U10 CA079778/CA/NCI NIH HHS/United States
- UL1 TR001409/TR/NCATS NIH HHS/United States
- 28832/CRUK_/Cancer Research UK/United Kingdom
- UL1 TR001433/TR/NCATS NIH HHS/United States
- U01 CA242871/CA/NCI NIH HHS/United States
- P30 CA051008/CA/NCI NIH HHS/United States
- U01 CA248226/CA/NCI NIH HHS/United States
- U01 CA079778/CA/NCI NIH HHS/United States
- U10 CA037422/CA/NCI NIH HHS/United States
- U10 CA080098/CA/NCI NIH HHS/United States
- U24 CA189523/CA/NCI NIH HHS/United States
- U01 CA176110/CA/NCI NIH HHS/United States
- U10 CA180822/CA/NCI NIH HHS/United States
- U10 CA180868/CA/NCI NIH HHS/United States
- R21 EB030209/EB/NIBIB NIH HHS/United States
- R50 CA211270/CA/NCI NIH HHS/United States
- R01 CA233888/CA/NCI NIH HHS/United States
- P50 CA211015/CA/NCI NIH HHS/United States
- R37 CA214955/CA/NCI NIH HHS/United States
- R01 CA082500/CA/NCI NIH HHS/United States
- S10 OD023495/OD/NIH HHS/United States
- R01 NS042645/NS/NINDS NIH HHS/United States
- U10 CA021661/CA/NCI NIH HHS/United States
- U10 CA180820/CA/NCI NIH HHS/United States
- WT_/Wellcome Trust/United Kingdom
- R01 CA269948/CA/NCI NIH HHS/United States
- U10 CA180794/CA/NCI NIH HHS/United States
- U24 CA215109/CA/NCI NIH HHS/United States
- R01 LM013151/LM/NLM NIH HHS/United States