Systematic Review of Privacy-Preserving Distributed Machine Learning From Federated Databases in Health Care
- PMID: 32134684
- PMCID: PMC7113079
- DOI: 10.1200/CCI.19.00047
Systematic Review of Privacy-Preserving Distributed Machine Learning From Federated Databases in Health Care
Abstract
Big data for health care is one of the potential solutions to deal with the numerous challenges of health care, such as rising cost, aging population, precision medicine, universal health coverage, and the increase of noncommunicable diseases. However, data centralization for big data raises privacy and regulatory concerns.Covered topics include (1) an introduction to privacy of patient data and distributed learning as a potential solution to preserving these data, a description of the legal context for patient data research, and a definition of machine/deep learning concepts; (2) a presentation of the adopted review protocol; (3) a presentation of the search results; and (4) a discussion of the findings, limitations of the review, and future perspectives.Distributed learning from federated databases makes data centralization unnecessary. Distributed algorithms iteratively analyze separate databases, essentially sharing research questions and answers between databases instead of sharing the data. In other words, one can learn from separate and isolated datasets without patient data ever leaving the individual clinical institutes.Distributed learning promises great potential to facilitate big data for medical application, in particular for international consortiums. Our purpose is to review the major implementations of distributed learning in health care.
Conflict of interest statement
Fadila Zerka
Samir Barakat
Sean Walsh
Ralph T. H. Leijenaar
Arthur Jochems
Benjamin Miraglio
Philippe Lambin
David Townend
No other potential conflicts of interest were reported.
Figures
References
-
- Mitchell TM: Machine Learning International ed., [Reprint.]. New York, NY, McGraw-Hill, 1997.
-
- Boyd S, Parikh N, Chu E, et al: Distributed optimization and statistical learning via the alternating direction method of multipliers. Foundations and Trends in Machine Learning 3:1-122, 2010.
-
- Cardoso I, Almeida E, Allende-Cid H, et al: Analysis of machine learning algorithms for diagnosis of diffuse lung diseases. Methods Inf Med 57:272-279, 2018. - PubMed
-
- Wang X, Peng Y, Lu L, et al: ChestX-Ray8: Hospital-scale chest x-ray database and benchmarks on weakly-supervised classification and localization of common thorax diseases. Presented at 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR). Honolulu, HI, July 21-26, 2017.
Publication types
MeSH terms
LinkOut - more resources
Full Text Sources
Medical
