Privacy-preserving federated machine learning on FAIR health data: A real-world application
- PMID: 38434250
- PMCID: PMC10904920
- DOI: 10.1016/j.csbj.2024.02.014
Privacy-preserving federated machine learning on FAIR health data: A real-world application
Abstract
Objective: This paper introduces a privacy-preserving federated machine learning (ML) architecture built upon Findable, Accessible, Interoperable, and Reusable (FAIR) health data. It aims to devise an architecture for executing classification algorithms in a federated manner, enabling collaborative model-building among health data owners without sharing their datasets.
Materials and methods: Utilizing an agent-based architecture, a privacy-preserving federated ML algorithm was developed to create a global predictive model from various local models. This involved formally defining the algorithm in two steps: data preparation and federated model training on FAIR health data and constructing the architecture with multiple components facilitating algorithm execution. The solution was validated by five healthcare organizations using their specific health datasets.
Results: Five organizations transformed their datasets into Health Level 7 Fast Healthcare Interoperability Resources via a common FAIRification workflow and software set, thereby generating FAIR datasets. Each organization deployed a Federated ML Agent within its secure network, connected to a cloud-based Federated ML Manager. System testing was conducted on a use case aiming to predict 30-day readmission risk for chronic obstructive pulmonary disease patients and the federated model achieved an accuracy rate of 87%.
Discussion: The paper demonstrated a practical application of privacy-preserving federated ML among five distinct healthcare entities, highlighting the value of FAIR health data in machine learning when utilized in a federated manner that ensures privacy protection without sharing data.
Conclusion: This solution effectively leverages FAIR datasets from multiple healthcare organizations for federated ML while safeguarding sensitive health datasets, meeting legislative privacy and security requirements.
Keywords: Distributed datasets; FAIR data; Federated machine learning; Privacy-preserving machine learning.
© 2024 The Authors.
Conflict of interest statement
The authors declare the following financial interests/personal relationships which may be considered as potential competing interests: A. Anil Sinaci reports financial support was provided by EU Framework Programme for Research and Innovation Science with and for Society. Celia Alvarez-Romero reports financial support was provided by European Regional Development Fund. If there are other authors, they declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.
Figures






Similar articles
-
Predicting 30-Day Readmission Risk for Patients With Chronic Obstructive Pulmonary Disease Through a Federated Machine Learning Architecture on Findable, Accessible, Interoperable, and Reusable (FAIR) Data: Development and Validation Study.JMIR Med Inform. 2022 Jun 2;10(6):e35307. doi: 10.2196/35307. JMIR Med Inform. 2022. PMID: 35653170 Free PMC article.
-
Advancing Privacy-Preserving Health Care Analytics and Implementation of the Personal Health Train: Federated Deep Learning Study.JMIR AI. 2025 Feb 6;4:e60847. doi: 10.2196/60847. JMIR AI. 2025. PMID: 39912580 Free PMC article.
-
FAIR4Health: Findable, Accessible, Interoperable and Reusable data to foster Health Research.Open Res Eur. 2022 May 31;2:34. doi: 10.12688/openreseurope.14349.2. eCollection 2022. Open Res Eur. 2022. PMID: 37645268 Free PMC article.
-
Federated Learning in Glaucoma: A Comprehensive Review and Future Perspectives.Ophthalmol Glaucoma. 2025 Jan-Feb;8(1):92-105. doi: 10.1016/j.ogla.2024.08.004. Epub 2024 Aug 29. Ophthalmol Glaucoma. 2025. PMID: 39214457 Free PMC article. Review.
-
Analysis of Privacy Preservation Enhancements in Federated Learning Frameworks.In: Sofia RC, Soldatos J, editors. Shaping the Future of IoT with Edge Intelligence: How Edge Computing Enables the Next Generation of IoT Applications. Abingdon (UK): River Publishers; 2024 Jan. Chapter 6. In: Sofia RC, Soldatos J, editors. Shaping the Future of IoT with Edge Intelligence: How Edge Computing Enables the Next Generation of IoT Applications. Abingdon (UK): River Publishers; 2024 Jan. Chapter 6. PMID: 38564559 Free Books & Documents. Review.
Cited by
-
Artificial intelligence in ischemic stroke images: current applications and future directions.Front Neurol. 2024 Jul 10;15:1418060. doi: 10.3389/fneur.2024.1418060. eCollection 2024. Front Neurol. 2024. PMID: 39050128 Free PMC article. Review.
-
Generation of Fundus Fluorescein Angiography Videos for Health Care Data Sharing.JAMA Ophthalmol. 2025 Aug 1;143(8):623-632. doi: 10.1001/jamaophthalmol.2025.1419. JAMA Ophthalmol. 2025. PMID: 40569610
-
A scalable and transparent data pipeline for AI-enabled health data ecosystems.Front Med (Lausanne). 2024 Jul 30;11:1393123. doi: 10.3389/fmed.2024.1393123. eCollection 2024. Front Med (Lausanne). 2024. PMID: 39139784 Free PMC article.
-
Development and validation of a deep learning-based framework for automated lung CT segmentation and acute respiratory distress syndrome prediction: a multicenter cohort study.EClinicalMedicine. 2024 Jul 26;75:102772. doi: 10.1016/j.eclinm.2024.102772. eCollection 2024 Sep. EClinicalMedicine. 2024. PMID: 39170939 Free PMC article.
-
Mapping the regulatory landscape for artificial intelligence in health within the European Union.NPJ Digit Med. 2024 Aug 27;7(1):229. doi: 10.1038/s41746-024-01221-6. NPJ Digit Med. 2024. PMID: 39191937 Free PMC article.
References
-
- Health Insurance Portability and Accountability Act of 1996 (HIPAA) n.d. https://www.cdc.gov/phlp/publications/topic/hipaa.html (accessed November 22, 2023).
-
- General Data Protection Regulation (GDPR) n.d. 〈https://gdpr.eu/〉 (Accessed November 22, 2023).
LinkOut - more resources
Full Text Sources