Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2024 Feb 17:24:136-145.
doi: 10.1016/j.csbj.2024.02.014. eCollection 2024 Dec.

Privacy-preserving federated machine learning on FAIR health data: A real-world application

Affiliations

Privacy-preserving federated machine learning on FAIR health data: A real-world application

A Anil Sinaci et al. Comput Struct Biotechnol J. .

Abstract

Objective: This paper introduces a privacy-preserving federated machine learning (ML) architecture built upon Findable, Accessible, Interoperable, and Reusable (FAIR) health data. It aims to devise an architecture for executing classification algorithms in a federated manner, enabling collaborative model-building among health data owners without sharing their datasets.

Materials and methods: Utilizing an agent-based architecture, a privacy-preserving federated ML algorithm was developed to create a global predictive model from various local models. This involved formally defining the algorithm in two steps: data preparation and federated model training on FAIR health data and constructing the architecture with multiple components facilitating algorithm execution. The solution was validated by five healthcare organizations using their specific health datasets.

Results: Five organizations transformed their datasets into Health Level 7 Fast Healthcare Interoperability Resources via a common FAIRification workflow and software set, thereby generating FAIR datasets. Each organization deployed a Federated ML Agent within its secure network, connected to a cloud-based Federated ML Manager. System testing was conducted on a use case aiming to predict 30-day readmission risk for chronic obstructive pulmonary disease patients and the federated model achieved an accuracy rate of 87%.

Discussion: The paper demonstrated a practical application of privacy-preserving federated ML among five distinct healthcare entities, highlighting the value of FAIR health data in machine learning when utilized in a federated manner that ensures privacy protection without sharing data.

Conclusion: This solution effectively leverages FAIR datasets from multiple healthcare organizations for federated ML while safeguarding sensitive health datasets, meeting legislative privacy and security requirements.

Keywords: Distributed datasets; FAIR data; Federated machine learning; Privacy-preserving machine learning.

PubMed Disclaimer

Conflict of interest statement

The authors declare the following financial interests/personal relationships which may be considered as potential competing interests: A. Anil Sinaci reports financial support was provided by EU Framework Programme for Research and Innovation Science with and for Society. Celia Alvarez-Romero reports financial support was provided by European Regional Development Fund. If there are other authors, they declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

Figures

ga1
Graphical abstract
Fig. 1
Fig. 1
Graphical representation of the Federated ML Architecture: components, subcomponents, and main interactions among them.
Fig. 2
Fig. 2
Training phase of the federated ML algorithm.
Fig. 3
Fig. 3
Validation phase of the federated ML algorithm.
Fig. 4
Fig. 4
Aggregation phase of the federated ML algorithm.
Fig. 5
Fig. 5
The deployment setup of the federated machine learning architecture.

Similar articles

Cited by

References

    1. Vayena E. Value from health data: European opportunity to catalyse progress in digital health. Lancet (Lond, Engl) 2021;397:652–653. doi: 10.1016/S0140-6736(21)00203-8. - DOI - PubMed
    1. Alami H., Gagnon M.-P., Fortin J.-P. Digital health and the challenge of health systems transformation. MHealth. 2017;3:31. doi: 10.21037/mhealth.2017.07.02. - DOI - PMC - PubMed
    1. Pashazadeh A., Navimipour N.J. Big data handling mechanisms in the healthcare applications: a comprehensive and systematic literature review. J Biomed Inform. 2018;82:47–62. doi: 10.1016/J.JBI.2018.03.014. - DOI - PubMed
    1. Health Insurance Portability and Accountability Act of 1996 (HIPAA) n.d. https://www.cdc.gov/phlp/publications/topic/hipaa.html (accessed November 22, 2023).
    1. General Data Protection Regulation (GDPR) n.d. 〈https://gdpr.eu/〉 (Accessed November 22, 2023).

LinkOut - more resources