Predicting 30-Day Readmission Risk for Patients With Chronic Obstructive Pulmonary Disease Through a Federated Machine Learning Architecture on Findable, Accessible, Interoperable, and Reusable (FAIR) Data: Development and Validation Study
- PMID: 35653170
- PMCID: PMC9204581
- DOI: 10.2196/35307
Predicting 30-Day Readmission Risk for Patients With Chronic Obstructive Pulmonary Disease Through a Federated Machine Learning Architecture on Findable, Accessible, Interoperable, and Reusable (FAIR) Data: Development and Validation Study
Abstract
Background: Owing to the nature of health data, their sharing and reuse for research are limited by legal, technical, and ethical implications. In this sense, to address that challenge and facilitate and promote the discovery of scientific knowledge, the Findable, Accessible, Interoperable, and Reusable (FAIR) principles help organizations to share research data in a secure, appropriate, and useful way for other researchers.
Objective: The objective of this study was the FAIRification of existing health research data sets and applying a federated machine learning architecture on top of the FAIRified data sets of different health research performing organizations. The entire FAIR4Health solution was validated through the assessment of a federated model for real-time prediction of 30-day readmission risk in patients with chronic obstructive pulmonary disease (COPD).
Methods: The application of the FAIR principles on health research data sets in 3 different health care settings enabled a retrospective multicenter study for the development of specific federated machine learning models for the early prediction of 30-day readmission risk in patients with COPD. This predictive model was generated upon the FAIR4Health platform. Finally, an observational prospective study with 30 days follow-up was conducted in 2 health care centers from different countries. The same inclusion and exclusion criteria were used in both retrospective and prospective studies.
Results: Clinical validation was demonstrated through the implementation of federated machine learning models on top of the FAIRified data sets from different health research performing organizations. The federated model for predicting the 30-day hospital readmission risk was trained using retrospective data from 4.944 patients with COPD. The assessment of the predictive model was performed using the data of 100 recruited (22 from Spain and 78 from Serbia) out of 2070 observed (records viewed) patients during the observational prospective study, which was executed from April 2021 to September 2021. Significant accuracy (0.98) and precision (0.25) of the predictive model generated upon the FAIR4Health platform were observed. Therefore, the generated prediction of 30-day readmission risk was confirmed in 87% (87/100) of cases.
Conclusions: Implementing a FAIR data policy in health research performing organizations to facilitate data sharing and reuse is relevant and needed, following the discovery, access, integration, and analysis of health research data. The FAIR4Health project proposes a technological solution in the health domain to facilitate alignment with the FAIR principles.
Keywords: FAIR principles; chronic obstructive pulmonary disease; clinical validation; early predictive model; privacy-preserving distributed data mining; research data management.
©Celia Alvarez-Romero, Alicia Martinez-Garcia, Jara Ternero Vega, Pablo Díaz-Jimènez, Carlos Jimènez-Juan, María Dolores Nieto-Martín, Esther Román Villarán, Tomi Kovacevic, Darijo Bokan, Sanja Hromis, Jelena Djekic Malbasa, Suzana Beslać, Bojan Zaric, Mert Gencturk, A Anil Sinaci, Manuel Ollero Baturone, Carlos Luis Parra Calderón. Originally published in JMIR Medical Informatics (https://medinform.jmir.org), 02.06.2022.
Conflict of interest statement
Conflicts of Interest: None declared.
Figures



Similar articles
-
FAIR4Health: Findable, Accessible, Interoperable and Reusable data to foster Health Research.Open Res Eur. 2022 May 31;2:34. doi: 10.12688/openreseurope.14349.2. eCollection 2022. Open Res Eur. 2022. PMID: 37645268 Free PMC article.
-
Privacy-preserving federated machine learning on FAIR health data: A real-world application.Comput Struct Biotechnol J. 2024 Feb 17;24:136-145. doi: 10.1016/j.csbj.2024.02.014. eCollection 2024 Dec. Comput Struct Biotechnol J. 2024. PMID: 38434250 Free PMC article.
-
Initiatives, Concepts, and Implementation Practices of the Findable, Accessible, Interoperable, and Reusable Data Principles in Health Data Stewardship: Scoping Review.J Med Internet Res. 2023 Aug 28;25:e45013. doi: 10.2196/45013. J Med Internet Res. 2023. PMID: 37639292 Free PMC article.
-
From Raw Data to FAIR Data: The FAIRification Workflow for Health Research.Methods Inf Med. 2020 Jun;59(S 01):e21-e32. doi: 10.1055/s-0040-1713684. Epub 2020 Jul 3. Methods Inf Med. 2020. PMID: 32620019
-
Challenges in mapping European rare disease databases, relevant for ML-based screening technologies in terms of organizational, FAIR and legal principles: scoping review.Front Public Health. 2023 Sep 15;11:1214766. doi: 10.3389/fpubh.2023.1214766. eCollection 2023. Front Public Health. 2023. PMID: 37780450 Free PMC article.
Cited by
-
FAIR principles to improve the impact on health research management outcomes.Heliyon. 2023 May 3;9(5):e15733. doi: 10.1016/j.heliyon.2023.e15733. eCollection 2023 May. Heliyon. 2023. PMID: 37205991 Free PMC article.
-
Privacy-preserving federated data access and federated learning: Improved data sharing and AI model development in transfusion medicine.Transfusion. 2025 Jan;65(1):22-28. doi: 10.1111/trf.18077. Epub 2024 Nov 29. Transfusion. 2025. PMID: 39610333 Free PMC article. Review.
-
Conceptual Framework and Documentation Standards of Cystoscopic Media Content for Artificial Intelligence.ArXiv [Preprint]. 2023 Jan 18:arXiv:2301.05991v2. ArXiv. 2023. Update in: J Biomed Inform. 2023 Jun;142:104369. doi: 10.1016/j.jbi.2023.104369. PMID: 36713258 Free PMC article. Updated. Preprint.
-
FAIR4Health: Findable, Accessible, Interoperable and Reusable data to foster Health Research.Open Res Eur. 2022 May 31;2:34. doi: 10.12688/openreseurope.14349.2. eCollection 2022. Open Res Eur. 2022. PMID: 37645268 Free PMC article.
-
Conceptual framework and documentation standards of cystoscopic media content for artificial intelligence.J Biomed Inform. 2023 Jun;142:104369. doi: 10.1016/j.jbi.2023.104369. Epub 2023 Apr 22. J Biomed Inform. 2023. PMID: 37088456 Free PMC article.
References
-
- Wilkinson M, Dumontier M, Aalbersberg IJ, Appleton G, Axton M, Baak A, Blomberg N, Boiten J-W, da Silva Santos LB, Bourne PE, Bouwman J, Brookes AJ, Clark T, Crosas M, Dillo I, Dumon O, Edmunds S, Evelo CT, Finkers R, Gonzalez-Beltran A, Gray AJ, Groth P, Goble C, Grethe JS, Heringa J, 't Hoen PA, Hooft R, Kuhn T, Kok R, Kok J, Lusher SJ, Martone ME, Mons A, Packer AL, Persson B, Rocca-Serra P, Roos M, van Schaik R, Sansone S-A, Schultes E, Sengstag T, Slater T, Strawn G, Swertz MA, Thompson M, van der Lei J, van Mulligen E, Velterop J, Waagmeester A, Wittenburg P, Wolstencroft K, Zhao J, Mons B. The FAIR Guiding Principles for scientific data management and stewardship. Sci Data. 2016 Mar 15;3:160018–9. doi: 10.1038/sdata.2016.18. doi: 10.1038/sdata.2016.18.sdata201618 - DOI - DOI - PMC - PubMed
-
- Couture JL, Blake RE, McDonald G, Ward CL. A funder-imposed data publication requirement seldom inspired data sharing. PLoS One. 2018 Jul 6;13(7):e0199789. doi: 10.1371/journal.pone.0199789. https://dx.plos.org/10.1371/journal.pone.0199789 PONE-D-17-41143 - DOI - DOI - PMC - PubMed
LinkOut - more resources
Full Text Sources
Miscellaneous