The impact of commercial health datasets on medical research and health-care algorithms
- PMID: 37100543
- PMCID: PMC10155113
- DOI: 10.1016/S2589-7500(23)00025-0
The impact of commercial health datasets on medical research and health-care algorithms
Abstract
As the health-care industry emerges into a new era of digital health driven by cloud data storage, distributed computing, and machine learning, health-care data have become a premium commodity with value for private and public entities. Current frameworks of health data collection and distribution, whether from industry, academia, or government institutions, are imperfect and do not allow researchers to leverage the full potential of downstream analytical efforts. In this Health Policy paper, we review the current landscape of commercial health data vendors, with special emphasis on the sources of their data, challenges associated with data reproducibility and generalisability, and ethical considerations for data vending. We argue for sustainable approaches to curating open-source health data to enable global populations to be included in the biomedical research community. However, to fully implement these approaches, key stakeholders should come together to make health-care datasets increasingly accessible, inclusive, and representative, while balancing the privacy and rights of individuals whose data are being collected.
Copyright © 2023 The Author(s). Published by Elsevier Ltd. This is an Open Access article under the CC BY 4.0 license. Published by Elsevier Ltd.. All rights reserved.
Conflict of interest statement
Declaration of interests We declare no competing interests.
Similar articles
-
Sexual Harassment and Prevention Training.2024 Mar 29. In: StatPearls [Internet]. Treasure Island (FL): StatPearls Publishing; 2025 Jan–. 2024 Mar 29. In: StatPearls [Internet]. Treasure Island (FL): StatPearls Publishing; 2025 Jan–. PMID: 36508513 Free Books & Documents.
-
Home treatment for mental health problems: a systematic review.Health Technol Assess. 2001;5(15):1-139. doi: 10.3310/hta5150. Health Technol Assess. 2001. PMID: 11532236
-
The use of Open Dialogue in Trauma Informed Care services for mental health consumers and their family networks: A scoping review.J Psychiatr Ment Health Nurs. 2024 Aug;31(4):681-698. doi: 10.1111/jpm.13023. Epub 2024 Jan 17. J Psychiatr Ment Health Nurs. 2024. PMID: 38230967
-
The Lived Experience of Autistic Adults in Employment: A Systematic Search and Synthesis.Autism Adulthood. 2024 Dec 2;6(4):495-509. doi: 10.1089/aut.2022.0114. eCollection 2024 Dec. Autism Adulthood. 2024. PMID: 40018061 Review.
-
Group-based interventions to reduce gambling involvement among male football fans: a synopsis of findings from a feasibility study.Public Health Res (Southampt). 2025 Jul;13(6):1-24. doi: 10.3310/SWWP9393. Public Health Res (Southampt). 2025. PMID: 40690427
Cited by
-
Perceptions of Data Set Experts on Important Characteristics of Health Data Sets Ready for Machine Learning: A Qualitative Study.JAMA Netw Open. 2023 Dec 1;6(12):e2345892. doi: 10.1001/jamanetworkopen.2023.45892. JAMA Netw Open. 2023. PMID: 38039004 Free PMC article.
-
Learning together for better health using an evidence-based Learning Health System framework: a case study in stroke.BMC Med. 2024 May 15;22(1):198. doi: 10.1186/s12916-024-03416-w. BMC Med. 2024. PMID: 38750449 Free PMC article.
-
Public perspectives on increased data sharing in health research in the context of the 2023 National Institutes of Health Data Sharing Policy.PLoS One. 2024 Aug 28;19(8):e0309161. doi: 10.1371/journal.pone.0309161. eCollection 2024. PLoS One. 2024. PMID: 39197051 Free PMC article.
-
A secure healthcare data sharing scheme based on two-dimensional chaotic mapping and blockchain.Sci Rep. 2024 Oct 8;14(1):23470. doi: 10.1038/s41598-024-73554-x. Sci Rep. 2024. PMID: 39379432 Free PMC article.
-
A Generative Foundation Model for Structured Patient Trajectory Data.AMIA Annu Symp Proc. 2025 May 22;2024:124-133. eCollection 2024. AMIA Annu Symp Proc. 2025. PMID: 40417485 Free PMC article.
References
-
- Yannoukakou A, Kitsos P, Milossi M, Nikita M. Big and open data privacy risks in health sector: developing a trend or establishing the future? 5th International Conference on E-Democracy, Security, Privacy and Trust in a Digital World; Dec 5–6, 2013.
-
- Glenn T, Monteith S. Privacy in the digital world: medical and health data outside of HIPAA protections. Curr Psychiatry Rep 2014; 16: 494. - PubMed
-
- Adam NR, Wieder R, Ghosh D. Data science, learning, and applications to biomedical and health sciences. Ann N Y Acad Sci 2017; 1387: 5–11. - PubMed
Publication types
MeSH terms
Grants and funding
LinkOut - more resources
Full Text Sources
Research Materials