Differential privacy in health research: A scoping review
- PMID: 34333623
- PMCID: PMC8449619
- DOI: 10.1093/jamia/ocab135
Differential privacy in health research: A scoping review
Abstract
Objective: Differential privacy is a relatively new method for data privacy that has seen growing use due its strong protections that rely on added noise. This study assesses the extent of its awareness, development, and usage in health research.
Materials and methods: A scoping review was conducted by searching for ["differential privacy" AND "health"] in major health science databases, with additional articles obtained via expert consultation. Relevant articles were classified according to subject area and focus.
Results: A total of 54 articles met the inclusion criteria. Nine articles provided descriptive overviews, 31 focused on algorithm development, 9 presented novel data sharing systems, and 8 discussed appraisals of the privacy-utility tradeoff. The most common areas of health research where differential privacy has been discussed are genomics, neuroimaging studies, and health surveillance with personal devices. Algorithms were most commonly developed for the purposes of data release and predictive modeling. Studies on privacy-utility appraisals have considered economic cost-benefit analysis, low-utility situations, personal attitudes toward sharing health data, and mathematical interpretations of privacy risk.
Discussion: Differential privacy remains at an early stage of development for applications in health research, and accounts of real-world implementations are scant. There are few algorithms for explanatory modeling and statistical inference, particularly with correlated data. Furthermore, diminished accuracy in small datasets is problematic. Some encouraging work has been done on decision making with regard to epsilon. The dissemination of future case studies can inform successful appraisals of privacy and utility.
Conclusions: More development, case studies, and evaluations are needed before differential privacy can see widespread use in health research.
Keywords: confidentiality; data sharing; differential privacy; privacy; statistical disclosure limitation.
© The Author(s) 2021. Published by Oxford University Press on behalf of the American Medical Informatics Association. All rights reserved. For permissions, please email: journals.permissions@oup.com.
Figures


References
-
- Holdren J. Memorandum for the Heads of Executive Departments and Agencies: Increasing Access to the Results of Federally Funded Scientific Research. 2013. https://obamawhitehouse.archives.gov/sites/default/files/microsites/ostp.... Accessed April 5, 2019.
-
- National Institutes of Health. Final NIH Statement on Sharing Research Data. 2003. https://grants.nih.gov/grants/guide/notice-files/NOT-OD-03-032.html. Accessed April 8, 2019.
-
- National Science Foundation. Proposal & Award Policies & Procedures Guide. 2019. https://www.nsf.gov/pubs/policydocs/pappg19_1/pappg_11.jsp#XID4. Accessed April 8, 2019.
-
- Taichman DB, Sahni P, Pinborg A, et al.Data sharing statements for clinical trials: a requirement of the international committee of medical journal editors. Ann Intern Med 2017; 167 (1): 63–5. - PubMed
-
- U.S. Department of Health and Human Services. Guidance Regarding Methods for De-Identification of Protected Health Information in Accordance with the Health Insurance Portability and Accountability Act (HIPAA) Privacy Rule. 2012. https://www.hhs.gov/hipaa/for-professionals/privacy/special-topics/de-id.... Accessed April 18, 2019.
Publication types
MeSH terms
LinkOut - more resources
Full Text Sources
Miscellaneous