Anonymization of whole slide images in histopathology for research and education
- PMID: 37205164
- PMCID: PMC10185865
- DOI: 10.1177/20552076231171475
Anonymization of whole slide images in histopathology for research and education
Abstract
Objective: The exchange of health-related data is subject to regional laws and regulations, such as the General Data Protection Regulation (GDPR) in the EU or the Health Insurance Portability and Accountability Act (HIPAA) in the United States, resulting in non-trivial challenges for researchers and educators when working with these data. In pathology, the digitization of diagnostic tissue samples inevitably generates identifying data that can consist of sensitive but also acquisition-related information stored in vendor-specific file formats. Distribution and off-clinical use of these Whole Slide Images (WSIs) are usually done in these formats, as an industry-wide standardization such as DICOM is yet only tentatively adopted and slide scanner vendors currently do not provide anonymization functionality.
Methods: We developed a guideline for the proper handling of histopathological image data particularly for research and education with regard to the GDPR. In this context, we evaluated existing anonymization methods and examined proprietary format specifications to identify all sensitive information for the most common WSI formats. This work results in a software library that enables GDPR-compliant anonymization of WSIs while preserving the native formats.
Results: Based on the analysis of proprietary formats, all occurrences of sensitive information were identified for file formats frequently used in clinical routine, and finally, an open-source programming library with an executable CLI tool and wrappers for different programming languages was developed.
Conclusions: Our analysis showed that there is no straightforward software solution to anonymize WSIs in a GDPR-compliant way while maintaining the data format. We closed this gap with our extensible open-source library that works instantaneously and offline.
Keywords: Pathology; data anonymization; health education; privacy; research.
© The Author(s) 2023.
Conflict of interest statement
The authors declare that no potential conflicts of interest exist with respect to the research, authorship, and/or publication of this article.
Figures




References
-
- Bueno G, Fernández-Carroble MM, Deniz O, et al.New trends of emerging technologies in digital pathology. Pathobiology 2016; 83: 61–69. - PubMed
-
- Official Journal of the European Union. Regulation (EU) 2016/679 of the European Parliament and of the Council of 27 April 2016 on the protection of natural persons with regard to the processing of personal data and on the free movement of such data, and repealing Directive 95/46/EC (General Data Protection Regulation), https://eur-lex.europa.eu/legal-content/EN/TXT/PDF/?uri=CELEX:32016R0679(2016, accessed 22 September 2022).
-
- European Commission. Data protection in the EU, https://ec.europa.eu/info/law/law-topic/data-protection/data-protection-...(2022, accessed 22 September 2022).
LinkOut - more resources
Full Text Sources
Research Materials
Miscellaneous