Current recommendations/practices for anonymising data from clinical trials in order to make it available for sharing: A scoping review
- PMID: 35730910
- PMCID: PMC9373195
- DOI: 10.1177/17407745221087469
Current recommendations/practices for anonymising data from clinical trials in order to make it available for sharing: A scoping review
Abstract
Background/aims: There are increasing pressures for anonymised datasets from clinical trials to be shared across the scientific community, and differing recommendations exist on how to perform anonymisation prior to sharing. We aimed to systematically identify, describe and synthesise existing recommendations for anonymising clinical trial datasets to prepare for data sharing.
Methods: We systematically searched MEDLINE®, EMBASE and Web of Science from inception to 8 February 2021. We also searched other resources to ensure the comprehensiveness of our search. Any publication reporting recommendations on anonymisation to enable data sharing from clinical trials was included. Two reviewers independently screened titles, abstracts and full text for eligibility. One reviewer extracted data from included papers using thematic synthesis, which then was sense-checked by a second reviewer. Results were summarised by narrative analysis.
Results: Fifty-nine articles (from 43 studies) were eligible for inclusion. Three distinct themes are emerging: anonymisation, de-identification and pseudonymisation. The most commonly used anonymisation techniques are: removal of direct patient identifiers; and careful evaluation and modification of indirect identifiers to minimise the risk of identification. Anonymised datasets joined with controlled access was the preferred method for data sharing.
Conclusions: There is no single standardised set of recommendations on how to anonymise clinical trial datasets for sharing. However, this systematic review shows a developing consensus on techniques used to achieve anonymisation. Researchers in clinical trials still consider that anonymisation techniques by themselves are insufficient to protect patient privacy, and they need to be paired with controlled access.
Keywords: Clinical trials; data anonymisation; data curation; datasets; guidelines; patient identification systems; personally identifiable information; systematic review.
Conflict of interest statement
Figures

Similar articles
-
A survey on UK researchers' views regarding their experiences with the de-identification, anonymisation, release methods and re-identification risk estimation for clinical trial datasets.Clin Trials. 2025 Feb;22(1):11-23. doi: 10.1177/17407745241259086. Epub 2024 Jun 19. Clin Trials. 2025. PMID: 39927449 Free PMC article.
-
Protecting patient privacy when sharing patient-level data from clinical trials.BMC Med Res Methodol. 2016 Jul 8;16 Suppl 1(Suppl 1):77. doi: 10.1186/s12874-016-0169-4. BMC Med Res Methodol. 2016. PMID: 27410040 Free PMC article.
-
Sharing traumatic stress research data: assessing and reducing the risk of re-identification.Eur J Psychotraumatol. 2025 Dec;16(1):2499296. doi: 10.1080/20008066.2025.2499296. Epub 2025 May 19. Eur J Psychotraumatol. 2025. PMID: 40387730 Free PMC article. Review.
-
Use and Understanding of Anonymization and De-Identification in the Biomedical Literature: Scoping Review.J Med Internet Res. 2019 May 31;21(5):e13484. doi: 10.2196/13484. J Med Internet Res. 2019. PMID: 31152528 Free PMC article.
-
Data sharing in clinical trials - practical guidance on anonymising trial datasets.Trials. 2018 Jan 10;19(1):25. doi: 10.1186/s13063-017-2382-9. Trials. 2018. PMID: 29321053 Free PMC article.
Cited by
-
A survey on UK researchers' views regarding their experiences with the de-identification, anonymisation, release methods and re-identification risk estimation for clinical trial datasets.Clin Trials. 2025 Feb;22(1):11-23. doi: 10.1177/17407745241259086. Epub 2024 Jun 19. Clin Trials. 2025. PMID: 39927449 Free PMC article.
-
A Scalable Pseudonymization Tool for Rapid Deployment in Large Biomedical Research Networks: Development and Evaluation Study.JMIR Med Inform. 2024 Apr 23;12:e49646. doi: 10.2196/49646. JMIR Med Inform. 2024. PMID: 38654577 Free PMC article.
-
Leveraging Synthetic Data to Facilitate Research: A Collaborative Model for Analyzing Sensitive National Cancer Registry Data in England.Ther Innov Regul Sci. 2025 Jun 5. doi: 10.1007/s43441-025-00820-z. Online ahead of print. Ther Innov Regul Sci. 2025. PMID: 40474047
-
Qualitative data sharing practices in clinical trials in the UK and Ireland: towards the production of good practice guidance.HRB Open Res. 2023 Feb 6;6:10. doi: 10.12688/hrbopenres.13667.1. eCollection 2023. HRB Open Res. 2023. PMID: 37456658 Free PMC article.
-
From Bedside to Desktop: A Data Protocol for Normative Intracranial EEG and Abnormality Mapping.Bio Protoc. 2025 May 20;15(10):e5321. doi: 10.21769/BioProtoc.5321. eCollection 2025 May 20. Bio Protoc. 2025. PMID: 40432754 Free PMC article.
References
-
- Song F, Hooper L, Loke Y. Publication bias: what is it? How do we measure it? How do we avoid it? Open Access J Clin Trials 2013; 2013: 71–81.
-
- Berlin JA, Morris S, Rockhold F, et al.. Bumps and bridges on the road to responsible sharing of clinical trial data. Clin Trials 2014; 11(1): 7–12. - PubMed
Publication types
MeSH terms
LinkOut - more resources
Full Text Sources