Towards a privacy preserving cohort discovery framework for clinical research networks
- PMID: 28007583
- PMCID: PMC5316314
- DOI: 10.1016/j.jbi.2016.12.008
Towards a privacy preserving cohort discovery framework for clinical research networks
Abstract
Background: The last few years have witnessed an increasing number of clinical research networks (CRNs) focused on building large collections of data from electronic health records (EHRs), claims, and patient-reported outcomes (PROs). Many of these CRNs provide a service for the discovery of research cohorts with various health conditions, which is especially useful for rare diseases. Supporting patient privacy can enhance the scalability and efficiency of such processes; however, current practice mainly relies on policy, such as guidelines defined in the Health Insurance Portability and Accountability Act (HIPAA), which are insufficient for CRNs (e.g., HIPAA does not require encryption of data - which can mitigate insider threats). By combining policy with privacy enhancing technologies we can enhance the trustworthiness of CRNs. The goal of this research is to determine if searchable encryption can instill privacy in CRNs without sacrificing their usability.
Methods: We developed a technique, implemented in working software to enable privacy-preserving cohort discovery (PPCD) services in large distributed CRNs based on elliptic curve cryptography (ECC). This technique also incorporates a block indexing strategy to improve the performance (in terms of computational running time) of PPCD. We evaluated the PPCD service with three real cohort definitions: (1) elderly cervical cancer patients who underwent radical hysterectomy, (2) oropharyngeal and tongue cancer patients who underwent robotic transoral surgery, and (3) female breast cancer patients who underwent mastectomy) with varied query complexity. These definitions were tested in an encrypted database of 7.1 million records derived from the publically available Healthcare Cost and Utilization Project (HCUP) Nationwide Inpatient Sample (NIS). We assessed the performance of the PPCD service in terms of (1) accuracy in cohort discovery, (2) computational running time, and (3) privacy afforded to the underlying records during PPCD.
Results: The empirical results indicate that the proposed PPCD can execute cohort discovery queries in a reasonable amount of time, with query runtime in the range of 165-262s for the 3 use cases, with zero compromise in accuracy. We further show that the search performance is practical because it supports a highly parallelized design for secure evaluation over encrypted records. Additionally, our security analysis shows that the proposed construction is resilient to standard adversaries.
Conclusions: PPCD services can be designed for clinical research networks. The security construction presented in this work specifically achieves high privacy guarantees by preventing both threats originating from within and beyond the network.
Keywords: Clinical research network (CRN); Data privacy; OneFlorida Clinical Data Research Network (CDRN); Patient-Centered Clinical Research Network (PCORnet); Privacy-preserving cohort discovery; Searchable encryption.
Copyright © 2016 Elsevier Inc. All rights reserved.
Conflict of interest statement
Figures





Similar articles
-
Securing healthcare data: A federated learning framework with hybrid encryption in cluster environments.Technol Health Care. 2025 May;33(3):1232-1257. doi: 10.1177/09287329241291397. Epub 2024 Nov 25. Technol Health Care. 2025. PMID: 40331546
-
Secure count query on encrypted genomic data.J Biomed Inform. 2018 May;81:41-52. doi: 10.1016/j.jbi.2018.03.003. Epub 2018 Mar 15. J Biomed Inform. 2018. PMID: 29550393
-
A framework for privacy-preserving access to next-generation EHRs.Stud Health Technol Inform. 2014;205:740-4. Stud Health Technol Inform. 2014. PMID: 25160285
-
The Health Insurance Portability and Accountability Act Privacy Rule: a practical guide for researchers.Med Care. 2004 Apr;42(4):321-7. doi: 10.1097/01.mlr.0000119578.94846.f2. Med Care. 2004. PMID: 15076808 Review.
-
A Blockchain Framework for Patient-Centered Health Records and Exchange (HealthChain): Evaluation and Proof-of-Concept Study.J Med Internet Res. 2019 Aug 31;21(8):e13592. doi: 10.2196/13592. J Med Internet Res. 2019. PMID: 31471959 Free PMC article. Review.
Cited by
-
Between Access and Privacy: Challenges in Sharing Health Data.Yearb Med Inform. 2018 Aug;27(1):55-59. doi: 10.1055/s-0038-1641216. Epub 2018 Aug 29. Yearb Med Inform. 2018. PMID: 30157505 Free PMC article. Review.
-
Enabling Privacy Preserving Record Linkage Systems Using Asymmetric Key Cryptography.AMIA Annu Symp Proc. 2020 Mar 4;2019:380-388. eCollection 2019. AMIA Annu Symp Proc. 2020. PMID: 32308831 Free PMC article.
-
OneFlorida Clinical Research Consortium: Linking a Clinical and Translational Science Institute With a Community-Based Distributive Medical Education Model.Acad Med. 2018 Mar;93(3):451-455. doi: 10.1097/ACM.0000000000002029. Acad Med. 2018. PMID: 29045273 Free PMC article.
-
Hypertension in Florida: Data From the OneFlorida Clinical Data Research Network.Prev Chronic Dis. 2018 Mar 1;15:E27. doi: 10.5888/pcd15.170332. Prev Chronic Dis. 2018. PMID: 29494332 Free PMC article.
-
High performance of privacy-preserving acute myocardial infarction auxiliary diagnosis based on federated learning: a multicenter retrospective study.Ann Transl Med. 2022 Sep;10(18):1006. doi: 10.21037/atm-22-4331. Ann Transl Med. 2022. PMID: 36267731 Free PMC article.
References
-
- Boneh D, Crescenzo GD, Ostrovsky R, Persiano G. Public key encryption with keyword search. Proceedings of Cryptology – EUROCRYPT. 2004:506–22.
-
- Waters B, Balfanz D, Durfee G, Smetters DK. Building an encrypted and searchable audit log. Proceedings of the 11th Annual Network and Distributed System Security Symposium. 2004
-
- Baek J, Safiavi-naini R, Susilo W. Public key encryption with keyword search revisited. Proceedings of International Conference on Computational Science and its Applications. 2008:1249–59.
-
- Khader D. Public key encryption with keyword search based on k-esilient IBE. Proceedings of the International Conference on Computational Science and its Applications. 2006:298–308.
-
- Crescenzo GD, Saraswat V. Proceedings of the Cryptology 8th International Conference on Progress in Cryptology. 2007. Public key encryption with searchable keywords based on Jacobi symbols; pp. 282–96.
MeSH terms
Grants and funding
LinkOut - more resources
Full Text Sources
Other Literature Sources
Miscellaneous