Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2016 Nov;23(6):1040-1045.
doi: 10.1093/jamia/ocw001. Epub 2016 Mar 24.

Preserving temporal relations in clinical data while maintaining privacy

Affiliations

Preserving temporal relations in clinical data while maintaining privacy

George Hripcsak et al. J Am Med Inform Assoc. 2016 Nov.

Abstract

Objective: Maintaining patient privacy is a challenge in large-scale observational research. To assist in reducing the risk of identifying study subjects through publicly available data, we introduce a method for obscuring date information for clinical events and patient characteristics.

Methods: The method, which we call Shift and Truncate (SANT), obscures date information to any desired granularity. Shift and Truncate first assigns each patient a random shift value, such that all dates in that patient's record are shifted by that amount. Data are then truncated from the beginning and end of the data set.

Results: The data set can be proven to not disclose temporal information finer than the chosen granularity. Unlike previous strategies such as a simple shift, it remains robust to frequent - even daily - updates and robust to inferring dates at the beginning and end of date-shifted data sets. Time-of-day may be retained or obscured, depending on the goal and anticipated knowledge of the data recipient.

Conclusions: The method can be useful as a scientific approach for reducing re-identification risk under the Privacy Rule of the Health Insurance Portability and Accountability Act and may contribute to qualification for the Safe Harbor implementation.

PubMed Disclaimer

Figures

Figure 1:
Figure 1:
Shift and Truncate. Each row is a unique patient, each number is a unique data point for a patient, and each rectangle represents the time that the patient was potentially observed. ( a ) Original data set. Patients are potentially observed for 3 years (each vertical line marks 1 year). Patients need not have data, but simply the potential to have been observed (even if they lived elsewhere or were not born yet, someone had the potential to have been observed). ( b ) Shifted data set. Patient records are shifted forward by 1–366 days. Data points that were previously aligned across patients are no longer aligned, but points within a given patient remain at the same relative distances from each other. ( c ) Shifted and truncated data set. Data points from the first 366 days of the shifted data set and from the last 366 days of the shifted data set are removed from the data set.

References

    1. Hripcsak G Duke JD Shah NH et al. . Observational Health Data Sciences and Informatics (OHDSI): opportunities for observational researchers . Stud Health Technol Inform . 2015. ; 216 : 574 – 578 . - PMC - PubMed
    1. Fleurence RL Curtis LH Califf RM Platt R Selby J Brown JS . Launching PCORnet, a national patient-centered clinical research network . J Am Med Inform Assoc. 2014. ; 4 : 578 – 582 . - PMC - PubMed
    1. Gottesman O Kuivaniemi H Tromp G et al. . The Electronic Medical Records and Genomics (eMERGE) network: past, present, and future . Genet Med. 2013. ; 15 : 761 – 771 . - PMC - PubMed
    1. Cavoukian A El Emam K . Dispelling the Myths Surrounding De-identification: Anonymization Remains a Strong Tool for Protecting Privacy . Office of the Ontario Informational and Privacy Commissioner, Toronto, Canada. June 2011. https://www.ipc.on.ca/images/Resources/anonymization.pdf . Accessed August 31, 2015 .
    1. McGraw D . Building public trust in uses of Health Insurance Portability and Accountability Act de-identified data . J Am Med Inform Assoc. 2013. ; 20 : 29 – 34 . - PMC - PubMed

Publication types