Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2014 Apr 7:8:35.
doi: 10.3389/fninf.2014.00035. eCollection 2014.

Sharing privacy-sensitive access to neuroimaging and genetics data: a review and preliminary validation

Affiliations

Sharing privacy-sensitive access to neuroimaging and genetics data: a review and preliminary validation

Anand D Sarwate et al. Front Neuroinform. .

Abstract

The growth of data sharing initiatives for neuroimaging and genomics represents an exciting opportunity to confront the "small N" problem that plagues contemporary neuroimaging studies while further understanding the role genetic markers play in the function of the brain. When it is possible, open data sharing provides the most benefits. However, some data cannot be shared at all due to privacy concerns and/or risk of re-identification. Sharing other data sets is hampered by the proliferation of complex data use agreements (DUAs) which preclude truly automated data mining. These DUAs arise because of concerns about the privacy and confidentiality for subjects; though many do permit direct access to data, they often require a cumbersome approval process that can take months. An alternative approach is to only share data derivatives such as statistical summaries-the challenges here are to reformulate computational methods to quantify the privacy risks associated with sharing the results of those computations. For example, a derived map of gray matter is often as identifiable as a fingerprint. Thus alternative approaches to accessing data are needed. This paper reviews the relevant literature on differential privacy, a framework for measuring and tracking privacy loss in these settings, and demonstrates the feasibility of using this framework to calculate statistics on data distributed at many sites while still providing privacy.

Keywords: collaborative research; data integration; data sharing; neuroimaging; privacy.

PubMed Disclaimer

Figures

Figure 1
Figure 1
System for differentially private classifier aggregation from many sites. The N sites each train a classifier on their local data to learn vectors {wi}. These are used by an aggregator to compute new features for its own data set. The aggregator can learn a classifier using its own data using a non-private algorithm (if its data is public) or a differentially private algorithm (if its data is private).
Figure 2
Figure 2
Classification error rates for the mixed private-public case (A) and the fully-private case (B). In both cases the combined differentially private classifier performs significantly better than the individual classifiers. The difference is statistically significant even after Bonferroni correction (to account for multiple sites) with corrected p-values below 1.8 × 10−33. Results thus motivate the use of differential privacy for sharing of brain imaging and genetic data to enable quick access to data which is either hard to access for logical reasons or not available for open sharing at all.

References

    1. Allen E. A., Erhardt E. B., Damaraju E., Gruner W., Segall J. M., Silva R. F., et al. (2011). A baseline for the multivariate comparison of resting state networks. Front. Syst. Neurosci. 5:2 10.3389/fnsys.2011.00002 - DOI - PMC - PubMed
    1. Arbabshirani M. R., Kiehl K., Pearlson G., Calhoun V. D. (2013). Classification of schizophrenia patients based on resting-state functional network connectivity. Front. Neurosci. 7:133 10.3389/fnins.2013.00133 - DOI - PMC - PubMed
    1. Bießmann F., Plis S., Meinecke F. C., Eichele T., Müller K. R. (2011). Analysis of multimodal neuroimaging data. IEEE Rev. Biomed. Eng. 4, 6 10.1109/RBME.2011.2170675 - DOI - PubMed
    1. Bridwell D. A., Wu L., Eichele T., Calhoun V. D. (2013). The spatiospectral characterization of brain networks: fusing concurrent EEG spectra and fMRI maps. Neuroimage 69, 101–111 10.1016/j.neuroimage.2012.12.024 - DOI - PMC - PubMed
    1. Chaudhuri K., Mishra N. (2006). When random sampling preserves privacy, in Advances in Cryptology - CRYPTO 2006. Lecture notes in computer science, Vol. 4117, ed Dwork C. (Berlin: Springer-Verlag; ), 198–213 10.1007/11818175_12 - DOI