Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2020 Jul 6;19(1):26.
doi: 10.1186/s12942-020-00219-z.

Street masking: a network-based geographic mask for easily protecting geoprivacy

Affiliations

Street masking: a network-based geographic mask for easily protecting geoprivacy

David Swanlund et al. Int J Health Geogr. .

Abstract

Background: Geographic masks are techniques used to protect individual privacy in published maps but are highly under-utilized in research. This leads to continual violations of individual privacy, as sensitive health records are put at risk in unmasked maps. New approaches to geographic masking are required that foster accessibility and ease of use, such that they become more widely adopted. This article describes a new geographic masking method, called street masking, that reduces the burden on users of finding supplemental population data by instead automatically retrieving OpenStreetMap data and using the road network as a basis for masking. We compare it to donut geomasking, both with and without population density taken into account, to evaluate its efficacy against geographic masks that require slightly less and slightly more supplemental data. Our analysis is performed on synthetic data in three different Canadian cities.

Results: Street masking performs similarly to population-based donut geomasking with regard to privacy protection, achieving comparable k-anonymity values at similar median displacement distances. As expected, distance-based donut geomasking performs worst at privacy protection. Street masking also performs very well regarding information loss, achieving far better cluster preservation and landcover agreement than population-based donut geomasking. Distance-based donut geomasking performs similarly to street masking, though at the cost of reduced privacy protection.

Conclusion: Street masking competes with, if not out-performs population-based donut geomasking and does so without requiring any supplemental data from users. Moreover, unlike most other geographic masks, it significantly minimizes the risk of false attribution and inherently takes many geographic barriers into account. It is easily accessible for Python users and provides the foundation for interfaces to be built for non-coding users, such that privacy can be better protected in sensitive geospatial research.

Keywords: Donut geomasking; Geographic masking; Geomasking; Geoprivacy; OpenStreetMap; Osmnx; Street masking.

PubMed Disclaimer

Conflict of interest statement

The authors declare that they have no competing interests.

Figures

Fig. 1
Fig. 1
A map depicting the points that were generated in Vancouver (a), Surrey (b), and Kamloops (c, Inset) and used for evaluating each geographic mask
Fig. 2
Fig. 2
A graphic illustrating the street masking algorithm. Note that the top point is in a low-density area, the middle point is in a high-density area, and the bottom point is in a medium-density area. As a result, the top point is moved the furthest distance after masking, the middle point is moved the least distance, and the bottom point is moved a medium distance
Fig. 3
Fig. 3
Results of Ripley’s K function analysis for clustering at different spatial scales in Vancouver. Results between the masks are fairly comparable, though population-based donut geomasking tends to increase clustering more than other methods
Fig. 4
Fig. 4
Results of Ripley’s K function analysis for clustering at different spatial scales in Surrey. Results here diverge more than in Vancouver due to increased population heterogeneity. Population-based donut geomasking significantly increases clustering compared to other masks. Street masking and distance-based donut geomasking produce similar results to the original data
Fig. 5
Fig. 5
Results of Ripley’s K function analysis for clustering at different spatial scales in Kamloops. With the most heterogeneously distributed population out of the three study areas, results here are most dramatic. Population-based donut geomasking greatly altered clustering. Distance-based donut geomasking is most like the original data, though street masking is not far behind

References

    1. Allshouse WB, Fitch MK, Hampton KH, Gesink DC, Doherty IA, Leone PA, Serre ML, Miller WC. Geomasking sensitive health data and privacy protection: an evaluation using an E911 database. Geocarto Int. 2010;25(6):443–452. doi: 10.1080/10106049.2010.496496. - DOI - PMC - PubMed
    1. Armstrong MP, Rushton G, Zimmerman DL. Geographically masking health data to preserve confidentiality. Stat Med. 1999;18(5):497–525. doi: 10.1002/(SICI)1097-0258(19990315)18:5<497::AID-SIM45>3.0.CO;2-#. - DOI - PubMed
    1. Barrington-Leigh C, Millard-Ball A. The world’s user-generated road map is more than 80% complete. PLoS ONE. 2017;12(8):e0180698. doi: 10.1371/journal.pone.0180698. - DOI - PMC - PubMed
    1. Boeing G (n.d.). Osmnx package. Readthedocs.Io. Retrieved January 3, 2020, from https://osmnx.readthedocs.io/en/stable/osmnx.html#module-osmnx.core.
    1. Boeing G. OSMnx: new methods for acquiring, constructing, analyzing, and visualizing complex street networks. Comput Environ Urban Syst. 2017;65:126–139. doi: 10.1016/j.compenvurbsys.2017.05.004. - DOI

Publication types