When Excessive Perturbation Goes Wrong and Why IPUMS-International Relies Instead on Sampling, Suppression, Swapping, and Other Minimally Harmful Methods to Protect Privacy of Census Microdata
- PMID: 28393150
- PMCID: PMC5382996
- DOI: 10.1007/978-3-642-33627-0_14
When Excessive Perturbation Goes Wrong and Why IPUMS-International Relies Instead on Sampling, Suppression, Swapping, and Other Minimally Harmful Methods to Protect Privacy of Census Microdata
Abstract
IPUMS-International disseminates population census microdata at no cost for 69 countries. Currently, a series of 212 samples totaling almost a half billion person records are available to researchers. Registration is required for researchers to gain access to the microdata. Statistics from Google Analytics show that IPUMS-International's lengthy, probing registration form is an effective deterrent for unqualified applicants. To protect data privacy, we rely principally on sampling, suppression of geographic detail, swapping of records across geographic boundaries, and other minimally harmful methods such as top and bottom coding. We do not use excessively perturbative methods. A recent case of perturbation gone wrong- the household samples of the 2000 census of the USA (PUMS), the 2003-2006 American Community Survey, and the 2004-2009 Current Population Survey-, an empirical study of the impact of perturbation on the usability of UK census microdata-the Individual SARs of the 1991 census of the UK-, and a mathematical demonstration in a timely compendium of statistical confidentiality practices confirm the wisdom of IPUMS microdata management protocols and statistical disclosure controls.
Keywords: IPUMS-International; data dissemination; data privacy; microdata samples; population census; statistical disclosure controls.
Figures
Similar articles
-
IPUMS-International High Precision Population Census Microdata Samples: Balancing the Privacy-Quality Tradeoff by Means of Restricted Access Extracts.Priv Stat Databases. 2006 Dec;4302:375-382. doi: 10.1007/11930242_31. Priv Stat Databases. 2006. PMID: 28393148 Free PMC article.
-
IPUMS-International Statistical Disclosure Controls: 159 Census Microdata Samples in Dissemination, 100+ in Preparation.Priv Stat Databases. 2010 Sep;6344:74-84. doi: 10.1007/978-3-642-15838-4_7. Priv Stat Databases. 2010. PMID: 28393149 Free PMC article.
-
Creating Statistically Literate Global Citizens: The Use of IPUMS-International Integrated Census Microdata in Teaching.Stat J IAOS. 2011;27(3-4):145-156. doi: 10.3233/SJI-2011-0733. Stat J IAOS. 2011. PMID: 25279022 Free PMC article.
-
A tutorial in assessing disclosure risk in microdata.Stat Med. 2018 Nov 10;37(25):3693-3706. doi: 10.1002/sim.7667. Epub 2018 Jun 21. Stat Med. 2018. PMID: 29931695 Review.
-
Quo vadis, data privacy?Ann N Y Acad Sci. 2012 Jul;1260:45-54. doi: 10.1111/j.1749-6632.2012.06630.x. Ann N Y Acad Sci. 2012. PMID: 22809458 Review.
Cited by
-
"It's None of Their Damn Business": Privacy and Disclosure Control in the U.S. Census, 1790-2020.Popul Dev Rev. 2023 Sep;49(3):651-679. doi: 10.1111/padr.12580. Epub 2023 Jul 24. Popul Dev Rev. 2023. PMID: 37928237 Free PMC article.
-
The Big Census Data Revolution: IPUMS-International. Trans-Border Access to Decades of Census Samples for Three-Fourths of the World and more.Rev Demogr Hist. 2013;30(1):69-88. Rev Demogr Hist. 2013. PMID: 25506369 Free PMC article.
-
The shortcomings of synthetic census microdata.Proc Natl Acad Sci U S A. 2025 Mar 18;122(11):e2424655122. doi: 10.1073/pnas.2424655122. Epub 2025 Mar 6. Proc Natl Acad Sci U S A. 2025. PMID: 40048290
-
IPUMS International: A review and future prospects of a unique global statistical cooperation programme.Stat J IAOS. 2016;32(4):715-727. doi: 10.3233/SJI-161022. Epub 2016 Nov 15. Stat J IAOS. 2016. PMID: 28835781 Free PMC article.
-
THE IPUMS COLLABORATION: INTEGRATING AND DISSEMINATING THE WORLD'S POPULATION MICRODATA.J Demogr Economics. 2015 Jun;81(2):203-216. J Demogr Economics. 2015. PMID: 26236495 Free PMC article.
References
-
- McCaa R, Ruggles S, Davern M, Swenson T, Mohan Palipudi K. IPUMS-International High Precision Population Census Microdata Samples: Balancing the Privacy-Quality Tradeoff by Means of Restricted Access Extracts. In: Domingo-Ferrer J, Franconi L, editors. Privacy in Statistical Databases. PSD2006 Proceedings, LNCS 4302. Berlin: Springer-Verlag; 2006. pp. 375–382. - PMC - PubMed
-
- McCaa R, Ruggles S, Sobek M. IPUMS-International Statistical Disclosure Controls: 159 Census Microdata Samples In Dissemination, 100+ In Preparation. In: Domingo-Ferrer J, Magkos E, editors. Privacy in Statistical Databases. PSD2010 Proceedings, LNCS 6344. Heidelberg: Springer; 2010. pp. 74–84. - PMC - PubMed
-
- United Nations Economic Commission for Europe. Managing Statistical Confidentiality & Microdata Access: Principles and Guidelines of Good Practice. Geneva: United Nations; 2007. Conference of European Statisticians. See online edition Annex 1.23: http://www.unece.org/fileadmin/DAM/stats/publications/Managing.statistic....
-
- Reiter JP. Statistical Approaches to Protecting Confidentiality for Microdata and Their Effects on the Quality of Statistical Inferences. Public Opinion Quarterly. 2012;76(1):163–181.
-
- Duncan GT, Elliot M, Salazar-González J-J. Statistical Confidentiality: Principles and Practice. Heidelberg: Springer; 2011.
Grants and funding
LinkOut - more resources
Full Text Sources
Miscellaneous