A data pipeline for secure extraction and sharing of social determinants of health
- PMID: 39888883
- PMCID: PMC11785280
- DOI: 10.1371/journal.pone.0317215
A data pipeline for secure extraction and sharing of social determinants of health
Abstract
Objectives: Linking neighborhood- and patient-level data provides valuable information about the influence of upstream social determinants of health (SDOH). However, sharing of these data across health systems presents challenges. We set out to develop a pipeline to acquire, deidentify, and share neighborhood-level SDOH data across multiple health systems.
Methods: We created a pipeline centered around Decentralized Geomarker Assessment for Multi-Site Studies (DeGAUSS) that utilizes containerization to geocode patient addresses and obtain neighborhood-level SDOH variables. We compared DeGAUSS to a third-party vendor geocoding tool available at Duke Health using a cohort of adult patients referred for abdominal transplant from January 1, 2016, to December 31, 2022. We calculated Cohen's Kappa and percent disagreement at census block group and tract levels, and by Area Deprivation Index, urbanicity, and year.
Results: The pipeline successfully generated SDOH data for 97.8% of addresses. There was high concordance between DeGAUSS and the vendor tool at the census block group (0.93) and tract levels (0.95). At the block group level, disagreement proportion differed by year and urbanicity, with larger disagreement in the rural category than in micropolitan and metropolitan categories (13%, 7%, 6.2%, respectively).
Discussion and conclusion: We describe a novel pipeline that can facilitate the secure acquisition and sharing of neighborhood-level SDOH without sharing PHI. The pipeline can be scaled to include additional social, climate, and environmental variables, and can be extended to an unlimited number of health systems.
Copyright: © 2025 Schappe et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Conflict of interest statement
The authors have declared that no competing interests exist.
Figures
References
-
- Health CoSDo. Closing the gap in a generation: health equity through action on the social determinants of health: final report of the commission on social determinants of health: World Health Organization; 2008. - PubMed
-
- Paskett E, Thompson B, Ammerman AS, Ortega AN, Marsteller J, Richardson D. Multilevel Interventions To Address Health Disparities Show Promise In Improving Population Health. Health Aff (Millwood). 2016;35(8):1429–34. doi: 10.1377/hlthaff.2015.1360 ; PubMed Central PMCID: PMC5553289. - DOI - PMC - PubMed
-
- Chow TE, Dede-Bamfo N., & Dahal K. R. Geographic disparity of positional errors and matching rate of residential addresses among geocoding solutions. Annals of GIS. 2015;22(1):29–42. doi: 10.1080/19475683.2015.1085437 - DOI
MeSH terms
Grants and funding
LinkOut - more resources
Full Text Sources
