Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2023 Jun 16:2023:62-70.
eCollection 2023.

High Resolution and Spatiotemporal Place-Based Computable Exposures at Scale

Affiliations

High Resolution and Spatiotemporal Place-Based Computable Exposures at Scale

Erika Rasnick et al. AMIA Jt Summits Transl Sci Proc. .

Abstract

Place-based exposures, termed "geomarkers", are powerful determinants of health but are often understudied because of a lack of open data and integration tools. Existing DeGAUSS (Decentralized Geomarker Assessment for Multisite Studies) software has been successfully implemented in multi-site studies, ensuring reproducibility and protection of health information. However, DeGAUSS relies on transporting geomarker data, which is not feasible for high-resolution spatiotemporal data too large to store locally or download over the internet. We expanded the DeGAUSS framework for high-resolution spatiotemporal geomarkers. Our approach stores data subsets based on coarsened location and year in an online repository, and appropriate subsets are downloaded to complete exposure assessment locally using exact date and location. We created and validated two free and open-source DeGAUSS containers for estimation of high-resolution, daily ambient air pollutant exposures, transforming published exposure assessment models into computable exposures for geomarker assessment at scale.

PubMed Disclaimer

Figures

Figure 1.
Figure 1.
An overview of the DeGAUSS approach used for large spatiotemporal data. Input start and end dates are expanded into a daily time series for each row in the input data. The calendar year and a coarsened geohash are used to download a spatiotemporal subset of exposure estimates. The files are cached locally, and exposure estimates are merged in based on exact date and location of the input data. Spatiotemporal pseudo-identifiers are removed by the user and exposures can be optionally averaged over specific time periods to reduce the risk of reidentification if sharing data.
Figure 2.
Figure 2.
(Left) S2 and H3 geohashes covering the contiguous United States colored by estimated population. (Right) “Safe Harbor” geohashes aggregated to prevent any single geohash from containing less than 20,000 estimated individuals. Aggregated geohashes are outlined in red.
Figure 3.
Figure 3.
System Usability Survey Results. In the four listed questions, respondents summarized their experience using DeGAUSS.

References

    1. Vineis P. A self-fulfilling prophecy: Are we underestimating the role of the environment in gene– environment interaction research? International Journal of Epidemiology. 2004;33(5):945–6. - PubMed
    1. Thessen AE, Grondin CJ, Kulkarni RD, Brander S, Truong L, Vasilevsky NA, et al. Community approaches for integrating environmental exposures into human models of disease. Environmental Health Perspectives. 2020;128(12):125002. - PMC - PubMed
    1. Martin Sanchez F, Gray K, Bellazzi R, Lopez-Campos G. Exposome informatics: Considerations for the design of future biomedical research information systems. Journal of the American Medical Informatics Association. 2013;21(3):386–90. - PMC - PubMed
    1. Wilkinson MD, Dumontier M, Aalbersberg IJ, Appleton G, Axton M, Baak A, et al. The fair guiding principles for scientific data management and stewardship. Sci Data. 2016;3:160018. - PMC - PubMed
    1. Brokamp C, Wolfe C, Lingren T, Harley J, Ryan P. Decentralized and reproducible geocoding and characterization of community and environmental exposures for multisite studies. Journal of the American Medical Informatics Association. 2017;25(3):309–14. - PMC - PubMed

LinkOut - more resources