Advancing Air Pollution Exposure Models with Open-Vocabulary Object Detection and Semantic Segmentation of Street-View Images

Zhendong Yuan¹, Jules Kerckhoffs¹, Pi-I Debby Lin², Esra Suel³, Hao Li⁴, Li Yi², Marcia Pescador Jimenez⁵, Peter James^{6

7}, Kees de Hoogh^{8

9}, Gerard Hoek¹, Roel Vermeulen^{1

10}

Affiliations

¹ Institute for Risk Assessment Sciences, Utrecht University, Utrecht 3584 CM, Netherlands.
² Division of Chronic Disease Research Across the Lifecourse (CoRAL), Department of Population Medicine, Harvard Medical School and Harvard Pilgrim Health Care Institute, Boston, Massachusetts 02215, United States.
³ Centre for Advanced Spatial Analysis (CASA), University College London, London W1T4TJ, United Kingdom.
⁴ Department of Geography, National University of Singapore, Singapore 119077, Singapore.
⁵ Department of Epidemiology, Boston University School of Public Health, Boston, Massachusetts 02118, United States.
⁶ Department of Environmental Health, Harvard TH Chan School of Public Health, Harvard University, Boston, Massachusetts 02115, United States.
⁷ Division of Environmental and Occupational Health, Department of Public Health Sciences, Davis School of Medicine, University of California, Davis, California 95616, United States.
⁸ Swiss Tropical and Public Health Institute, Allschwil 4123, Switzerland.
⁹ University of Basel, 4001 Basel, Switzerland.
¹⁰ Julius Centre for Health Sciences and Primary Care, University Medical Centre, Utrecht University, Utrecht 3584CX, Netherlands.

PMID: 41014621
DOI: 10.1021/acs.est.5c09687

Advancing Air Pollution Exposure Models with Open-Vocabulary Object Detection and Semantic Segmentation of Street-View Images

Zhendong Yuan et al. Environ Sci Technol. 2025.

. 2025 Sep 27.

doi: 10.1021/acs.est.5c09687. Online ahead of print.

Authors

Zhendong Yuan¹, Jules Kerckhoffs¹, Pi-I Debby Lin², Esra Suel³, Hao Li⁴, Li Yi², Marcia Pescador Jimenez⁵, Peter James^{6

7}, Kees de Hoogh^{8

9}, Gerard Hoek¹, Roel Vermeulen^{1

10}

Affiliations

¹ Institute for Risk Assessment Sciences, Utrecht University, Utrecht 3584 CM, Netherlands.
² Division of Chronic Disease Research Across the Lifecourse (CoRAL), Department of Population Medicine, Harvard Medical School and Harvard Pilgrim Health Care Institute, Boston, Massachusetts 02215, United States.
³ Centre for Advanced Spatial Analysis (CASA), University College London, London W1T4TJ, United Kingdom.
⁴ Department of Geography, National University of Singapore, Singapore 119077, Singapore.
⁵ Department of Epidemiology, Boston University School of Public Health, Boston, Massachusetts 02118, United States.
⁶ Department of Environmental Health, Harvard TH Chan School of Public Health, Harvard University, Boston, Massachusetts 02115, United States.
⁷ Division of Environmental and Occupational Health, Department of Public Health Sciences, Davis School of Medicine, University of California, Davis, California 95616, United States.
⁸ Swiss Tropical and Public Health Institute, Allschwil 4123, Switzerland.
⁹ University of Basel, 4001 Basel, Switzerland.
¹⁰ Julius Centre for Health Sciences and Primary Care, University Medical Centre, Utrecht University, Utrecht 3584CX, Netherlands.

PMID: 41014621
DOI: 10.1021/acs.est.5c09687

Abstract

Mobile monitoring campaigns combined with land use regression (LUR) models effectively capture fine-scale spatial variations in urban air pollution. However, traditional predictor variables often fail to capture the nuances of the built environment and undocumented emission sources. To address this, we developed a framework integrating customizable object-level and segmentation-level visual features from street-view images into stepwise regression and random-forest-based LUR models. Using 5.7 million mobile air pollution measurements (2019-2020) and 0.37 million street-view images (2008-2024), we mapped nitrogen dioxide (NO₂), black carbon (BC), and ultrafine particles (UFP) across 46,664 road segments in Amsterdam, The Netherlands. Incorporating street-view images improved model performance, increasing R² by 0.01-0.05 and reducing mean absolute errors by 0.7-10.3%. Sensitivity analyses indicated that key street-view-derived visual features remained stable across years and seasons. Using images from nearby years expanded training instances, thereby enhancing alignment with mobile measurements at fine granularity. Our open-vocabulary object detection module identified influential but previously unrecognized object predictors, such as chimneys, traffic lights, and shops. Combined with segmentation-derived features (e.g., walls, roads, grass), street-view images contributed 8-18% feature importance to model predictions. These findings highlight the potential of visual data in enhancing hyperlocal air pollution mapping and exposure assessment.

Keywords: air pollution; deep learning; exposure assessment; land use regression (LUR); mobile sensing; street-view image; vision-language model (VLM); vision-transformer models (ViT).

PubMed Disclaimer

LinkOut - more resources

Full Text Sources
- American Chemical Society

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Advancing Air Pollution Exposure Models with Open-Vocabulary Object Detection and Semantic Segmentation of Street-View Images

Affiliations

Advancing Air Pollution Exposure Models with Open-Vocabulary Object Detection and Semantic Segmentation of Street-View Images

Authors

Affiliations

Abstract

LinkOut - more resources

Full Text Sources