Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2023 Apr 6;192(4):644-657.
doi: 10.1093/aje/kwac220.

Bias Amplification and Variance Inflation in Distributed Lag Models Using Low-Spatial-Resolution Data

Bias Amplification and Variance Inflation in Distributed Lag Models Using Low-Spatial-Resolution Data

Michael Leung et al. Am J Epidemiol. .

Abstract

Distributed lag models (DLMs) are often used to estimate lagged associations and identify critical exposure windows. In a simulation study of prenatal nitrogen dioxide (NO2) exposure and birth weight, we demonstrate that bias amplification and variance inflation can manifest under certain combinations of DLM estimation approaches and time-trend adjustment methods when using low-spatial-resolution exposures with extended lags. Our simulations showed that when using high-spatial-resolution exposure data, any time-trend adjustment method produced low bias and nominal coverage for the distributed lag estimator. When using either low- or no-spatial-resolution exposures, bias due to time trends was amplified for all adjustment methods. Variance inflation was higher in low- or no-spatial-resolution DLMs when using a long-term spline to adjust for seasonality and long-term trends due to concurvity between a distributed lag function and secular function of time. NO2-birth weight analyses in a Massachusetts-based cohort showed that associations were negative for exposures experienced in gestational weeks 15-30 when using high-spatial-resolution DLMs; however, associations were null and positive for DLMs with low- and no-spatial-resolution exposures, respectively, which is likely due to bias amplification. DLM analyses should jointly consider the spatial resolution of exposure data and the parameterizations of the time trend adjustment and lag constraints.

Keywords: Z-bias; air pollution; bias amplification; birth weight; concurvity; distributed lag models; spatial resolution; variance inflation.

PubMed Disclaimer

Figures

Figure 1
Figure 1
Continues
Figure 1
Figure 1
The time-varying association between weekly nitrogen dioxide (NO2) exposure and birth weight in term deliveries at Beth Israel Deaconess Medical Center (n = 46,153), Boston, Massachusetts, 2000–2016. Estimates were made using natural spline–based distributed lag models (ns-DLMs) (A) and tree-based distributed lag models (TDLMs) (B) under scenarios with 3 different spatial resolutions for exposure (high, low, and no resolution) and 6 methods to adjust for time trends (no adjustment, long-term spline, year and spline, year and harmonics, year and month, and year and season). Black solid lines show the lag-response estimates, gray shaded areas show the 95% confidence intervals for ns-DLMs and 95% Bayesian credible intervals for TDLMs, and the black dashed line indicates the null hypothesis of no effect across all weeks. Web Figure 3 is a zoomed-out version (expanded y-axis) of the no-spatial-resolution ns-DLM with a long-term spline. g, grams; ppb, parts per billion
Figure 2
Figure 2
Continues
Figure 2
Figure 2
Simulation results, showing the time-varying association between nitrogen dioxide (NO2) exposure and birth weight, of natural spline–based distributed lag models (A) and tree-based distributed lag models (B), across all weeks using 6 methods to adjust for time trends (no adjustment, long-term spline, year and spline, year and harmonics, year and month, and year and season). Simulation inputs include time trends and NO2 effects from the Massachusetts data. Gray lines show the lag-response relationships from each iteration of the simulation, the black solid line indicates the average across all simulations (500 replicates), the black dashed line indicates the null hypothesis of no effect across all weeks, and the black dotted line indicates the true simulated lag-response relationship. g, grams; ppb, parts per billion.
Figure 3
Figure 3
Continues
Figure 3
Figure 3
Coverage of lag-specific associations between nitrogen dioxide (NO2) and birth weight in simulations using natural spline–based distributed lag models (DLMs) (A) and tree-based distributed lag models (TDLMs) (B), with 6 methods to adjust for time trends (no adjustment, long-term spline, year and spline, year and harmonics, year and month, and year and season). Simulation inputs include time trends and NO2 effects from the Massachusetts data. The solid line and points indicate the proportion of simulations over 500 replicates in which the lag-specific 95% confidence interval (or credible intervals for TDLMs) contained the true simulated lag-specific effect, and the black dashed line indicates the 95% nominal coverage.

References

    1. He MZ, Kinney PL, Li T, et al. Short- and intermediate-term exposure to NO2 and mortality: a multi-county analysis in China. Environ Pollut. 2020;261:114165. - PMC - PubMed
    1. Wilson A, Chiu Y-HM, Hsu H-HL, et al. Potential for bias when estimating critical windows for air pollution in children’s health. Am J Epidemiol. 2017;186(11):1281–1289. - PMC - PubMed
    1. Kioumourtzoglou MA, Raz R, Wilson A, et al. Traffic-related air pollution and pregnancy loss. Epidemiology. 2019;30(1):4–10. - PMC - PubMed
    1. Jahn JL, Krieger N, Agénor M, et al. Gestational exposure to fatal police violence and pregnancy loss in US core based statistical areas, 2013–2015. EClinicalMedicine. 2021;36:100901. - PMC - PubMed
    1. Huang Y, Kioumourtzoglou M-A, Mittleman MA, et al. Air pollution and risk of placental abruption: a study of births in New York City, 2008–2014. Am J Epidemiol. 2021;190(6):1021–1033. - PMC - PubMed

Publication types