Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2021;35(8):1701-1713.
doi: 10.1007/s00477-020-01965-z. Epub 2021 Jan 3.

The impact of modelling choices on modelling outcomes: a spatio-temporal study of the association between COVID-19 spread and environmental conditions in Catalonia (Spain)

Affiliations

The impact of modelling choices on modelling outcomes: a spatio-temporal study of the association between COVID-19 spread and environmental conditions in Catalonia (Spain)

Álvaro Briz-Redón. Stoch Environ Res Risk Assess. 2021.

Abstract

The choices that researchers make while conducting a statistical analysis usually have a notable impact on the results. This fact has become evident in the ongoing research of the association between the environment and the evolution of the coronavirus disease 2019 (COVID-19) pandemic, in light of the hundreds of contradictory studies that have already been published on this issue in just a few months. In this paper, a COVID-19 dataset containing the number of daily cases registered in the regions of Catalonia (Spain) since the start of the pandemic to the end of August 2020 is analysed using statistical models of diverse levels of complexity. Specifically, the possible effect of several environmental variables (solar exposure, mean temperature, and wind speed) on the number of cases is assessed. Thus, the first objective of the paper is to show how the choice of a certain type of statistical model to conduct the analysis can have a severe impact on the associations that are inferred between the covariates and the response variable. Secondly, it is shown how the use of spatio-temporal models accounting for the nature of the data allows understanding the evolution of the pandemic in space and time. The results suggest that even though the models fitted to the data correctly capture the evolution of COVID-19 in space and time, determining whether there is an association between the spread of the pandemic and certain environmental conditions is complex, as it is severely affected by the choice of the model.

Keywords: COVID-19; Environmental covariates; Integrated nested Laplace approximation; Relative risk; Space-time interaction; Spatio-temporal models.

PubMed Disclaimer

Conflict of interest statement

Conflict of interestThe author declare that they have no conflict of interest.

Figures

Fig. 1
Fig. 1
Map of peninsular Spain at the province level (a) and map of Catalonia at the region level (b). In a, the four provinces of Catalonia are highlighted. In b, the region of Barcelonès, where the capital city of Catalonia (Barcelona) is located, is also highlighted
Fig. 2
Fig. 2
Histograms of the PIT scores obtained for Models 3 to 8 (from left to right), considering a 0-day, a 7-day, and a 14-day lagged effect (from top to bottom) on the covariates
Fig. 3
Fig. 3
Summary of the estimates obtained for the coefficients associated with environmental covariates for each of the 12 models fitted, considering a lagged effect on the covariates of 0, 7, or 14 days
Fig. 4
Fig. 4
Relative risks on a weekly and a daily basis according to the structured and unstructured temporal random effects estimated through Models 3 (a) and 4 (b). The relative risk corresponding to the structured component is computed as either exp(γw(t)) or exp(γt), whereas the one corresponding to the unstructured component is computed as either exp(ϕw(t)) or exp(ϕt)
Fig. 5
Fig. 5
Global relative risks at the region level estimated for the period under study (computed as exp(ui+vi)) considering Model 3 (a) and Model 4 (b)
Fig. 6
Fig. 6
Relative risks at the region level (computed as exp(ui+vi+γt+ϕt+δit)) estimated for a selection of days within the period under study with Model 9
Fig. 7
Fig. 7
Evolution of the relative risks (computed as exp(ui+vi+γt+ϕt+δit)), according to the estimates provided by Model 9 in the six regions of Catalonia with the highest global relative risks (according to the estimates provided by Models 3 and 4). To make this plot, the relative risks provided by Model 9 have been smoothed through a locally estimated scatterplot smoothing (LOESS) regression (Fox and Weisberg 2018) for ease of visualisation and interpretation

Similar articles

Cited by

References

    1. Arauzo-Carod J-M (2020) A first insight about spatial dimension of COVID-19: analysis at municipality level. J Publ Health, fdaa140 - PMC - PubMed
    1. Baker RE, Yang W, Vecchi GA, Metcalf CJE, Grenfell BT (2020) Susceptible supply limits the role of climate in the early SARS-CoV-2 pandemic. Science - PMC - PubMed
    1. Besag J, York J, Mollié A. Bayesian image restoration, with two applications in spatial statistics. Ann Inst Stat Math. 1991;43(1):1–20. doi: 10.1007/BF00116466. - DOI
    1. Bivand RS, Pebesma EJ, Gomez-Rubio V, Pebesma EJ. Applied spatial data analysis with R. Berlin: Springer; 2008.
    1. Bivand R, Keitt T, Rowlingson B (2019). rgdal: bindings for the ‘Geospatial’ Data Abstraction Library. R package version 1.4-6

LinkOut - more resources