Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2024 Jun:49:100659.
doi: 10.1016/j.sste.2024.100659. Epub 2024 May 12.

Assessing and attenuating the impact of selection bias on spatial cluster detection studies

Affiliations

Assessing and attenuating the impact of selection bias on spatial cluster detection studies

Joseph Boyle et al. Spat Spatiotemporal Epidemiol. 2024 Jun.

Abstract

Spatial cluster analyses are commonly used in epidemiologic studies of case-control data to detect whether certain areas in a study region have an excess of disease risk. Case-control studies are susceptible to potential biases including selection bias, which can result from non-participation of eligible subjects in the study. However, there has been no systematic evaluation of the effects of non-participation on the findings of spatial cluster analyses. In this paper, we perform a simulation study assessing the effect of non-participation on spatial cluster analysis using the local spatial scan statistic under a variety of scenarios that vary the location and rates of study non-participation and the presence and intensity of a zone of elevated risk for disease for simulated case-control studies. We find that geographic areas of lower participation among controls than cases can greatly inflate false-positive rates for identification of artificial spatial clusters. Additionally, we find that even modest non-participation outside of a true zone of elevated risk can decrease spatial power to identify the true zone. We propose a spatial algorithm to correct for potentially spatially structured non-participation that compares the spatial distributions of the observed sample and underlying population. We demonstrate its ability to markedly decrease false positive rates in the absence of elevated risk and resist decreasing spatial sensitivity to detect true zones of elevated risk. We apply our method to a case-control study of non-Hodgkin lymphoma. Our findings suggest that greater attention should be paid to the potential effects of non-participation in spatial cluster studies.

Keywords: Case-control study; Epidemiology; Local spatial scan; Non-participation; Selection bias; Spatial cluster.

PubMed Disclaimer

Conflict of interest statement

Declaration of competing interest There has no conflicts of interest from any authors.

Figures

Figure 1.
Figure 1.
Illustration of example case-control dataset from Spatial Risk 1, Participation N, Odds Ratio 3, with cases and controls represented by filled-in and empty dots respectively and the red region defining the zone of elevated risk.
Figure 2.
Figure 2.
Most likely cluster in Iowa study center (in red) from NCI-SEER NHL study and observed sample. Participant locations have been randomly jittered to maintain confidentiality.
Figure 3.
Figure 3.
Most likely cluster in Iowa study center (in red) from NCI-SEER NHL study and proposed method which added 10 participants to observed sample. Participant locations have been randomly jittered to maintain confidentiality.

Similar articles

References

    1. Waller LA, Gotway CA. Applied Spatial Statistics for Public Health Data. John Wiley & Sons; 2004.
    1. Hastie TJ, Tibshirani RJ. Generalized Additive Models. Routledge; 2017.
    1. Richardson S, Thomson A, Best N, Elliott P. Interpreting posterior relative risk estimates in disease-mapping studies. Environmental health perspectives. 2004;112(9):1016–1025. - PMC - PubMed
    1. Wheeler DC, De Roos AJ, Cerhan JR, et al. Spatial-temporal analysis of non-Hodgkin lymphoma in the NCI-SEER NHL case-control study. Environmental Health. 2011;10(1):1–13. - PMC - PubMed
    1. Boyle J, Ward MH, Cerhan JR, Rothman N, Wheeler DC. Estimating mixture effects and cumulative spatial risk over time simultaneously using a Bayesian index low-rank kriging multiple membership model. Statistics in Medicine. 2022;41(29):5679–5697. - PMC - PubMed

LinkOut - more resources