Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2013 Dec 7:12:54.
doi: 10.1186/1476-072X-12-54.

Detecting cancer clusters in a regional population with local cluster tests and Bayesian smoothing methods: a simulation study

Affiliations

Detecting cancer clusters in a regional population with local cluster tests and Bayesian smoothing methods: a simulation study

Dorothea Lemke et al. Int J Health Geogr. .

Abstract

Background: There is a rising public and political demand for prospective cancer cluster monitoring. But there is little empirical evidence on the performance of established cluster detection tests under conditions of small and heterogeneous sample sizes and varying spatial scales, such as are the case for most existing population-based cancer registries. Therefore this simulation study aims to evaluate different cluster detection methods, implemented in the open source environment R, in their ability to identify clusters of lung cancer using real-life data from an epidemiological cancer registry in Germany.

Methods: Risk surfaces were constructed with two different spatial cluster types, representing a relative risk of RR = 2.0 or of RR = 4.0, in relation to the overall background incidence of lung cancer, separately for men and women. Lung cancer cases were sampled from this risk surface as geocodes using an inhomogeneous Poisson process. The realisations of the cancer cases were analysed within small spatial (census tracts, N = 1983) and within aggregated large spatial scales (communities, N = 78). Subsequently, they were submitted to the cluster detection methods. The test accuracy for cluster location was determined in terms of detection rates (DR), false-positive (FP) rates and positive predictive values. The Bayesian smoothing models were evaluated using ROC curves.

Results: With moderate risk increase (RR = 2.0), local cluster tests showed better DR (for both spatial aggregation scales > 0.90) and lower FP rates (both < 0.05) than the Bayesian smoothing methods. When the cluster RR was raised four-fold, the local cluster tests showed better DR with lower FPs only for the small spatial scale. At a large spatial scale, the Bayesian smoothing methods, especially those implementing a spatial neighbourhood, showed a substantially lower FP rate than the cluster tests. However, the risk increases at this scale were mostly diluted by data aggregation.

Conclusion: High resolution spatial scales seem more appropriate as data base for cancer cluster testing and monitoring than the commonly used aggregated scales. We suggest the development of a two-stage approach that combines methods with high detection rates as a first-line screening with methods of higher predictive ability at the second stage.

PubMed Disclaimer

Figures

Figure 1
Figure 1
Overview of the study area (Regierungsbezirk Münster) with the modelled cluster areas. Two different spatial aggregation scales are shown: (a) the 78 communities and (b) the 1983 census tracts with the associated population density. Location of the study area in Germany (c).
Figure 2
Figure 2
Overview of the simulation process/design. In the defined cluster areas relative risks of two resp. four and in the remaining study a relative risk of one were assigned. The geocoded observed cases (Oi) were generated using an inhomogeneous Poisson process.
Figure 3
Figure 3
Spatial aggregation scales of the realised observed cases. The observed cases (Oi) were aggregated to the census tracts (a), and to the communities (b).
Figure 4
Figure 4
Averaged ROC curves of the four applied Bayesian smoothing models at census tract level. The letter indicating the different risk realizations: (a) urban cluster in the male population (RR = 2.0); (b) rural cluster the male population (RR = 2.0); (c) urban cluster in the female population (RR = 2.0); (d) rural cluster the male population (RR = 2.0); (e) urban cluster in the male population (RR = 4.0); (f) rural cluster in the male population (RR = 4.0); and (g) rural cluster in the female population (RR = 4.0).
Figure 5
Figure 5
Averaged ROC curves of the four applied Bayesian smoothing models at community level. The letter indicating the different risk realizations: (a) urban cluster in the male population (RR = 2.0); (b) rural cluster the male population (RR = 2.0); (c) urban cluster in the female population (RR = 2.0); (d) rural cluster the male population (RR = 2.0); (e) urban cluster in the male population (RR = 4.0); (f) rural cluster in the male population (RR = 4.0); and (g) rural cluster in the female population (RR = 4.0).

Similar articles

Cited by

References

    1. Monitoring in epidemiologischen Krebsregistern: Möglichkeiten – Grenzen – Risiken. http://www.krebsregister-niedersachsen.de/registerstelle/?page_id=2071.
    1. Kieschke J. In: EKN-Bericht. Epidemiologisches Krebsregister Niedersachsen, editor. Oldenburg, Germany; 2010. Auswertung des EKN zur Krebshäufigkeit in der Samtgemeinde Asse.
    1. Dreesmann J. In: Monitoring-Workshop. Niedersächsisches Landesgesundheitsamt, editor. Oldenburg, Germany; 2012. Statistische Erkennung Räumlicher Cluster- Erfahrungen aus der Infektionsepidemiologie.
    1. Olsen SF, Martuzzi M, Elliott P. Cluster analysis and disease mapping–why, when, and how? A step by step guide. BMJ. 1996;12(7061):863–866. doi: 10.1136/bmj.313.7061.863. - DOI - PMC - PubMed
    1. Waller LA, Gotway CA. Applied Spatial Statistics for Public Health Data. Hoboken, N.J.: John Wiley & Sons; 2004.

Publication types