Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2005 Sep 15:4:19.
doi: 10.1186/1476-069X-4-19.

Cluster detection methods applied to the Upper Cape Cod cancer data

Affiliations

Cluster detection methods applied to the Upper Cape Cod cancer data

Al Ozonoff et al. Environ Health. .

Abstract

Background: A variety of statistical methods have been suggested to assess the degree and/or the location of spatial clustering of disease cases. However, there is relatively little in the literature devoted to comparison and critique of different methods. Most of the available comparative studies rely on simulated data rather than real data sets.

Methods: We have chosen three methods currently used for examining spatial disease patterns: the M-statistic of Bonetti and Pagano; the Generalized Additive Model (GAM) method as applied by Webster; and Kulldorff's spatial scan statistic. We apply these statistics to analyze breast cancer data from the Upper Cape Cancer Incidence Study using three different latency assumptions.

Results: The three different latency assumptions produced three different spatial patterns of cases and controls. For 20 year latency, all three methods generally concur. However, for 15 year latency and no latency assumptions, the methods produce different results when testing for global clustering.

Conclusion: The comparative analyses of real data sets by different statistical methods provides insight into directions for further research. We suggest a research program designed around examining real data sets to guide focused investigation of relevant features using simulated data, for the purpose of understanding how to interpret statistical methods applied to epidemiological data with a spatial component.

PubMed Disclaimer

Figures

Figure 1
Figure 1
Breast cancer cases and controls. Distribution of breast cancer cases (in red) and controls (in blue). Each point represents the residence of one participant. Locations in this map have been geographically altered to preserve confidentiality. Actual residences were used in the analysis.
Figure 2
Figure 2
Breast cancer 20 year latency (GAM). Breast cancer 20 year latency, GAM smoothed rate map. Solid lines delineate areas where the point-wise GAM deviance statistic is less than 0.05.
Figure 3
Figure 3
Breast cancer 20 year latency (scan). Breast cancer 20 year latency, scan statistic most likely cluster. Estimated relative risk for the indicated cluster is 0.823.
Figure 4
Figure 4
Breast cancer 15 year latency (GAM). Breast cancer 15 year latency.
Figure 5
Figure 5
Breast cancer 15 year latency (scan). Breast cancer 15 year latency. RR = 4.629.
Figure 6
Figure 6
Breast cancer no latency (GAM). Breast cancer no latency.
Figure 7
Figure 7
Breast cancer no latency (scan). Breast cancer no latency. RR = 0.453.

References

    1. Lawson A, Biggeri A, Böhning D, Lesaffre E, Viel JF, Bertollini R. Disease Mapping and Risk Assessment for Public Health. Wiley; 1999.
    1. Forsberg L. A review of clustering literature for public health. [Preprint]
    1. Kulldorff M, Tango T, Park PJ. Power comparisons for disease clustering tests. Comp Stat Data Anal. 2003;42:665–684. doi: 10.1016/S0167-9473(02)00160-3. - DOI
    1. Ozonoff A, Bonetti M, Forsberg L, Pagano M. Power comparisons for an improved disease clustering test. Comp Stat Data Anal. 2005;48:679–684. doi: 10.1016/j.csda.2004.03.012. - DOI
    1. Bonetti M, Pagano M. The interpoint distance distribution as a descriptor of point patterns: An application to cluster detection. Stat Med. - PubMed

Publication types