Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2011;6(8):e23903.
doi: 10.1371/journal.pone.0023903. Epub 2011 Aug 31.

Bayesian classification and regression trees for predicting incidence of cryptosporidiosis

Affiliations

Bayesian classification and regression trees for predicting incidence of cryptosporidiosis

Wenbiao Hu et al. PLoS One. 2011.

Abstract

Background: Classification and regression tree (CART) models are tree-based exploratory data analysis methods which have been shown to be very useful in identifying and estimating complex hierarchical relationships in ecological and medical contexts. In this paper, a Bayesian CART model is described and applied to the problem of modelling the cryptosporidiosis infection in Queensland, Australia.

Methodology/principal findings: We compared the results of a Bayesian CART model with those obtained using a Bayesian spatial conditional autoregressive (CAR) model. Overall, the analyses indicated that the nature and magnitude of the effect estimates were similar for the two methods in this study, but the CART model more easily accommodated higher order interaction effects.

Conclusions/significance: A Bayesian CART model for identification and estimation of the spatial distribution of disease risk is useful in monitoring and assessment of infectious diseases prevention and control.

PubMed Disclaimer

Conflict of interest statement

Competing Interests: The authors have declared that no competing interests exist.

Figures

Figure 1
Figure 1. The best tree identified from Bayesian regression trees.
At each terminal node the mean (μ) and number of individuals (n) are displayed.
Figure 2
Figure 2. The observed spatial distribution of SEIFA, temperature, rainfall and annual average incidence rates of cryptosporidiosis.
Figure 3
Figure 3. The best tree identified from Bayesian classification trees.
At each terminal node the predicted category of presence or absence is denoted respectively by pres or abs. The two numbers directly below this are in general a/b (e.g. 16/0) which denotes the number of observed absences “a” and presences “b” that are classified into this particular node.

Similar articles

Cited by

References

    1. Meinhardt P, Casemore D, Miller K. Epidemiologic aspects of human cryptosporidiosis and the role of waterborne transmission. Epidemiol Rev. 1996;18:118–136. - PubMed
    1. Mabaso M, Vounatsou P, Midzi S, Silva J, Smith T. Spatio-temporal analysis of the role of climate in inter-annual variation of malaria incidence in Zimbabwe. Int J Health Geog. 2006;5:20. - PMC - PubMed
    1. Moore D, Carpenter T. Spatial analytical methods and geographic information systems: use in health research and epidemiology. Epidemiol Rev. 1999;21:143–161. - PubMed
    1. Anselin L. Under the hood - Issues in the specification and interpretation of spatial regression models. Agric Econo. 2002;27:247–267.
    1. Anselin L. Exploring spatial data with GeoDa: a workbook. Urbana, USA.: 2005.