Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Comparative Study
. 2017 Aug 24;12(8):e0183626.
doi: 10.1371/journal.pone.0183626. eCollection 2017.

Species distribution models: A comparison of statistical approaches for livestock and disease epidemics

Affiliations
Comparative Study

Species distribution models: A comparison of statistical approaches for livestock and disease epidemics

Tracey Hollings et al. PLoS One. .

Abstract

In livestock industries, reliable up-to-date spatial distribution and abundance records for animals and farms are critical for governments to manage and respond to risks. Yet few, if any, countries can afford to maintain comprehensive, up-to-date agricultural census data. Statistical modelling can be used as a proxy for such data but comparative modelling studies have rarely been undertaken for livestock populations. Widespread species, including livestock, can be difficult to model effectively due to complex spatial distributions that do not respond predictably to environmental gradients. We assessed three machine learning species distribution models (SDM) for their capacity to estimate national-level farm animal population numbers within property boundaries: boosted regression trees (BRT), random forests (RF) and K-nearest neighbour (K-NN). The models were built from a commercial livestock database and environmental and socio-economic predictor data for New Zealand. We used two spatial data stratifications to test (i) support for decision making in an emergency response situation, and (ii) the ability for the models to predict to new geographic regions. The performance of the three model types varied substantially, but the best performing models showed very high accuracy. BRTs had the best performance overall, but RF performed equally well or better in many simulations; RFs were superior at predicting livestock numbers for all but very large commercial farms. K-NN performed poorly relative to both RF and BRT in all simulations. The predictions of both multi species and single species models for farms and within hypothetical quarantine zones were very close to observed data. These models are generally applicable for livestock estimation with broad applications in disease risk modelling, biosecurity, policy and planning.

PubMed Disclaimer

Conflict of interest statement

Competing Interests: The authors have declared that no competing interests exist.

Figures

Fig 1
Fig 1. Spatial variogram of farm-level RMSPE, using Euclidean distance between farm centroids.
The solid line gives the variogram of the observed data, with the dotted lines indicating the 95% confidence intervals of the null distribution obtained by random permutation of RMSPE between the farm locations.
Fig 2
Fig 2. Maps of the actual livestock units (LSU) and the modelled region level livestock predictions for three regions.
Maps show results from Random Forests and Boosted Regression Trees for Waikato, Canterbury and Wellington for region predictions. The gradient colour axis is on a log scale for display purposes. KNN results are not shown due the extremely poor fit of the models relative to other model types.
Fig 3
Fig 3. Maps of the actual livestock units (LSU) and the modelled zone level livestock predictions for three regions.
Maps show results from Random Forests and Boosted Regression Trees for Waikato, Canterbury and Wellington for zone level predictions. The gradient colour axis is on a log scale for display purposes.
Fig 4
Fig 4. Scatterplot of the observed against the predicted farm-level.
Results are plotted by region. Total quarantine zone LSU was calculated by summing the observed and predicted values for all farms within the 3km quarantine zone. Cattle results are not shown as plots they are a subset of LSU.
Fig 5
Fig 5. Scatterplot of the observed against the predicted total quarantine zone LSU.
Results are plotted by region. Total quarantine zone LSU was calculated by summing the observed and predicted values for all farms within the 3km quarantine zone. Cattle results are not shown as plots they are a subset of LSU.
Fig 6
Fig 6. Relative influence of predictor variables from random forest models for LSU and cattle.
Plots show the increase in MSE of predictions if the variable of interest were randomly permuted. See S1 Table for full variable names and descriptions.

References

    1. Robinson TP, Franceschini G, Wint W. The Food and Agriculture Organization’s gridded livestock of the world. Veterinaria Italiana. 2007;43(3):745–51. - PubMed
    1. Prosser DJ, Wu J, Ellis EC, Gale F, Van Boeckel TP, Wint W, et al. Modelling the distribution of chickens, ducks, and geese in China. Agriculture, Ecosystems & Environment. 2011;141(3):381–9. - PMC - PubMed
    1. Robinson TP, Wint GW, Conchedda G, Van Boeckel TP, Ercoli V, Palamara E, et al. Mapping the Global Distribution of Livestock. PloS One. 2014;9(5):e96084 doi: 10.1371/journal.pone.0096084 - DOI - PMC - PubMed
    1. Levine JM D 'Antonio CM. Forecasting biological invasions with increasing international trade. Conservation Biology. 2003;17(1):322–6.
    1. Anderson I. Foot and mouth disease 2001: Lessons to be learned inquiry report. London, UK.: House of Commons, 2002 Contract No.: 7 September.

Publication types