Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2018 Dec 3;96(12):4935-4943.
doi: 10.1093/jas/sky359.

Prediction of slaughter age in pigs and assessment of the predictive value of phenotypic and genetic information using random forest

Affiliations

Prediction of slaughter age in pigs and assessment of the predictive value of phenotypic and genetic information using random forest

Ahmad Alsahaf et al. J Anim Sci. .

Abstract

The weight of a pig and the rate of its growth are key elements in pig production. In particular, predicting future growth is extremely useful, since it can help in determining feed costs, pen space requirements, and the age at which a pig reaches a desired slaughter weight. However, making these predictions is challenging, due to the natural variation in how individual pigs grow, and the different causes of this variation. In this paper, we used machine learning, namely random forest (RF) regression, for predicting the age at which the slaughter weight of 120 kg is reached. Additionally, we used the variable importance score from RF to quantify the importance of different types of input data for that prediction. Data of 32,979 purebred Large White pigs were provided by Topigs Norsvin, consisting of phenotypic data, estimated breeding values (EBVs), along with pedigree and pedigree-genetic relationships. Moreover, we presented a 2-step data reduction procedure, based on random projections (RPs) and principal component analysis (PCA), to extract features from the pedigree and genetic similarity matrices for use as inputs in the prediction models. Our results showed that relevant phenotypic features were the most effective in predicting the output (age at 120 kg), explaining approximately 62% of its variance (i.e., R2 = 0.62). Estimated breeding value, pedigree, or pedigree-genetic features interchangeably explain 2% of additional variance when added to the phenotypic features, while explaining, respectively, 38%, 39%, and 34% of the variance when used separately.

PubMed Disclaimer

Figures

Figure 1.
Figure 1.
The proportion of explained variance (EV) and accumulated EV of the first 40 principal components of An×500RP and Hn×500RP .
Figure 2.
Figure 2.
The random forest feature importance scores, when X (all input features) is used as an input matrix. The scores are normalized by the score of the most important feature.
Figure 3.
Figure 3.
The performance of random forest regression for different input matrices, measured by the R2, and averaged over 10 test subsets of 10-fold cross-validation. The proportion of the colors within each bar represents the relative accumulated importance of the input matrix that the color represents: Xph (light gray), XEBV (dark gray), XP (dashed lines), XG (cross-pattern).

References

    1. Anaraki F. P., and Hughes S.. 2014. Memory and computation efficient PCA via very 371 sparse random projections. In International Conference on Machine Learning 2014 Jan 27 372 (pp. 1341–1349).
    1. Alsahaf A., Azzopardi G., Ducro B., Veerkamp R. F., and Petkov N.. 2018. Assigning pigs to uniform target weight groups using machine learning. Proceedings of the World Congress on Genetics Applied to Livestock Production, vol. Species - Porcine1:112.
    1. Apichottanakul A., Pathumnakul S., and Piewthongngam K.. 2012. The role of pig size prediction in supply chain planning. Biosyst. Eng. 113:298–307. doi: 10.1016/j.biosystemseng.2012.07.008
    1. Avogadri R., and Valentini G.. 2009. Fuzzy ensemble clustering based on random projections for DNA microarray data analysis. Artif. Intell. Med. 45:173–183. doi:10.1016/j.artmed.2008.07.014 - PubMed
    1. Bingham E., and Mannila H.. 2001. Random projection in dimensionality reduction. Proceedings of the Seventh ACM SIGKDD International Conference on Knowledge Discovery and Data Mining - KDD 01. doi:10.1145/502512.502546

LinkOut - more resources