Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2022 Jun 27:13:921057.
doi: 10.3389/fpls.2022.921057. eCollection 2022.

An Industrial-Grade Solution for Crop Disease Image Detection Tasks

Affiliations

An Industrial-Grade Solution for Crop Disease Image Detection Tasks

Guowei Dai et al. Front Plant Sci. .

Abstract

Crop leaf diseases can reflect the current health status of the crop, and the rapid and automatic detection of field diseases has become one of the difficulties in the process of industrialization of agriculture. In the widespread application of various machine learning techniques, recognition time consumption and accuracy remain the main challenges in moving agriculture toward industrialization. This article proposes a novel network architecture called YOLO V5-CAcT to identify crop diseases. The fast and efficient lightweight YOLO V5 is chosen as the base network. Repeated Augmentation, FocalLoss, and SmoothBCE strategies improve the model robustness and combat the positive and negative sample ratio imbalance problem. Early Stopping is used to improve the convergence of the model. We use two technical routes of model pruning, knowledge distillation and memory activation parameter compression ActNN for model training and identification under different hardware conditions. Finally, we use simplified operators with INT8 quantization for further optimization and deployment in the deep learning inference platform NCNN to form an industrial-grade solution. In addition, some samples from the Plant Village and AI Challenger datasets were applied to build our dataset. The average recognition accuracy of 94.24% was achieved in images of 59 crop disease categories for 10 crop species, with an average inference time of 1.563 ms per sample and model size of only 2 MB, reducing the model size by 88% and the inference time by 72% compared with the original model, with significant performance advantages. Therefore, this study can provide a solid theoretical basis for solving the common problems in current agricultural disease image detection. At the same time, the advantages in terms of accuracy and computational cost can meet the needs of agricultural industrialization.

Keywords: activate quantitative; convolutional neural network; crop disease detection; knowledge distillation; model compression; model deployment.

PubMed Disclaimer

Conflict of interest statement

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Figures

Figure 1
Figure 1
Technical route diagram.
Figure 2
Figure 2
Crop disease YOLOV5 network structure diagram.
Figure 3
Figure 3
FPGM pruning process.
Figure 4
Figure 4
Step of knowledge distillation.
Figure 5
Figure 5
ActNN compression and decompression process.
Figure 6
Figure 6
Integration of YOLO V5 model structure using ActNN.
Figure 7
Figure 7
layer fusion and data reuse.
Figure 8
Figure 8
Sample images of the PFD dataset.
Figure 9
Figure 9
Confidence accuracy curve and recall accuracy curve. (A) Precision-confidence curves of the YOLO V5s-CAcT3 model on the PFD dataset. (B) PR curves of YOLO V5s-CAcT3 model on the PFD dataset.
Figure 10
Figure 10
(A) AI Challenger accuracy, parameter drop, and sparsity curves vs. (B) PFD accuracy, parameter drop, and sparsity curves.
Figure 11
Figure 11
Histogram of model weight change during sparse training.
Figure 12
Figure 12
Effect of temperature parameters on knowledge distillation.
Figure A1
Figure A1
Confusion matrix for the predictions of the best model trained on the PFD dataset.
Figure A2
Figure A2
Prediction of the best model trained on the PFD dataset and live labeling of the sample images.
Figure A3
Figure A3
Prediction of the best model trained on the PFD dataset and live labeling of the sample images.

References

    1. Afzaal H., Farooque A. A., Schumann A. W., Hussain N., McKenzie-Gopsill A., Esau T., et al. . (2021). Detection of a potato disease (early blight) using artificial intelligence. Remote Sens. 13, 411. 10.3390/rs13030411 - DOI - PubMed
    1. Al-Wesabi F. N., Albraikan A. A., Hilal A. M., Eltahir M. M., Hamza M. A., Zamani A. S. (2022). Artificial intelligence enabled apple leaf disease classification for precision agriculture. Comput. Mat. Contin. 70, 6223–6238. 10.32604/cmc.2022.021299 - DOI
    1. Bakurov I., Buzzelli M., Schettini R., Castelli M., Vanneschi L. (2022). Structural similarity index (SSIM) revisited: a data-driven approach. Expert Syst. Appl. 189, 116087. 10.1016/j.eswa.2021.116087 - DOI
    1. Bochkovskiy A., Wang C.-Y., Liao H.-Y. M. (2020). YOLOv4: optimal speed and accuracy of object detection. ArXiv:200410934 [Cs, Eess]. 10.48550/arXiv.2004.10934 - DOI
    1. Chakrabarti A., Moseley B. (2019). Backprop with approximate activations for memory-efficient network training, in Advances in Neural Information Processing Systems, Vol. 32 (Curran Associates Inc). Available online at: https://proceedings.neurips.cc/paper/2019/hash/30c8e1ca872524fbf7ea5c519...