Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Review
. 2025 Apr 12;25(8):2433.
doi: 10.3390/s25082433.

A Comprehensive Review of Deep Learning in Computer Vision for Monitoring Apple Tree Growth and Fruit Production

Affiliations
Review

A Comprehensive Review of Deep Learning in Computer Vision for Monitoring Apple Tree Growth and Fruit Production

Meng Lv et al. Sensors (Basel). .

Abstract

The high nutritional and medicinal value of apples has contributed to their widespread cultivation worldwide. Unfavorable factors in the healthy growth of trees and extensive orchard work are threatening the profitability of apples. This study reviewed deep learning combined with computer vision for monitoring apple tree growth and fruit production processes in the past seven years. Three types of deep learning models were used for real-time target recognition tasks: detection models including You Only Look Once (YOLO) and faster region-based convolutional network (Faster R-CNN); classification models including Alex network (AlexNet) and residual network (ResNet); segmentation models including segmentation network (SegNet), and mask regional convolutional neural network (Mask R-CNN). These models have been successfully applied to detect pests and diseases (located on leaves, fruits, and trunks), organ growth (including fruits, apple blossoms, and branches), yield, and post-harvest fruit defects. This study introduced deep learning and computer vision methods, outlined in the current research on these methods for apple tree growth and fruit production. The advantages and disadvantages of deep learning were discussed, and the difficulties faced and future trends were summarized. It is believed that this research is important for the construction of smart apple orchards.

Keywords: apple tree growth; computer vision; fruit production; smart orchard; target recognition.

PubMed Disclaimer

Conflict of interest statement

The authors declare no conflicts of interest.

Figures

Figure 1
Figure 1
Summary of literature searches for monitoring apple tree growth and fruit yield based on deep learning and computer vision techniques.
Figure 2
Figure 2
The PMD for identification of pests. (a) The structure of the EfficientDet neural network. (b) The PMD. (c) Automatic counting results for PMD models (red border box—detecting classe malifoliella, green border box—detecting classes other insects) [27].
Figure 3
Figure 3
Accurate localization and identification of apple leaf diseases using YOLOV5-CBAM-C3TR. (A) The overall architecture of YOLOV5-CBAM-C3TR. (B) The detailed structure of the added optimization module: (1) CBAM, (2) Transformer. (C) Comparison of recognition results of different models in real scenes: (3) original image, (4) recognition results using YOLOV5, and (5) recognition results using YOLOV5-CBAM-C3TR [39].
Figure 4
Figure 4
The improved YOLOV7 model for identifying apple blossoms. (a) Parameter configuration and overall structure diagram of the improved YOLOV7 model. (b) Comparison of the accuracy of different models for detecting apple blossoms using thermodynamic force maps [57].
Figure 5
Figure 5
Counting apple production in the orchard using YOLOV7 + MAM. (A) The specific architecture of the YOLOV7 + MAM model. (B) Performance demonstration of YOLOV7 + MAM model for apple fruit recognition: (1) the original image in orchard, (2) the image of YOLOV7 + MAM model for identification apples in orchard. (C) The apple tracking results of the model for different video frames: (3) apple tracking results for Video ID1 using YOLOV7 + MAM + ByteTrack, (4) apple tracking results for Video ID2 using YOLOV7 + MAM + ByteTrack, (5) apple tracking results for Video ID3 using YOLOV7 + MAM + ByteTrack [75].
Figure 6
Figure 6
(A) Flowchart of apple fruit defect detection using ASDINet. (B) The comparison of AP metrics versus time with other SOTA models. (C) The comparison of loss metrics with other SOTA models. (D) Class activation map: (1) original image, (2) recognition results of apple fruit defects by U-Net [85].

Similar articles

References

    1. De la Peña-Armada R., Mateos-Aparicio I. Sustainable Approaches Using Green Technologies for Apple By-Product Valorisation as A New Perspective into the History of the Apple. Molecules. 2022;27:6937. doi: 10.3390/molecules27206937. - DOI - PMC - PubMed
    1. Wang W., Yang J., Xiao J., Li S., Zhou D. Face Recognition Based on Deep Learning; Proceedings of the International Conference on Human Centered Computing; Phnom Penh, Cambodia. 27–29 November 2014; pp. 812–820. - DOI
    1. Yang D., Li S., Peng Z., Wang P., Wang J., Yang H. MF-CNN: Traffic Flow Prediction Using Convolutional Neural Network and Multi-Features Fusion. IEICE Trans. Inf. Syst. 2019;102:1526–1536. doi: 10.1587/transinf.2018EDP7330. - DOI
    1. Doulamis N., Voulodimos A. FAST-MDL: Fast Adaptive Supervised Training of Multi-Layered Deep Learning Models for Consistent Object Tracking and Classification; Proceedings of the 2016 IEEE International Conference on Imaging Systems and Techniques (IST); Chania, Greece. 4–6 October 2016; pp. 318–323. - DOI
    1. Toshev A., Szegedy C. DeepPose: Human Pose Estimation via Deep Neural Networks; Proceedings of the 2014 IEEE Conference on Computer Vision and Pattern Recognition; Columbus, OH, USA. 23–28 June 2014; pp. 1653–1660. - DOI

MeSH terms

LinkOut - more resources