YOLOv8s-Longan: a lightweight detection method for the longan fruit-picking UAV
- PMID: 39911656
- PMCID: PMC11794187
- DOI: 10.3389/fpls.2024.1518294
YOLOv8s-Longan: a lightweight detection method for the longan fruit-picking UAV
Abstract
Introduction: Due to the limited computing power and fast flight speed of the picking of unmanned aerial vehicles (UAVs), it is important to design a quick and accurate detecting algorithm to obtain the fruit position.
Methods: This paper proposes a lightweight deep learning algorithm, named YOLOv8s-Longan, to improve the detection accuracy and reduce the number of model parameters for fruitpicking UAVs. To make the network lightweight and improve its generalization performance, the Average and Max pooling attention (AMA) attention module is designed and integrated into the DenseAMA and C2f-Faster-AMA modules on the proposed backbone network. To improve the detection accuracy, a crossstage local network structure VOVGSCSPC module is designed, which can help the model better understand the information of the image through multiscale feature fusion and improve the perception and expression ability of the model. Meanwhile, the novel Inner-SIoU loss function is proposed as the loss function of the target bounding box.
Results and discussion: The experimental results show that the proposed algorithm has good detection ability for densely distributed and mutually occluded longan string fruit under complex backgrounds with a mAP@0.5 of 84.3%. Compared with other YOLOv8 models, the improved model of mAP@0.5 improves by 3.9% and reduces the number of parameters by 20.3%. It satisfies the high accuracy and fast detection requirements for fruit detection in fruit-picking UAV scenarios.
Keywords: YOLOv8-Longan network; attention mechanism; lightweight network; longan; target detection.
Copyright © 2025 Li, Wu, Zhang, Chen, Lin, Mai and Shi.
Conflict of interest statement
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Figures















References
-
- Cai L., Li H., Dong W., Fang H. (2022). Micro-expression recognition using 3d densenet fused squeeze-and-excitation networks. Appl. Soft Comput. 119. doi: 10.1016/j.asoc.2022.108594 - DOI
-
- Chen W., Liu M., Zhao C., Li X., Wang Y. (2024. b). Mtd-yolo: Multi-task deep convolutional neural network for cherry tomato fruit bunch maturity detection. Comput. Electron. Agric. 216, 108533. doi: 10.1016/j.compag.2023.108533 - DOI
-
- Dairath M. H., Akram M. W., Mehmood M. A., Sarwar H. U., Akram M. Z., Omar M. M., et al. . (2023). Computer vision-based prototype robotic picking cum grading system for fruits. Smart Agric. Technol. 4, 100210. doi: 10.1016/j.atech.2023.100210 - DOI
-
- Ding Y., Zhang Z., Hu H., He F., Cheng S., Zhang Y. (2024). “Multi-feature fusion: Graph neural network and cnn combining for hyperspectral image classification,” In: Graph Neural Network for Feature Extraction and Classification of Hyperspectral Remote Sensing Images. (Springer, Singapore: ).
LinkOut - more resources
Full Text Sources