Small Target-YOLOv5: Enhancing the Algorithm for Small Object Detection in Drone Aerial Imagery Based on YOLOv5
- PMID: 38202996
- PMCID: PMC10781303
- DOI: 10.3390/s24010134
Small Target-YOLOv5: Enhancing the Algorithm for Small Object Detection in Drone Aerial Imagery Based on YOLOv5
Abstract
Object detection in drone aerial imagery has been a consistent focal point of research. Aerial images present more intricate backgrounds, greater variation in object scale, and a higher occurrence of small objects compared to standard images. Consequently, conventional object detection algorithms are often unsuitable for direct application in drone scenarios. To address these challenges, this study proposes a drone object detection algorithm model based on YOLOv5, named SMT-YOLOv5 (Small Target-YOLOv5). The enhancement strategy involves improving the feature fusion network by incorporating detection layers and implementing a weighted bidirectional feature pyramid network. Additionally, the introduction of the Combine Attention and Receptive Fields Block (CARFB) receptive field feature extraction module and DyHead dynamic target detection head aims to broaden the receptive field, mitigate information loss, and enhance perceptual capabilities in spatial, scale, and task domains. Experimental validation on the VisDrone2021 dataset confirms a significant improvement in the target detection accuracy of SMT-YOLOv5. Each improvement strategy yields effective results, raising the average precision by 12.4 percentage points compared to the original method. Detection improvements for large, medium, and small targets increase by 6.9%, 9.5%, and 7.7%, respectively, compared to the original method. Similarly, applying the same improvement strategies to the low-complexity YOLOv8n results in SMT-YOLOv8n, which is comparable in complexity to SMT-YOLOv5s. The results indicate that, relative to SMT-YOLOv8n, SMT-YOLOv5s achieves a 2.5 percentage point increase in average precision. Furthermore, comparative experiments with other enhancement methods demonstrate the effectiveness of the improvement strategies.
Keywords: drone aerial imagery; dynamic object detection head; feature fusion network; receptive field feature extraction module; small objects.
Conflict of interest statement
The authors declare no conflicts of interest.
Figures














Similar articles
-
ASG-YOLOv5: Improved YOLOv5 unmanned aerial vehicle remote sensing aerial images scenario for small object detection based on attention and spatial gating.PLoS One. 2024 Jun 3;19(6):e0298698. doi: 10.1371/journal.pone.0298698. eCollection 2024. PLoS One. 2024. PMID: 38829850 Free PMC article.
-
YOLOv5_mamba: unmanned aerial vehicle object detection based on bidirectional dense feedback network and adaptive gate feature fusion.Sci Rep. 2024 Sep 27;14(1):22396. doi: 10.1038/s41598-024-73241-x. Sci Rep. 2024. PMID: 39333360 Free PMC article.
-
YOLOv5s-DSD: An Improved Aerial Image Detection Algorithm Based on YOLOv5s.Sensors (Basel). 2023 Aug 3;23(15):6905. doi: 10.3390/s23156905. Sensors (Basel). 2023. PMID: 37571688 Free PMC article.
-
Research on improved YOLOv8n based potato seedling detection in UAV remote sensing images.Front Plant Sci. 2024 May 1;15:1387350. doi: 10.3389/fpls.2024.1387350. eCollection 2024. Front Plant Sci. 2024. PMID: 38751836 Free PMC article.
-
An insulator target detection algorithm based on improved YOLOv5.Sci Rep. 2025 Jan 2;15(1):496. doi: 10.1038/s41598-024-84623-6. Sci Rep. 2025. PMID: 39747537 Free PMC article.
Cited by
-
Research on the Method of Foreign Object Detection for Railway Tracks Based on Deep Learning.Sensors (Basel). 2024 Jul 11;24(14):4483. doi: 10.3390/s24144483. Sensors (Basel). 2024. PMID: 39065881 Free PMC article.
-
SEB-YOLO: An Improved YOLOv5 Model for Remote Sensing Small Target Detection.Sensors (Basel). 2024 Mar 29;24(7):2193. doi: 10.3390/s24072193. Sensors (Basel). 2024. PMID: 38610404 Free PMC article.
-
Automatic detection of foreign object intrusion along railway tracks based on MACENet.PLoS One. 2025 Aug 6;20(8):e0329303. doi: 10.1371/journal.pone.0329303. eCollection 2025. PLoS One. 2025. PMID: 40768523 Free PMC article.
References
-
- Pietikäinen M. Local binary patterns. Scholarpedia. 2010;5:9775. doi: 10.4249/scholarpedia.9775. - DOI
-
- Lindeberg T. Scale invariant feature transform. Scholarpedia. 2012;7:10491. doi: 10.4249/scholarpedia.10491. - DOI
-
- Schapire R.E. The strength of weak learnability. Mach. Learn. 1990;5:197–227. doi: 10.1007/BF00116037. - DOI
-
- Breiman L. Bagging predictors. Mach. Learn. 1996;24:123–140. doi: 10.1007/BF00058655. - DOI
Grants and funding
LinkOut - more resources
Full Text Sources