. 2024 Sep 25;24(19):6209.

doi: 10.3390/s24196209.

SOD-YOLOv8-Enhancing YOLOv8 for Small Object Detection in Aerial Imagery and Traffic Scenes

Boshra Khalili¹, Andrew W Smyth¹

Affiliations

PMID: 39409249
PMCID: PMC11478522
DOI: 10.3390/s24196209

SOD-YOLOv8-Enhancing YOLOv8 for Small Object Detection in Aerial Imagery and Traffic Scenes

Boshra Khalili et al. Sensors (Basel). 2024.

. 2024 Sep 25;24(19):6209.

doi: 10.3390/s24196209.

Authors

Boshra Khalili¹, Andrew W Smyth¹

Affiliation

¹ Department of Civil Engineering and Engineering Mechanics, Columbia University, New York, NY 10027, USA.

PMID: 39409249
PMCID: PMC11478522
DOI: 10.3390/s24196209

Abstract

Object detection, as a crucial aspect of computer vision, plays a vital role in traffic management, emergency response, autonomous vehicles, and smart cities. Despite the significant advancements in object detection, detecting small objects in images captured by high-altitude cameras remains challenging, due to factors such as object size, distance from the camera, varied shapes, and cluttered backgrounds. To address these challenges, we propose small object detection YOLOv8 (SOD-YOLOv8), a novel model specifically designed for scenarios involving numerous small objects. Inspired by efficient generalized feature pyramid networks (GFPNs), we enhance multi-path fusion within YOLOv8 to integrate features across different levels, preserving details from shallower layers and improving small object detection accuracy. Additionally, we introduce a fourth detection layer to effectively utilize high-resolution spatial information. The efficient multi-scale attention module (EMA) in the C2f-EMA module further enhances feature extraction by redistributing weights and prioritizing relevant features. We introduce powerful-IoU (PIoU) as a replacement for CIoU, focusing on moderate quality anchor boxes and adding a penalty based on differences between predicted and ground truth bounding box corners. This approach simplifies calculations, speeds up convergence, and enhances detection accuracy. SOD-YOLOv8 significantly improves small object detection, surpassing widely used models across various metrics, without substantially increasing the computational cost or latency compared to YOLOv8s. Specifically, it increased recall from 40.1% to 43.9%, precision from 51.2% to 53.9%, mAP_0.5 from 40.6% to 45.1%, and mAP_0.5:0.95 from 24% to 26.6%. Furthermore, experiments conducted in dynamic real-world traffic scenes illustrated SOD-YOLOv8's significant enhancements across diverse environmental conditions, highlighting its reliability and effective object detection capabilities in challenging scenarios.

Keywords: YOLOv8; attention mechanism; bounding box regression; feature pyramid network; small object detection.

PubMed Disclaimer

Conflict of interest statement

The authors declare no conflict of interest.

Figures

**Figure 1**
The network structure of YOLOv8, including the following modules: (a) C2F; (b) Bottleneck; (c) Convolution (conv); (d) Spatial Pyramid Pooling Fast (SPPF); and (e) Detection Layer.

**Figure 2**
Proposed improved YOLOv8 for small object detection, with original YOLOv8 in gray and highlighted improved modules.

**Figure 3**
skip-layer links: (a) dense-link: concatenates features from all preceding layers; (b) ${log}_{2} n$ -link: concatenates features from up to ${log}_{2} (l) + 1$ layers at each level.

**Figure 4**
Different feature pyramid network designs: (a) FPN uses a top-down strategy; (b) PANet enhances FPN with a bottom-up pathway; (c) BiFPN integrates cross-scale pathways bidirectionally; (d) GFPN includes a queen-fusion style pathway and skip-layer connections.

**Figure 5**
Enhanced and efficient GPFN structure.

**Figure 6**
Efficient multi-scale attention mechanism.

**Figure 8**
Anchor box regression process guided by (a) complete IoU-based loss function (CIoU), (b) penalty term in powerful-IoU (PIoU) loss function without attention function.

**Figure 9**
Information regarding the manual annotation process for objects in the VisDrone2019 dataset.

**Figure 10**
(a) Training progress plot comparing YOLOv8s-GFPN-EMA, YOLOv8s-GFPN, and YOLOv8s based on $m A P_{0.5}$ (b) and precision.

**Figure 11**
(a) Confusion matrix of YOLOv8s; (b) confusion matrix of proposed model.

**Figure 12**
Inference results for (a) YOLOv8s and (b) SOD-YOLOv8s across diverse scenarios including distant and high-density objects, as well as nighttime scenarios, using the VisDrone2019 dataset.

**Figure 13**
The perspective captured by COSMOS cameras on the 12th floor of Columbia’s Mudd building overlooking the intersection [56].

**Figure 14**
Inference results for (a) YOLOv8s and (b) SOD-YOLOv8s across various scenarios, including scenes with distant and high-density objects, as well as nighttime scenarios, using the traffic scene dataset.

See this image and copyright information in PMC

Cited by

Optimized small object detection in low resolution infrared images using super resolution and attention based feature fusion.
Wang W, Xu J, Zhang R. Wang W, et al. PLoS One. 2025 Jul 18;20(7):e0328223. doi: 10.1371/journal.pone.0328223. eCollection 2025. PLoS One. 2025. PMID: 40680051 Free PMC article.
Improved model MASW YOLO for small target detection in UAV images based on YOLOv8.
Meng X, Yuan F, Zhang D. Meng X, et al. Sci Rep. 2025 Jul 11;15(1):25027. doi: 10.1038/s41598-025-10428-w. Sci Rep. 2025. PMID: 40646143 Free PMC article.
Detecting subsurface diseases on airport road surface based on an improved SSD algorithm.
Pan M, Chen H, Yang L, Jiang X. Pan M, et al. PLoS One. 2025 Jul 23;20(7):e0327522. doi: 10.1371/journal.pone.0327522. eCollection 2025. PLoS One. 2025. PMID: 40700465 Free PMC article.
Accuracy-Efficiency Trade-Off: Optimizing YOLOv8 for Structural Crack Detection.
Zhang J, Beliaeva ZV, Huang Y. Zhang J, et al. Sensors (Basel). 2025 Jun 21;25(13):3873. doi: 10.3390/s25133873. Sensors (Basel). 2025. PMID: 40648132 Free PMC article.
Rice-SVBDete: a detection algorithm for small vascular bundles in rice stem's cross-sections.
Zhu X, Zhou W, Li J, Yang M, Zhou H, Huang J, Shi J, Shen J, Pang G, Wang L. Zhu X, et al. Front Plant Sci. 2025 May 26;16:1589161. doi: 10.3389/fpls.2025.1589161. eCollection 2025. Front Plant Sci. 2025. PMID: 40491816 Free PMC article.

References

1. Chen X., Ma H., Wan J., Li B., Xia T. Multi-view 3D oBject Detection Network for Autonomous Driving; Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR); Honolulu, HI, USA. 21–26 July 2017; pp. 6526–6534.
1. Alqarqaz M., Younes M.B., Qaddoura R. An Object Classification Approach for Autonomous Vehicles Using Machine Learning Techniques. World Electr. Veh. J. 2023;14:41. doi: 10.3390/wevj14020041. - DOI
1. Lim Y., Tiang S.S., Lim W.H., Wong C.H., Mastaneh M., Chong K.S., Sun B. Object Detection in Autonomous Vehicles: A Performance Analysis; Proceedings of the International Conference on Mechatronics and Intelligent Robotics; Singapore. 22–23 August 2023; Singapore: Springer Nature; 2023. pp. 277–291.
1. Feng J., Wang J., Qin R. Lightweight detection network for arbitrary-oriented vehicles in UAV imagery via precise positional information encoding and bidirectional feature fusion. Int. J. Remote Sens. 2023;44:4529–4558. doi: 10.1080/01431161.2023.2197129. - DOI
1. Chuai Q., He X., Li Y. Improved Traffic Small Object Detection via Cross-Layer Feature Fusion and Channel Attention. Electronics. 2023;12:3421. doi: 10.3390/electronics12163421. - DOI

Grants and funding

EEC-2133516/National Science Foundation

LinkOut - more resources

Full Text Sources
- MDPI
- PubMed Central
Other Literature Sources
- The Lens - Patent Citations Database
Research Materials
- NCI CPTC Antibody Characterization Program

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

SOD-YOLOv8-Enhancing YOLOv8 for Small Object Detection in Aerial Imagery and Traffic Scenes

Affiliation

SOD-YOLOv8-Enhancing YOLOv8 for Small Object Detection in Aerial Imagery and Traffic Scenes

Authors

Affiliation

Abstract

Conflict of interest statement

Figures

Similar articles

Cited by

References

Grants and funding

LinkOut - more resources

Full Text Sources

Other Literature Sources

Research Materials

Abstract

Conflict of interest statement

Figures

Similar articles

Cited by

References

Related information

Grants and funding

LinkOut - more resources

Full Text Sources

Other Literature Sources

Research Materials