Corner-Point and Foreground-Area IoU Loss: Better Localization of Small Objects in Bounding Box Regression
- PMID: 37430876
- PMCID: PMC10223589
- DOI: 10.3390/s23104961
Corner-Point and Foreground-Area IoU Loss: Better Localization of Small Objects in Bounding Box Regression
Abstract
Bounding box regression is a crucial step in object detection, directly affecting the localization performance of the detected objects. Especially in small object detection, an excellent bounding box regression loss can significantly alleviate the problem of missing small objects. However, there are two major problems with the broad Intersection over Union (IoU) losses, also known as Broad IoU losses (BIoU losses) in bounding box regression: (i) BIoU losses cannot provide more effective fitting information for predicted boxes as they approach the target box, resulting in slow convergence and inaccurate regression results; (ii) most localization loss functions do not fully utilize the spatial information of the target, namely the target's foreground area, during the fitting process. Therefore, this paper proposes the Corner-point and Foreground-area IoU loss (CFIoU loss) function by delving into the potential for bounding box regression losses to overcome these issues. First, we use the normalized corner point distance between the two boxes instead of the normalized center-point distance used in the BIoU losses, which effectively suppresses the problem of BIoU losses degrading to IoU loss when the two boxes are close. Second, we add adaptive target information to the loss function to provide richer target information to optimize the bounding box regression process, especially for small object detection. Finally, we conducted simulation experiments on bounding box regression to validate our hypothesis. At the same time, we conducted quantitative comparisons of the current mainstream BIoU losses and our proposed CFIoU loss on the small object public datasets VisDrone2019 and SODA-D using the latest anchor-based YOLOv5 and anchor-free YOLOv8 object detection algorithms. The experimental results demonstrate that YOLOv5s (+3.12% Recall, +2.73% mAP@0.5, and +1.91% mAP@0.5:0.95) and YOLOv8s (+1.72% Recall and +0.60% mAP@0.5), both incorporating the CFIoU loss, achieved the highest performance improvement on the VisDrone2019 test set. Similarly, YOLOv5s (+6% Recall, +13.08% mAP@0.5, and +14.29% mAP@0.5:0.95) and YOLOv8s (+3.36% Recall, +3.66% mAP@0.5, and +4.05% mAP@0.5:0.95), both incorporating the CFIoU loss, also achieved the highest performance improvement on the SODA-D test set. These results indicate the effectiveness and superiority of the CFIoU loss in small object detection. Additionally, we conducted comparative experiments by fusing the CFIoU loss and the BIoU loss with the SSD algorithm, which is not proficient in small object detection. The experimental results demonstrate that the SSD algorithm incorporating the CFIoU loss achieved the highest improvement in the AP (+5.59%) and AP75 (+5.37%) metrics, indicating that the CFIoU loss can also improve the performance of algorithms that are not proficient in small object detection.
Keywords: bounding box regression; loss function; object detection; small object.
Conflict of interest statement
The authors declare no conflict of interest.
Figures








Similar articles
-
Powerful-IoU: More straightforward and faster bounding box regression loss with a nonmonotonic focusing mechanism.Neural Netw. 2024 Feb;170:276-284. doi: 10.1016/j.neunet.2023.11.041. Epub 2023 Nov 22. Neural Netw. 2024. PMID: 38000311
-
Object Detection of Flexible Objects with Arbitrary Orientation Based on Rotation-Adaptive YOLOv5.Sensors (Basel). 2023 May 20;23(10):4925. doi: 10.3390/s23104925. Sensors (Basel). 2023. PMID: 37430839 Free PMC article.
-
Elevating Detection Performance in Optical Remote Sensing Image Object Detection: A Dual Strategy with Spatially Adaptive Angle-Aware Networks and Edge-Aware Skewed Bounding Box Loss Function.Sensors (Basel). 2024 Aug 18;24(16):5342. doi: 10.3390/s24165342. Sensors (Basel). 2024. PMID: 39205035 Free PMC article.
-
SOD-YOLOv8-Enhancing YOLOv8 for Small Object Detection in Aerial Imagery and Traffic Scenes.Sensors (Basel). 2024 Sep 25;24(19):6209. doi: 10.3390/s24196209. Sensors (Basel). 2024. PMID: 39409249 Free PMC article.
-
Object Detection for UAV Aerial Scenarios Based on Vectorized IOU.Sensors (Basel). 2023 Mar 13;23(6):3061. doi: 10.3390/s23063061. Sensors (Basel). 2023. PMID: 36991772 Free PMC article.
Cited by
-
Exploring the impact of hyperparameter and data augmentation in YOLO V10 for accurate bone fracture detection from X-ray images.Sci Rep. 2025 Mar 21;15(1):9828. doi: 10.1038/s41598-025-93505-4. Sci Rep. 2025. PMID: 40119100 Free PMC article.
-
DYNet: A Printed Book Detection Model Using Dual Kernel Neural Networks.Sensors (Basel). 2023 Dec 17;23(24):9880. doi: 10.3390/s23249880. Sensors (Basel). 2023. PMID: 38139726 Free PMC article.
-
Enhanced YOLOv5: An Efficient Road Object Detection Method.Sensors (Basel). 2023 Oct 10;23(20):8355. doi: 10.3390/s23208355. Sensors (Basel). 2023. PMID: 37896450 Free PMC article.
References
-
- Ren S., He K., Girshick R., Sun J. Advances in Neural Information Processing Systems 28, Proceedings of the Annual Conference on Neural Information Processing Systems 2015, Montreal, QC, Canada, 7–12 December 2015. Curran Associates, Inc.; Red Hook, NY, USA: 2015. Faster R-CNN: Towards real-time object detection with region proposal networks.
-
- Liu W., Anguelov D., Erhan D., Szegedy C., Reed S., Fu C.-Y., Berg A.C. Computer Vision–ECCV 2016, Proceedings of the 14th European Conference, Amsterdam, The Netherlands, 11–14 October 2016. Springer; Berlin/Heidelberg, Germany: 2016. Ssd: Single shot multibox detector; pp. 21–37. Part I 14.
-
- Ultralytics YOLOv5. [(accessed on 20 January 2023)]. Available online: https://github.com/ultralytics/yolov5.
-
- Law H., Deng J. CornerNet: Detecting objects as paired keypoints; Proceedings of the European Conference on Computer Vision (ECCV); Munich, Germany. 8–14 September 2018; pp. 734–750.
-
- Tian Z., Shen C., Chen H., He T. FCOS: Fully convolutional one-stage object detection; Proceedings of the IEEE/CVF International Conference on Computer Vision; Seoul, Republic of Korea. 20–26 October 2019; pp. 9627–9636.
Grants and funding
LinkOut - more resources
Full Text Sources