Multi-scale object detection in UAV images based on adaptive feature fusion
- PMID: 38536859
- PMCID: PMC10971329
- DOI: 10.1371/journal.pone.0300120
Multi-scale object detection in UAV images based on adaptive feature fusion
Abstract
With the widespread use of UAVs, UAV aerial image target detection technology can be used for practical applications in the military, traffic planning, personnel search and rescue and other fields. In this paper, we propose a multi-scale UAV aerial image detection method based on adaptive feature fusion for solving the problem of detecting small target objects in UAV aerial images. This method automatically adjusts the convolution kernel receptive field and reduces the redundant background of the image by adding an adaptive feature extraction module (AFEM) to the backbone network. This enables it to obtain more accurately and effectively small target feature information. In addition, we design an adaptive feature weighted fusion network (SBiFPN) to effectively enhance the representation of shallow feature information of small targets. Finally, we add an additional small target detection scale to the original network to expand the receptive field of the network and strengthen the detection of small target objects. The training and testing are carried out on the VisDrone public dataset. The experimental results show that the proposed method can achieve 38.5% mAP, which is 2.0% higher than the baseline network YOLOv5s, and can still detect the UAV aerial image well in complex scenes.
Copyright: © 2024 Tan et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Conflict of interest statement
The authors have declared that no competing interests exist.
Figures
References
-
- Girshick R. Fast R-CNN. Proceedings of 2015 IEEE International Conference on Comp-uter Vision [Internet]. 2015 April 01, 2015:[1440–8 pp.]. https://ui.adsabs.harvard.edu/abs/2015arXiv150408083G.
-
- Mittal P, Sharma A, Singh R, Dhull V. Dilated Convolution Based RCNN Using Feature Fusion for Low-Altitude Aerial Objects. Expert Syst Appl. 2022;199:14.
-
- Jin R, Lv JN, Li B, Ye JC, Lin DF. Toward Efficient Object Detection in Aerial Images Using Extreme Scale Metric Learning. IEEE Access. 2021;9:56214–27.
MeSH terms
LinkOut - more resources
Full Text Sources
Medical
