CAFE-YOLO: an object detection algorithm from UAV perspective fusing channel attention and fine-grained feature enhancement
- PMID: 41062552
- PMCID: PMC12508220
- DOI: 10.1038/s41598-025-18881-3
CAFE-YOLO: an object detection algorithm from UAV perspective fusing channel attention and fine-grained feature enhancement
Abstract
In aerial imagery captured by drones, object detection tasks often face challenges such as a high proportion of small objects, complex background interference, and insufficient lighting conditions, all of which substantially affect feature representation and detection accuracy. To address these challenges, a novel object detection algorithm named channel attention and fine-grained enhancement YOLO (CAFE-YOLO) is proposed. This algorithm incorporates a channel attention mechanism into the backbone network to enhance the focus on critical features while suppressing redundant information. Furthermore, a fine-grained feature enhancement module is introduced to extract local detail features, improving the perception of small and occluded objects. In the detection head, a lightweight attention-guided feature fusion strategy is designed to further optimize object localization and classification performance. Experimental results on the VisDrone2019 dataset show that the proposed method achieves significantly better detection performance than most existing advanced algorithms in complex drone-captured imaging scenarios. While maintaining a lightweight architecture, it reaches a mean average precision at IoU threshold 0.5 of 44.6%, demonstrating substantial improvements in both overall detection accuracy and robustness.
Keywords: Channel attention mechanism; Drone-captured imagery; Fine-grained feature enhancement; Object detection.
© 2025. The Author(s).
Conflict of interest statement
Declarations. Competing interests: The authors declare no competing interests.
Figures












References
-
- Zhang, Y., Wu, C., Zhang, T., Liu, Y. & Zheng, Y. Self-Attention Guidance and Multiscale Feature Fusion-Based UAV Image Object Detection. IEEE Geosci. Remote Sens. Lett.20, 1–5 (2023).
-
- Wang, X., He, N., Hong, C., Wang, Q. & Chen, M. Improved YOLOXX Based UAV Aerial Photography Object Detection Algorithm. Image Vis. Comput.135, 104697 (2023).
-
- Zeng, S., Yang, W., Jiao, Y., Geng, L. & Chen, X. SCA-YOLO: A New Small Object Detection Model for UAV Images. Vis. Comput.40, 1787–1803 (2023).
-
- Gao, P. et al. “Double FCOS: A Two-Stage Model Utilizing FCOS for Vehicle Detection in Various Remote Sensing Scenes’’. IEEE J. Sel. Topics Appl. Earth Obs. Remote Sens15, 4730–4743 (2022).
-
- Huyan, N., Zhang, X., Quan, D., Chanussot, J. & Jiao, L. Cluster-Memory Augmented Deep Autoencoder via Optimal Transportation for Hyperspectral Anomaly Detection. IEEE Trans. Geosci. Remote Sens.60, 5531916 (2022).
LinkOut - more resources
Full Text Sources