. 2024 Feb 28;24(5):1547.

doi: 10.3390/s24051547.

A Multi-Task Network Based on Dual-Neck Structure for Autonomous Driving Perception

Guopeng Tan¹, Chao Wang^{1

2}, Zhihua Li^{1

2}, Yuanbiao Zhang¹, Ruikai Li¹

Affiliations

¹ School of Information & Electrical Engineering, Hebei University of Engineering, Handan 056038, China.
² Hebei Key Laboratory of Security & Protection Information Sensing and Processing, Handan 056038, China.

PMID: 38475082
PMCID: PMC10935359
DOI: 10.3390/s24051547

A Multi-Task Network Based on Dual-Neck Structure for Autonomous Driving Perception

Guopeng Tan et al. Sensors (Basel). 2024.

. 2024 Feb 28;24(5):1547.

doi: 10.3390/s24051547.

Authors

Guopeng Tan¹, Chao Wang^{1

2}, Zhihua Li^{1

2}, Yuanbiao Zhang¹, Ruikai Li¹

Affiliations

¹ School of Information & Electrical Engineering, Hebei University of Engineering, Handan 056038, China.
² Hebei Key Laboratory of Security & Protection Information Sensing and Processing, Handan 056038, China.

PMID: 38475082
PMCID: PMC10935359
DOI: 10.3390/s24051547

Abstract

A vision-based autonomous driving perception system necessitates the accomplishment of a suite of tasks, including vehicle detection, drivable area segmentation, and lane line segmentation. In light of the limited computational resources available, multi-task learning has emerged as the preeminent methodology for crafting such systems. In this article, we introduce a highly efficient end-to-end multi-task learning model that showcases promising performance on all fronts. Our approach entails the development of a reliable feature extraction network by introducing a feature extraction module called C2SPD. Moreover, to account for the disparities among various tasks, we propose a dual-neck architecture. Finally, we present an optimized design for the decoders of each task. Our model evinces strong performance on the demanding BDD100K dataset, attaining remarkable accuracy (Acc) in vehicle detection and superior precision in drivable area segmentation (mIoU). In addition, this is the first work that can process these three visual perception tasks simultaneously in real time on an embedded device Atlas 200I A2 and maintain excellent accuracy.

Keywords: drivable area segmentation; lane line segmentation; multi-task learning; vehicle detection.

PubMed Disclaimer

Conflict of interest statement

The authors declare no conflicts of interest.

Figures

**Figure 1**
Performing real-time inference on Atlas 200I A2.

**Figure 3**
The backbone of feature extraction.

See this image and copyright information in PMC

References

1. Ren S., He K., Girshick R., Sun J. Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks. IEEE Trans. Pattern Anal. Mach. Intell. 2017;39:1137–1149. doi: 10.1109/TPAMI.2016.2577031. - DOI - PubMed
1. Redmon J., Farhadi A. Yolov3: An increme-ntal improvement. arXiv. 20181804.02767
1. Wang C.-Y., Bochkovskiy A., Liao H.-Y.M. Scaled-YOLOv4: Scaling Cross Stage Partial Network; Proceedings of the IEEE International Conference on Computer Vision; Nashville, TN, USA. 20–25 June 2021; pp. 13024–13033.
1. Jocher G. YOLOv5 Release v6.2. 2022. [(accessed on 25 February 2024)]. Available online: https://github.com/ultralytics/yolov5/releases/tag/v6.2.
1. Wang C.-Y., Bochkovskiy A., Liao H.-Y.M. Yolov7: Trainable bag-of-freebies sets new state-of-the-art for real-time object detectors. arXiv. 20222207.02696

LinkOut - more resources

Full Text Sources

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

A Multi-Task Network Based on Dual-Neck Structure for Autonomous Driving Perception

Affiliations

A Multi-Task Network Based on Dual-Neck Structure for Autonomous Driving Perception

Authors

Affiliations

Abstract

Conflict of interest statement

Figures

Similar articles

References

LinkOut - more resources

Full Text Sources

Abstract

Conflict of interest statement

Figures

Similar articles

References

Related information

LinkOut - more resources

Full Text Sources