Research on Road Scene Understanding of Autonomous Vehicles Based on Multi-Task Learning
- PMID: 37448087
- PMCID: PMC10346996
- DOI: 10.3390/s23136238
Research on Road Scene Understanding of Autonomous Vehicles Based on Multi-Task Learning
Abstract
Road scene understanding is crucial to the safe driving of autonomous vehicles. Comprehensive road scene understanding requires a visual perception system to deal with a large number of tasks at the same time, which needs a perception model with a small size, fast speed, and high accuracy. As multi-task learning has evident advantages in performance and computational resources, in this paper, a multi-task model YOLO-Object, Drivable Area, and Lane Line Detection (YOLO-ODL) based on hard parameter sharing is proposed to realize joint and efficient detection of traffic objects, drivable areas, and lane lines. In order to balance tasks of YOLO-ODL, a weight balancing strategy is introduced so that the weight parameters of the model can be automatically adjusted during training, and a Mosaic migration optimization scheme is adopted to improve the evaluation indicators of the model. Our YOLO-ODL model performs well on the challenging BDD100K dataset, achieving the state of the art in terms of accuracy and computational efficiency.
Keywords: autonomous vehicles; drivable area detection; lane line detection; multi-task learning; traffic object detection; visual perception.
Conflict of interest statement
The authors declare no conflict of interest.
Figures









Similar articles
-
Multi-Task Environmental Perception Methods for Autonomous Driving.Sensors (Basel). 2024 Aug 28;24(17):5552. doi: 10.3390/s24175552. Sensors (Basel). 2024. PMID: 39275463 Free PMC article.
-
A Review of Vision-Based Multi-Task Perception Research Methods for Autonomous Vehicles.Sensors (Basel). 2025 Apr 20;25(8):2611. doi: 10.3390/s25082611. Sensors (Basel). 2025. PMID: 40285295 Free PMC article. Review.
-
A Multi-Task Road Feature Extraction Network with Grouped Convolution and Attention Mechanisms.Sensors (Basel). 2023 Sep 30;23(19):8182. doi: 10.3390/s23198182. Sensors (Basel). 2023. PMID: 37837012 Free PMC article.
-
Mobip: a lightweight model for driving perception using MobileNet.Front Neurorobot. 2023 Dec 4;17:1291875. doi: 10.3389/fnbot.2023.1291875. eCollection 2023. Front Neurorobot. 2023. PMID: 38111713 Free PMC article.
-
Off-Road Detection Analysis for Autonomous Ground Vehicles: A Review.Sensors (Basel). 2022 Nov 3;22(21):8463. doi: 10.3390/s22218463. Sensors (Basel). 2022. PMID: 36366160 Free PMC article. Review.
Cited by
-
A panoramic driving perception fusion algorithm based on multi-task learning.PLoS One. 2024 Jun 4;19(6):e0304691. doi: 10.1371/journal.pone.0304691. eCollection 2024. PLoS One. 2024. PMID: 38833435 Free PMC article.
-
Optimal Configuration of Multi-Task Learning for Autonomous Driving.Sensors (Basel). 2023 Dec 9;23(24):9729. doi: 10.3390/s23249729. Sensors (Basel). 2023. PMID: 38139577 Free PMC article.
-
Multi-object detection for crowded road scene based on ML-AFP of YOLOv5.Sci Rep. 2023 Oct 12;13(1):17310. doi: 10.1038/s41598-023-43458-3. Sci Rep. 2023. PMID: 37828051 Free PMC article.
-
Multi-Task Environmental Perception Methods for Autonomous Driving.Sensors (Basel). 2024 Aug 28;24(17):5552. doi: 10.3390/s24175552. Sensors (Basel). 2024. PMID: 39275463 Free PMC article.
References
-
- Qian Y., Dolan J.M., Yang M. DLT-Net: Joint detection of drivable areas, lane lines, and traffic objects. IEEE Trans. Intell. Transp. Syst. (IVS) 2019;21:4670–4679. doi: 10.1109/TITS.2019.2943777. - DOI
-
- Teichmann M., Weber M., Zollner M., Cipolla R., Urtasun R. MultiNet: Real-time joint semantic reasoning for autonomous driving; Proceedings of the 2018 IEEE Intelligent Vehicles Symposium (IV); Changshu, China. 26–30 June 2018; pp. 1013–1020.
-
- Sun Z., Bebis G., Miller R. On-road vehicle detection: A review. IEEE Trans. Pattern Anal. Mach. Intell. 2006;28:694–711. - PubMed
-
- Owais M. Traffic sensor location problem: Three decades of research. Expert Syst. Appl. 2022;208:118134. doi: 10.1016/j.eswa.2022.118134. - DOI
-
- Bhaggiaraj S., Priyadharsini M., Karuppasamy K., Snegha R. Deep Learning Based Self Driving Cars Using Computer Vision; Proceedings of the 2023 International Conference on Networking and Communications (ICNWC); Chennai, India. 5–6 April 2023; pp. 1–9.
MeSH terms
LinkOut - more resources
Full Text Sources
Research Materials