. 2021 Jun 8;21(12):3964.

doi: 10.3390/s21123964.

Transfer Learning Based Semantic Segmentation for 3D Object Detection from Point Cloud

Muhammad Imad¹, Oualid Doukhi¹, Deok-Jin Lee²

Affiliations

¹ Center for Artificial Intelligence & Autonomous Systems, Kunsan National University, 558 Daehak-ro, Naun 2(i)-dong, Gunsan 54150, Korea.
² School of Mechanical Design Engineering, Smart e-Mobilty Lab, Center for Artificial Intelligence & Autonomous Systems, Jeonbuk National University, 567, Baekje-daero, Deokjin-gu, Jeonju-si 54896, Korea.

PMID: 34201390
PMCID: PMC8230345
DOI: 10.3390/s21123964

Transfer Learning Based Semantic Segmentation for 3D Object Detection from Point Cloud

Muhammad Imad et al. Sensors (Basel). 2021.

. 2021 Jun 8;21(12):3964.

doi: 10.3390/s21123964.

Authors

Muhammad Imad¹, Oualid Doukhi¹, Deok-Jin Lee²

Affiliations

¹ Center for Artificial Intelligence & Autonomous Systems, Kunsan National University, 558 Daehak-ro, Naun 2(i)-dong, Gunsan 54150, Korea.
² School of Mechanical Design Engineering, Smart e-Mobilty Lab, Center for Artificial Intelligence & Autonomous Systems, Jeonbuk National University, 567, Baekje-daero, Deokjin-gu, Jeonju-si 54896, Korea.

PMID: 34201390
PMCID: PMC8230345
DOI: 10.3390/s21123964

Abstract

Three-dimensional object detection utilizing LiDAR point cloud data is an indispensable part of autonomous driving perception systems. Point cloud-based 3D object detection has been a better replacement for higher accuracy than cameras during nighttime. However, most LiDAR-based 3D object methods work in a supervised manner, which means their state-of-the-art performance relies heavily on a large-scale and well-labeled dataset, while these annotated datasets could be expensive to obtain and only accessible in the limited scenario. Transfer learning is a promising approach to reduce the large-scale training datasets requirement, but existing transfer learning object detectors are primarily for 2D object detection rather than 3D. In this work, we utilize the 3D point cloud data more effectively by representing the birds-eye-view (BEV) scene and propose a transfer learning based point cloud semantic segmentation for 3D object detection. The proposed model minimizes the need for large-scale training datasets and consequently reduces the training time. First, a preprocessing stage filters the raw point cloud data to a BEV map within a specific field of view. Second, the transfer learning stage uses knowledge from the previously learned classification task (with more data for training) and generalizes the semantic segmentation-based 2D object detection task. Finally, 2D detection results from the BEV image have been back-projected into 3D in the postprocessing stage. We verify results on two datasets: the KITTI 3D object detection dataset and the Ouster LiDAR-64 dataset, thus demonstrating that the proposed method is highly competitive in terms of mean average precision (mAP up to 70%) while still running at more than 30 frames per second (FPS).

Keywords: 3D object detection; point cloud processing; semantic segmentation; transfer learning.

PubMed Disclaimer

Conflict of interest statement

The authors declare no conflict of interest.

Figures

**Figure 1**
Overview of the proposed 3D object detection architecture. The proposed model directly utilizes LiDAR-based birds-eye-view (BEV) images to estimate and localize 3D bounding volumes. The whole pipeline consists of a preprocessing module, deep learning, and back projection module.

**Figure 2**
Schematic representation of the birds-eye-view RGB map.

**Figure 3**
Transfer learning from classification to segmentation.

**Figure 4**
Schematic illustration of an encoder-decoder architecture. The left-hand side is a birds-eye-view RGB map that is passed to a series of computational layers, and the right-hand side is the output decoder feature map. The arrows are skip connection layers, where input is being directly concatenated from encoder to decoder.

**Figure 5**
Comparison between model from scratch and model initialized with pretrained classification weights.

**Figure 6**
Comparison between model from scratch and model initialized with classification pretrained weights: (a,b) shows prediction using pretrained weights, (c,d) shows prediction using the model trained from scratch.

**Figure 7**
The visualization results on KITTI dataset using our proposed method. Subfigures (a–d) shows the ground truth on the right-hand side, and the left-hand side shows the output results. The images shows that the proposed model performs well in different scenarios.

**Figure 8**
(a–d) shows samples of prediction on the left-hand side and extracted contours for the car class on the right-hand side. Subfigures (a,b) shows the simple scenarios, and (c,d) shows the images where the proposed model is able to achieve accurate results in more complex scenarios with rotated bounding boxes.

**Figure 9**
Qualitative results of the proposed model using KITTI 3D object detection dataset in LiDAR frame.

**Figure 10**
Qualitative results of the proposed model using the Ouster LiDAR-64 dataset in LiDAR frame.

**Figure 11**
Performance comparison. This plot shows the mean average precision (mAP) against the run-time (FPS) in the LiDAR frame. We compare our proposed model with the existing model for 3D object detection and measured our architecture performance on a dedicated embedded platform (Intel PC) with real-time efficiency.

See this image and copyright information in PMC

Cited by

HP3D-V2V: High-Precision 3D Object Detection Vehicle-to-Vehicle Cooperative Perception Algorithm.
Chen H, Wang H, Liu Z, Gu D, Ye W. Chen H, et al. Sensors (Basel). 2024 Mar 28;24(7):2170. doi: 10.3390/s24072170. Sensors (Basel). 2024. PMID: 38610381 Free PMC article.
Obstacle Detection Using a Facet-Based Representation from 3-D LiDAR Measurements.
Dulău M, Oniga F. Dulău M, et al. Sensors (Basel). 2021 Oct 15;21(20):6861. doi: 10.3390/s21206861. Sensors (Basel). 2021. PMID: 34696073 Free PMC article.
General-Purpose Deep Learning Detection and Segmentation Models for Images from a Lidar-Based Camera Sensor.
Yu X, Salimpour S, Queralta JP, Westerlund T. Yu X, et al. Sensors (Basel). 2023 Mar 8;23(6):2936. doi: 10.3390/s23062936. Sensors (Basel). 2023. PMID: 36991648 Free PMC article.
Efficient Detection and Tracking of Human Using 3D LiDAR Sensor.
Gómez J, Aycard O, Baber J. Gómez J, et al. Sensors (Basel). 2023 May 12;23(10):4720. doi: 10.3390/s23104720. Sensors (Basel). 2023. PMID: 37430633 Free PMC article.
Cascaded Deep Learning Neural Network for Automated Liver Steatosis Diagnosis Using Ultrasound Images.
Rhyou SY, Yoo JC. Rhyou SY, et al. Sensors (Basel). 2021 Aug 5;21(16):5304. doi: 10.3390/s21165304. Sensors (Basel). 2021. PMID: 34450746 Free PMC article.

See all "Cited by" articles

References

1. Himmelsbach M., Mueller A., Lüttel T., Wünsche H.J. LIDAR-based 3D object perception; Proceedings of the 1st International Workshop on Cognition for Technical Systems; Munich, Germany. 6–8 October 2008;
1. Sualeh M., Kim G.W. Dynamic multi-lidar based multiple object detection and tracking. Sensors. 2019;19:1474. doi: 10.3390/s19061474. - DOI - PMC - PubMed
1. Jung J., Bae S.H. Real-time road lane detection in urban areas using LiDAR data. Electronics. 2018;7:276. doi: 10.3390/electronics7110276. - DOI
1. Kuang H., Wang B., An J., Zhang M., Zhang Z. Voxel-FPN: Multi-scale voxel feature aggregation for 3D object detection from LIDAR point clouds. Sensors. 2020;20:704. doi: 10.3390/s20030704. - DOI - PMC - PubMed
1. Wang L., Li R., Sun J., Liu X., Zhao L., Seah H.S., Quah C.K., Tandianus B. Multi-View Fusion-Based 3D Object Detection for Robot Indoor Scene Perception. Sensors. 2019;19:4092. doi: 10.3390/s19194092. - DOI - PMC - PubMed

MeSH terms

Actions
Actions
Actions

Grants and funding

LX (No : 2020-254) / (2020M3C1C1A02084772)/(2020M3C1C1A01082375)/(2019R1F1A1049711)/(2020M3C1C1A01082375)/This research was supported by the Spatial Information Research Institute funded by LX (No : 2020-254). and also was supported by the Unmanned Vehicles Core Technology Research and Development Program through the National Research Foundation of Korea(NRF

LinkOut - more resources

Full Text Sources
Miscellaneous
- NCI CPTAC Assay Portal

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Transfer Learning Based Semantic Segmentation for 3D Object Detection from Point Cloud

Affiliations

Transfer Learning Based Semantic Segmentation for 3D Object Detection from Point Cloud

Authors

Affiliations

Abstract

Conflict of interest statement

Figures

Similar articles

Cited by

References

MeSH terms

Grants and funding

LinkOut - more resources

Full Text Sources

Miscellaneous