LiDAR-as-Camera for End-to-End Driving

Ardi Tampuu¹, Romet Aidla¹, Jan Aare van Gent¹, Tambet Matiisen¹

Affiliations

PMID: 36905051
PMCID: PMC10007091
DOI: 10.3390/s23052845

LiDAR-as-Camera for End-to-End Driving

Ardi Tampuu et al. Sensors (Basel). 2023.

. 2023 Mar 6;23(5):2845.

doi: 10.3390/s23052845.

Authors

Ardi Tampuu¹, Romet Aidla¹, Jan Aare van Gent¹, Tambet Matiisen¹

Affiliation

¹ Institute of Computer Science, University of Tartu, 51009 Tartu, Estonia.

PMID: 36905051
PMCID: PMC10007091
DOI: 10.3390/s23052845

Abstract

The core task of any autonomous driving system is to transform sensory inputs into driving commands. In end-to-end driving, this is achieved via a neural network, with one or multiple cameras as the most commonly used input and low-level driving commands, e.g., steering angle, as output. However, simulation studies have shown that depth-sensing can make the end-to-end driving task easier. On a real car, combining depth and visual information can be challenging due to the difficulty of obtaining good spatial and temporal alignment of the sensors. To alleviate alignment problems, Ouster LiDARs can output surround-view LiDAR images with depth, intensity, and ambient radiation channels. These measurements originate from the same sensor, rendering them perfectly aligned in time and space. The main goal of our study is to investigate how useful such images are as inputs to a self-driving neural network. We demonstrate that such LiDAR images are sufficient for the real-car road-following task. Models using these images as input perform at least as well as camera-based models in the tested conditions. Moreover, LiDAR images are less sensitive to weather conditions and lead to better generalization. In a secondary research direction, we reveal that the temporal smoothness of off-policy prediction sequences correlates with the actual on-policy driving ability equally well as the commonly used mean absolute error.

Keywords: LiDAR in autonomous driving; autonomous driving; end-to-end driving; evaluation; generalization.

PubMed Disclaimer

Conflict of interest statement

The authors declare no conflict of interest. The funders had no role in the design of the study; in the collection, analyses, or interpretation of data; in the writing of the manuscript; or in the decision to publish the results.

Figures

**Figure A1**
LiDAR and camera images in summer, autumn, and winter (from top to down for LiDAR, left to right for the camera). The area used for model inputs is marked with a red rectangle. In LiDAR images, the red channel corresponds to intensity, green to depth, and blue to ambient radiation.

**Figure A2**
LiDAR channels at the same location across the three seasons, in order from top down: summer, autumn, winter. (a) In the intensity channel, we see a significant difference in how the road itself looks, while vegetation is surprisingly similar despite deciduous plants having no leaves in autumn and winter. (b) Depth image looks stable across seasons, but rather uninformative, as road and low vegetation areas are hard to discern. (c) Ambient radiation images vary strongly in brightness across the seasons, while also displaying strong noise. The noise looks akin to white noise or salt-and-pepper noise and authors do not know its cause.

**Figure A3**
Interventions of a LiDAR v1 model in the winter. The interventions are far more frequent in open fields, whereas the model can handle driving in the forest much better. Furthermore, the middle section of the route which contains bushes by the roadside is driven well.

**Figure 1**
The location of sensors used in this work. There are other sensors on the vehicle not illustrated here.

**Figure 2**
Input modalities. The red box marks the area used as model input. Top: surround view LiDAR image, with red: intensity, blue: depth, and green: ambient. Bottom: 120-degree FOV camera.

**Figure 3**
The modified PilotNet architecture. Each box represents the output from a layer, with the first box corresponding to the input of size (264, 68, 3). The model consists of 5 convolutional layers and 4 fully connected layers. The flattening operation is not made visible here. See the filter sizes, usage of batch normalization, and activation functions in Table 1.

**Figure 4**
Safety-driver interventions in the experiments where the test track was not included in the training set. Interventions from 3 test runs with different versions of the same model and from both driving directions are overlaid on one map. Interventions due to traffic are not filtered out from these maps, unlike in Table 2. Left: camera models v1–v3 (first 3 rows of Table 2). Middle: LiDAR models v1–v3 (rows 4–6 of Table 2). Right: an example of a situation where the safety driver has to take over due to traffic. Such situations are not counted as interventions in Table 2.

See this image and copyright information in PMC

References

1. Tampuu A., Matiisen T., Semikin M., Fishman D., Muhammad N. A survey of end-to-end driving: Architectures and training methods. arXiv. 2020 doi: 10.1109/TNNLS.2020.3043505.2003.06404 - DOI - PubMed
1. Ly A.O., Akhloufi M. Learning to drive by imitation: An overview of deep behavior cloning methods. IEEE Trans. Intell. Veh. 2020;6:195–209. doi: 10.1109/TIV.2020.3002505. - DOI
1. Huang Y., Chen Y. Autonomous driving with deep learning: A survey of state-of-art technologies. arXiv. 20202006.06091
1. Yurtsever E., Lambert J., Carballo A., Takeda K. A Survey of Autonomous Driving: Common Practices and Emerging Technologies. arXiv. 20191906.05113
1. Bansal M., Krizhevsky A., Ogale A. Chauffeurnet: Learning to drive by imitating the best and synthesizing the worst. arXiv. 20181812.03079

Grants and funding

LinkOut - more resources

Full Text Sources

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

LiDAR-as-Camera for End-to-End Driving

Affiliation

LiDAR-as-Camera for End-to-End Driving

Authors

Affiliation

Abstract

Conflict of interest statement

Figures

References

Grants and funding

LinkOut - more resources

Full Text Sources