Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Review
. 2019 Sep 26;19(19):4188.
doi: 10.3390/s19194188.

Deep Learning on Point Clouds and Its Application: A Survey

Affiliations
Review

Deep Learning on Point Clouds and Its Application: A Survey

Weiping Liu et al. Sensors (Basel). .

Abstract

Point cloud is a widely used 3D data form, which can be produced by depth sensors, such as Light Detection and Ranging (LIDAR) and RGB-D cameras. Being unordered and irregular, many researchers focused on the feature engineering of the point cloud. Being able to learn complex hierarchical structures, deep learning has achieved great success with images from cameras. Recently, many researchers have adapted it into the applications of the point cloud. In this paper, the recent existing point cloud feature learning methods are classified as point-based and tree-based. The former directly takes the raw point cloud as the input for deep learning. The latter first employs a k-dimensional tree (Kd-tree) structure to represent the point cloud with a regular representation and then feeds these representations into deep learning models. Their advantages and disadvantages are analyzed. The applications related to point cloud feature learning, including 3D object classification, semantic segmentation, and 3D object detection, are introduced, and the datasets and evaluation metrics are also collected. Finally, the future research trend is predicted.

Keywords: application of point cloud; deep learning; feature learning; point cloud.

PubMed Disclaimer

Conflict of interest statement

The authors declare no conflict of interest.

Figures

Figure 1
Figure 1
The main models for feature learning with raw point clouds as input.
Figure 2
Figure 2
The architecture of a recurrent neural network (RNN).
Figure 3
Figure 3
The architecture of an autoencoder.

Similar articles

Cited by

References

    1. Balsabarreiro J., Fritsch D. Generation of Visually Aesthetic and Detailed 3d Models of Historical Cities by Using Laser Scanning and Digital Photogrammetry. Digit. Appl. Archaeol. Cult. Herit. 2017;8:57–64.
    1. Balsa-Barreiro J., Fritsch D. Advances in Visual Computing, Proceedings of the 11th International Symposium (ISVC 2015), Las Vegas, NV, USA, 14–16 December 2015. Springer; Berlin/Heidelberg, Germany: 2015. Generation of 3d/4d Photorealistic Building Models. The Testbed Area for 4d Cultural Heritage World Project: The Historical Center of Calw (Germany)
    1. Oliveira M., Lopes L.S., Lim G.H., Kasaei S.H., Tomé A.M., Chauhan A. 3D object perception and perceptual learning in the RACE project. Robot. Auton. Syst. 2016;75:614–626. doi: 10.1016/j.robot.2015.09.019. - DOI
    1. Mahler J., Matl M., Satish V., Danielczuk M., Derose B., McKinley S., Goldberg K. Learning ambidextrous robot grasping policies. Sci. Robot. 2019;4:eaau4984. doi: 10.1126/scirobotics.aau4984. - DOI - PubMed
    1. Velodyne Hdl-64e Lidar Specification. [(accessed on 5 May 2019)]; Available online: https://velodynelidar.com/hdl-64e.html.

LinkOut - more resources