Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2022 Jul 8;22(14):5128.
doi: 10.3390/s22145128.

NrtNet: An Unsupervised Method for 3D Non-Rigid Point Cloud Registration Based on Transformer

Affiliations

NrtNet: An Unsupervised Method for 3D Non-Rigid Point Cloud Registration Based on Transformer

Xiaobo Hu et al. Sensors (Basel). .

Abstract

Self-attention networks have revolutionized the field of natural language processing and have also made impressive progress in image analysis tasks. Corrnet3D proposes the idea of first obtaining the point cloud correspondence in point cloud registration. Inspired by these successes, we propose an unsupervised network for non-rigid point cloud registration, namely NrtNet, which is the first network using a transformer for unsupervised large deformation non-rigid point cloud registration. Specifically, NrtNet consists of a feature extraction module, a correspondence matrix generation module, and a reconstruction module. Feeding a pair of point clouds, our model first learns the point-by-point features and feeds them to the transformer-based correspondence matrix generation module, which utilizes the transformer to learn the correspondence probability between pairs of point sets, and then the correspondence probability matrix conducts normalization to obtain the correct point set corresponding matrix. We then permute the point clouds and learn the relative drift of the point pairs to reconstruct the point clouds for registration. Extensive experiments on synthetic and real datasets of non-rigid 3D shapes show that NrtNet outperforms state-of-the-art methods, including methods that use grids as input and methods that directly compute point drift.

Keywords: NrtNet; non-rigid point cloud; registration; self-attentive; transformer; unsupervised.

PubMed Disclaimer

Conflict of interest statement

The authors declare no conflict of interest.

Figures

Figure 1
Figure 1
The idea of our proposed NrtNet. The point cloud is first rearranged using transformer, and then the permuted point cloud is reconstructed to achieve the registration.
Figure 2
Figure 2
NrtNet is an unsupervised, end-to-end network for non-rigid point cloud registration. The source point cloud ARn×3 and the target point cloud BRn×3 are fed into the feature extraction module and the transformer module to generate the point set correspondence matrix PRn×n. Then, the permuted point cloud is fed into the reconstruction module to generate the exact same point cloud Alast as B, which achieves the purpose of registration.
Figure 3
Figure 3
Transformer module. The probability matrix Prand can be obtained by feeding the high-dimensional features of the point cloud FaRn×d and FbRn×d into the transformer encoder and transformer decoder, respectively. Then, the probability matrix Prand is fed into the smooth module to obtain the inverted exact correspondence matrix P.
Figure 4
Figure 4
Transformer Encoder. φ, ψ is a linear layer, α is an mlp and they are all feature transform layers, δ is a linear layer which is a position encoder and γ is a mapping function.
Figure 5
Figure 5
Transformer decoder and the smooth module. These two modules convert the transformed features obtained from the transformer encoder into an exact point set correspondence matrix.
Figure 6
Figure 6
The reconstruction module concatenates the global features vb of the target point cloud to each permuted point cloud Are_order, and feeds them into the MLP to reconstruct the point cloud Alast.
Figure 7
Figure 7
Quantitative comparison of point set correspondence rates for non-rigid registration under different methods.
Figure 8
Figure 8
In a qualitative comparison between Nrtnet and other methods in a large deformed human pose, the experiments show the effectiveness of different methods for non-rigid point cloud registration.
Figure 9
Figure 9
Quantitative comparison of point set correspondence rates for rigid registration under different methods.
Figure 10
Figure 10
Registration performance of Nrtnet in small deformation dataset paper, cloth, sweater, and t-shirt.
Figure 11
Figure 11
The point set correspondence rate under different losses. The experiments qualitatively compare the correspondence differences between NrtNet’s losses and ordinary losses.
Figure 12
Figure 12
The registration effect of NrtNet on the real scan data.

References

    1. Apostolopoulos J.G., Chou P.A., Culbertson B., Kalker T., Trott M.D., Wee S. The road to immersive communication. Proc. IEEE. 2012;100:974–990. doi: 10.1109/JPROC.2011.2182069. - DOI
    1. Raviteja T., Vedaraj I.R. An introduction of autonomous vehicles and a brief survey. J. Crit. Rev. 2020;7:196–202.
    1. Silva R., Oliveira J.C., Giraldi G.A. Introduction to augmented reality. Natl. Lab. Sci. Comput. 2003;11:1–11.
    1. Hackel T., Savinov N., Ladicky L., Wegner J.D., Schindler K., Pollefeys M. Semantic3d. net: A new large-scale point cloud classification benchmark. arXiv. 20171704.03847
    1. Reisi A.R., Moradi M.H., Jamasb S. Classification and comparison of maximum power point tracking techniques for photovoltaic system: A review. Renew. Sustain. Energy Rev. 2013;19:433–443. doi: 10.1016/j.rser.2012.11.052. - DOI

LinkOut - more resources