. 2024 Jul 20;24(14):4707.

doi: 10.3390/s24144707.

Deformation Estimation of Textureless Objects from a Single Image

Sahand Eivazi Adli¹, Joshua K Pickard², Ganyun Sun², Rickey Dubay¹

Affiliations

¹ Department of Mechanical Engineering, University of New Brunswick, 15 Dineen Drive, Fredericton, NB E3B 5A3, Canada.
² Eigen Innovations Inc., Fredericton, NB E3B 1S1, Canada.

PMID: 39066104
PMCID: PMC11280557
DOI: 10.3390/s24144707

Deformation Estimation of Textureless Objects from a Single Image

Sahand Eivazi Adli et al. Sensors (Basel). 2024.

. 2024 Jul 20;24(14):4707.

doi: 10.3390/s24144707.

Authors

Sahand Eivazi Adli¹, Joshua K Pickard², Ganyun Sun², Rickey Dubay¹

Affiliations

¹ Department of Mechanical Engineering, University of New Brunswick, 15 Dineen Drive, Fredericton, NB E3B 5A3, Canada.
² Eigen Innovations Inc., Fredericton, NB E3B 1S1, Canada.

PMID: 39066104
PMCID: PMC11280557
DOI: 10.3390/s24144707

Abstract

Deformations introduced during the production of plastic components degrade the accuracy of their 3D geometric information, a critical aspect of object inspection processes. This phenomenon is prevalent among primary plastic products from manufacturers. This work proposes a solution for the deformation estimation of textureless plastic objects using only a single RGB image. This solution encompasses a unique image dataset of five deformed parts, a novel method for generating mesh labels, sequential deformation, and a training model based on graph convolution. The proposed sequential deformation method outperforms the prevalent chamfer distance algorithm in generating precise mesh labels. The training model projects object vertices into features extracted from the input image, and then, predicts vertex location offsets based on the projected features. The predicted meshes using these offsets achieve a sub-millimeter accuracy on synthetic images and approximately 2.0 mm on real images.

Keywords: deformation estimation; graph convolution; image dataset; label generation; single image; textureless deformed object.

PubMed Disclaimer

Conflict of interest statement

Authors Joshua K. Pickard and Ganyun Sun were employed by the company Eigen Innovations. The remaining authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Figures

**Figure 1**
Images of the five textureless plastic objects studied in this paper: (a) skateboard, (b) bracket, (c) round flat receptacle lid (rfrl), (d) oval flange, and (e) vent cover.

**Figure 2**
Images of the deformed versions of the skateboard; the wireframe illustrates the undeformed model: (a) $F = 22.5 N, M D = 10.08 m m$ ; (b) $F = 44.5 N, M D = 19.93 m m$ ; (c) $F = - 22.5 N$ , $M D = - 10.08 m m$ ; (d) $F = - 44.5 N, M D = - 19.93 m m$ .

**Figure 3**
Images of the deformed versions of the bracket; the wireframe illustrates the undeformed model: (a) $F = 24 N$ , $M D = 5.14 m m$ ; (b) $F = 66 N$ , $M D = 14.15 m m$ ; (c) $F = - 24 N$ , $M D = - 5.14 m m$ ; (d) $F = - 66 N$ , $M D = - 14.15 m m$ .

**Figure 4**
Images of the deformed versions of the round flat receptacle lid; the wireframe illustrates the undeformed model: (a) $F = 1050 N, M D = 6.09 m m$ ; (b) $F = 2250 N, M D = 13.06 m m$ ; (c) $F = 3450 N$ , $M D = 20.03 m m$ ; (d) $F = 4300 N, M D = 4300 m m$ .

**Figure 5**
Images of the deformed versions of the oval flange; the wireframe illustrates the undeformed model: (a) $F = 1300 N, M D = 8.93 m m$ ; (b) $F = 2650 N, M D = 18.20 m m$ ; (c) $F = - 1300 N$ , $M D = - 8.93 m m$ ; (d) $F = - 2650 N, M D = - 18.20 m m$ .

**Figure 6**
Images of the deformed versions of the vent cover; the wireframe illustrates the undeformed model: (a) $F = 150 N, M D = 3.88 m m$ ; (b) $F = 540 N, M D = 13.97 m m$ ; (c) $F = - 150 N$ , $M D = - 3.88 m m$ ; (d) $F = - 540 N, M D = - 13.97 m m$ .

**Figure 7**
This schematic depicts the camera position relative to the deformed vent cover, located at the origin $(0, 0, 0)$ of Blender’s global coordinate system. The red, green, and blue axes represent the X, Y, and Z axes, respectively.

**Figure 8**
Images of the deformed vent cover with $F = 540.0 N, M D = 13.97 m m$ , and $R_{Z_{O}} = 0 °$ . (a) $T_{X_{L}} = 0 m m$ , (b) $T_{X_{L}} = 25 m m$ , (c) $T_{X_{L}} = 35 m m$ , (d) $T_{X_{L}} = 45 m m$ , (e) $R_{X_{C}} = 0 °$ , (f) $R_{X_{C}} = 10 °$ , (g) $R_{X_{C}} = 20 °$ , and (h) $R_{X_{C}} = 30 °$ . If not mentioned, $R_{X_{C}} = 60 °$ and $T_{X_{L}} = 0 m m$ .

**Figure 9**
Images of the 3D-printed deformed models captured by a smartphone camera. (a) Deformed model of the bracket with $F = 66.0 N, M D = 14.15 m m$ . (b) Deformed bracket component with $F = - 66.0 N, M D = 14.15 m m$ . (c) Deformed vent cover component with $F = 270 N, M D = 6.98 m m$ . (d) Deformed model of the vent cover object with $F = - 500.0 N, M D = - 12.93 m m$ .

**Figure 10**
Setup devised to capture pictures from the printed deformed models.

**Figure 11**
Real-world images of a 3D-printed, deformed vent cover ( $F = - 500.0 N, M D = - 12.93 m m$ ) used for training the machine learning model. (a) $R_{Z_{O}} = - 0.70 °$ and $R_{X_{C}} = 85.36 °$ . (b) $R_{Z_{O}} = 36.46 °$ and $R_{X_{C}} = 82.09 °$ . (c) $R_{Z_{O}} = - 34.40 °$ , and $R_{X_{C}} = 72.71 °$ . (d) $R_{Z_{O}} = - 3.49 °$ and $R_{X_{C}} = 76.82 °$ .

**Figure 12**
Mesh comparison: (a) Deformed mesh model, Ansys output with 6268 vertices and 12,544 faces. (b) Initial undeformed mesh model (training model input) with 1145 vertices and 2298 faces.

**Figure 13**
The chamfer distance algorithm applied on two 2D distributions. False correspondences (red dashed line) for middle blue point, and true correspondence (green dashed line).

**Figure 14**
Sequential deformation (lower path) vs. direct application of the chamfer distance (upper path) on a vent cover object with significant geometric variation (e). (a) Initial undeformed model. (b) Least deformed model with $M D = 0.25 m m, F = 10 N$ . (c) Average deformed model with $M D = 7.76 m m, F = 300 N$ . (d) Deformed model with $M D = 14.75 m m, F = 570 N$ . (e) Highest deformed model with $M D = 15.00 m m, F = 580 N$ . Labels were generated using (f) direct chamfer distance and (g) sequential deformation algorithm.

**Figure 15**
Sequential deformation (SD) vs. chamfer distance (CD) on skateboard object with $M D = - 19.93 m m$ (first row) and $M D = 19.93 m m$ (second row). (a,e) Initial undeformed model. (b,f) Deformed models (Ansys output). (c,g) CD algorithm outputs. (d,h) SD algorithm outputs.

**Figure 16**
Sequential deformation (SD) vs. chamfer distance (CD) on bracket object with $M D = - 14.15 m m$ (first row) and $M D = 14.15 m m$ (second row). (a,e) Initial undeformed model. (b,f) Deformed models (Ansys output). (c,g) CD algorithm outputs. (d,h) SD algorithm outputs.

**Figure 17**
Sequential deformation (SD) vs. chamfer distance (CD) on RFRL object with $M D = 24.97 m m$ . (a) Initial undeformed model. (b) Deformed model (Ansys output). (c) CD algorithm output. (d) SD algorithm output.

**Figure 18**
Sequential deformation (SD) vs. chamfer distance (CD) on oval flange object with $M D = - 18.20 m m$ (first row) and $M D = 18.20 m m$ (second row). (a,e) Initial undeformed model. (b,f) Deformed models (Ansys output). (c,g) CD algorithm outputs. (d,h) SD algorithm outputs.

**Figure 19**
Sequential deformation (SD) vs. chamfer distance (CD) on vent cover object with $M D = 15.00 m m$ . (a) Initial undeformed model. (b) Deformed model (Ansys output). (c) CD algorithm output. (d) SD algorithm output.

**Figure 20**
Training model pipeline: Blue cubes represent 2D convolutional layers, and yellow cubes denote max-pooling layers. The “cam” block represents the camera model’s intrinsic and extrinsic parameters. Green rectangles symbolize graph convolutional layers, while magenta rectangles represent dense layers.

**Figure 21**
Visualization of actual Euclidean distance error between the predicted mesh and the ground truth for each vertex of the predicted skateboard $[F = 33.5 N, M D = 15.0 m m]$ . (a) Input mesh of the training network (undeformed). (b) Input image of the deformed skateboard fed to the training model. (c–e) Predicted mesh of the deformed skateboard from different viewpoints. (f) Color bar representing the magnitude of the Euclidean distance error in millimeters (mm).

**Figure 22**
Visualization of actual Euclidean distance error between the predicted mesh and the ground truth for each vertex of the predicted skateboard $[F = - 33.5 N, M D = - 15.0 m m]$ . (a) Input mesh of the training network (undeformed). (b) Input image of the deformed skateboard fed to the training model. (c–e) Predicted mesh of the deformed skateboard from different viewpoints. (f) Color bar representing the magnitude of the Euclidean distance error in millimeters (mm).

**Figure 23**
Visualization of actual Euclidean distance error between the predicted mesh and the ground truth for each vertex of the predicted bracket $[F = 28 N, M D = 6.00 m m]$ . (a) Input mesh of the training network (undeformed). (b) Input image of the deformed bracket fed to the training model. (c–e) Predicted mesh of the deformed bracket from different viewpoints. (f) Color bar representing the magnitude of the Euclidean distance error in millimeters (mm).

**Figure 24**
Visualization of actual Euclidean distance error between the predicted mesh and the ground truth for each vertex of the predicted bracket $[F = - 46 N, M D = - 9.86 m m]$ . (a) Input mesh of the training network (undeformed). (b) Input image of the deformed bracket fed to the training model. (c–e) Predicted mesh of the deformed bracket from different viewpoints. (f) Color bar representing the magnitude of the Euclidean distance error in millimeters (mm).

**Figure 25**
Visualization of actual Euclidean distance error between the predicted mesh and the ground truth for each vertex of the predicted RFRL $[F = 2750 N, M D = 15.96 m m]$ . (a) Input mesh of the training network (undeformed). (b) Input image of the deformed RFRL fed to the training model. (c–e) Predicted mesh of the deformed RFRL from different viewpoints. (f) Color bar representing the magnitude of the Euclidean distance error in millimeters (mm).

**Figure 26**
Visualization of actual Euclidean distance error between the predicted mesh and the ground truth for each vertex of the predicted RFRL $[F = 3950 N, M D = 22.93 m m]$ . (a) Input mesh of the training network (undeformed). (b) Input image of the deformed RFRL fed to the training model. (c–e) Predicted mesh of the deformed RFRL from different viewpoints. (f) Color bar representing the magnitude of the Euclidean distance error in millimeters (mm).

**Figure 27**
Visualization of actual Euclidean distance error between the predicted mesh and the ground truth for each vertex of the predicted oval flange $[F = 1750 N, M D = 12.02 m m]$ . (a) Input mesh of the training network (undeformed). (b) Input image of the deformed oval flange fed to the training model. (c–e) Predicted mesh of the deformed oval flange from different viewpoints. (f) Color bar representing the magnitude of the Euclidean distance error in millimeters (mm).

**Figure 28**
Visualization of actual Euclidean distance error between the predicted mesh and the ground truth for each vertex of the predicted oval flange $[F = - 2200 N, M D = - 15.11 m m]$ . (a) Input mesh of the training network (undeformed). (b) Input image of the deformed oval flange fed to the training model. (c–e) Predicted mesh of the deformed oval flange from different viewpoints. (f) Color bar representing the magnitude of the Euclidean distance error in millimeters (mm).

**Figure 29**
Visualization of actual Euclidean distance error between the predicted mesh and the ground truth for each vertex of the predicted vent cover $[F = 310 N, M D = 8.02 m m]$ . (a) Input mesh of the training network (undeformed). (b) Input image of the deformed vent cover fed to the training model. (c–e) Predicted mesh of the deformed vent cover from different viewpoints. (f) Color bar representing the magnitude of the Euclidean distance error in millimeters (mm).

**Figure 30**
Visualization of actual Euclidean distance error between the predicted mesh and the ground truth for each vertex of the predicted vent cover $[F = 390 N, M D = - 10.09 m m]$ . (a) Input mesh of the training network (undeformed). (b) Input image of the deformed vent cover fed to the training model. (c–e) Predicted mesh of the deformed vent cover from different viewpoints. (f) Color bar representing the magnitude of the Euclidean distance error in millimeters (mm).

**Figure 31**
Visualization of actual Euclidean distance error between the predicted mesh and the ground truth for each vertex of the predicted bracket $[F = - 28 N, M D = - 6.00 m m]$ . (a) Input mesh of the training network (undeformed). (b) Real image of the actual deformed bracket fed to the training model. (c–e) Predicted mesh of the deformed bracket from different viewpoints. (f) Color bar representing the magnitude of the Euclidean distance error in millimeters (mm).

**Figure 32**
Visualization of actual Euclidean distance error between the predicted mesh and the ground truth for each vertex of the predicted vent cover $[F = 500 N, M D = - 10.09 m m]$ . (a) Input mesh of the training network (undeformed). (b) Real image of the 3D-printed deformed vent cover fed to the training model. (c–e) Predicted mesh of the deformed vent cover from different viewpoints. (f) Color bar representing the magnitude of the Euclidean distance error in millimeters (mm).

See this image and copyright information in PMC

References

1. Lowe D.G. Distinctive Image Features from Scale-Invariant Keypoints. Int. J. Comput. Vis. 2004;60:91–110. doi: 10.1023/B:VISI.0000029664.99615.94. - DOI
1. Bay H., Tuytelaars T., Van Gool L. SURF: Speeded Up Robust Features. In: Leonardis A., Bischof H., Pinz A., editors. Computer Vision—ECCV 2006. Volume 3951. Springer; Berlin/Heidelberg, Germany: 2006. pp. 404–417. Lecture Notes in Computer Science. - DOI
1. Wang N., Zhang Y., Li Z., Fu Y., Yu H., Liu W., Xue X., Jiang Y.-G. Pixel2mesh: 3D mesh model generation via image guided deformation. IEEE Trans. Pattern Anal. Mach. Intell. 2021;43:3600–3613. doi: 10.1109/TPAMI.2020.2984232. - DOI - PubMed
1. Moemen M.Y., Elghamrawy H., Givigi S.N., Noureldin A. 3-D reconstruction and measurement system based on multimobile robot machine vision. IEEE Trans. Instrum. Meas. 2021;70:5003109. doi: 10.1109/TIM.2020.3026719. - DOI
1. Wang Y., James S., Stathopoulou E.K., Beltrán-González C., Konishi Y., Del Bue A. Autonomous 3-D reconstruction, mapping, and exploration of indoor environments with a robotic arm. IEEE Trans. Robot. Autom. Lett. 2019;4:3340–3347. doi: 10.1109/LRA.2019.2926676. - DOI

Grants and funding

IT14798/Mitacs

LinkOut - more resources

Full Text Sources
- MDPI
- PubMed Central

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Deformation Estimation of Textureless Objects from a Single Image

Affiliations

Deformation Estimation of Textureless Objects from a Single Image

Authors

Affiliations

Abstract

Conflict of interest statement

Figures

References

Grants and funding

LinkOut - more resources

Full Text Sources